Data Resource - Text File
About
A text file is a file data resource that contains characters data.
Attributes
A text file allows to specifies the following attributes.
| Name | Default Values | Description |
|---|---|---|
| endOfRecords | \r\n, \n or \r on Read \r\n (Windows) or \n (Linux) on Write | The end of record (EOR) are the characters that design the end of a record (by default the end of line) |
| characterSet | Detected on Read UTF-8 on Creation | The character set that maps bits data to characters. |
| columnName | Lines | The name of the column when loaded in a relational database. A text file is then processed as tabular data with one column of type text. |
Note also that the count common attribute gives you the number of record (line by default).
CharacterSet
The below values are the detected and most known characterSet values.
If a value entered is not supported, you will get the whole list of supported character set.
| Value | Languages (Description) |
|---|---|
| US-ASCII | Seven-bit ASCII, a.k.a. |
| UTF-16 | Sixteen-bit UCS Transformation Format, byte order identified by an optional byte-order mark |
| UTF-16BE | Sixteen-bit UCS Transformation Format, big-endian byte order |
| UTF-16LE | Sixteen-bit UCS Transformation Format, little-endian byte order |
| UTF-8 | Eight-bit UCS Transformation Format |
| UTF-32BE | |
| UTF-32LE | |
| Shift_JIS | Japanese |
| ISO-2022-JP | Japanese |
| ISO-2022-CN | Simplified Chinese |
| ISO-2022-KR | Korean |
| GB18030 | Chinese |
| Big5 | Traditional Chinese |
| EUC-JP | Japanese |
| EUC-KR | Korean |
| ISO-8859-1 | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish - ISO Latin Alphabet No |
| ISO-8859-2 | Czech, Hungarian, Polish, Romanian |
| ISO-8859-5 | Russian |
| ISO-8859-6 | Arabic |
| ISO-8859-7 | Greek |
| ISO-8859-8 | Hebrew |
| ISO-8859-9 | Turkish |
| windows-1250 | Czech, Hungarian, Polish, Romanian |
| windows-1251 | Russian |
| windows-1252 | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish |
| windows-1253 | Greek |
| windows-1254 | Turkish |
| windows-1255 | Hebrew |
| windows-1256 | Arabic |
| KOI8-R | Russian |
| IBM420 | Arabic |
| IBM424 | Hebrew |