A text file is a file data resource that contains characters data.
A text file allows to specifies the following attributes.
Name | Default Values | Description |
---|---|---|
endOfRecords | \r\n, \n or \r on Read \r\n (Windows) or \n (Linux) on Write | The end of record (EOR) are the characters that design the end of a record (by default the end of line) |
characterSet | Detected on Read UTF-8 on Creation | The character set that maps bits data to characters. |
columnName | Lines | The name of the column when loaded in a relational database. A text file is then processed as tabular data with one column of type text. |
Note also that the count common attribute gives you the number of record (line by default).
The below values are the detected and most known characterSet values.
If a value entered is not supported, you will get the whole list of supported character set.
Value | Languages (Description) |
---|---|
US-ASCII | Seven-bit ASCII, a.k.a. |
UTF-16 | Sixteen-bit UCS Transformation Format, byte order identified by an optional byte-order mark |
UTF-16BE | Sixteen-bit UCS Transformation Format, big-endian byte order |
UTF-16LE | Sixteen-bit UCS Transformation Format, little-endian byte order |
UTF-8 | Eight-bit UCS Transformation Format |
UTF-32BE | |
UTF-32LE | |
Shift_JIS | Japanese |
ISO-2022-JP | Japanese |
ISO-2022-CN | Simplified Chinese |
ISO-2022-KR | Korean |
GB18030 | Chinese |
Big5 | Traditional Chinese |
EUC-JP | Japanese |
EUC-KR | Korean |
ISO-8859-1 | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish - ISO Latin Alphabet No |
ISO-8859-2 | Czech, Hungarian, Polish, Romanian |
ISO-8859-5 | Russian |
ISO-8859-6 | Arabic |
ISO-8859-7 | Greek |
ISO-8859-8 | Hebrew |
ISO-8859-9 | Turkish |
windows-1250 | Czech, Hungarian, Polish, Romanian |
windows-1251 | Russian |
windows-1252 | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish |
windows-1253 | Greek |
windows-1254 | Turkish |
windows-1255 | Hebrew |
windows-1256 | Arabic |
KOI8-R | Russian |
IBM420 | Arabic |
IBM424 | Hebrew |