Categories


Popular topics

The Conversions drop down in the Advanced tab offers the following conversion options related to file encoding for the active file.

Conversion Description
Unix / Mac (legacy) to DOS Converts all occurrences of Unix line terminators (hex 0A, or LF), and all occurrences of Mac (legacy) line terminators (hex 0D, or CR) with Windows / DOS style line terminators (hex 0D 0A, or CR/LF).
DOS to Mac (legacy) Converts all occurrences of DOS style line terminators to Mac (legacy) line terminators.
DOS to Unix Converts all occurrences of DOS style line terminators to Unix line terminators.
EBCDIC to ASCII Converts the complete file from EBCDIC encoding to ASCII. For example, files from the IBM mainframe system will often be in EBCDIC format.
ASCII to EBCDIC Converts the complete file from ASCII encoding to EBCDIC.
OEM to ANSI Converts the complete file from OEM encoding to ANSI. This command assumes that the file's underlying encoding is using the OEM format. OEM is typically used with much older DOS systems.
OEM to EBCDIC Converts the complete file from OEM encoding to EBCDIC.
ASCII to Unicode Converts the complete file from ASCII encoding to Unicode (16-bit wide characters, also known as "UTF-16"). The conversion uses the active ASCII code page as the source encoding, which is visible in the status bar. Unless you've manually changed this, this will be the default ASCII code page for the operating system. After conversion, the status bar shows a "U-" in front of the line terminator type to indicate the file is Unicode.
UTF-8 to Unicode Converts the complete file from UTF-8 encoding to Unicode (16-bit wide characters, also known as "UTF-16"). After conversion, the status bar will show a "U-" in front of the line terminator type to indicate the file is Unicode.
Unicode to ASCII Converts the complete file from Unicode (UTF-16) encoding to ASCII. The conversion uses the active ASCII code page as the source encoding, which is visible in the status bar. Unless you've manually changed this, this will be the default ASCII code page for the operating system. Because the Unicode character set supports far more characters and glyphs than ASCII, it's possible not all characters can be converted. In these cases, a question mark "?" is substituted in for these characters. For Unicode files the status bar shows a "U-" in front of line terminator type to indicate the file is Unicode, but following this conversion, no indicator is present.
UTF-8 to ASCII Converts the complete file from UTF-8 encoding to ASCII. The conversion uses the active ASCII code page as the source encoding, which is visible in the status bar. Unless you've manually changed this, this will be the default ASCII code page for the operating system. Because the Unicode character set supports far more characters and glyphs than ASCII, it's possible not all characters can be converted. In these cases, a question mark "?" is substituted in for these characters. For UTF-8 files the status bar shows a "U8-" in front of line terminator type to indicate the file is UTF-8, but following this conversion, no indicator is present.
ASCII to UTF-8 Converts the complete file from ASCII encoding to UTF-8. After conversion, the status bar will show a "U8-" in front of the line terminator type to indicate the file is UTF-8.
Unicode/UTF-8 to UTF-8 (Unicode editing) Converts the complete file from Unicode (UTF-16) or UTF-8 (ASCII representation) encoding to UTF-8. After conversion, the status bar will show a "U8-" in front of the line terminator type to indicate the file is UTF-8.
Unicode/ASCII/UTF-8 to UTF-8 (Unicode editing) Converts the complete file from Unicode (UTF-16), ASCII, or UTF-8 (ASCII representation) encoding to UTF-8. The file will however not be internally handled as Unicode and as such the multi-byte UTF-8 characters will display as upper ASCII characters and not the UTF-8 character they represent. The status bar will not indicate this file format as it has no different structure from an ASCII file.
Unicode to Unicode big-endian Converts the complete file from Unicode (UTF-16) encoding to Unicode big-endian encoding. Endian refers to the order in which bytes are stored. On Windows, which uses the little endian format of Unicode, UTF-16LE is just called "Unicode" or "UTF-16" and UTF-16BE is called "Unicode (big endian)".
Unicode big-endian to Unicode Converts the complete file from Unicode (big-endian) encoding to Unicode (UTF-16) encoding. Endian refers to the order in which bytes are stored. On Windows, which are mostly little endian, UTF-16LE is just called "Unicode" or "UTF-16" and UTF-16BE is just called "Unicode (big endian)".
Unicode to ASCII-escaped Unicode Converts the complete file from Unicode (UTF-16) to ASCII-escaped Unicode, which allows the file to maintain its Unicode bytes while in ASCII format.
Unicode to ASCII-escaped Unicode Converts the complete file from ASCII-escaped Unicode to Unicode (UTF-16).