OpenI18N Standard Codeset Name alias table

(Last Update: 2003-03-11)

###
### notation used in this document:
### "###" indicates the following text is commentary
### "*" means the field is empty.
### "+" means the name is supported in the category as previously defined.
### "%" begins and ends the category keywords
###
### The purpose of this document is to identify the commonly used character
### set names on the Linux platform. This also includes the aliases/names
### that are ambiguous and inconsistent, therefore, not recommended. 
### There are problems associated with data conversion between native encoding
### and Unicode. Thus, to ensure the data integrity of your document,
### we recommend you create your document in UTF-8 encoding only. 
### For examples on issues regarding the data conversion, please see
### the W3C Note about Japanese text profiling information
### of XML document at http://www.w3.org/TR/japanese-xml/.
###

%Standard Name% %glibc charset names% %Preferred MIME name% %other IANA names% %Java names% %NOT-RECOMMENDED names% %NOTE%
ISO-646-US ANSI_X3.4-1986, ISO-IR-6, ANSI_X3.4-1968, ISO_646.IRV:1991, ASCII, ISO646-US, US-ASCII, IBM367, CP367 US-ASCII ANSI_X3.4-1968, iso-ir-6, ANSI_X3.4-1986, ISO_646.irv:1991, ASCII, ISO646-US, us, IBM367, cp367, csASCII ascii7, default, 646, iso_646.irv:1983, iso969-US
TW-BIG5 BIG5, BIG5-CP950 Big5 csBig5 big5
HKSCS-BIG5 BIG5-HKSCS, BIG5HKSCS * Big5-HKSCS big5hk, big5-hkscs:unicode 3.0
EUC-JP EUC-JP EUC-JP Extended_UNIX_Code_Packed_Format_for_Japanese, csEUCPkdFmtJapanese x-eucjp, x-euc-jp ujis (glibc) eucjis (java) ### These names are inconsistent and ambiguous, therefore not recommended.
UTF-8 UTF-8 UTF-8 * unicode-1-1-utf-8 eucjputf8 ### Ambiguous name, not recommended.
EUC-KR EUC-KR EUC-KR csEUCKR 5601, ksc-5601, ksc-5601-1987, ksc-5601_1987, ksc5601 (Java) ### The use of names ksc-5601[-1987] to refer to EUC-KR is not recommended.
EUC-TW EUC-TW * * cns11643, ibm-euctw
GB-18030 GB18030 * * ibm1392, ibm-1392, gb18030-2000
GB-2312 GB2312 GB2312 csGB2312 EUC_CN, euccn, euc-cn
GB-K GBK * * *
ISO-8859-1 ISO-8859-1, ISO-IR-100, ISO_8859-1:1987, ISO_8859-1, LATIN1, L1, IBM819, CP819 ISO-8859-1 ISO_8859-1:1987, iso-ir-100, ISO_8859-1, latin1, l1, IBM819, CP819, csISOLatin1 819, cp819, iso8859-1, 8859-1, iso8859_1, iso_8859_1
ISO-8859-2 ISO-8859-2, ISO-IR-101, ISO_8859-2:1987, ISO_8859-2, LATIN2, L2 ISO-8859-2 ISO_8859-2:1987, iso-ir-101, ISO_8859-2, latin2, l2, csISOLatin2 912, cp912, ibm-912, ibm912, iso8859-2, 8859-2, iso8859_2, iso_8859_2
ISO-8859-3 ISO-8859-3, ISO-IR-109, ISO_8859-3:1988, ISO_8859-3, LATIN3, L3 ISO-8859-3 ISO_8859-3:1988, iso-ir-109, ISO_8859-3, latin3, l3, csISOLatin3 913, cp913, ibm-913, ibm913, iso8859-3, 8859-3, iso8859_3, iso_8859_3
ISO-8859-4 ISO-8859-4, ISO-IR-110, ISO_8859-4:1988, ISO_8859-4, LATIN4, L4 ISO-8859-4 ISO_8859-4:1988, iso-ir-110, ISO_8859-4, latin4, l4, csISOLatin4 914, cp914, ibm-914, ibm914, iso8859-4, 8859-4, iso8859_4, iso_8859_4
ISO-8859-5 ISO-8859-5, ISO-IR-144, ISO_8859-5:1988, ISO_8859-5, CYRILLIC ISO-8859-5 ISO_8859-5:1988, iso-ir-144, ISO_8859-5, cyrillic, csISOLatinCyrillic 915, cp915, ibm-915, ibm915, iso8859-5, 8859-5, iso8859_5, iso_8859_5
ISO-8859-6 ISO-8859-6, ISO-IR-127, ISO_8859-6:1987, ISO_8859-6, ECMA-114, ASMO-708, ARABIC ISO-8859-6 ISO_8859-6:1987, iso-ir-127, ISO_8859-6, ECMA-114, ASMO-708, arabic, csISOLatinArabic 1089, cp1089, ibm-1089, ibm1089, iso8859-6, 8859-6, iso8859_6, iso_8859_6
ISO-8859-7 ISO-8859-7, ISO-IR-126, ISO_8859-7:1987, ISO_8859-7, ELOT_928, ECMA-118, GREEK, GREEK8 ISO-8859-7 ISO_8859-7:1987, iso-ir-126, ISO_8859-7, ELOT_928, ECMA-118, greek, greek8, csISOLatinGreek 813, cp813, ibm-813, ibm813, iso8859-7, 8859-7, iso8859_7, iso_8859_7
ISO-8859-8 ISO-8859-8, ISO-IR-138, ISO_8859-8:1988, ISO_8859-8, HEBREW ISO-8859-8 ISO_8859-8:1988, iso-ir-138, ISO_8859-8, hebrew, csISOLatinHebrew 916, cp916, ibm-916, ibm916, iso8859-8, 8859-8, iso8859_8, iso_8859_8
ISO-8859-9 ISO-8859-9, ISO-IR-148, ISO_8859-9:1989, ISO_8859-9, LATIN5, L5 ISO-8859-9 ISO_8859-9:1989, iso-ir-148, ISO_8859-9, latin5, l5, csISOLatin5 920, cp920, ibm-920, ibm920, iso8859-9, 8859-9, iso8859_9, iso_8859_9
ISO-8859-13 ISO-8859-13, ISO-IR-179, LATIN7, L7 * ISO-8859-13 iso_8859-13, iso8859-13, 8859-13, iso8859_13, iso_8859_13
ISO-8859-14 ISO-8859-14, LATIN8, L8 * ISO-8859-14, iso-ir-199, ISO_8859-14:1998, ISO_8859-14, latin8, iso-celtic, l8 *
ISO-8859-15 ISO-8859-15 * ISO-8859-15 csisolatin9, csisolatin0, latin9, latin0, 923, cp923, ibm-923, ibm923, iso8859-15, iso_8859-15, 8859-15, iso_8859-15_FDIS, L9
ISO-8859-16 ISO-8859-16, ISO-IR-226, LATIN10, L10 * * *
KOI-8-R KOI8-R KOI8-R csKOI8R koi8
KOI-8-U KOI8-U * KOI8-U *
KOI-8-T KOI8-T * * *
SHIFTJIS SHIFT_JIS, SJIS Shift_JIS MS_Kanji, csShiftJIS pck Some implementation may have a problem in implementing this as a locale.
VISCII viscii * * * Some implementation may have a problem in implementing this as a locale.
CP-437 IBM437, CP437, 437 * IBM437, cp437, 437, csPC8CodePage437 437, csPC8CodePage437, ibm-437
CP-850 IBM850, CP850, 850 * IBM850, cp850, 850, csPC850Multilingual 850, csPC850Multilingual, ibm-850
CP-851 IBM851, CP851, 851 * IBM851, cp851, 851, csIBM851 *
CP-852 IBM852, CP852, 852 * IBM852, cp852, 852, csPCp852 852, csPCp852, ibm-852
CP-855 IBM855, CP855, 855 * IBM855, cp855, 855, csIBM855 cspcp855, ibm-855
CP-857 IBM857, CP857, 857 * IBM857, cp857, 857, csIBM857 857, csIBM857, ibm-857
CP-860 IBM860, CP860, 860 * IBM860, cp860, 860, csIBM860 860, csIBM860, ibm-860
CP-861 IBM861, CP861, 861, CP-IS * IBM861, cp861, 861, cp-is, csIBM861 861, cp-is, csIBM861, ibm-861
CP-862 IBM862, CP862, 862 * IBM862, cp862, 862, csPC862LatinHebrew 862, csPC862LatinHebrew, ibm-862
CP-863 IBM863, CP863, 863 * IBM863, cp863, 863, csIBM863 863, csIBM863, ibm-863
CP-864 IBM864, CP864 * IBM864, cp864, csIBM864 csIBM864, ibm-864
CP-865 IBM865, CP865, 865 * IBM865, cp865, 865, csIBM865 865, csIBM865, ibm-865
CP-866 IBM866, CP866, 866 * IBM866, cp866, 866, csIBM866 866, csIBM866, ibm-866
CP-868 IBM868, CP868, CP-AR * IBM868, CP868, cp-ar, csIBM868 ibm-868
CP-869 IBM869, CP869, 869, CP-GR * IBM869, cp869, 869, cp-gr, csIBM869 *
CP-891 IBM891, CP891 * IBM891, cp891, csIBM891 *
CP-903 IBM903, CP903 * IBM903, cp903, csIBM903 *
CP-904 IBM904, CP904, 904 * IBM904, cp904, 904, csIBM904 *
CP-1251 CP1251, MS-CYRL * windows-1251 Cp1251
CP-1255 CP1255, MS-HEBR * windows-1255 Cp1255
TIS-620 TIS-620, TIS620, TIS620-0, TIS620.2529-1, TIS620.2533-0, ISO-IR-166 * TIS-620 TIS620.2533
GEORGIAN-PS GEORGIAN-PS * * *