/[webpac]/openisis/current/doc/encoding.txt
This is repository of my old source code which isn't updated any more. Go to git.rot13.org for current projects!
ViewVC logotype

Contents of /openisis/current/doc/encoding.txt

Parent Directory Parent Directory | Revision Log Revision Log


Revision 237 - (show annotations)
Mon Mar 8 17:43:12 2004 UTC (20 years ago) by dpavlin
File MIME type: text/plain
File size: 5282 byte(s)
initial import of openisis 0.9.0 vendor drop

1 95 single-byte and 45 multi-byte encodings as supported by Java rt+i18n.jar
2 $
3 ASCII American Standard Code for Information Interchange
4 Cp037 USA, Canada (Bilingual, French), Netherlands, Portugal, Brazil, Australia
5 Cp273 IBM Austria, Germany
6 Cp277 IBM Denmark, Norway
7 Cp278 IBM Finland, Sweden
8 Cp280 IBM Italy
9 Cp284 IBM Catalan/Spain, Spanish Latin America
10 Cp285 IBM United Kingdom, Ireland
11 Cp297 IBM France
12 Cp420 IBM Arabic
13 Cp424 IBM Hebrew
14 Cp437 MS-DOS United States, Australia, New Zealand, South Africa
15 Cp500 EBCDIC 500V1
16 Cp737 PC Greek
17 Cp775 PC Baltic
18 Cp838 IBM Thailand extended SBCS
19 Cp850 MS-DOS Latin-1
20 Cp852 MS-DOS Latin-2
21 Cp855 IBM Cyrillic
22 Cp856 IBM Hebrew
23 Cp857 IBM Turkish
24 Cp858 Variant of Cp850 with Euro character
25 Cp860 MS-DOS Portuguese
26 Cp861 MS-DOS Icelandic
27 Cp862 PC Hebrew
28 Cp863 MS-DOS Canadian French
29 Cp864 PC Arabic
30 Cp865 MS-DOS Nordic
31 Cp866 MS-DOS Russian
32 Cp868 MS-DOS Pakistan
33 Cp869 IBM Modern Greek
34 Cp870 IBM Multilingual Latin-2
35 Cp871 IBM Iceland
36 Cp874 IBM Thai
37 Cp875 IBM Greek
38 Cp918 IBM Pakistan (Urdu)
39 Cp921 IBM Latvia, Lithuania (AIX, DOS)
40 Cp922 IBM Estonia (AIX, DOS)
41 Cp1006 IBM AIX Pakistan (Urdu)
42 Cp1025 IBM Multilingual Cyrillic: Bulgaria, Bosnia, Herzegovinia, Macedonia (FYR)
43 Cp1026 IBM Latin-5, Turkey
44 Cp1046 IBM Arabic - Windows
45 Cp1097 IBM Iran (Farsi)/Persian
46 Cp1098 IBM Iran (Farsi)/Persian (PC)
47 Cp1112 IBM Latvia, Lithuania
48 Cp1122 IBM Estonia
49 Cp1123 IBM Ukraine
50 Cp1124 IBM AIX Ukraine
51 Cp1140 Variant of Cp037 with Euro character
52 Cp1141 Variant of Cp273 with Euro character
53 Cp1142 Variant of Cp277 with Euro character
54 Cp1143 Variant of Cp278 with Euro character
55 Cp1144 Variant of Cp280 with Euro character
56 Cp1145 Variant of Cp284 with Euro character
57 Cp1146 Variant of Cp285 with Euro character
58 Cp1147 Variant of Cp297 with Euro character
59 Cp1148 Variant of Cp500 with Euro character
60 Cp1149 Variant of Cp871 with Euro character
61 Cp1250 Windows Eastern European
62 Cp1251 Windows Cyrillic
63 Cp1252 Windows Latin-1
64 Cp1253 Windows Greek
65 Cp1254 Windows Turkish
66 Cp1255 Windows Hebrew
67 Cp1256 Windows Arabic
68 Cp1257 Windows Baltic
69 Cp1258 Windows Vietnamese
70 ISO8859_1 ISO 8859-1, Latin alphabet No. 1
71 ISO8859_2 ISO 8859-2, Latin alphabet No. 2
72 ISO8859_3 ISO 8859-3, Latin alphabet No. 3
73 ISO8859_4 ISO 8859-4, Latin alphabet No. 4
74 ISO8859_5 ISO 8859-5, Latin/Cyrillic alphabet
75 ISO8859_6 ISO 8859-6, Latin/Arabic alphabet
76 ISO8859_7 ISO 8859-7, Latin/Greek alphabet
77 ISO8859_8 ISO 8859-8, Latin/Hebrew alphabet
78 ISO8859_9 ISO 8859-9, Latin alphabet No. 5
79 ISO8859_13 ISO 8859-13, Latin alphabet No. 7
80 ISO8859_15_FDIS ISO 8859-15, Latin alphabet No. 9
81 KOI8_R KOI8-R, Russian
82 MacArabic Macintosh Arabic
83 MacCentralEurope Macintosh Latin-2
84 MacCroatian Macintosh Croatian
85 MacCyrillic Macintosh Cyrillic
86 MacDingbat Macintosh Dingbat
87 MacGreek Macintosh Greek
88 MacHebrew Macintosh Hebrew
89 MacIceland Macintosh Iceland
90 MacRoman Macintosh Roman
91 MacRomania Macintosh Romania
92 MacSymbol Macintosh Symbol
93 MacThai Macintosh Thai
94 MacTurkish Macintosh Turkish
95 MacUkraine Macintosh Ukraine
96 MS874 Windows Thai
97 TIS620 TIS620, Thai
98
99 Big5 Big5, Traditional Chinese
100 Cp930 Japanese Katakana-Kanji mixed with 4370 UDC, superset of 5026
101 Cp933 Korean Mixed with 1880 UDC, superset of 5029
102 Cp935 Simplified Chinese Host mixed with 1880 UDC, superset of 5031
103 Cp937 Traditional Chinese Host miexed with 6204 UDC, superset of 5033
104 Cp939 Japanese Latin Kanji mixed with 4370 UDC, superset of 5035
105 Cp942 IBM OS/2 Japanese, superset of Cp932
106 Cp942C Variant of Cp942
107 Cp943 IBM OS/2 Japanese, superset of Cp932 and Shift-JIS
108 Cp943C Variant of Cp943
109 Cp948 OS/2 Chinese (Taiwan) superset of 938
110 Cp949 PC Korean
111 Cp949C Variant of Cp949
112 Cp950 PC Chinese (Hong Kong, Taiwan)
113 Cp964 AIX Chinese (Taiwan)
114 Cp970 AIX Korean
115 Cp1381 IBM OS/2, DOS People's Republic of China (PRC)
116 Cp1383 IBM AIX People's Republic of China (PRC)
117 Cp33722 IBM-eucJP - Japanese (superset of 5050)
118 EUC_CN GB2312, EUC encoding, Simplified Chinese
119 EUC_JP JIS X 0201, 0208, 0212, EUC encoding, Japanese
120 EUC_KR KS C 5601, EUC encoding, Korean
121 EUC_TW CNS11643 (Plane 1-3), EUC encoding, Traditional Chinese
122 GBK GBK, Simplified Chinese
123 ISO2022CN ISO 2022 CN, Chinese (conversion to Unicode only)
124 ISO2022CN_CNS CNS 11643 in ISO 2022 CN form, Traditional Chinese (conversion from Unicode only)
125 ISO2022CN_GB GB 2312 in ISO 2022 CN form, Simplified Chinese (conversion from Unicode only)
126 ISO2022JP JIS X 0201, 0208 in ISO 2022 form, Japanese
127 ISO2022KR ISO 2022 KR, Korean
128 JIS0201 JIS X 0201, Japanese
129 JIS0208 JIS X 0208, Japanese
130 JIS0212 JIS X 0212, Japanese
131 JISAutoDetect Detects and converts from Shift-JIS, EUC-JP, ISO 2022 JP (conversion to Unicode only)
132 Johab Johab, Korean
133 MS932 Windows Japanese
134 MS936 Windows Simplified Chinese
135 MS949 Windows Korean
136 MS950 Windows Traditional Chinese
137 SJIS Shift-JIS, Japanese
138 UnicodeBig Sixteen-bit Unicode Transformation Format, big-endian byte order, with byte-order mark
139 UnicodeBigUnmarked Sixteen-bit Unicode Transformation Format, big-endian byte order
140 UnicodeLittle Sixteen-bit Unicode Transformation Format, little-endian byte order, with byte-order mark
141 UnicodeLittleUnmarked Sixteen-bit Unicode Transformation Format, little-endian byte order
142 UTF8 Eight-bit Unicode Transformation Format
143 UTF-16 Sixteen-bit Unicode Transformation Format, byte order specified by a mandatory initial byte-order mark
144 $

  ViewVC Help
Powered by ViewVC 1.1.26