/[webpac]/openisis/current/doc/encoding.txt
This is repository of my old source code which isn't updated any more. Go to git.rot13.org for current projects!
ViewVC logotype

Annotation of /openisis/current/doc/encoding.txt

Parent Directory Parent Directory | Revision Log Revision Log


Revision 237 - (hide annotations)
Mon Mar 8 17:43:12 2004 UTC (20 years, 1 month ago) by dpavlin
File MIME type: text/plain
File size: 5282 byte(s)
initial import of openisis 0.9.0 vendor drop

1 dpavlin 237 95 single-byte and 45 multi-byte encodings as supported by Java rt+i18n.jar
2     $
3     ASCII American Standard Code for Information Interchange
4     Cp037 USA, Canada (Bilingual, French), Netherlands, Portugal, Brazil, Australia
5     Cp273 IBM Austria, Germany
6     Cp277 IBM Denmark, Norway
7     Cp278 IBM Finland, Sweden
8     Cp280 IBM Italy
9     Cp284 IBM Catalan/Spain, Spanish Latin America
10     Cp285 IBM United Kingdom, Ireland
11     Cp297 IBM France
12     Cp420 IBM Arabic
13     Cp424 IBM Hebrew
14     Cp437 MS-DOS United States, Australia, New Zealand, South Africa
15     Cp500 EBCDIC 500V1
16     Cp737 PC Greek
17     Cp775 PC Baltic
18     Cp838 IBM Thailand extended SBCS
19     Cp850 MS-DOS Latin-1
20     Cp852 MS-DOS Latin-2
21     Cp855 IBM Cyrillic
22     Cp856 IBM Hebrew
23     Cp857 IBM Turkish
24     Cp858 Variant of Cp850 with Euro character
25     Cp860 MS-DOS Portuguese
26     Cp861 MS-DOS Icelandic
27     Cp862 PC Hebrew
28     Cp863 MS-DOS Canadian French
29     Cp864 PC Arabic
30     Cp865 MS-DOS Nordic
31     Cp866 MS-DOS Russian
32     Cp868 MS-DOS Pakistan
33     Cp869 IBM Modern Greek
34     Cp870 IBM Multilingual Latin-2
35     Cp871 IBM Iceland
36     Cp874 IBM Thai
37     Cp875 IBM Greek
38     Cp918 IBM Pakistan (Urdu)
39     Cp921 IBM Latvia, Lithuania (AIX, DOS)
40     Cp922 IBM Estonia (AIX, DOS)
41     Cp1006 IBM AIX Pakistan (Urdu)
42     Cp1025 IBM Multilingual Cyrillic: Bulgaria, Bosnia, Herzegovinia, Macedonia (FYR)
43     Cp1026 IBM Latin-5, Turkey
44     Cp1046 IBM Arabic - Windows
45     Cp1097 IBM Iran (Farsi)/Persian
46     Cp1098 IBM Iran (Farsi)/Persian (PC)
47     Cp1112 IBM Latvia, Lithuania
48     Cp1122 IBM Estonia
49     Cp1123 IBM Ukraine
50     Cp1124 IBM AIX Ukraine
51     Cp1140 Variant of Cp037 with Euro character
52     Cp1141 Variant of Cp273 with Euro character
53     Cp1142 Variant of Cp277 with Euro character
54     Cp1143 Variant of Cp278 with Euro character
55     Cp1144 Variant of Cp280 with Euro character
56     Cp1145 Variant of Cp284 with Euro character
57     Cp1146 Variant of Cp285 with Euro character
58     Cp1147 Variant of Cp297 with Euro character
59     Cp1148 Variant of Cp500 with Euro character
60     Cp1149 Variant of Cp871 with Euro character
61     Cp1250 Windows Eastern European
62     Cp1251 Windows Cyrillic
63     Cp1252 Windows Latin-1
64     Cp1253 Windows Greek
65     Cp1254 Windows Turkish
66     Cp1255 Windows Hebrew
67     Cp1256 Windows Arabic
68     Cp1257 Windows Baltic
69     Cp1258 Windows Vietnamese
70     ISO8859_1 ISO 8859-1, Latin alphabet No. 1
71     ISO8859_2 ISO 8859-2, Latin alphabet No. 2
72     ISO8859_3 ISO 8859-3, Latin alphabet No. 3
73     ISO8859_4 ISO 8859-4, Latin alphabet No. 4
74     ISO8859_5 ISO 8859-5, Latin/Cyrillic alphabet
75     ISO8859_6 ISO 8859-6, Latin/Arabic alphabet
76     ISO8859_7 ISO 8859-7, Latin/Greek alphabet
77     ISO8859_8 ISO 8859-8, Latin/Hebrew alphabet
78     ISO8859_9 ISO 8859-9, Latin alphabet No. 5
79     ISO8859_13 ISO 8859-13, Latin alphabet No. 7
80     ISO8859_15_FDIS ISO 8859-15, Latin alphabet No. 9
81     KOI8_R KOI8-R, Russian
82     MacArabic Macintosh Arabic
83     MacCentralEurope Macintosh Latin-2
84     MacCroatian Macintosh Croatian
85     MacCyrillic Macintosh Cyrillic
86     MacDingbat Macintosh Dingbat
87     MacGreek Macintosh Greek
88     MacHebrew Macintosh Hebrew
89     MacIceland Macintosh Iceland
90     MacRoman Macintosh Roman
91     MacRomania Macintosh Romania
92     MacSymbol Macintosh Symbol
93     MacThai Macintosh Thai
94     MacTurkish Macintosh Turkish
95     MacUkraine Macintosh Ukraine
96     MS874 Windows Thai
97     TIS620 TIS620, Thai
98    
99     Big5 Big5, Traditional Chinese
100     Cp930 Japanese Katakana-Kanji mixed with 4370 UDC, superset of 5026
101     Cp933 Korean Mixed with 1880 UDC, superset of 5029
102     Cp935 Simplified Chinese Host mixed with 1880 UDC, superset of 5031
103     Cp937 Traditional Chinese Host miexed with 6204 UDC, superset of 5033
104     Cp939 Japanese Latin Kanji mixed with 4370 UDC, superset of 5035
105     Cp942 IBM OS/2 Japanese, superset of Cp932
106     Cp942C Variant of Cp942
107     Cp943 IBM OS/2 Japanese, superset of Cp932 and Shift-JIS
108     Cp943C Variant of Cp943
109     Cp948 OS/2 Chinese (Taiwan) superset of 938
110     Cp949 PC Korean
111     Cp949C Variant of Cp949
112     Cp950 PC Chinese (Hong Kong, Taiwan)
113     Cp964 AIX Chinese (Taiwan)
114     Cp970 AIX Korean
115     Cp1381 IBM OS/2, DOS People's Republic of China (PRC)
116     Cp1383 IBM AIX People's Republic of China (PRC)
117     Cp33722 IBM-eucJP - Japanese (superset of 5050)
118     EUC_CN GB2312, EUC encoding, Simplified Chinese
119     EUC_JP JIS X 0201, 0208, 0212, EUC encoding, Japanese
120     EUC_KR KS C 5601, EUC encoding, Korean
121     EUC_TW CNS11643 (Plane 1-3), EUC encoding, Traditional Chinese
122     GBK GBK, Simplified Chinese
123     ISO2022CN ISO 2022 CN, Chinese (conversion to Unicode only)
124     ISO2022CN_CNS CNS 11643 in ISO 2022 CN form, Traditional Chinese (conversion from Unicode only)
125     ISO2022CN_GB GB 2312 in ISO 2022 CN form, Simplified Chinese (conversion from Unicode only)
126     ISO2022JP JIS X 0201, 0208 in ISO 2022 form, Japanese
127     ISO2022KR ISO 2022 KR, Korean
128     JIS0201 JIS X 0201, Japanese
129     JIS0208 JIS X 0208, Japanese
130     JIS0212 JIS X 0212, Japanese
131     JISAutoDetect Detects and converts from Shift-JIS, EUC-JP, ISO 2022 JP (conversion to Unicode only)
132     Johab Johab, Korean
133     MS932 Windows Japanese
134     MS936 Windows Simplified Chinese
135     MS949 Windows Korean
136     MS950 Windows Traditional Chinese
137     SJIS Shift-JIS, Japanese
138     UnicodeBig Sixteen-bit Unicode Transformation Format, big-endian byte order, with byte-order mark
139     UnicodeBigUnmarked Sixteen-bit Unicode Transformation Format, big-endian byte order
140     UnicodeLittle Sixteen-bit Unicode Transformation Format, little-endian byte order, with byte-order mark
141     UnicodeLittleUnmarked Sixteen-bit Unicode Transformation Format, little-endian byte order
142     UTF8 Eight-bit Unicode Transformation Format
143     UTF-16 Sixteen-bit Unicode Transformation Format, byte order specified by a mandatory initial byte-order mark
144     $

  ViewVC Help
Powered by ViewVC 1.1.26