1 |
* ASCII and Latin1 (ISO-8859-1) |
2 |
the ASCII table including C0 conrols (first 32 characters) |
3 |
and the Latin1 table created by Charset -tab -enc ISO-8859-1 |
4 |
(i.e. the identity table, looks like Latin1, |
5 |
if this is the encoding used to see this document). |
6 |
$ |
7 |
... 0 1 2 3 4 5 6 7 8 9 A B C D E F |
8 |
0 NUL STX SOT ETX EOT ENQ ACK BEL BS HT LF VT FF CR SO SI |
9 |
16 DLE DC1 DC2 DC3 DC4 NAK SYN ETB CAN EM SUB ESC FS GS RS US |
10 |
... 0 1 2 3 4 5 6 7 8 9 A B C D E F |
11 |
32 ! " # $ % & ' ( ) * + , - . / |
12 |
48 0 1 2 3 4 5 6 7 8 9 : ; < = > ? |
13 |
64 @ A B C D E F G H I J K L M N O |
14 |
80 P Q R S T U V W X Y Z [ \ ] ^ _ |
15 |
96 ` a b c d e f g h i j k l m n o |
16 |
112 p q r s t u v w x y z { | } ~ DEL |
17 |
128 (C1 controls ... |
18 |
140 ... ommited) |
19 |
160 ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ ® ¯ |
20 |
176 ° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿ |
21 |
192 À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï |
22 |
208 Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß |
23 |
224 à á â ã ä å æ ç è é ê ë ì í î ï |
24 |
240 ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ |
25 |
$ |
26 |
Note: there is a similar charset, Cp1252 (Windows Latin-1), |
27 |
which differs only in the range 128-159, where several |
28 |
special characters like the EURO sign and typograhic quotes |
29 |
(like `` ´´ ,,) are added. |
30 |
Note: character 173 is the soft-hyphen (shy) used in text-processing |
31 |
to indicate an optional hyphenation. Depending on how you read this text, |
32 |
it might not be displayed. |
33 |
|
34 |
* Cp437 and Cp850 (DOS LatinUS a.k.a. "OEM Charset" and Latin1) |
35 |
representation of Cp850 in Latin1 created by Charset -tab -enc Cp850. |
36 |
The blanks represent line-drawing and other characters which |
37 |
cannot be represented in Latin1 (java actually prints question marks). |
38 |
Position 240 has the shy, |
39 |
the last char (255) is the nonbreaking space (nbsp, Latin1 160). |
40 |
$ |
41 |
... 0 1 2 3 4 5 6 7 8 9 A B C D E F |
42 |
128 Ç ü é â ä à å ç ê ë è ï î ì Ä Å |
43 |
144 É æ Æ ô ö ò û ù ÿ Ö Ü ø £ Ø × |
44 |
160 á í ó ú ñ Ñ ª º ¿ ® ¬ ½ ¼ ¡ « » |
45 |
176 Á Â À © ¢ ¥ |
46 |
192 ã Ã ¤ |
47 |
208 ð Ð Ê Ë È Í Î Ï ¦ Ì |
48 |
224 Ó ß Ô Ò õ Õ µ þ Þ Ú Û Ù ý Ý ¯ ´ |
49 |
240 ± ¾ ¶ § ÷ ¸ ° ¨ · ¹ ³ ² |
50 |
$ |
51 |
|
52 |
According to the default ISISAC.TAB, ranges 128-154 and 160-165 are alpha. |
53 |
This fits Cp437, which is the same for these ranges, |
54 |
but contains even more linedrawing in the higher positions. |
55 |
Most confusing, it has some greek letters for technical use in row "224", |
56 |
with the beta just at position 225, where it was replaced by the similar |
57 |
looking german sz ligature. |
58 |
Less surprising, the default ISISUC.TAB is also made for Cp437. |
59 |
|
60 |
|
61 |
the unicode mapping of Cp850 as created by Charset -ctab |
62 |
$ |
63 |
199, 252, 233, 226, 228, 224, 229, 231, 234, 235, 232, 239, 238, 236, 196, 197, |
64 |
201, 230, 198, 244, 246, 242, 251, 249, 255, 214, 220, 248, 163, 216, 215, 402, |
65 |
225, 237, 243, 250, 241, 209, 170, 186, 191, 174, 172, 189, 188, 161, 171, 187, |
66 |
9617,9618,9619,9474,9508, 193, 194, 192, 169,9571,9553,9559,9565, 162, 165,9488, |
67 |
9492,9524,9516,9500,9472,9532, 227, 195,9562,9556,9577,9574,9568,9552,9580, 164, |
68 |
240, 208, 202, 203, 200, 305, 205, 206, 207,9496,9484,9608,9604, 166, 204,9600, |
69 |
211, 223, 212, 210, 245, 213, 181, 254, 222, 218, 219, 217, 253, 221, 175, 180, |
70 |
173, 177,8215, 190, 182, 167, 247, 184, 176, 168, 183, 185, 179, 178,9632, 160, |
71 |
$ |
72 |
|
73 |
See also a |
74 |
> encoding list of encodings |
75 |
supported by Java and some |
76 |
> /doc/charsets.html notes on the use of charsets with ISIS |
77 |
. |
78 |
Roman Czyborra compiled an |
79 |
> http://czyborra.com/charsets/codepages.html illustrated overview |
80 |
over the most commonly used codepages. |
81 |
--- |
82 |
$Id: CsTables.txt,v 1.3 2002/12/02 18:20:56 kripke Exp $ |