1 |
converting between character sets |
2 |
|
3 |
* overview |
4 |
|
5 |
Since Malete can represent both record and index data as plain text files, |
6 |
standard tools like |
7 |
> http://www.gnu.org/software/recode/recode.html recode |
8 |
or |
9 |
> http://www.gnu.org/software/libiconv/ iconv |
10 |
can be used for character set conversion. |
11 |
|
12 |
This can also be used to conver WinIsis or wwwisist files via |
13 |
> CDS cds import/export. |
14 |
|
15 |
|
16 |
* example |
17 |
|
18 |
To convert the standard WinIsis cds database to malete as is, |
19 |
without any character set conversion, use |
20 |
$ |
21 |
malete cdsimp -No850 cds |
22 |
$ |
23 |
|
24 |
Create a textual representation of the index by |
25 |
$ |
26 |
malete qdump cds 2>cds.mqt |
27 |
$ |
28 |
|
29 |
Convert the textfiles to UTF-8 use |
30 |
$ |
31 |
iconv -f CP850 -t UTF-8 <cds.mrd >cds.mrd-u |
32 |
iconv -f CP850 -t UTF-8 <cds.mqt >cds.mqt-u |
33 |
$ |
34 |
|
35 |
Replace the Malete database with the UTF-8 files |
36 |
$ |
37 |
rm cds.mq? cds.mr? |
38 |
mv cds.mrd-u cds.mrd |
39 |
malete qload cds <cds.mqt-u |
40 |
$ |
41 |
|
42 |
|
43 |
|
44 |
|
45 |
* notes |
46 |
|
47 |
Conversions that do not change the byte count like for example |
48 |
codepage 850 to Latin-1 can also be applied to IIF (ISO 2709) files. |
49 |
|
50 |
This will not work for conversions like codepage 850 to UTF-8 |
51 |
which change the length of data. |
52 |
|