(U-00026AE3): Compact IDS.
[chise/ids.git] / README.en
1 ;; -*- coding: utf-8 -*-
2 This package contains data and utilities about structures of
3 Han Ideographs (漢字).
4
5
6 The following files contains the data about structures of Han
7 Ideographs (漢字).
8     
9     IDS-UCS-Basic.txt   CJK Unified Ideographs (U+4E00 〜 U+9FA5)
10                         of ISO/IEC 10646-1:2000
11     IDS-UCS-Ext-A.txt   CJK Unified Ideographs Extension A
12                         (U+3400 〜 U+4DB5, U+FA1F and U+FA23)
13                         of ISO/IEC 10646-1:2000
14     IDS-UCS-Compat.txt  CJK Compatibility Ideographs
15                         (U+F900 〜 U+FA2D, except U+FA1F and U+FA23)
16                         of ISO/IEC 10646-1:2000
17     IDS-UCS-Ext-B-1.txt CJK Unified Ideographs Extension B [part 1]
18                         (U-00020000 〜 U-00021FFF)
19                         of ISO/IEC 10646-2:2001
20     IDS-UCS-Ext-B-2.txt CJK Unified Ideographs Extension B [part 2]
21                         (U-00022000 〜 U-00023FFF)
22                         of ISO/IEC 10646-2:2001
23     IDS-UCS-Ext-B-3.txt CJK Unified Ideographs Extension B [part 3]
24                         (U-00024000 〜 U-00025FFF)
25                         of ISO/IEC 10646-2:2001
26     IDS-UCS-Ext-B-4.txt CJK Unified Ideographs Extension B [part 4]
27                         (U-00026000 〜 U-00027FFF)
28                         of ISO/IEC 10646-2:2001
29     IDS-UCS-Ext-B-5.txt CJK Unified Ideographs Extension B [part 5]
30                         (U-00028000 〜 U-00029FFF)
31                         of ISO/IEC 10646-2:2001
32     IDS-UCS-Ext-B-6.txt CJK Unified Ideographs Extension B [part 6]
33                         (U-0002A000 〜 U-0002A6D6)
34                         of ISO/IEC 10646-2:2001
35     IDS-UCS-Compat-Supplement.txt
36                         CJK Compatibility Ideographs Supplement
37                         (U-0002F800 〜 U-0002FA1D)
38                         of ISO/IEC 10646-2:2001
39
40 These files are encoded by UTF-8.  The format of each line of the
41 files is:
42
43     <CODEPOINT><TAB><CHARACTER><TAB><IDS>
44
45 or
46
47     ;; <COMMENTS>
48
49 Each element means
50
51     <TAB>       <HORIZONTAL TABULATION> (U+0009)
52     <CODEPOINT> code point
53                 U+hhhh          Hex form of UCS code point
54                                 (U+0000 〜 U+FFFF)
55                 U-hhhhhhhh      Hex form of UCS code point
56                                 (U+00000000 〜 U+7FFFFFFF)
57     <CHARACTER> character corresponding with <CODEPOINT>
58     <IDS>       Ideographic Description Sequence
59                 (based on ISO/IEC 10646-1:2000 F.3.1; however
60                 Compatibility Ideographs and non-UCS Ideographs are
61                 also allowed)
62     <COMMENTS>  comment
63
64
65 The developing of the package was supported by the “Exploratory
66 Software Project” of Information-technology Promotion Agency, Japan.
67 Some data in the IDS-UCS* files are derived and expanded from the CDP
68 database developped by C.C. Hsieh and his team at Academia Sinica in
69 Taipei, Taiwan.
70
71 This package is free software; you can redistribute it and/or modify
72 it under the terms of the GNU General Public License as published by
73 the Free Software Foundation; either version 2, or (at your option)
74 any later version.
75
76 This package is distributed in the hope that it will be useful, but
77 WITHOUT ANY WARRANTY; without even the implied warranty of
78 MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
79 General Public License for more details.
80
81 You should have received a copy of the GNU General Public License
82 along with this package; see the file COPYING.  If not, write to
83 the Free Software Foundation, Inc., 59 Temple Place - Suite 330,
84 Boston, MA 02111-1307, USA.