(U-00020203): Delete incomplete `ideographic-structure'.
[chise/xemacs-chise.git-] / lisp / utf-2000 / README.en
1 CHISE character database format (S-expression version)
2
3 * Relations between Characters
4
5 Property-name `->foo' or `<-foo' mean relations between characters.
6
7 If there are a relationship `foo', property-name `->foo' means a value
8 of the property is a `foo' of the defined character.  Likewise,
9 property-name `<-foo' means the defined character is a `foo' of each
10 value of the property.
11
12 Example 1: If a character ?A has property `(->lowercase . (?a))', the
13 character ?a specified in the property is a lowercase of the defined
14 character ?A.
15
16 Example 2: If a character ?a has property `(<-lowercase . (?A))', the
17 defined character ?a is a lowercase of the character ?A specified in
18 the property.
19
20
21 ** Character Reference
22
23 Type of property `->foo' or `<-foo' are list of ``char-ref''.  The
24 type char-ref is defined by following definition:
25
26    char-ref = character | char-spec | char-property-list
27
28 The type ``char-property-list'' is a property-list.  In the
29 property-list, some keys are defined in the specification.  Following
30 list describes meaning of keys:
31
32   :char
33         Required.  It indicates a character.
34         [type] character | char-spec
35
36   :sources
37         Optional.  It indicates a source of the relation.
38         [type] list of symbols
39
40         Meaning of some symbols are defined.  Following list describes
41         meaning of symbols:
42
43                 zhonghua-dazidian (chuuka-daijiten)     \e$BCf\e$(Ga^DKGsL(\e(B
44                 tongwen-tongkao (doubun-tsuukou)        \e$BF1J8DL9M\e(B
45                 yupian (gyokuhen)                       \e$B6LJS\e(B
46                 pianhai (henkai)                        \e$BJS3$\e(B
47                 pianhai-leibian (henkai-ruihen)         \e$BJS3$\e$(CW>x:\e(B
48                 yunhai (inkai)                          \e$B1$2q\e(B
49                 yunhai-bei (inkaiho)                    \e$B1$2qJd\e(B
50                 zihui (jii)                             \e$B;zWC\e(B
51                 zihui-bei (jiiho)                       \e$B;zWCJd\e(B
52                 cihai (jikai)                           \e$B<-3$\e(B
53                 jiyun (shuuin)                          \e$B=81$\e(B
54                 haipian (kaihen)                        \e$B3$JS\e(B
55                 kangxi                                  \e$B9/_f;zE5\e(B
56                 guangyun (kouin)                        \e$B9-1$\e(B
57                 daikanwa (morohashi-daikanwa)           \e$BBg4AOB<-E5\e(B
58                 ruijumyougishou (ruishuu-meigishou)     \e$BN`f\L>5A>6\e(B
59                 zhengyun (seiin)                        \e$B@51$\e(B
60                 zhengzi-tong (seiji-tsuu)               \e$B@5;z\e$(CwW\e(B
61                 shuowen-tongxun-dingsheng
62                         (setsumon-tsuukun-teisei)       \e$B@bJ8DL71Djfa\e(B
63                 shuowen                                 \e$B@bJ8\e(B
64                 songyuan-yilai-suzipu
65                         (sougen-irai-zokujifu)          \e$BAW850JMhB/;zIh\e(B
66                 yuquan                                  \e$B6L@t\e(B
67                 chuanpian (senhen)                      \e$B@nJS\e(B
68                 kanjo                                   \e$B4A=q\e(B
69                 ronkou                                  \e$BO@9U\e(B
70
71 ** Properties
72
73 ->lowercase     indicates lowercase
74 ->uppercase     indicates uppercase
75 ->titlecase     indicates titlecase (uppercase + lowercase)
76
77 ->fullwidth     indicates ``fullwidth'' characters
78 <-fullwidth     a ``fullwidth'' character of
79
80 ->same-ideograph        indicates same characters (\e$BF1;z\e(B)
81 ->original-ideograph    indicates ``original'' characters (\e$BK\;z\e(B)
82 <-original-ideograph    an ``original'' character of
83 ->ancient-ideograph     indicates ``ancient'' characters (\e$B8E;z\e(B)
84 <-ancient-ideograph     an ``ancient'' character of
85 ->vulgar-ideograph      indicates ``vulgar'' characters (\e$BB/;z\e(B)
86 <-vulgar-ideograph      a ``vulgar'' character of
87 ->wrong-ideograph       indicates ``wrong'' characters (\e$Bkw;z\e(B)
88 <-wrong-ideograph       a ``wrong'' character of
89 ->simplified-ideograph  indicates simplified characters
90 <-simplified-ideograph  a simplified character of
91 ->ideographic-variants  indicates variants (\e$B0[BN;z\e(B)
92 ->synonyms              indicates synonyms
93
94 ->radical               indicates corresponding radical forms
95 <-radical               a radical form of
96
97 ->bopomofo              indicates bopomofo characters
98 <-bopomofo              a bopomofo character of
99
100 ->ideographic           indicates corresponding characters used with
101                         Ideographic script
102 <-ideographic           an Ideographic symbol of