1 -*- coding: utf-8-gb-er -*-
4 知世 project ― beyond the UTF-2000
11 守岡 知彦 / MORIOKA Tomohiko
13 Document Information Center
14 for Chinese Studies, Kyōto University
19 知 (Knowledge, Information)
23 Not only for worldwide,
24 but also for time (ancient → future)
30 ・CHISE (CHaracter Information
32 character information server
34 ・TOMOYO (Text Object Manipulator
35 and Outfit for YOurself)
38 History (1)— Before UTF-2000
41 represented by coded character sets
45 History (2) — UTF-2000 (1)
48 represented by character object
54 ・Every character related information
55 are stored in character database
57 - system gets property of character
60 - user can add characters by definition
62 → user can use own unification rule
67 ・sample implementation of UTF-2000
73 Problem of XEmacs UTF-2000
75 ・Require too big memory
76 → external database + lazy loading
78 ・There are no UTF-2000 based
79 external representations
82 + application/char-info for MIME
89 (1) private character database
90 based on dbm like simple database
92 (2) local character database server
93 (based on PostgreSQL?)
95 (3) distributed server system
97 - Check conflicts and report
102 (0) Complete UTF-2000
103 (a) complete XEmacs UTF-2000
105 to xemacs-patches :-)
106 (b) implement GNU Emacs 21 UTF-2000
108 (1) Multiple representation in one system
110 (2) Character definition editor
112 (3) Network representation
117 ・Develop high quality character data
118 not depended on any character codes
120 ・Integrate glyph, shape and
121 type setting information
122 into the character database system
124 ・Searchable image based document database
125 (especially for classical