1 <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
4 <title>CHaracter Information Service Environment</title>
5 <link rel=stylesheet href="chise.css" type="text/css">
15 <b><a href="index.html.ja.iso-2022-jp">[Japanese page]</a></b>
20 <a href="http://cvs.m17n.org/chise/"><img
21 alt="m17n.org" src="images/tomura-s.png" align="middle"></a>
22 <a href="http://www.kanji.zinbun.kyoto-u.ac.jp/projects/chise/"><img
23 alt="kanji.zinbun.kyoto-u.ac.jp" src="images/diccs-s.jpg" align="middle"></a>
24 <a href="http://mousai.as.wakwak.ne.jp/projects/chise/"><img
25 alt="mousai.as.wakwak.ne.jp" src="images/egret-pond-s.jpg"
32 <h1>CHISE project</h1>
36 <!--<b><a href="index.html.ja.iso-2022-jp"><img
37 src="images/japanese-page.png">
41 <h2>About the CHISE Project</h2>
43 The CHISE (CHaracter Information Service Environment) project attempts
44 to collect and organize into a Knowledge-Base information about
45 characters in the scripts of the world. A new processing environment
46 based on this architecture is currently under development.
51 <li>Koichi Kamichi has published
52 <a href="http://fonts.jp/chise_linkmap/">chise_linkmap
53 (a visualization system for CHISE character database)</a>,
54 <a href="http://fonts.jp/chise_swig_perl/">chise_swig_perl
55 (a libchise wrapper for perl 5)</a> and
56 <a href="http://fonts.jp/makettf/">makettf
57 (simple TTF binder)</a>, which were results of
58 <a href="News/20051013-15.html">CHISE Conference 2005
59 and CodeFest Kyoto 2005</a>.</li>
60 <li><a href="News/20051013-15.html">CHISE Conference
61 2005</a> has been held this October 13 (Thu), 14 (Fri)
62 at <a href="http://www.kcif.or.jp/en/">Kyoto International
63 Community House</a>.</li>
64 <li><a href="http://mousai.kanji.zinbun.kyoto-u.ac.jp/ids-find">
65 CHISE-IDS Hanzi/Hanja/Kanji Searcher
66 </a>has been published.</li>
67 <!-- <li>2004-06-09 (Wed)
68 Tomohiko Morioka will make a presentation on CHISE Project in
69 <a href="http://kura.hanazono.ac.jp/kanji/20040609symposium.html"
70 >Symposium: <i>Frontier of Character Information Processing:
71 Past, Presenta and Future</i></a>.</li>
73 A presentation on CHISE Project was made in
74 <a href="http://www.sigch.soken.ac.jp/2004.05/">the 62nd meeting of
75 the IPSJ SIG Computers and the Humanities</a>.</li>
76 <li>2003-11-28 (Fri), 29 (Sat)
77 <a href="http://coe21.zinbun.kyoto-u.ac.jp/ws-type-2003">Glyph
78 and Typesetting Workshop</a> was held at
79 <a href="http://www.kcif.or.jp/jp/footer/05.html"
80 >Kyoto City International Foundation</a>.
82 <!-- <li>2003-10-31 (Fri) -->
83 <!-- Presentations on the CHISE project were made in -->
84 <!-- <a href="http://lc.linux.or.jp/lc2003/index.html">Linux Conference -->
93 The CHISE project is the aggregate of the following sub-projects.
97 <li>Development of a character processing architecture based on a
98 character knowledge base
99 <!--文字知識データベースに基づく文字処理アーキテクチャの開発-->
101 <li><a href="xemacs/index.html">XEmacs CHISE</a>
102 <li><a href="ruby/index.html">Ruby/CHISE</a>
103 <li><a href="perl/index.html">Perl/CHISE</a>
106 <li><a href="topicmaps/index.html">A TopicMaps based approach to a
108 <!--TopicMapsによる文字知識データベース・システムの開発--></a></li>
109 <li><a href="char-data/">Database of features of characters
110 <!--文字に関するさまざまな知識のデータベース化--></a>
112 <li><a href="ids/index.html">Database of the component structure of
113 Chinese Characters<!--漢字構造情報データベース--></a></li>
114 <li><a href="glyph/index.html">Intgegration and Composition of
115 Character Glyphs and Styles<!--グリフ・字形情報の統合と合成--></a></li>
118 <li><a href="visualization/index.html">Mathematical analysis and visualation
119 of character knowledge<!--文字知識情報の数理的解析と可視化--></a></li>
120 <li><a href="omega/index.html">Omega/CHISE: Typesetting System in cooperation
121 with character knowledge database
122 <!--文字データベースと連携した組版システム--></a></li>
126 <h2>Development of a character processing architecture based on a
127 character knowledge base</h2>
128 <h3><a name="xemacs/">XEmacs UTF-2000</a></h3> <p>
129 It is now possible to load character
130 attributes from a external database on demand ("lazy loading"). On
131 Intel 32 bit processor architectures, the size of the executable file
132 thus shrinks from the 30 MB required with the traditional built to
133 just about 15 MB. This can now be downloaded from <a
134 href="http://www.kanji.zinbun.kyoto-u.ac.jp/projects/chise/dist/XEmacs/xemacs-utf-2000-0.19.tar.gz">
135 XEmacs UTF-2000 0.19 (Koriyama)</a>. In addtion, there is a UTF-2000
136 branch of the XEmacs tree at cvs.m17n.org in /cvs/root, this can be
137 accessed by anonymous CVS </p>
139 <h2>A <a name="topicmaps">
140 <a href="http://www.topicmaps.org">TopicMaps</a> based approach to a
144 In 2001 the prototype of a Topic Map engine has been developed based
145 on <a href="http://www.zope.org/">Zope</a>. This proved less than
146 ideal for this purpose, so the focus for this year is to port this
147 engine to a relational database backend. Currently development
148 continued with PostgreSQL. It is planned to enable Topic Map editing
149 within XEmacs UTF-2000, but also to allow multiple clients in addtion
153 <h2>Database of features of characters</h2>
155 <h3>Database of the component structure of Chinese Characters</h3>
158 Based on the Ideographic Description Characters (IDS) in
159 ISO/IEC 10646-1:2000 and Unicode, we are now developping a database
160 that expresses the structure of Chinese Characters using this syntax.
161 At the moment, we are using the characters in the Unicode tables as a
162 reference. The basic <emph>CJK Unified Ideographs</emph>, as well as
163 <emph>Extension A</emph> and <emph>Extension B</epmph>, together more
164 than 70000 characters are currently covered.
168 <a href="images/ids-ext-b-1.png">
169 <img align="ids" src="images/ids-ext-b-1-s.png">
171 Table of the component structure database
176 The following tables are currently available via anonymous CVS from <a
177 href="http://cvs.m17n.org/">cvs.m17n.org</a> at <a
178 href="http://cvs.m17n.org/cgi-bin/viewcvs/?cvsroot=chise">/cvs/chise</a>
180 href="http://cvs.m17n.org/cgi-bin/viewcvs/ids/?cvsroot=chise">ids:</a>
186 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Basic.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
189 <dd>CJK Unified Ideographs (U+4E00 〜 U+9FA5) of ISO/IEC
193 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-A.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
196 <dd>CJK Unified Ideographs Extension A (U+3400 〜 U+4DB5, U+FA1F and
197 U+FA23) of ISO/IEC 10646-1:2000
200 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Compat.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
203 <dd>CJK Compatibility Ideographs (U+F900 〜 U+FA2D, except U+FA1F
204 and U+FA23) of ISO/IEC 10646-1:2000
207 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-1.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
210 <dd>CJK Unified Ideographs Extension B [part 1] (U-00020000 〜
211 U-00021FFF) of ISO/IEC 10646-2:2001
214 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-2.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
217 <dd>CJK Unified Ideographs Extension B [part 2] (U-00022000 〜
218 U-00023FFF) of ISO/IEC 10646-2:2001
220 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-3.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
223 <dd>CJK Unified Ideographs Extension B [part 3] (U-00024000 〜
224 U-00025FFF) of ISO/IEC 10646-2:2001
226 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-4.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
229 <dd>CJK Unified Ideographs Extension B [part 4] (U-00026000 〜
230 U-00027FFF) of ISO/IEC 10646-2:2001
232 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-5.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
235 <dd>CJK Unified Ideographs Extension B [part 5] (U-00028000 〜
236 U-00029FFF) of ISO/IEC 10646-2:2001
238 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-6.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
241 <dd>CJK Unified Ideographs Extension B [part 6] (U-0002A000 〜
242 U-0002A6D6) of ISO/IEC 10646-2:2001
244 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Compat-Supplement.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
245 IDS-UCS-Compat-Supplement.txt
247 <dd>CJK Compatibility Ideographs Supplement (U-0002F800 〜
248 U-0002FA1D) of ISO/IEC 10646-2:2001
250 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-01.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
253 <dd>Morohashi: Daikanwa Jiten, Volume 1
255 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-02.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
258 <dd>Morohashi: Daikanwa Jiten, Volume 2
260 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-03.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
263 <dd>Morohashi: Daikanwa Jiten, Volume 3
265 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-04.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
268 <dd>Morohashi: Daikanwa Jiten, Volume 4
270 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-05.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
273 <dd>Morohashi: Daikanwa Jiten, Volume 5
275 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-06.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
278 <dd>Morohashi: Daikanwa Jiten, Volume 6
280 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-07.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
283 <dd>Morohashi: Daikanwa Jiten, Volume 7
285 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-08.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
288 <dd>Morohashi: Daikanwa Jiten, Volume 8
290 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-09.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
293 <dd>Morohashi: Daikanwa Jiten, Volume 9
295 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-10.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
298 <dd>Morohashi: Daikanwa Jiten, Volume 10
300 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-11.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
303 <dd>Morohashi: Daikanwa Jiten, Volume 11
305 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-12.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
308 <dd>Morohashi: Daikanwa Jiten, Volume 12
310 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-dx.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
313 <dd>Morohashi: Daikanwa Jiten, Additions
315 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-ho.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
318 <dd>Morohashi: Daikanwa Jiten, Appendix
320 href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-CBETA.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
323 <dd>Characters encountered by the <a href="http://www.cbeta.org/">Chinese Buddhist Electronic Text
324 Association (CBETA)</a>
329 <li><a href="http://web.sfc.keio.ac.jp/~kamichi/">Koichi KAMICHI</a>
330 (<a href="http://www.fonts.jp/">
331 Forum for development of on-the-fly generation of Kanji Fonts
333 <a href="http://www.fonts.jp/search.html">
334 Analytic tool for Kanji Fonts (in Japanese)
338 <h3><a name="glyph">Intgegration and Composition of Character Glyphs
339 and Styles</a></h3> <p> In the character database is information about
340 character glyphs and styles collected. This allows to use this
341 information together with the other knowledge about a character in the
342 database to built a system that uses the <a href="#ids">component
343 structure information </a> to assemble the font for a character
344 depending on the contextual requirements from its components. With
345 this system, occurrences of mismatches based on erroneous association
346 or insufficient contextual information are excluded, and it will be
347 possible easily display and print character forms that have not been codified and for
348 which no fonts exists .
351 <a href="http://www.fonts.jp/">
352 Forum for development of on-the-fly generation of Kanji Fonts
357 <h3><a name="network">Mathematical analysis and visualation of
358 character knowledge</a></h3>
360 <li>Yoshi Fujiwara, Yasuhiro Suzuki, Tomohiko
362 href="http://www2.crl.go.jp/jt/a134/yoshi/pc/kanji/nw.ps">
363 Network of Words</a>”, <a href="http://arob.cc.oita-u.ac.jp/">
364 Artificial Life and Robotics 2002</a>
365 (<a href="http://www2.crl.go.jp/jt/a134/yoshi/pc/kanji/index.html">
366 Presentation material
368 <li>Model for the relation of Kanji characters that share a component
371 href="http://www2.crl.go.jp/jt/a134/yoshi/pc/kanji/mage1.jpg">
373 src="images/mage1-s.jpg"><br>Image 1</a>
375 <a href="http://www2.crl.go.jp/jt/a134/yoshi/pc/kanji/mage2.jpg">
377 src="images/mage2-s.jpg"><br>Image 2</a>
382 <h2>CVS Repository</h2>
384 <a href="http://cvs.m17n.org/cgi-bin/viewcvs/?cvsroot=chise">/cvs/chise</a>
388 <h2>Mailing List</h2>
390 Discussion about the CHISE Project occur in the CHISE-{ja|en} mailing list.
392 Anybody who would like to take part in the discussion about and
393 development of the CHISE Project, has ideas or questions about the
394 implementation or wishes for new features is welcome to join either
395 the English, or the Japanese or both lists.
397 To become a member in the CHISE mailing, send a message to the
401 <dd><a href="mailto:chise-ja-ctl@m17n.org">
402 chise-ja-ctl@m17n.org</a>
405 <dd><a href="mailto:chise-en-ctl@m17n.org">
406 chise-en-ctl@m17n.org</a>
410 <blockquote>subscribe Your Name</blockquote>
411 in the body of the message. You will then receive a conformation
412 message with the line
415 confirm PASSWORD Your Name
416 </blockquote> You will have to reply to this message to become a member.
420 <h2>Papers and Presentations</h2>
422 <li><a href="xemacs/#presentation">
423 About XEmacs UTF-2000</a>
424 <li><a href="#network">About mathematical analysis of Character Information</a>
427 <li><a href="papers/u2k-plan.ja/">
428 “Model and Implementation of a Next Generation Multilingual
429 Processing System”
430 </a> (in Japanese. October 1999)
431 <li><a href="http://www.kanji.zinbun.kyoto-u.ac.jp/~wittern/">WITTERN, Christian</a>,
432 “Non-system characters in XML documents”, in:
433 <i>The Frontier of Asian Information Processing</i>
434 [Seminar Series of the National Documentation and
435 Information Centers in Humanities] No. 10, November 2000
436 <li><a href="http://www.kanji.zinbun.kyoto-u.ac.jp/~tomo/">MORIOKA Tomohiko</a>,
437 “The UTF-2000 Project”, in:
439 href="http://www.kanji.zinbun.kyoto-u.ac.jp/publications/kanji-and-info-2.pdf">
440 Kanji and Information, No.2</a>, March 2001 (in Japanese)
441 <li><a href="http://www.kanji.zinbun.kyoto-u.ac.jp/~tomo/">MORIOKA Tomohiko</a>,
442 “CHISE project &emdash; beyond the UTF-2000”,
443 <a href="http://www.m17n.org/m17n2001/">
444 m17n2001: the Fifth International Symposium on Multilingual
445 Information Processing and Open Source Software
447 <li><a href="http://www.kanji.zinbun.kyoto-u.ac.jp/~tomo/">MORIOKA Tomohiko</a>,
448 “A Short Introduction to UTF-2000 Project”,
449 the First TEI Character Set Issues Working Group (October 2001,
450 University of California, Berkeley, USA).
451 <li><a href="http://www.kanji.zinbun.kyoto-u.ac.jp/~wittern/">WITTERN, Christian</a>,
452 “What is Digitisation?”, in:
454 href="http://www.kanji.zinbun.kyoto-u.ac.jp/publications/kanji-and-info-3.pdf">
455 Kanji and Information, No.3</a>, October 2001 (in Japanese).
456 <li><a href="http://www.ya.sakura.ne.jp/~moro/">MORO, Shigeki</a>,
457 “The meaning of 'beyond character codes'”, in:
459 href="http://www.kanji.zinbun.kyoto-u.ac.jp/publications/kanji-and-info-3.pdf">
460 Kanji and Information, No.3</a>, October 2001 (in Japanese).
461 <li><a href="http://www.kanji.zinbun.kyoto-u.ac.jp/~wittern/">WITTERN, Christian</a>,
462 “Some thoughts on the digitization of Kanji”,
463 <i>Information Technology and the Humanities</i>
464 [Seminar Series of the National Documentation and
465 Information Centers in Humanities] No. 11, November 2001.
466 <li><a href="http://web.sfc.keio.ac.jp/~kamichi/">KAMICHI, Koichi</a>,
467 “Building KAGE (Kanji-font Automatic Generating Engine):
468 The Next Gerenation of Kanji Processing beyond the Character Code Model”
469 in <a href="http://www.jaet.gr.jp/jj/3.html"><i>Journal of Japan Association for
470 East Asian Text Processing (JAET)</i> No. 3</a>, October 2002 (in Japanese).
471 <li><a href="http://www.ya.sakura.ne.jp/~moro/">MORO, Shigeki</a>,
472 “Software Review: CHISE Project,”
473 in <a href="http://www.jaet.gr.jp/jj/3.html"><i>Journal of Japan Association for
474 East Asian Text Processing (JAET)</i> No. 3</a>, October 2002 (in Japanese).
475 <!-- <li><a href="http://www.kanji.zinbun.kyoto-u.ac.jp/~tomo/">MORIOKA, Tomohiko</a>,
476 <a href="papers/dc2002.pdf">
477 「ポスト文字コード時代の文書処理技術に関する展望」</a>、
479 (全国文献・情報センター人文社会科学学術セミナーシリーズ No.12),
481 <li><a href="http://www.kanji.zinbun.kyoto-u.ac.jp/~tomo/">Morioka, Tomohiko</a>,
482 <a href="http://ya.sakura.ne.jp/~moro/">Moro, Shigeki</a>.
483 “Moji-sosei ni motozuku moji-shori
484 (Character Processing based on Character Features).”
485 <cite><a href="http://www.ipsj.or.jp/members/SIGNotes/Jpn/17/2004/062/"
486 >IPSJ SIG Technical Report Vol. 2004, No. 58 (2004-CH-62)</a></cite>.
487 May, 2004. pp. 53-60. (in Japanese)</li>
492 <h2><a href="history">History</a></h2>
494 This project was assisted by <a
495 href="http://www.ipa.go.jp/NBP/13nendo/13mito/koubo13.html">IPA Exploratory
496 Software Project, 2001</a>.
501 <b>[<a href="../">Above</a>]</b>
503 <p><img SRC="images/dragon.jpg" height=146 width=198></center>
508 <a href="http://www.kanji.zinbun.kyoto-u.ac.jp/">Documentation and Information Center for Chinese Studies (DICCS)</a>,
509 <a href="http://www.zinbun.kyoto-u.ac.jp/">Institute for Research in the Humanities</a>,
510 <a href="http://www.kyoto-u.ac.jp/">Kyoto University</a>
513 <a href="http://www.m17n.org/">m17n.org (the Organization for Multilingualization)</a>
514 <a href="http://www.aist.go.jp/">(National Institute of Advanced Industrial Science and Technology)</a>
518 <a href="http://www.hanazono.ac.jp/">Hanazono University</a>
521 <a href="http://www.aist.go.jp/">National Institute of Advanced Industrial Science and Technology</a>
524 <a href="http://bioinfo.tmd.ac.jp/">Dept. of Bioinformatics</a>,
525 <a href="http://www.tmd.ac.jp/mri/mri.html">Medical Research Institute</a>,
526 <a href="http://www.tmd.ac.jp/">Tokyo Medical and Dental University</a>
532 Last modified: Mon May 17 02:42:17 JST 2004
534 <a href="http://www.aurora.dti.ne.jp/~zom/Counter/index.html">
536 src="http://mousai.as.wakwak.ne.jp/cgi-bin/counterp.cgi?projects_chise-en.log"
542 <!-- Keep this comment at the end of the file
546 time-stamp-line-limit:40