</table>
</h1>
<p>
-<!-- hhmts start -->
-Last modified: Fri Sep 27 00:30:59 JST 2002
-<!-- hhmts end -->
<br>
<b><a href="index.html.ja.iso-2022-jp"><img
src="images/japanese-page.png">
<hr>
<!--
-<h2>\e$BJ8;zCN<1%G!<%?%Y!<%9$K4p$E$/J8;z=hM}%"!<%-%F%/%A%c$N3+H/\e(B</h2>
+<h2>文字知識データベースに基づく文字処理アーキテクチャの開発</h2>
-->
<h2>Development of a character processing architecture based on a
character knowledge base</h2>
-<h3><a name="xemacs/">XEmacs UTF-2000</a></h3> <p> <!-- \e$B30ItJ8;z%G!<%?\e(B
-\e$B%Y!<%9$+$iJ8;zB0@-$r\e(B lazy-loading \e$B2DG=$K$J$j$^$7$?!#\e(BIA32 \e$B%"!<%-%F%/%A%c\e(B
-\e$B$G<B9T7A<0$NBg$-$5$,=>MhLs\e(B 30 MB \e$B$@$C$?$N$,Ls\e(B 15 MB \e$B$K$J$j$^$7$?!#8=:_!"\e(B
-cvs.m17n.org \e$B$N\e(B /cvs/root \e$B$N\e(BXEmacs \e$B%b%8%e!<%k$N\e(B utf-2000 \e$B;^$G$+$i\e(B
-anonymous CVS \e$B$GF~<j2DG=$G$9!#\e(B--> It is now possible to load character
+<h3><a name="xemacs/">XEmacs UTF-2000</a></h3> <p> <!-- 外部文字データ
+ベースから文字属性を lazy-loading 可能になりました。IA32 アーキテクチャ
+で実行形式の大きさが従来約 30 MB だったのが約 15 MB になりました。現在、
+cvs.m17n.org の /cvs/root のXEmacs モジュールの utf-2000 枝でから
+anonymous CVS で入手可能です。--> It is now possible to load character
attributes from a external database on demand ("lazy loading"). On
Intel 32 bit processor architectures, the size of the executable file
thus shrinks from the 30 MB required with the traditional built to
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Basic.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
IDS-UCS-Basic.txt
</a>
- <dd>CJK Unified Ideographs (U+4E00 \e$B!A\e(B U+9FA5) of ISO/IEC
+ <dd>CJK Unified Ideographs (U+4E00 〜 U+9FA5) of ISO/IEC
10646-1:2000
<dt><a
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-A.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
IDS-UCS-Ext-A.txt
</a>
- <dd>CJK Unified Ideographs Extension A (U+3400 \e$B!A\e(B U+4DB5, U+FA1F and
+ <dd>CJK Unified Ideographs Extension A (U+3400 〜 U+4DB5, U+FA1F and
U+FA23) of ISO/IEC 10646-1:2000
<dt><a
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Compat.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
IDS-UCS-Compat.txt
</a>
- <dd>CJK Compatibility Ideographs (U+F900 \e$B!A\e(B U+FA2D, except U+FA1F
+ <dd>CJK Compatibility Ideographs (U+F900 〜 U+FA2D, except U+FA1F
and U+FA23) of ISO/IEC 10646-1:2000
<dt><a
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-1.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
IDS-UCS-Ext-B-1.txt
</a>
- <dd>CJK Unified Ideographs Extension B [part 1] (U-00020000 \e$B!A\e(B
+ <dd>CJK Unified Ideographs Extension B [part 1] (U-00020000 〜
U-00021FFF) of ISO/IEC 10646-2:2001
<dt><a
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-2.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
IDS-UCS-Ext-B-2.txt
</a>
- <dd>CJK Unified Ideographs Extension B [part 2] (U-00022000 \e$B!A\e(B
+ <dd>CJK Unified Ideographs Extension B [part 2] (U-00022000 〜
U-00023FFF) of ISO/IEC 10646-2:2001
<dt><a
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-3.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
IDS-UCS-Ext-B-3.txt
</a>
- <dd>CJK Unified Ideographs Extension B [part 3] (U-00024000 \e$B!A\e(B
+ <dd>CJK Unified Ideographs Extension B [part 3] (U-00024000 〜
U-00025FFF) of ISO/IEC 10646-2:2001
<dt><a
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-4.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
IDS-UCS-Ext-B-4.txt
</a>
- <dd>CJK Unified Ideographs Extension B [part 4] (U-00026000 \e$B!A\e(B
+ <dd>CJK Unified Ideographs Extension B [part 4] (U-00026000 〜
U-00027FFF) of ISO/IEC 10646-2:2001
<dt><a
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-5.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
IDS-UCS-Ext-B-5.txt
</a>
- <dd>CJK Unified Ideographs Extension B [part 5] (U-00028000 \e$B!A\e(B
+ <dd>CJK Unified Ideographs Extension B [part 5] (U-00028000 〜
U-00029FFF) of ISO/IEC 10646-2:2001
<dt><a
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-6.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
IDS-UCS-Ext-B-6.txt
</a>
- <dd>CJK Unified Ideographs Extension B [part 6] (U-0002A000 \e$B!A\e(B
+ <dd>CJK Unified Ideographs Extension B [part 6] (U-0002A000 〜
U-0002A6D6) of ISO/IEC 10646-2:2001
<dt><a
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Compat-Supplement.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
IDS-UCS-Compat-Supplement.txt
</a>
- <dd>CJK Compatibility Ideographs Supplement (U-0002F800 \e$B!A\e(B
+ <dd>CJK Compatibility Ideographs Supplement (U-0002F800 〜
U-0002FA1D) of ISO/IEC 10646-2:2001
<dt><a
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-01.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
character knowledge</a></h3>
<ul>
<li>Yoshi Fujiwara, Yasuhiro Suzuki, Tomohiko
- Morioka, \e$B!H\e(B<a
+ Morioka, “<a
href="http://www2.crl.go.jp/jt/a134/yoshi/pc/kanji/nw.ps">
- Network of Words</a>\e$B!I\e(B, <a href="http://arob.cc.oita-u.ac.jp/">
+ Network of Words</a>”, <a href="http://arob.cc.oita-u.ac.jp/">
Artificial Life and Robotics 2002</a>
(<a href="http://www2.crl.go.jp/jt/a134/yoshi/pc/kanji/index.html">
Presentation material
src="images/mage1-s.jpg"><br>Image 1</a>
<br>
<a href="http://www2.crl.go.jp/jt/a134/yoshi/pc/kanji/mage2.jpg">
- <img alt="\e$BLO<0?^\e(B2"
+ <img alt="模式図2"
src="images/mage2-s.jpg"><br>Image 2</a>
</ul>
<!-- <h2>TOMOYO Project</h2> -->
<!-- <p> -->
<!-- TOMOYO (Text Operation Models and Outfits for Your Objects) -->
-<!-- \e$B%W%m%8%'%/%H$O!"=>Mh!V\e(BUTF-2000 \e$B%W%m%8%'%/%H!W$H8F$s$G$$$?$b$N$G!"\e(B -->
-<!-- \e$BJ8;zCN<1%G!<%?%Y!<%9$K4p$E$/\e(B -->
-<!-- \e$BJ8;z=hM}%"!<%-%F%/%A%c$r3+H/$9$k$?$a$N%W%m%8%'%/%H$G$9!#\e(B -->
+<!-- プロジェクトは、従来「UTF-2000 プロジェクト」と呼んでいたもので、 -->
+<!-- 文字知識データベースに基づく -->
+<!-- 文字処理アーキテクチャを開発するためのプロジェクトです。 -->
<!-- </p> -->
"Model and Implementation of a Next Generation Multilingual
Processing System"
</a> (October 1999)
- <li>WITTERN, Christian, \e$B!H\e(BNon-system characters in XML documents\e$B!I\e(B, in:
+ <li>WITTERN, Christian, “Non-system characters in XML documents”, in:
<i>The Frontier of Asian Information Processing</i>
[Seminar Series of the National Documentation and
Information Centers in Humanities] No. 10, November 2000
- <li>MORIOKA Tomohiko, \e$B!V\e(BThe UTF-2000 Project\e$B!W\e(B, in:
+ <li>MORIOKA Tomohiko, 「The UTF-2000 Project」, in:
<a
href="http://www.kanji.zinbun.kyoto-u.ac.jp/publications/kanji-and-info-2.pdf">
Kanji and Information, No.2</a>, March 2001
- <li>MORIOKA Tomohiko,\e$B!H\e(BCHISE project &emdash; beyond the UTF-2000\e$B!I\e(B,
+ <li>MORIOKA Tomohiko,“CHISE project &emdash; beyond the UTF-2000”,
<a href="http://www.m17n.org/m17n2001/">
m17n2001: the Fifth International Symposium on Multilingual
Information Processing and Open Source Software
</a>.
- <li>MORIOKA Tomohiko, \e$B!H\e(BA Short Introduction to UTF-2000 Project\e$B!I\e(B,
+ <li>MORIOKA Tomohiko, “A Short Introduction to UTF-2000 Project”,
the First TEI Character Set Issues Working Group (October 2001,
University of California, Berkeley, USA).
<li>WITTERN, Christian, "What is Digitisation?", in:
<a
href="http://www.kanji.zinbun.kyoto-u.ac.jp/publications/kanji-and-info-3.pdf">
Kanji and Information, No.3</a>, October 2001
- <li>WITTERN, Christian, \e$B!H\e(BSome thoughts on the digitization of Kanji\e$B!I\e(B,
+ <li>WITTERN, Christian, “Some thoughts on the digitization of Kanji”,
<i>Information Technology and the Humanities</i>
[Seminar Series of the National Documentation and
Information Centers in Humanities] No. 11, November 2001
</a>]</b>
<p><img SRC="images/dragon.jpg" height=146 width=198></center>
+<hr>
+
+<!-- hhmts start -->
+Last modified: Wed Oct 9 03:33:25 JST 2002
+<!-- hhmts end -->.
+<a href="http://www.aurora.dti.ne.jp/~zom/Counter/index.html">
+<img
+ src="http://mousai.as.wakwak.ne.jp/cgi-bin/counterp.cgi?projects_chise-en.log"
+ alt="counter"></a>
+since Oct 9 2002.
+
</body>
</html>
<!-- Keep this comment at the end of the file