<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
"http://www.w3.org/TR/html4/loose.dtd">
-<html lang="ja">
+<html lang="en">
<head>
<title>CHaracter Information Service Environment</title>
</head>
</table>
</h1>
<p>
-<!-- hhmts start -->
-Time-stamp: "2002-09-26 17:51:22 JST chris"
-<!-- hhmts end -->
<br>
<b><a href="index.html.ja.iso-2022-jp"><img
-src="../../images/japanese-page.gif">
+src="images/japanese-page.png">
</a></b><br>
<hr>
href="http://www.kanji.zinbun.kyoto-u.ac.jp/~tomo/">Tomohiko
MORIOKA</a> and
<a href="http://www.kanji.zinbun.kyoto-u.ac.jp/~wittern/">
- Christian WITTERN</a> are presenting at the <a
+ Christian WITTERN</a> made a presentation at the <a
href="http://pnc-ecai.oiu.ac.jp/prog2.htm">
PNC Annual Conference and Joint Meetings 2002
</a>.
<li>2002-09-19 <a
href="http://www.kanji.zinbun.kyoto-u.ac.jp/~tomo/"
- >Tomohiko MORIOKA</a> is presenting at the <a href="http://lc.linux.or.jp/lc2002/">
+ >Tomohiko MORIOKA</a> gave a presentation at the <a href="http://lc.linux.or.jp/lc2002/">
Linux Conference 2002</a>
<li>2002-08-21 <a href="http://www.kanji.zinbun.kyoto-u.ac.jp/projects/chise/dist/XEmacs/xemacs-utf-2000-0.19.tar.gz">
XEmacs UTF-2000 0.19 (Koriyama)
<hr>
<!--
-<h2>\e$BJ8;zCN<1%G!<%?%Y!<%9$K4p$E$/J8;z=hM}%"!<%-%F%/%A%c$N3+H/\e(B</h2>
+<h2>文字知識データベースに基づく文字処理アーキテクチャの開発</h2>
-->
<h2>Development of a character processing architecture based on a
character knowledge base</h2>
-<h3><a name="xemacs/">XEmacs UTF-2000</a></h3> <p> <!-- \e$B30ItJ8;z%G!<%?\e(B
-\e$B%Y!<%9$+$iJ8;zB0@-$r\e(B lazy-loading \e$B2DG=$K$J$j$^$7$?!#\e(BIA32 \e$B%"!<%-%F%/%A%c\e(B
-\e$B$G<B9T7A<0$NBg$-$5$,=>MhLs\e(B 30 MB \e$B$@$C$?$N$,Ls\e(B 15 MB \e$B$K$J$j$^$7$?!#8=:_!"\e(B
-cvs.m17n.org \e$B$N\e(B /cvs/root \e$B$N\e(BXEmacs \e$B%b%8%e!<%k$N\e(B utf-2000 \e$B;^$G$+$i\e(B
-anonymous CVS \e$B$GF~<j2DG=$G$9!#\e(B--> It is now possible to load character
+<h3><a name="xemacs/">XEmacs UTF-2000</a></h3> <p> <!-- 外部文字データ
+ベースから文字属性を lazy-loading 可能になりました。IA32 アーキテクチャ
+で実行形式の大きさが従来約 30 MB だったのが約 15 MB になりました。現在、
+cvs.m17n.org の /cvs/root のXEmacs モジュールの utf-2000 枝でから
+anonymous CVS で入手可能です。--> It is now possible to load character
attributes from a external database on demand ("lazy loading"). On
Intel 32 bit processor architectures, the size of the executable file
thus shrinks from the 30 MB required with the traditional built to
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Basic.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
IDS-UCS-Basic.txt
</a>
- <dd>CJK Unified Ideographs (U+4E00 \e$B!A\e(B U+9FA5) of ISO/IEC
+ <dd>CJK Unified Ideographs (U+4E00 〜 U+9FA5) of ISO/IEC
10646-1:2000
<dt><a
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-A.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
IDS-UCS-Ext-A.txt
</a>
- <dd>CJK Unified Ideographs Extension A (U+3400 \e$B!A\e(B U+4DB5, U+FA1F and
+ <dd>CJK Unified Ideographs Extension A (U+3400 〜 U+4DB5, U+FA1F and
U+FA23) of ISO/IEC 10646-1:2000
<dt><a
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Compat.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
IDS-UCS-Compat.txt
</a>
- <dd>CJK Compatibility Ideographs (U+F900 \e$B!A\e(B U+FA2D, except U+FA1F
+ <dd>CJK Compatibility Ideographs (U+F900 〜 U+FA2D, except U+FA1F
and U+FA23) of ISO/IEC 10646-1:2000
<dt><a
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-1.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
IDS-UCS-Ext-B-1.txt
</a>
- <dd>CJK Unified Ideographs Extension B [part 1] (U-00020000 \e$B!A\e(B
+ <dd>CJK Unified Ideographs Extension B [part 1] (U-00020000 〜
U-00021FFF) of ISO/IEC 10646-2:2001
<dt><a
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-2.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
IDS-UCS-Ext-B-2.txt
</a>
- <dd>CJK Unified Ideographs Extension B [part 2] (U-00022000 \e$B!A\e(B
+ <dd>CJK Unified Ideographs Extension B [part 2] (U-00022000 〜
U-00023FFF) of ISO/IEC 10646-2:2001
<dt><a
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-3.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
IDS-UCS-Ext-B-3.txt
</a>
- <dd>CJK Unified Ideographs Extension B [part 3] (U-00024000 \e$B!A\e(B
+ <dd>CJK Unified Ideographs Extension B [part 3] (U-00024000 〜
U-00025FFF) of ISO/IEC 10646-2:2001
<dt><a
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-4.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
IDS-UCS-Ext-B-4.txt
</a>
- <dd>CJK Unified Ideographs Extension B [part 4] (U-00026000 \e$B!A\e(B
+ <dd>CJK Unified Ideographs Extension B [part 4] (U-00026000 〜
U-00027FFF) of ISO/IEC 10646-2:2001
<dt><a
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-5.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
IDS-UCS-Ext-B-5.txt
</a>
- <dd>CJK Unified Ideographs Extension B [part 5] (U-00028000 \e$B!A\e(B
+ <dd>CJK Unified Ideographs Extension B [part 5] (U-00028000 〜
U-00029FFF) of ISO/IEC 10646-2:2001
<dt><a
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-6.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
IDS-UCS-Ext-B-6.txt
</a>
- <dd>CJK Unified Ideographs Extension B [part 6] (U-0002A000 \e$B!A\e(B
+ <dd>CJK Unified Ideographs Extension B [part 6] (U-0002A000 〜
U-0002A6D6) of ISO/IEC 10646-2:2001
<dt><a
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Compat-Supplement.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
IDS-UCS-Compat-Supplement.txt
</a>
- <dd>CJK Compatibility Ideographs Supplement (U-0002F800 \e$B!A\e(B
+ <dd>CJK Compatibility Ideographs Supplement (U-0002F800 〜
U-0002FA1D) of ISO/IEC 10646-2:2001
<dt><a
href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-01.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
character knowledge</a></h3>
<ul>
<li>Yoshi Fujiwara, Yasuhiro Suzuki, Tomohiko
- Morioka, \e$B!H\e(B<a
+ Morioka, “<a
href="http://www2.crl.go.jp/jt/a134/yoshi/pc/kanji/nw.ps">
- Network of Words</a>\e$B!I\e(B, <a href="http://arob.cc.oita-u.ac.jp/">
+ Network of Words</a>”, <a href="http://arob.cc.oita-u.ac.jp/">
Artificial Life and Robotics 2002</a>
(<a href="http://www2.crl.go.jp/jt/a134/yoshi/pc/kanji/index.html">
Presentation material
src="images/mage1-s.jpg"><br>Image 1</a>
<br>
<a href="http://www2.crl.go.jp/jt/a134/yoshi/pc/kanji/mage2.jpg">
- <img alt="\e$BLO<0?^\e(B2"
+ <img alt="模式図2"
src="images/mage2-s.jpg"><br>Image 2</a>
</ul>
<!-- <h2>TOMOYO Project</h2> -->
<!-- <p> -->
<!-- TOMOYO (Text Operation Models and Outfits for Your Objects) -->
-<!-- \e$B%W%m%8%'%/%H$O!"=>Mh!V\e(BUTF-2000 \e$B%W%m%8%'%/%H!W$H8F$s$G$$$?$b$N$G!"\e(B -->
-<!-- \e$BJ8;zCN<1%G!<%?%Y!<%9$K4p$E$/\e(B -->
-<!-- \e$BJ8;z=hM}%"!<%-%F%/%A%c$r3+H/$9$k$?$a$N%W%m%8%'%/%H$G$9!#\e(B -->
+<!-- プロジェクトは、従来「UTF-2000 プロジェクト」と呼んでいたもので、 -->
+<!-- 文字知識データベースに基づく -->
+<!-- 文字処理アーキテクチャを開発するためのプロジェクトです。 -->
<!-- </p> -->
<h2>Papers and Presentations</h2>
<ul>
<li><a href="xemacs/#presentation">
- About XEmacs</a>
+ About XEmacs UTF-2000</a>
<li><a href="#network">About mathematical analysis of Character Information</a>
<li>Other
<ul>
- <li><a href="papers/u2k-plan.ja/">
- "Model and Implementation of a Next Generation Multilingual
- Processing System"
- </a> (October 1999)
- <li>WITTERN, Christian, \e$B!H\e(BNon-system characters in XML documents\e$B!I\e(B, in:
- <i>The Frontier of Asian Information Processing</i>
- [Seminar Series of the National Documentation and
- Information Centers in Humanities] No. 10, November 2000
- <li>MORIOKA Tomohiko, \e$B!V\e(BThe UTF-2000 Project\e$B!W\e(B, in:
- <a
- href="http://www.kanji.zinbun.kyoto-u.ac.jp/publications/kanji-and-info-2.pdf">
- Kanji and Information, No.2</a>, March 2001
- <li>MORIOKA Tomohiko,\e$B!H\e(BCHISE project &emdash; beyond the UTF-2000\e$B!I\e(B,
- <a href="http://www.m17n.org/m17n2001/">
- m17n2001: the Fifth International Symposium on Multilingual
- Information Processing and Open Source Software
- </a>.
- <li>MORIOKA Tomohiko, \e$B!H\e(BA Short Introduction to UTF-2000 Project\e$B!I\e(B,
- the First TEI Character Set Issues Working Group (October 2001,
- University of California, Berkeley, USA).
- <li>WITTERN, Christian, "What is Digitisation?", in:
- <a
- href="http://www.kanji.zinbun.kyoto-u.ac.jp/publications/kanji-and-info-3.pdf">
- Kanji and Information, No.3</a>, October 2001
- <li>MORO Shigeki, "The meaning of 'beyond character codes'", in:
- <a
- href="http://www.kanji.zinbun.kyoto-u.ac.jp/publications/kanji-and-info-3.pdf">
- Kanji and Information, No.3</a>, October 2001
- <li>WITTERN, Christian, \e$B!H\e(BSome thoughts on the digitization of Kanji\e$B!I\e(B,
- <i>Information Technology and the Humanities</i>
- [Seminar Series of the National Documentation and
- Information Centers in Humanities] No. 11, November 2001
+ <li><a href="papers/u2k-plan.ja/">
+ “Model and Implementation of a Next Generation Multilingual
+ Processing System”
+ </a> (in Japanese. October 1999)
+ <li><a href="http://www.kanji.zinbun.kyoto-u.ac.jp/~wittern/">WITTERN, Christian</a>,
+ “Non-system characters in XML documents”, in:
+ <i>The Frontier of Asian Information Processing</i>
+ [Seminar Series of the National Documentation and
+ Information Centers in Humanities] No. 10, November 2000
+ <li><a href="http://www.kanji.zinbun.kyoto-u.ac.jp/~tomo/">MORIOKA Tomohiko</a>,
+ “The UTF-2000 Project”, in:
+ <a
+ href="http://www.kanji.zinbun.kyoto-u.ac.jp/publications/kanji-and-info-2.pdf">
+ Kanji and Information, No.2</a>, March 2001 (in Japanese)
+ <li><a href="http://www.kanji.zinbun.kyoto-u.ac.jp/~tomo/">MORIOKA Tomohiko</a>,
+ “CHISE project &emdash; beyond the UTF-2000”,
+ <a href="http://www.m17n.org/m17n2001/">
+ m17n2001: the Fifth International Symposium on Multilingual
+ Information Processing and Open Source Software
+ </a>.
+ <li><a href="http://www.kanji.zinbun.kyoto-u.ac.jp/~tomo/">MORIOKA Tomohiko</a>,
+ “A Short Introduction to UTF-2000 Project”,
+ the First TEI Character Set Issues Working Group (October 2001,
+ University of California, Berkeley, USA).
+ <li><a href="http://www.kanji.zinbun.kyoto-u.ac.jp/~wittern/">WITTERN, Christian</a>,
+ “What is Digitisation?”, in:
+ <a
+ href="http://www.kanji.zinbun.kyoto-u.ac.jp/publications/kanji-and-info-3.pdf">
+ Kanji and Information, No.3</a>, October 2001 (in Japanese).
+ <li><a href="http://www.ya.sakura.ne.jp/~moro/">MORO, Shigeki</a>,
+ “The meaning of 'beyond character codes'”, in:
+ <a
+ href="http://www.kanji.zinbun.kyoto-u.ac.jp/publications/kanji-and-info-3.pdf">
+ Kanji and Information, No.3</a>, October 2001 (in Japanese).
+ <li><a href="http://www.kanji.zinbun.kyoto-u.ac.jp/~wittern/">WITTERN, Christian</a>,
+ “Some thoughts on the digitization of Kanji”,
+ <i>Information Technology and the Humanities</i>
+ [Seminar Series of the National Documentation and
+ Information Centers in Humanities] No. 11, November 2001.
+ <li><a href="http://web.sfc.keio.ac.jp/~kamichi/">KAMICHI, Koichi</a>,
+ “Building KAGE (Kanji-font Automatic Generating Engine):
+ The Next Gerenation of Kanji Processing beyond the Character Code Model”
+ in <a href="http://www.jaet.gr.jp/jj/3.html"><i>Journal of Japan Association for
+ East Asian Text Processing (JAET)</i> No. 3</a>, October 2002 (in Japanese).
+ <li><a href="http://www.ya.sakura.ne.jp/~moro/">MORO, Shigeki</a>,
+ “Software Review: CHISE Project,”
+ in <a href="http://www.jaet.gr.jp/jj/3.html"><i>Journal of Japan Association for
+ East Asian Text Processing (JAET)</i> No. 3</a>, October 2002 (in Japanese).
+ <!-- <li><a href="http://www.kanji.zinbun.kyoto-u.ac.jp/~tomo/">MORIOKA, Tomohiko</a>,
+ <a href="papers/dc2002.pdf">
+ 「ポスト文字コード時代の文書処理技術に関する展望」</a>、
+ 「データベースの活用と人文社会科学」
+ (全国文献・情報センター人文社会科学学術セミナーシリーズ No.12),
+ 2002年11月 -->
</ul>
</ul>
</a>]</b>
<p><img SRC="images/dragon.jpg" height=146 width=198></center>
+<hr>
+
+<!-- hhmts start -->
+Last modified: Wed Oct 9 03:33:25 JST 2002
+<!-- hhmts end -->.
+<a href="http://www.aurora.dti.ne.jp/~zom/Counter/index.html">
+<img
+ src="http://mousai.as.wakwak.ne.jp/cgi-bin/counterp.cgi?projects_chise-en.log"
+ alt="counter"></a>
+since Oct 9 2002.
+
</body>
</html>
<!-- Keep this comment at the end of the file