English translation of the CHISE toppage.
authorchris <chris>
Thu, 26 Sep 2002 08:53:20 +0000 (08:53 +0000)
committerchris <chris>
Thu, 26 Sep 2002 08:53:20 +0000 (08:53 +0000)
index.html.en [new file with mode: 0644]

diff --git a/index.html.en b/index.html.en
new file mode 100644 (file)
index 0000000..694fc68
--- /dev/null
@@ -0,0 +1,433 @@
+<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
+            "http://www.w3.org/TR/html4/loose.dtd">
+<html lang="ja">
+<head>
+<title>CHaracter Information Service Environment</title>
+</head>
+<body>
+<p>
+[<a href="http://cvs.m17n.org/chise/">m17n.org</a>]
+[<a href="http://www.kanji.zinbun.kyoto-u.ac.jp/projects/chise/">
+Kyoto University, Institute for Research in Humanities, Documentation
+            and Information Center for Chinese Studies
+</a>]
+</p>
+
+<h1>
+<table cellspacing="8">
+<tr><th align="center" valign="top">
+<img alt="DICCS" src="images/cm450118-s.jpg">
+<td align="center" valign="middle">
+<font size="+3">CHISE project</font>
+</table>
+</h1>
+<p>
+<!-- hhmts start -->
+Time-stamp: "2002-09-26 17:44:00 JST chris"
+<!-- hhmts end -->
+<br>
+<b><a href="index.html.ja.iso-2022-jp"><img
+src="../../images/japanese-page.gif">
+</a></b><br>
+<hr>
+
+<h2>About the CHISE Project</h2>
+<p>
+The CHISE (CHaracter Information Service Environment) project attempts
+to collect and organize into a Knowledge-Base information about
+characters in the scripts of the world.  A new processing environment
+based on this architecture is currently under development.
+</p>
+
+
+<h2>News</h2>
+<ul>
+        <li>2002-09-19 <a
+       href="http://www.kanji.zinbun.kyoto-u.ac.jp/~tomo/"
+       >Tomohiko MORIOKA</a> is presenting at the <a href="http://lc.linux.or.jp/lc2002/">
+       Linux Conference 2002</a>
+   <li>2002-09-20 to 22 <a
+       href="http://www.kanji.zinbun.kyoto-u.ac.jp/~tomo/">Tomohiko
+                        MORIOKA</a> and 
+       <a href="http://www.kanji.zinbun.kyoto-u.ac.jp/~wittern/">
+       Christian WITTERN</a> are presenting at the <a
+       href="http://pnc-ecai.oiu.ac.jp/prog2.htm">
+       PNC Annual Conference and Joint Meetings 2002
+       </a>.
+   <li><a href="http://www.kanji.zinbun.kyoto-u.ac.jp/projects/chise/dist/XEmacs/xemacs-utf-2000-0.19.tar.gz">
+       XEmacs UTF-2000 0.19 (Koriyama)
+       </a> has been released.
+</ul>
+
+<hr>
+<!--
+<h2>\e$BJ8;zCN<1%G!<%?%Y!<%9$K4p$E$/J8;z=hM}%"!<%-%F%/%A%c$N3+H/\e(B</h2>
+-->
+<h2>Development of a character processing architecture based on a
+character knowledge base</h2>
+
+<h3><a name="xemacs/">XEmacs UTF-2000</a></h3> <p> <!-- \e$B30ItJ8;z%G!<%?\e(B
+\e$B%Y!<%9$+$iJ8;zB0@-$r\e(B lazy-loading \e$B2DG=$K$J$j$^$7$?!#\e(BIA32 \e$B%"!<%-%F%/%A%c\e(B
+\e$B$G<B9T7A<0$NBg$-$5$,=>MhLs\e(B 30 MB \e$B$@$C$?$N$,Ls\e(B 15 MB \e$B$K$J$j$^$7$?!#8=:_!"\e(B
+cvs.m17n.org \e$B$N\e(B /cvs/root \e$B$N\e(BXEmacs \e$B%b%8%e!<%k$N\e(B utf-2000 \e$B;^$G$+$i\e(B 
+anonymous CVS \e$B$GF~<j2DG=$G$9!#\e(B--> It is now possible to load character
+attributes from a external database on demand ("lazy loading").  On
+Intel 32 bit processor architectures, the size of the executable file
+thus shrinks from the 30 MB required with the traditional built to
+just about 15 MB. This can now be downloaded from <a
+href="http://www.kanji.zinbun.kyoto-u.ac.jp/projects/chise/dist/XEmacs/xemacs-utf-2000-0.19.tar.gz">
+XEmacs UTF-2000 0.19 (Koriyama)</a>. In addtion, there is a UTF-2000
+branch of the XEmacs tree at cvs.m17n.org in /cvs/root, this can be
+accessed by anonymous CVS </p>
+
+
+<h2>A <a name="topicmaps">
+<a href="http://www.topicmaps.org">TopicMaps</a> based approach to a
+character dababase 
+</a></h2>
+<p>
+In 2001 the prototype of a Topic Map engine has been developed based
+on <a href="http://www.zope.org/">Zope</a>.  This proved less than
+ideal for this purpose, so the focus for this year is to port this
+engine to a relational database backend.  Currently development
+continued with PostgreSQL. It is planned to enable Topic Map editing
+within  XEmacs UTF-2000, but also to allow multiple clients in addtion
+to this.
+</p>
+
+
+
+<h2>Database of features of characters</h2>
+
+<h3>Database of the component structure of Chinese Characters</h3>
+
+<p>
+Based on the Ideographic Description Characters (IDS) in 
+ISO/IEC 10646-1:2000 and Unicode, we are now developping a database
+that expresses the structure of Chinese Characters using this syntax. 
+At the moment, we are using the characters in the Unicode tables as a
+reference.  The basic <emph>CJK Unified Ideographs</emph>, as well as
+<emph>Extension A</emph> and <emph>Extension B</epmph>, together more
+than 70000 characters are currently covered.
+</p>
+
+<p>
+<a href="images/ids-ext-b-1.png">
+<img align="ids" src="images/ids-ext-b-1-s.png">
+<br>
+Table of the component structure database
+</a>
+</p>
+
+<p>
+The following tables are currently available via anonymous CVS from <a
+href="http://cvs.m17n.org/">cvs.m17n.org</a> at <a
+href="http://cvs.m17n.org/cgi-bin/viewcvs/?cvsroot=chise">/cvs/chise</a> 
+as module <a
+href="http://cvs.m17n.org/cgi-bin/viewcvs/ids/?cvsroot=chise">ids:</a> 
+</p>
+
+<blockquote>
+<dl compact>
+  <dt><a
+href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Basic.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+      IDS-UCS-Basic.txt
+      </a>
+  <dd>CJK Unified Ideographs (U+4E00 \e$B!A\e(B U+9FA5) of ISO/IEC
+      10646-1:2000
+
+  <dt><a
+href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-A.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+      IDS-UCS-Ext-A.txt
+      </a>
+  <dd>CJK Unified Ideographs Extension A (U+3400 \e$B!A\e(B U+4DB5, U+FA1F and
+      U+FA23) of ISO/IEC 10646-1:2000
+
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Compat.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+      IDS-UCS-Compat.txt
+      </a>
+  <dd>CJK Compatibility Ideographs (U+F900 \e$B!A\e(B U+FA2D, except U+FA1F
+      and U+FA23) of ISO/IEC 10646-1:2000
+
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-1.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+       IDS-UCS-Ext-B-1.txt
+      </a>
+  <dd>CJK Unified Ideographs Extension B [part 1] (U-00020000 \e$B!A\e(B 
+      U-00021FFF) of ISO/IEC 10646-2:2001
+
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-2.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+       IDS-UCS-Ext-B-2.txt
+      </a>
+  <dd>CJK Unified Ideographs Extension B [part 2] (U-00022000 \e$B!A\e(B 
+      U-00023FFF) of ISO/IEC 10646-2:2001
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-3.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+       IDS-UCS-Ext-B-3.txt
+      </a>
+  <dd>CJK Unified Ideographs Extension B [part 3] (U-00024000 \e$B!A\e(B 
+      U-00025FFF) of ISO/IEC 10646-2:2001
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-4.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+       IDS-UCS-Ext-B-4.txt
+      </a>
+  <dd>CJK Unified Ideographs Extension B [part 4] (U-00026000 \e$B!A\e(B
+      U-00027FFF) of ISO/IEC 10646-2:2001
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-5.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+       IDS-UCS-Ext-B-5.txt
+      </a>
+  <dd>CJK Unified Ideographs Extension B [part 5] (U-00028000 \e$B!A\e(B
+      U-00029FFF) of ISO/IEC 10646-2:2001
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Ext-B-6.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+       IDS-UCS-Ext-B-6.txt
+      </a>
+  <dd>CJK Unified Ideographs Extension B [part 6] (U-0002A000 \e$B!A\e(B
+      U-0002A6D6) of ISO/IEC 10646-2:2001
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-UCS-Compat-Supplement.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+      IDS-UCS-Compat-Supplement.txt
+      </a>
+  <dd>CJK Compatibility Ideographs Supplement (U-0002F800 \e$B!A\e(B 
+      U-0002FA1D) of ISO/IEC 10646-2:2001
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-01.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+      IDS-Daikanwa-01.txt
+      </a>
+  <dd>Morohashi: Daikanwa Jiten, Volume 1
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-02.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+      IDS-Daikanwa-02.txt
+      </a>
+  <dd>Morohashi: Daikanwa Jiten, Volume 2
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-03.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+      IDS-Daikanwa-03.txt
+      </a>
+  <dd>Morohashi: Daikanwa Jiten, Volume 3
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-04.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+      IDS-Daikanwa-04.txt
+      </a>
+  <dd>Morohashi: Daikanwa Jiten, Volume 4
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-05.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+      IDS-Daikanwa-05.txt
+      </a>
+  <dd>Morohashi: Daikanwa Jiten, Volume 5
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-06.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+      IDS-Daikanwa-06.txt
+      </a>
+  <dd>Morohashi: Daikanwa Jiten, Volume 6
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-07.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+      IDS-Daikanwa-07.txt
+      </a>
+  <dd>Morohashi: Daikanwa Jiten, Volume 7
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-08.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+      IDS-Daikanwa-08.txt
+      </a>
+  <dd>Morohashi: Daikanwa Jiten, Volume 8
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-09.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+      IDS-Daikanwa-09.txt
+      </a>
+  <dd>Morohashi: Daikanwa Jiten, Volume 9
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-10.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+      IDS-Daikanwa-10.txt
+      </a>
+  <dd>Morohashi: Daikanwa Jiten, Volume 10
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-11.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+      IDS-Daikanwa-11.txt
+      </a>
+  <dd>Morohashi: Daikanwa Jiten, Volume 11
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-12.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+      IDS-Daikanwa-12.txt
+      </a>
+  <dd>Morohashi: Daikanwa Jiten, Volume 12
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-dx.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+      IDS-Daikanwa-dx.txt
+      </a>
+  <dd>Morohashi: Daikanwa Jiten, Additions
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-Daikanwa-ho.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+      IDS-Daikanwa-ho.txt
+      </a>
+  <dd>Morohashi: Daikanwa Jiten, Appendix
+  <dt><a
+      href="http://cvs.m17n.org/cgi-bin/viewcvs/*checkout*/ids/IDS-CBETA.txt?rev=HEAD&cvsroot=chise&content-type=text/plain">
+      IDS-CBETA.txt
+      </a>
+  <dd>Characters encountered by the <a href="http://www.cbeta.org/">Chinese Buddhist Electronic Text
+      Association (CBETA)</a>
+</dl>
+</blockquote>
+
+<ul>
+  <li><a href="http://web.sfc.keio.ac.jp/~kamichi/">Koichi KAMICHI</a>
+      (<a href="http://www.fonts.jp/">
+      Forum for development of on-the-fly generation of Kanji Fonts
+      </a>)
+      <a href="http://www.fonts.jp/search.html">
+                       Analytic tool for Kanji Fonts (in Japanese)
+      </a>
+</ul>
+
+
+<h3><a name="glyph">Intgegration and Composition of Character Glyphs
+and Styles</a></h3> <p> In the character database is information about
+character glyphs and styles collected.  This allows to use this
+information together with the other knowledge about a character in the
+database to built a system that uses the <a href="#ids">component
+structure information </a> to assemble the font for a character
+depending on the contextual requirements from its components.  With
+this system, occurrences of mismatches based on erroneous association
+or insufficient contextual information are excluded, and it will be
+possible easily display and print character forms that have not been codified and for
+which no fonts exists .
+<ul>
+  <li>
+      <a href="http://www.fonts.jp/">
+      Forum for development of on-the-fly generation of Kanji Fonts
+      </a>
+</ul>
+
+
+<h3><a name="network">Mathematical analysis and visualation of
+character knowledge</a></h3>
+<ul>
+  <li>Yoshi Fujiwara, Yasuhiro Suzuki, Tomohiko
+      Morioka, \e$B!H\e(B<a
+      href="http://www2.crl.go.jp/jt/a134/yoshi/pc/kanji/nw.ps">
+      Network of Words</a>\e$B!I\e(B, <a href="http://arob.cc.oita-u.ac.jp/">
+      Artificial Life and Robotics 2002</a>
+      (<a href="http://www2.crl.go.jp/jt/a134/yoshi/pc/kanji/index.html">
+                       Presentation material
+      </a>)
+  <li>Model for the relation of Kanji characters that share a component
+      <br>
+      <a
+                       href="http://www2.crl.go.jp/jt/a134/yoshi/pc/kanji/mage1.jpg">
+      <img alt="Image 1"
+      src="images/mage1-s.jpg"><br>Image 1</a>
+&nbsp;<br>
+      <a href="http://www2.crl.go.jp/jt/a134/yoshi/pc/kanji/mage2.jpg">
+      <img alt="\e$BLO<0?^\e(B2"
+      src="images/mage2-s.jpg"><br>Image 2</a>
+</ul>
+
+<!--  <h2>TOMOYO Project</h2> -->
+<!--  <p> -->
+<!--  TOMOYO (Text Operation Models and Outfits for Your Objects) -->
+<!--  \e$B%W%m%8%'%/%H$O!"=>Mh!V\e(BUTF-2000 \e$B%W%m%8%'%/%H!W$H8F$s$G$$$?$b$N$G!"\e(B -->
+<!--  \e$BJ8;zCN<1%G!<%?%Y!<%9$K4p$E$/\e(B -->
+<!--  \e$BJ8;z=hM}%"!<%-%F%/%A%c$r3+H/$9$k$?$a$N%W%m%8%'%/%H$G$9!#\e(B -->
+<!--  </p> -->
+
+
+<hr>
+<h2>Mailing List</h2>
+<p>
+Discussion about the CHISE Project occur in the CHISE-{ja|en} mailing list.
+<p>
+Anybody who would like to take part in the discussion about and
+development of the CHISE Project, has ideas or questions about the
+implementation or wishes for new features is welcome to join either
+the English, or the Japanese or both lists.
+<p>
+To become a member in the CHISE mailing, send a message to the
+following adress:
+<dl compact>
+  <dt>For Japanese:
+  <dd><a href="mailto:chise-ja-ctl@m17n.org">
+      chise-ja-ctl@m17n.org</a>
+
+  <dt>For English:
+  <dd><a href="mailto:chise-en-ctl@m17n.org">
+      chise-en-ctl@m17n.org</a>
+</dl>
+
+with the word 
+<blockquote>subscribe Your Name</blockquote>
+in the body of the message.  You will then receive a conformation
+message with the line
+
+<blockquote>
+confirm PASSWORD Your Name
+</blockquote> You will have to reply to this message to become a member.
+
+
+<hr>
+
+<h2>Papers and Presentations</h2>
+<ul>
+  <li><a href="xemacs/#presentation">
+      About XEmacs</a>
+  <li><a href="#network">About mathematical analysis of Character Information</a>
+  <li>Other
+      <ul>
+       <li><a href="papers/u2k-plan.ja/">
+       "Model and Implementation of a Next Generation Multilingual
+       Processing System"
+           </a> (October 1999)
+       <li>WITTERN, Christian, \e$B!H\e(BNon-system characters in XML documents\e$B!I\e(B, in:
+       <i>The Frontier of Asian Information Processing</i>
+       [Seminar Series of the National Documentation and
+                                                       Information Centers in Humanities] No. 10, November 2000
+       <li>MORIOKA Tomohiko, \e$B!V\e(BThe UTF-2000 Project\e$B!W\e(B, in:
+           <a
+           href="http://www.kanji.zinbun.kyoto-u.ac.jp/publications/kanji-and-info-2.pdf">
+           Kanji and Information, No.2</a>, March 2001
+       <li>MORIOKA Tomohiko,\e$B!H\e(BCHISE project &emdash; beyond the UTF-2000\e$B!I\e(B,
+           <a href="http://www.m17n.org/m17n2001/">
+           m17n2001: the Fifth International Symposium on Multilingual
+           Information Processing and Open Source Software
+           </a>.
+       <li>MORIOKA Tomohiko, \e$B!H\e(BA Short Introduction to UTF-2000 Project\e$B!I\e(B,
+           the First TEI Character Set Issues Working Group (October 2001,
+           University of California, Berkeley, USA).
+       <li>WITTERN, Christian, "What is Digitisation?", in:
+           <a
+           href="http://www.kanji.zinbun.kyoto-u.ac.jp/publications/kanji-and-info-3.pdf">
+           Kanji and Information, No.3</a>, October 2001
+       <li>MORO Shigeki, "The meaning of 'beyond character codes'", in:
+           <a
+           href="http://www.kanji.zinbun.kyoto-u.ac.jp/publications/kanji-and-info-3.pdf">
+           Kanji and Information, No.3</a>, October 2001
+       <li>WITTERN, Christian, \e$B!H\e(BSome thoughts on the digitization of Kanji\e$B!I\e(B,
+       <i>Information Technology and the Humanities</i>
+       [Seminar Series of the National Documentation and
+                                                       Information Centers in Humanities] No. 11, November 2001
+      </ul>
+</ul>
+
+<h2><a name="history">History</a></h2>
+
+<hr>
+
+<br>
+<b>[<a href="http://www.kanji.zinbun.kyoto-u.ac.jp/">Documentation and Information Center for Chinese Studies</a> at the 
+<a href="http://www.zinbun.kyoto-u.ac.jp/">
+Institute for Research in the Humanities</a>&nbsp;
+<a href="http://www.kanji.zinbun.kyoto-u.ac.jp/projects/">
+Related Projects
+</a>]</b>
+<p><img SRC="images/dragon.jpg" height=146 width=198></center>
+
+</body>
+</html>
+<!-- Keep this comment at the end of the file
+Local variables:
+mode: text
+tab-width: 8
+time-stamp-line-limit:40
+-->
\ No newline at end of file