中文摘要 |
XML 是新一代網際網路語言標準,使用者可以自行定義網頁中之標籤( Tag),經過網頁瀏覽器或程式解讀,可以讓全球的 XML 網頁互相溝通及進行資料交換,是近代一項網際網路之重大改革。本研究使用包裝器( Wrapper)/中介工具( Mediator)之系統架設觀念來建構,系統基本上可以分為三層:第一層為中醫古籍文獻整合架構,使用文獻萃取( InformationExtraction)技術,將不同來源的自由文字( Free Text)檔轉換成規格統一之資料庫表格檔,本文選擇類證治裁作為系統開發範本。第二層為建構中醫古籍文獻資料庫儲存所( Repository),本研究以樹狀結構( Tree Structure)建置類證治裁模型( Model),並運用微軟 SQL 2000 伺服器,將文獻模型整合至資料庫伺服器中,第三層為 XML 結構描述( Schema),透過 SQLXML虛擬目錄( Virtual Directory)將資料庫儲存所內的資料以 XML 文件方式送至網際網路上供瀏覽,經由 Xpath Query 技術網友可以隨意搜尋樹狀結構中任何節點中之資料。本研究初期以建立三層系統架構為目的,日後尚有眾多研究課題有待繼續進行。XML is rapidly emerging as the new standard for data representation and exchange on the internet. Tags on the XML homepage can be defined by the developer and this kind of document can be interpreted by the internet browser or a snippet of coding so that XML is being considered a newly standardized language as well as a revolution on the world-wide-web. Architecture of this system is based on the Wrapper/Mediator concepts, it can be subdivided into 3 layers: Layer 1 is the information extraction layer which integrating heterogeneous sources from Classical Chinese Medicinal Literatures and converting the free text files into tabular form in the relational database. In this study, Lei-Zheng-Zhi-Cai, a Chinese medicinal text from the Ming dynasty, is used. Layer 2 is the database repository of the Chinese literature which is developed by the Microsoft SQL 2000 server. Relational tables of Lei-Zheng-Zhi-Cai retreival model in form of tree structure which are stored in the repository. Layer 3 is the XML Schema which retrieving information stored in the repository through the SQLXML Virtual directory. Partial or complete filtration of the tree structure from the XML document can be generated by applying the Xpath Query on the Internet. In this study, we focus on developing the three layer architecture and further research topics are expected in the future. |