Sense Extraction and Disambiguation for Chinese Words from Bilingual Terminology Bank

Bai, Ming-hong; Keh-Jiann Chen; Chang, Jason S.

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	Sense Extraction and Disambiguation for Chinese Words from Bilingual Terminology Bank
作者	Bai, Ming-hong (Bai, Ming-hong)、Keh-Jiann Chen (Keh-Jiann Chen)、Chang, Jason S. (Chang, Jason S.)
中文摘要	Using lexical semantic knowledge to solve natural language processing problems has been getting popular in recent years. Because semantic processing relies heavily on lexical semantic knowledge, the construction of lexical semantic databases has become urgent. WordNet is the most famous English semantic knowledge database at present; many researches of word sense disambiguation adopt it as a standard. Because of the success of WordNet, there is a trend to construct WordNet in different languages. In this paper, we propose a methodology for constructing Chinese WordNet by extracting information from a bilingual terminology bank. We developed an algorithm of word-to-word alignment to extract the English-Chinese translation-equivalent word pairs first. Then, the algorithm disambiguates word senses and maps Chinese word senses to WordNet synsets to achieve the goal. In the word-to-word alignment experiment, this alignment algorithm achieves the f-score of 98.4%. In the word sense disambiguation experiment, the extracted senses cover 36.89% of WordNet synsets and the accuracy of the three proposed disambiguation rules achieve the accuracies of 80%, 83% and 87%, respectively.
起訖頁	223-243
關鍵詞	Word alignment、Word sense disambiguation、WordNet、EM algorithm、Sense tagging
刊名	中文計算語言學期刊
期數	200609 (11:3期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	An Empirical Study of Word Error Minimization Approaches for Mandarin Large Vocabulary Continuous Speech Recognition
該期刊-下一篇	A Probe into Ambiguities of Determinative-Measure Compounds