A Model for Word Sense Disambiguation

Li, Juanzi; Huang, Chang-ning

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	A Model for Word Sense Disambiguation
作者	Li, Juanzi (Li, Juanzi)、Huang, Chang-ning (Huang, Chang-ning)
中文摘要	Word sense disambiguation is one of the most difficult problems in natural language processing. This paper puts forward a model for mapping a structural semantic space from a thesaurus into a multi-dimensional, real-valued vector space and gives a word sense disambiguation method based on this mapping. The model, which uses an unsupervised learning method to acquire the disambiguation knowledge, not only saves extensive manual work, but also realizes the sense tagging of a large number of content words. Firstly, a Chinese thesaurus Cilin and a very large-scale corpus are used to construct the structure of the semantic space. Then, a dynamic disambiguation model is developed to disambiguate an ambiguous word according to the vectors of monosemous words in each of its possible categories. In order to resolve the problem of data sparseness, a method is proposed to make the model more robust. Testing results show that the model has relatively good performance and can also be used for other languages.
起訖頁	1-19
關鍵詞	Natural language processing、Word sense disambiguation、Unsupervised learning、Vector space、Language modeling
刊名	中文計算語言學期刊
期數	199908 (4:2期)
出版單位	中華民國計算語言學學會
該期刊-下一篇	Resolving Translation Ambiguity and Target Polysemy in Cross-Language Information Retrieval＋