Quantitative Criteria for Computational Chinese Lexicography A Study Based on a Standard Reference Lexicon for Chinese NLP

Chu-Ren Huang; Zhoa-ming Gao; Claude C.C. Shen; Keh-Jiann Chen

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	Quantitative Criteria for Computational Chinese Lexicography A Study Based on a Standard Reference Lexicon for Chinese NLP
作者	Chu-Ren Huang (Chu-Ren Huang)、Zhoa-ming Gao (Zhoa-ming Gao)、Claude C.C. Shen (Claude C.C. Shen)、Keh-Jiann Chen (Keh-Jiann Chen)
英文摘要	The construction of a standard reference lexicon for Chinese NLP involves two fundamental issues in computational linguistics: the definition of a word and the principled delimitation of the lexicon. We argued that such reference lexicons must be judged by their cross-domain portability, expressive adequacy, and reusability. Thus principles for lexical selection must also be driven these criteria. This paper reports the approach and result of our construction of a standard reference lexicon for Chinese NLP, which also serves as the empirical basis for a segmentation standard. Our approach uses a mixture if stochastic and heuristic steps. First, a reference corpus is selected and lexical entries are automatically extracted from it based on statistically significant threshold. Second, the coverage of the automatically extracted lexicon is enhanced by conceptual primes as well as by comparative studies of MRD's from different Chinese speaking communities. We show the satisfactory coverage of the resultant lexicon by testing it with randomly accessed texts from the web.
起訖頁	87-108
刊名	ROCLING論文集
期數	1998 (1998期)
出版單位	國立高雄師範大學輔導與諮商研究所
該期刊-上一篇	應用動態、靜待辭典以加速鍵盤輸入中文之方法
該期刊-下一篇	The Design of Sem-Syn Initial Grammar In Chinese Grammatical Inference