Segmentation Standard for Chinese Natural Language Processing

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	Segmentation Standard for Chinese Natural Language Processing
作者	Huang, Chu-ren (Huang, Chu-ren)、Keh-Jiann Chen (Keh-Jiann Chen)、Chen, Feng-yi (Chen, Feng-yi)、Chang, Li-li (Chang, Li-li)
中文摘要	This paper proposes a segmentation standard for Chinese natural language processing. The standard is proposed to achieve linguistic felicity, computational feasibility, and data uniformity. Linguistic felicity is maintained by a definition of segmentation unit that is equivalent to the theoretical definition of word, as well as a set of segmentation principles that are equivalent to a functional definition of a word. Computational feasibility is ensured by the fact that the above functional definitions are procedural in nature and can be converted to segmentation algorithms as well as by the implementable heuristic guidelines which deal with specific linguistic categories. Data uniformity is achieved by stratification of the standard itself and by defining a standard lexicon as part of the standard.
起訖頁	47-62
關鍵詞	中文自然語言
刊名	中文計算語言學期刊
期數	199708 (2:2期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	Longest Tokenization
該期刊-下一篇	Aligning More Words with High Precision for Small Bilingual Corpora