Chinese Main Verb Identification: From Specification to Realization

Ding, Bing-gong; Huang, Chang-ning; Huang, De-gen

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	Chinese Main Verb Identification: From Specification to Realization
作者	Ding, Bing-gong (Ding, Bing-gong)、Huang, Chang-ning (Huang, Chang-ning)、Huang, De-gen (Huang, De-gen)
中文摘要	Main verb identification is the task of automatically identifying the predicate-verb in a sentence. It is useful for many applications in Chinese Natural Language Processing. Although most studies have focused on the model used to identify the main verb, the definition of the main verb should not be overlooked. In our specification design, we have found many complicated issues that still need to be resolved since they haven’t been well discussed in previous works. Thus, the first novel aspect of our work is that we carefully design a specification for annotating the main verb and investigate various complicated cases. We hope this discussion will help to uncover the difficulties involved in this problem. Secondly, we present an approach to realizing main verb identification based on the use of chunk information, which leads to better results than the approach based on part-of-speech. Finally, based on careful observation of the studied corpus, we propose new local and contextual features for main verb identification. According to our specification, we annotate a corpus and then use a Support Vector Machine (SVM) to integrate all the features we propose. Our model, which was trained on our annotated corpus, achieved a promising F score of 92.8%. Furthermore, we show that main verb identification can improve the performance of the Chinese Sentence Breaker, one of the applications of main verb identification, by 2.4%.
起訖頁	53-93
關鍵詞	Chinese main verb identification、Text analysis、Natural language processing、SVM
刊名	中文計算語言學期刊
期數	200503 (10:1期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	Automated Alignment and Extraction of a Bilingual Ontology for Cross-Language Domain-Specific Applications
該期刊-下一篇	Aligning Parallel Bilingual Corpora Statistically with Punctuation Criteria