Improve Parsing Performance by Self-Learning

Hsieh, Yu-ming; Yang, Duen-chi; Keh-Jiann Chen

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	Improve Parsing Performance by Self-Learning
作者	Hsieh, Yu-ming (Hsieh, Yu-ming)、Yang, Duen-chi (Yang, Duen-chi)、Keh-Jiann Chen (Keh-Jiann Chen)
中文摘要	There are many methods to improve performance of statistical parsers. Resolving structural ambiguities is a major task of these methods. In the proposed approach, the parser produces a set of n-best trees based on a feature-extended PCFG grammar and then selects the best tree structure based on association strengths of dependency word-pairs. However, there is no sufficiently large Treebank producing reliable statistical distributions of all word-pairs. This paper aims to provide a self-learning method to resolve the problems. The word association strengths were automatically extracted and learned by parsing a giga-word corpus. Although the automatically learned word associations were not perfect, the constructed structure evaluation model improved the bracketed f-score from 83.09% to 86.59%. We believe that the above iterative learning processes can improve parsing performances automatically by learning word-dependence information continuously from web.
起訖頁	195-216
關鍵詞	Parsing、Word association、Knowledge extraction、PCFG、PoS tagging、Semantic
刊名	中文計算語言學期刊
期數	200706 (12:2期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	MiniJudge: Software for Small-Scale Experimental Syntax
該期刊-下一篇	A Comparative Study of Histogram Equalization (HEQ) for Robust Speech Recognition