  1. 熱門:
首頁 臺灣期刊   法律   公行政治   醫事相關   財經   社會學   教育   其他 大陸期刊   核心   重要期刊 DOI文章
ROCLING論文集 本站僅提供期刊文獻檢索。

Unsupervised Overlapping Feature Selection for Conditional Random Fields Learning in Chinese Word Segmentation
Unsupervised Overlapping Feature Selection for Conditional Random Fields Learning in Chinese Word Segmentation
作者 Ting-Hao Yang (Ting-Hao Yang)Tian-Jian Jiang (Tian-Jian Jiang)Chan-Hung Kuo (Chan-Hung Kuo)Richard Tzong-Han Tsai (Richard Tzong-Han Tsai)Wen-Lian Hsu (Wen-Lian Hsu)
This work represents several unsupervised feature selections based on frequent strings that help improve conditional random fields (CRF) model for Chinese word segmentation (CWS). These features include character-based N-gram (CNG), Accessor Variety based string (AVS), and Term Contributed Frequency (TCF) with a specific manner of boundary overlapping. For the experiment, the baseline is the 6-tag, a state-of-the-art labeling scheme of CRF-based CWS; and the data set is acquired from SIGHAN CWS bakeoff 2005. The experiment results show that all of those features improve our system's F1 measure (F) and Recall of Out-of-Vocabulary (ROOV). In particular, the feature collections which contain AVS feature outperform other types of features in terms of F, whereas the feature collections containing TCB/TCF information has better ROOV.
起訖頁 109-122
關鍵詞 Word SegmentationUnsupervised Feature SelectionConditional Random Fields
刊名 ROCLING論文集  
期數 2011 (2011期)
出版單位 中華民國計算語言學學會
該期刊-上一篇 英文技術文獻中一般動詞與其受詞之中文翻譯的語境效用
該期刊-下一篇 繁體中文文本中對於日文人名及異體字的處理策略




讀者服務專線:+886-2-23756688 傳真:+886-2-23318496
地址:臺北市館前路28 號 7 樓 客服信箱
Copyright © 元照出版 All rights reserved. 版權所有,禁止轉貼節錄