月旦知識庫
 
  1. 熱門:
 
首頁 臺灣期刊   法律   公行政治   醫事相關   財經   社會學   教育   其他 大陸期刊   核心   重要期刊 DOI文章
中文計算語言學期刊 本站僅提供期刊文獻檢索。
  【月旦知識庫】是否收錄該篇全文,敬請【登入】查詢為準。
最新【購點活動】


篇名
Chinese Word Segmentation as Character Tagging
作者 Xue, Nianwen (Xue, Nianwen)
中文摘要
In this paper we report results of a supervised machine-learning approach to Chinese word segmentation. A maximum entropy tagger is trained on manually annotated data to automatically assign to Chinese characters, or hanzi, tags that indicate the position of a hanzi within a word. The tagged output is then converted into segmented text for evaluation. Preliminary results show that this approach is competitive against other supervised machine-learning segmenters reported in previous studies, achieving precision and recall rates of 95.01% and 94.94% respectively, trained on a 237K-word training set.
起訖頁 29-47
關鍵詞 Chinese word segmentationSupervised machine-learningMaximum entropyCharacter tagging
刊名 中文計算語言學期刊  
期數 200302 (8:1期)
出版單位 中華民國計算語言學學會
該期刊-上一篇 Customizable Segmentation of Morphologically Derived Words in Chinese
該期刊-下一篇 Measuring and Comparing the Productivity of Mandarin Chinese Suffixes
 

新書閱讀



最新影音


優惠活動




讀者服務專線:+886-2-23756688 傳真:+886-2-23318496
地址:臺北市館前路28 號 7 樓 客服信箱
Copyright © 元照出版 All rights reserved. 版權所有,禁止轉貼節錄