月旦知識庫
 
  1. 熱門:
 
首頁 臺灣期刊   法律   公行政治   醫事相關   財經   社會學   教育   其他 大陸期刊   核心   重要期刊 DOI文章
中文計算語言學期刊 本站僅提供期刊文獻檢索。
  【月旦知識庫】是否收錄該篇全文,敬請【登入】查詢為準。
最新【購點活動】


篇名
Integrating Dictionary and Web N-grams for Chinese Spell Checking
作者 Jian-Cheng Wu (Jian-Cheng Wu)Hsun-wen Chiu (Hsun-wen Chiu)Jason S. Chang (Jason S. Chang)
中文摘要
Chinese spell checking is an important component of many NLP applications, including word processors, search engines, and automatic essay rating. Nevertheless, compared to spell checkers for alphabetical languages (e.g., English or French), Chinese spell checkers are more difficult to develop because there are no word boundaries in the Chinese writing system and errors may be caused by various Chinese input methods. In this paper, we propose a novel method for detecting and correcting Chinese typographical errors. Our approach involves word segmentation, detection rules, and phrase-based machine translation. The error detection module detects errors by segmenting words and checking word and phrase frequency based on compiled and Web corpora. The phonological or morphological typographical errors found then are corrected by running a decoder based on the statistical machine translation model (SMT). The results show that the proposed system achieves significantly better accuracy in error detection and more satisfactory performance in error correction than the state-of-the-art systems.
起訖頁 17-29
關鍵詞 Chinese Spelling DetectionChinese Spelling CorrectionChinese Similar CharactersNgramLanguage ModelMachine Translation
刊名 中文計算語言學期刊  
期數 201312 (18:4期)
出版單位 中華民國計算語言學學會
該期刊-上一篇 蘊涵句型分析於改進中文文字蘊涵識別系統
該期刊-下一篇 Correcting Serial Grammatical Errors based on N-grams and Syntax
 

新書閱讀



最新影音


優惠活動




讀者服務專線:+886-2-23756688 傳真:+886-2-23318496
地址:臺北市館前路28 號 7 樓 客服信箱
Copyright © 元照出版 All rights reserved. 版權所有,禁止轉貼節錄