  1. 熱門:
首頁 臺灣期刊   法律   公行政治   醫事相關   財經   社會學   教育   其他 大陸期刊   核心   重要期刊 DOI文章
中文計算語言學期刊 本站僅提供期刊文獻檢索。

A Study on Consistency Checking Method of Part-Of-Speech Tagging for Chinese Corpora1
作者 Hu Zhang (Hu Zhang)Jiaheng Zheng (Jiaheng Zheng)
Ensuring consistency of Part-Of-Speech (POS) tagging plays an important role In the construction of high-quality Chinese corpora. After having analyzed the POS tagging of multi-category words in large-scale corpora, we propose a novel classification-based consistency checking method of POS tagging in this paper. Our method builds a vector model of the context of multi-category words along with using the k-NN algorithm to classify context vectors constructed from POS tagging sequences and to judge their consistency. These methods are evaluated on our 1.5M-word corpus. The experimental results indicate that the proposed method is feasible and effective.
起訖頁 157-169
關鍵詞 Multi-Category WordsConsistency CheckingPart of Speech TaggingChinese CorpusClassification
刊名 中文計算語言學期刊  
期數 200806 (13:2期)
出版單位 中華民國計算語言學學會
該期刊-上一篇 Multiple Document Summarization Using Principal Component Analysis Incorporating Semantic Vector Space Model
該期刊-下一篇 Constructing a Temporal Relation Tagged Corpus of Chinese Based on Dependency Structure Analysis




讀者服務專線:+886-2-23756688 傳真:+886-2-23318496
地址:臺北市館前路28 號 7 樓 客服信箱
Copyright © 元照出版 All rights reserved. 版權所有,禁止轉貼節錄