Chinese Word Segmentation and Part-of-Speech Tagging in One Step

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	Chinese Word Segmentation and Part-of-Speech Tagging in One Step
作者	Tom B.Y. Lai (Tom B.Y. Lai)、Maosong Sun (Maosong Sun)、Benjamin K. T'sou (Benjamin K. T'sou)、S. Caesar Lun (S. Caesar Lun)
英文摘要	In Chinese natural language processing, word segmentation and part-of-speech tagging is generally carried out as two separate steps. Earlier, the authors introduced a tag-based Markov-model approach to word segmentation. As the tags are of a syntactic nature, this is effectively doing word segmentation and part-of-speech tagging simultaneously. We have used a best-first algorithm with empirical results showing the search for the best solution to be efficient for inputs of reasonable length. In this paper, we will see that the job can be done using an O(n2) algorithm. In our experiments, we actually had the algorithm reduced to O(n) by setting a maximum number of character for words in Chinese to a constant. We also show that performing word segmentation and part-of-speech tagging in one step will bring about improvement in accurracy.
起訖頁	229-236
刊名	ROCLING論文集
期數	1997 (1997期)
出版單位	國立高雄師範大學輔導與諮商研究所
該期刊-上一篇	The Role of Shared Attention in Human-Computer Conversation
該期刊-下一篇	Corpus-Based Chinese Text Summarization System