  1. 熱門:
首頁 臺灣期刊   法律   公行政治   醫事相關   財經   社會學   教育   其他 大陸期刊   核心   重要期刊 DOI文章
ROCLING論文集 本站僅提供期刊文獻檢索。

A Three-Phase System for Chinese Named Entity Recognition
A Three-Phase System for Chinese Named Entity Recognition
作者 Conrad Chen (Conrad Chen)Hsi-Jian Lee (Hsi-Jian Lee)
The handling of out-of-vocabulary (OOV) words is one of the key points to a high performance lexical analysis in natural language processing. Among all OOV words, named entities (NE) are the most productive ones. They generally constitute the most meaningful parts of sentences (persons, affairs, time, places, and objects). In this paper, we propose a three-phase“generation, filtering, and recovery”system to address the NER problem. A set of stochastic models is first used to generate all possible NE candidates. Then we treat candidate filtering as an ambiguity resolution problem. To resolve ambiguities, we adopt a maximal-matching-rule-driven lexical analyzer. Last, a pattern matching method is applied to detect and recover abnormalities in the results of the previous two phases. Pure lexical information is exploited in our system. We get a high recall of 96% with personal names (PER), satisfiable recall of 88%, 89%, and 80% with transliteration names (TRA), location names (LOC), and organization names (ORG), respectively. The overall precision and excluding rate is over 90% and 99%.
起訖頁 1-10
刊名 ROCLING論文集  
期數 2004 (2004期)
出版單位 中華民國計算語言學學會
該期刊-上一篇 A Noise Estimator with Rapid Adaptation in Variable-Level Noisy Environments
該期刊-下一篇 主題導向之非結構化文本資訊擷取技術




讀者服務專線:+886-2-23756688 傳真:+886-2-23318496
地址:臺北市館前路28 號 7 樓 客服信箱
Copyright © 元照出版 All rights reserved. 版權所有,禁止轉貼節錄