英文摘要 |
The named-entity phrases in free text represent a formidable challenge to text analysis. Translat-ing a named-entity is important for the task of Cross Language Information Retrieval and Ques-tion Answering. However, both tasks are not easy to handle because named-entities found in free text are often not listed in a monolingual or bilingual dictionary. Although it is possible to iden-tify and translate named-entities on the fly without a list of proper names and transliterations, an extensive list certainly will ensure the high accuracy rate of text analysis. We use a list of proper names and transliterations to train a Machine Transliteration Model. With the model it is possi-ble to extract proper names and their transliterations in a bilingual corpus with high average pre-cision and recall rates. |