  1. 熱門:
首頁 臺灣期刊   法律   公行政治   醫事相關   財經   社會學   教育   其他 大陸期刊   核心   重要期刊 DOI文章
ROCLING論文集 本站僅提供期刊文獻檢索。

作者 Tai-Hsuan Ho (Tai-Hsuan Ho)Kae-Cherng Yang (Kae-Cherng Yang)Juei-Sung Lin (Juei-Sung Lin)Lin-Shan Lee (Lin-Shan Lee)
This paper presents a phoneme-to-text conversion system for Chinese language using long-distance language modeling. First of all, we employ extended bigrams (Huang 1993) of window size d to capture the long-distance dependent relations in Chinese language, in which d bigram tables are estimated independently from the training data for distance 1 to d. Each bigram table is associated with a mixture weight, which can be optimized based on the held-out data using deleted interpolation algorithm (Ney 1994). The system then performs the tree-trellis search (Soong 1991) to generate N-best sentence hypotheses, and integrates these extended bigram probabilities at sentence level. In our experiments, we generate 200 best sentence hypotheses and the integration of long-distance bigram reduces the error rate by about 11% as compared with word bigram language model only. Secondly, to reduce the number of parameters, we merge the extended bigram tables from distance 2 to d to form a single long-distance bigram table, disregarding the influence caused by different distances. Since the model complexity is significantly reduced, we derive a very efficient stack decoding algorithm for the integration of this augmented long-distance information. Experiments show that the error rate remains the same as that of d extended bigrams using N-best search algorithm, while the search efficiency is significantly improved.
起訖頁 287-299
刊名 ROCLING論文集  
期數 1997 (1997期)
出版單位 國立高雄師範大學輔導與諮商研究所
該期刊-上一篇 Attributive Clauses in Chinese: Theory and Implementation
該期刊-下一篇 Automatic Speaker Identification Based on Fuzzy Theory and Neural network Using Genetic Algorithm




讀者服務專線:+886-2-23756688 傳真:+886-2-23318496
地址:臺北市館前路28 號 7 樓 客服信箱
Copyright © 元照出版 All rights reserved. 版權所有,禁止轉貼節錄