Modeling Cantonese Pronunciation Variations for Large-Vocabulary Continuous Speech Recognition

Lee, Tan; Kam, Patgi; Soong, Frank K.

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	Modeling Cantonese Pronunciation Variations for Large-Vocabulary Continuous Speech Recognition
作者	Lee, Tan (Lee, Tan)、Kam, Patgi (Kam, Patgi)、Soong, Frank K. (Soong, Frank K.)
中文摘要	This paper presents different methods of handling pronunciation variations in Cantonese large-vocabulary continuous speech recognition. In an LVCSR system, three knowledge sources are involved: a pronunciation lexicon, acoustic models and language models. In addition, a decoding algorithm is used to search for the most likely word sequence. Pronunciation variation can be handled by explicitly modifying the knowledge sources or improving the decoding method. Two types of pronunciation variations are defined, namely, phone changes and sound changes. Phone change means that one phoneme is realized as another phoneme. A sound change happens when the acoustic realization is ambiguous between two phonemes. Phone changes are handled by constructing a pronunciation variation dictionary to include alternative pronunciations at the lexical level or dynamically expanding the search space to include those pronunciation variants. Sound changes are handled by adjusting the acoustic models through sharing or adaptation of the Gaussian mixture components. Experimental results show that the use of a pronunciation variation dictionary and the method of dynamic search space expansion can improve speech recognition performance substantially. The methods of acoustic model refinement were found to be relatively less effective in our experiments.
起訖頁	17-35
關鍵詞	Automatic speech recognition、Pronunciation variation、Cantonese
刊名	中文計算語言學期刊
期數	200603 (11:1期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	Using Duration Information in Cantonese Connected-Digit Recognition
該期刊-下一篇	A Maximum Entropy Approach for Semantic Language Modeling