  1. 熱門:
首頁 臺灣期刊   法律   公行政治   醫事相關   財經   社會學   教育   其他 大陸期刊   核心   重要期刊 DOI文章
中文計算語言學期刊 本站僅提供期刊文獻檢索。

Automatic Recognition of Cantonese-English Code-Mixing Speech
作者 Joyce Y. C. Chan (Joyce Y. C. Chan)Houwei Cao (Houwei Cao)P. C. Ching  (P. C. Ching )Tan Lee (Tan Lee)
Code-mixing is a common phenomenon in bilingual societies. It refers to the intra-sentential switching of two different languages in a spoken utterance. This paper presents the first study on automatic recognition of Cantonese-English code-mixing speech, which is common in Hong Kong. This study starts with the design and compilation of code-mixing speech and text corpora. The problems of acoustic modeling, language modeling, and language boundary detection are investigated. Subsequently, a large-vocabulary code-mixing speech recognition system is developed based on a two-pass decoding algorithm. For acoustic modeling, it is shown that cross-lingual acoustic models are more appropriate than language-dependent models. The language models being used are character tri-grams, in which the embedded English words are grouped into a small number of classes. Language boundary detection is done either by exploiting the phonological and lexical differences between the two languages or is done based on the result of cross-lingual speech recognition. The language boundary information is used to re-score the hypothesized syllables or words in the decoding process. The proposed code-mixing speech recognition system attains the accuracies of 56.4% and 53.0% for the Cantonese syllables and English words in code-mixing utterances.
起訖頁 281-304
關鍵詞 Automatic Speech RecognitionCode-mixing Acoustic ModelingLanguage Modeling
刊名 中文計算語言學期刊  
期數 200909 (14:3期)
出版單位 中華民國計算語言學學會
該期刊-上一篇 A Thesaurus-Based Semantic Classification of English Collocations
該期刊-下一篇 Corpus, Lexicon, and Construction: A Quantitative Corpus Approach to Mandarin Possessive Construction




讀者服務專線:+886-2-23756688 傳真:+886-2-23318496
地址:臺北市館前路28 號 7 樓 客服信箱
Copyright © 元照出版 All rights reserved. 版權所有,禁止轉貼節錄