Automatic Recognition of Cantonese-English Code-Mixing Speech

Joyce Y. C. Chan; Houwei Cao; P. C. Ching; Tan Lee

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	Automatic Recognition of Cantonese-English Code-Mixing Speech
作者	Joyce Y. C. Chan (Joyce Y. C. Chan)、Houwei Cao (Houwei Cao)、P. C. Ching (P. C. Ching )、Tan Lee (Tan Lee)
中文摘要	Code-mixing is a common phenomenon in bilingual societies. It refers to the intra-sentential switching of two different languages in a spoken utterance. This paper presents the first study on automatic recognition of Cantonese-English code-mixing speech, which is common in Hong Kong. This study starts with the design and compilation of code-mixing speech and text corpora. The problems of acoustic modeling, language modeling, and language boundary detection are investigated. Subsequently, a large-vocabulary code-mixing speech recognition system is developed based on a two-pass decoding algorithm. For acoustic modeling, it is shown that cross-lingual acoustic models are more appropriate than language-dependent models. The language models being used are character tri-grams, in which the embedded English words are grouped into a small number of classes. Language boundary detection is done either by exploiting the phonological and lexical differences between the two languages or is done based on the result of cross-lingual speech recognition. The language boundary information is used to re-score the hypothesized syllables or words in the decoding process. The proposed code-mixing speech recognition system attains the accuracies of 56.4% and 53.0% for the Cantonese syllables and English words in code-mixing utterances.
起訖頁	281-304
關鍵詞	Automatic Speech Recognition、Code-mixing、 Acoustic Modeling、Language Modeling
刊名	中文計算語言學期刊
期數	200909 (14:3期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	A Thesaurus-Based Semantic Classification of English Collocations
該期刊-下一篇	Corpus, Lexicon, and Construction: A Quantitative Corpus Approach to Mandarin Possessive Construction