Automatic Segmentation and Labeling for Mandarin Chinese Speech Corpora for Concatenation-based TTS

Lin, Cheng-yuan; Jang, Roger Jyh-shing; Chen, Kuan-ting

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	Automatic Segmentation and Labeling for Mandarin Chinese Speech Corpora for Concatenation-based TTS
作者	Lin, Cheng-yuan (Lin, Cheng-yuan)、Jang, Roger Jyh-shing (Jang, Roger Jyh-shing)、Chen, Kuan-ting (Chen, Kuan-ting)
中文摘要	Precise phone/syllable boundary labeling of the utterances in a speech corpus plays an important role in constructing a corpus-based TTS (text-to-speech) system. However, automatic labeling based on Viterbi forced alignment does not always produce satisfactory results. Moreover, a suitable labeling method for one language does not necessarily produce desirable results for another language. Hence in this paper, we propose a new procedure for refining the boundaries of utterances in a Mandarin speech corpus. This procedure employs different sets of acoustic features for four different phonetic categories. In addition, a new scheme is proposed to deal with the “periodic voiced + periodic voiced” case, which produced most of the segmentation errors in our experiment. Several experiments were conducted to demonstrate the feasibility of the proposed approach.
起訖頁	145-166
關鍵詞	Speech assessment methods phonetic alphabet、Speech corpus、Sequential forward selection、K-nearest neighbor rule、Leave-one-out、Speaker-adapted model、Context-dependent hidden Markov model、HMM
刊名	中文計算語言學期刊
期數	200506 (10:2期)
出版單位	中華民國計算語言學學會
該期刊-下一篇	The Formosan Language Archive: Linguistic Analysis and Language Processing