中英文語碼轉換語音合成系統開發

練欣柔; 黃立宇; 陳嘉平

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	中英文語碼轉換語音合成系統開發
並列篇名	Development of Mandarin-English Code-switching Speech Synthesis System
作者	練欣柔、黃立宇、陳嘉平
中文摘要	本論文提出中英文語碼轉換語音合成系統。為了使系統可專注於學習不同語言間的內容，利用已統一語者風格的多語言人工資料集進行訓練。之後在合成器中加入語言向量，以增加系統對多語言的掌握。此外對輸入的中、英文分別進行不同的前處理，將中文進行斷詞且轉為漢語拼音，藉此增加語音的自然度，且減輕學習時的複雜度，也透過數字正規化判斷句子中的阿拉伯數字，是否需要加上數字單位。英文部份則對複雜的頭字語進行讀音判斷與轉換。
英文摘要	In this paper, the Mandarin-English codeswitching speech synthesis system has been proposed. To focus on learning the content information between two languages, the training dataset is multilingual artificial dataset whose speaker style is unified. Adding language embedding into the system helps it be more adaptive to multilingual dataset. Besides, text preprocessing is applied and be used in different way which depends on the languages. Word segmentation and text-to-pinyin are the text preprocessing for Mandarin, which not only improves the fluency but also reduces the learning complexity. Number normalization decides whether the arabic numerals in sentence needs to add the digits. The preprocessing for English is acronym conversion which decides the pronunciation of acronym.
起訖頁	47-56
關鍵詞	語音合成、語碼轉換、資料前處理、Speech Synthesize、Codeswitching、Text Preprocessing
刊名	中文計算語言學期刊
期數	202212 (27:2期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	探討語者驗證系統中特徵處理模組與注意力機制
該期刊-下一篇	Analyzing Discourse Functions with Acoustic Features and Phone Embeddings: Non-lexical Items in Taiwan Mandarin