應用文脈分析於中英夾雜語音合成系統

洪翌翔; 黃奕欽; 鄧廣豐

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	應用文脈分析於中英夾雜語音合成系統
並列篇名	Linguistic Analysis for English/Mandarin Speech Synthesis System
作者	洪翌翔、黃奕欽、鄧廣豐
中文摘要	本論文將藉由文脈分析的處理，實作出一套中英夾雜的語音系統。在語音模型的建模上，採取統計式模型中的隱藏式馬可夫模型（Hidden Markov Model）做為基礎針對中文以及英文進行處理。在系統的實作中，首先在合成語音前先將文字做前語言處理切割成中文和英文的部分，接著將中文與英文分別已預先訓練好的的中文／英文之語音模型分別進行合成，最終將各自合成的部份進行語音段的串接。其中，由於中文以及英文為不同的語言，為了維持整段話的連貫性，若整個句子以中文句當作主體，並且將此中英夾雜句中的英文字的部份，透過其詞性分析（POS Analysis）找出其詞性後，將此英文字置換成與其詞性相同的中文字（Substitute Word，縮寫為SW），使其與原英文字的詞性相同，在中文主體句中，則透過置換過後的中文句來進行文脈分析，挑選合適的中文語音模型，並用來為合成整段中文句子，並且將合成好的英文部分替換回該句中完成中英文夾雜的句子。透過實驗分析顯示，透過文脈的分析，能夠幫助合成的句子的語流較為順暢，因而提升中英夾雜句的何成語音更為自然。
英文摘要	In this study, we analysis the effect of the linguistic information for the English/Mandarin speech synthesis system. In order to construct the acoustic models for both languages, we adopted the Hidden Markov Model. For the system implementation, we firstly detected the language segments for each language of the input bilingual sentence, and then independently generate the feature sequences for each language. However, for generating fluent synthesized speech, the linguistic information should be taken into account. Here, if the bilingual sentence is mainly written in Mandarin with a few English words, we firstly analyze the Part-Of-Speech information for the English words. Then, we adopted some substitute words (SW) to translate the English parts into Mandarin which have the same POS tags as their corresponding English words. Finally, The entire sentence consists of only one language and could be analyzed linguistically and keep its context information. Finally, the synthesized speech should be more fluent since the contextual linguistic information is used for choosing the suitable acoustic model sequence. In order to construct the original bilingual speech utterance, the English segment is substituted back to the synthesized speech. Experimental results showed that adding the contextual linguistic information is indeed helpful for generating fluent speech for the bilingual sentences.
起訖頁	368-377
關鍵詞	中英夾雜句、隱藏式馬可夫模型、文脈分析、語音串接、語音合成、English/Mandarin bilingual sentence、Hidden Markov Model、Linguistic analysis、Speech concatenation、Speech synthesis
刊名	ROCLING論文集
期數	2019 (2019期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	基於Seq2Seq模型的中文文法錯誤診斷系統
該期刊-下一篇	基於有向圖與爭論導向摘要的網路辯論之爭論元素辨識

新書閱讀

元照讀書館

優惠活動

月旦品評家

元照讀書館

．研討會新訊

月旦知識庫

月旦法律分析庫
月旦醫事法網
月旦會計財稅網

期刊數位服務

社群平台

讀者服務

關於元照

讀者服務專線：+886-2-23756688　傳真：+886-2-23318496
地址：臺北市館前路28 號 7 樓　客服信箱