Data Driven Approaches to Phonetic Transcription with Integration of Automatic Speech Recognition and Grapheme-to-Phoneme for Spoken Buddhist Sutra

Min-Siong Liang; Ren-Yuan Lyu; Yuang-Chin Chiang

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	Data Driven Approaches to Phonetic Transcription with Integration of Automatic Speech Recognition and Grapheme-to-Phoneme for Spoken Buddhist Sutra
作者	Min-Siong Liang (Min-Siong Liang)、Ren-Yuan Lyu (Ren-Yuan Lyu)、Yuang-Chin Chiang (Yuang-Chin Chiang)
中文摘要	We propose a new approach for performing phonetic transcription of text that utilizes automatic speech recognition (ASR) to help traditional grapheme-to-phoneme (G2P) techniques. This approach was applied to transcribe Chinese text into Taiwanese phonetic symbols. By augmenting the text with speech and using automatic speech recognition with a sausage searching net constructed from multiple pronunciations of text, we are able to reduce the error rate of phonetic transcription. Using a pronunciation lexicon with multiple pronunciations for each item, a transcription error rate of 12.74% was achieved. Further improvement can be achieved by adapting the pronunciation lexicon with pronunciation variation (PV) rules derived manually from corrected transcription in a speech corpus. The PV rules can be categorized into two kinds: knowledge-based and data-driven rules. By incorporating the PV rules, an error rate of 10.56% could be achieved. Although this technique was developed for Taiwanese speech, it could easily be adapted to other Chinese spoken languages or dialects.
起訖頁	233-253
關鍵詞	Automatic Phonetic Transcription、Phone Recognition、Grapheme-to-Phoneme (G2P)、Pronunciation Variation、Chinese Text、Taiwanese (Min-Nan)、Dialect、Buddhist Sutra
刊名	中文計算語言學期刊
期數	200806 (13:2期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	A Cross-Linguistic Study of Voice Onset Time in Stop Consonant Productions