Development and Testing Transcription Software for a Southern Min Spoken Corpus

Jia-Cing Ruan; Chiung-Wen Hsu; James Myers; Jane S. Tsay

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	Development and Testing Transcription Software for a Southern Min Spoken Corpus
作者	Jia-Cing Ruan (Jia-Cing Ruan)、Chiung-Wen Hsu (Chiung-Wen Hsu)、James Myers (James Myers)、Jane S. Tsay (Jane S. Tsay)
中文摘要	The usual challenges of transcribing spoken language are compounded for Southern Min (Taiwanese) because it lacks a generally accepted orthography. This study reports the development and testing of software tools for assisting such transcription. Three tools are compared, each representing a different type of interface with our corpus-based Southern Min lexicon (Tsay, 2007): our original Chinese character-based tool (Segmentor), the first version of a romanization-based lexicon entry tool called Adult-Corpus Romanization Input Program (ACRIP 1.0), and a revised version of ACRIP that accepts both character and romanization inputs and integrates them with sound files (ACRIP 2.0). In two experiments, naive native speakers of Southern Min were asked to transcribe passages from our corpus of adult spoken Southern Min (Tsay and Myers, in progress), using one or more of these tools. Experiment 1 showed no disadvantage for romanization-based compared with character-based transcription even for untrained transcribers. Experiment 2 showed significant advantages of the new mixed-system tool (ACRIP 2.0) over both Segmentor and ACRIP 1.0, in both speed and accuracy of transcription. Experiment 2 also showed that only minimal additional training brought dramatic improvements in both speed and accuracy. These results suggest that the transcription of non-Mandarin Sinitic languages benefits from flexible, integrated software tools.
起訖頁	1-26
關鍵詞	Speech Transcription、Southern Min、Taiwanese、Romanization、Key-in Systems
刊名	中文計算語言學期刊
期數	201203 (17:1期)
出版單位	中華民國計算語言學學會
該期刊-下一篇	可變速中文文字轉語音系統