The NCCU Corpus of Spoken Chinese: Mandarin, Hakka, and Southern Min

Chui, Kawai; Lai, Huei-ling

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

首頁

臺灣期刊

其他

Taiwan Journal of Linguistics

200812 (6:2期)

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	The NCCU Corpus of Spoken Chinese: Mandarin, Hakka, and Southern Min
作者	Chui, Kawai (Chui, Kawai)、Lai, Huei-ling (Lai, Huei-ling)
英文摘要	In Taiwan, most people speak Mandarin, Southern Min, or Hakka. Not only are the three Chinese dialects undergoing linguistic changes, but the population of Southern Min and Hakka is also diminishing. The NCCU Corpus of Spoken Chinese is thus a project of language documentation whereby open online access to Mandarin, Hakka, and Southern Min data is provided for non-profit-making research. As a language documentation project, the NCCU spoken corpus focuses on collecting and archiving spoken forms of various types. It consists of three sub-corpora, namely the Corpus of Spoken Mandarin, the Corpus of Spoken Hakka, and the Corpus of Spoken Southern Min. The three corpora share a common scheme for the collection of spoken data, mostly in the form of spontaneous face-to-face conversations. The infrastructure of the corpus is designed in a simple yet user-friendly way, so that data can be processed efficiently in the database, and users can browse the spoken data directly from the web. We hope that our work can encourage more people to engage in building up spoken corpora from different perspectives and for different purposes.
起訖頁	119-144
刊名	Taiwan Journal of Linguistics
期數	200812 (6:2期)
出版單位	文鶴出版有限公司
該期刊-上一篇	Developing an Online Corpus of Formosan Languages