TAICAR--The Collection and Annotation of an In-Car Speech Database Created in Taiwan

Wang, Hsien-chang; Yang, Chung-hsien; Wang, Jhing-fa; Wu, Chung-hsien; Chien, Jen-tzung

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	TAICAR--The Collection and Annotation of an In-Car Speech Database Created in Taiwan
作者	Wang, Hsien-chang (Wang, Hsien-chang)、Yang, Chung-hsien (Yang, Chung-hsien)、Wang, Jhing-fa (Wang, Jhing-fa)、Wu, Chung-hsien (Wu, Chung-hsien)、Chien, Jen-tzung (Chien, Jen-tzung)
中文摘要	This paper describes a project that aims to create a Mandarin speech database for the automobile setting (TAICAR). A group of researchers from several universities and research institutes in Taiwan have participated in the project. The goal is to generate a corpus for the development and testing of various speech-processing techniques. There are six recording sites in this project. Various words, sentences, and spontaneously queries uttered in the vehicular navigation setting have been collected in this project. A preliminary corpus of utterances from 192 speakers was created from utterances generated in different vehicles. The database contains more than 163,000 files, occupying 16.8 gigabytes of disk space.
起訖頁	237-249
關鍵詞	TAICAR、In-car speech、Speech database、Multi-channel recording、Corpus collection and annotation
刊名	中文計算語言學期刊
期數	200506 (10:2期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	MATBN: A Mandarin Chinese Broadcast News Corpus
該期刊-下一篇	Design and Development of a Bilingual Reading Comprehension Corpus