Nepali Speech Recognition Using CNN, GRU and CTC

Bharat Bhatta; Basanta Joshi; Ram Krishna Maharjhan

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	Nepali Speech Recognition Using CNN, GRU and CTC
並列篇名	Nepali Speech Recognition Using CNN, GRU and CTC
作者	Bharat Bhatta (Bharat Bhatta)、Basanta Joshi (Basanta Joshi)、Ram Krishna Maharjhan (Ram Krishna Maharjhan)
英文摘要	Communication is an important part of life. To use communication technology efficiently we need to know how to use them or how to instruct these devices to perform tasks. Automatic speech recognition plays an important role in interaction with the technology. Nepali speech recognition involves in conversion of Nepali speech to its correct Nepali transcriptions. The purposed model consists of CNN, GRU and CTC network. The feature in the raw audio is extracted by using MFCC algorithm. CNN is for learning high level features. GRU is responsible for constructing the acoustic model. CTC is responsible for decoding. The dataset consists of 18 female speakers. It is provided by Open Speech and Language Resources. The build model can predict the with the WER of 11%.
起訖頁	1-9
關鍵詞	Nepali Speech Recognition、Automatic Speech Recognition、Gated Recurrent Unit (GRU)、Convolution Neural Network (CNN)
刊名	ROCLING論文集
期數	2020 (2020期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	「忙」的語意及認知概念分析──以語料庫為本
該期刊-下一篇	上下文語言模型化技術於常見問答檢索之研究