語音增強基於小腦模型控制器

朱皓駿; 李仲溪; 方士豪; 林志民; 張雲帆; 曹昱

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	語音增強基於小腦模型控制器
作者	朱皓駿、李仲溪、方士豪、林志民、張雲帆、曹昱
中文摘要	本文提出了一個小腦模型控制器（Cerebellar Model Articulation Controller, CMAC）應用於語音增強系統（Speech Enhancement System），所提出的CMAC使用歸一化梯度下降法（Normalized Gradient Descent Method）增加CMAC參數的自適應學習速度，具有比傳統類神經網路方法更快的學習速度、體積小且良好的泛化，因此更適合做高速的訊號處理。實驗方面，使用CMAC與MMSE做比較，為了比較性能，我們用了三種語音評估方法來做CMAC消除雜音及MMSE消除雜音後的數值比較，分別為（Perceptual Evaluation of Speech Quality, PESQ）、（Segmental Signal-to-Noise Ratio, SSNR）以及（Speech Distortion Index, SDI）。由實驗結果可知，在三種評估方法，CMAC皆能達到較佳的結果。
英文摘要	Traditionally, cerebellar model articulation controller (CMAC) is used in motor control, inverted pendulum robot, and nonlinear channel equalization. In this study, we investigate the capability of CMAC for speech enhancement. We construct a CMAC-based supervised speech enhancement system, which includes offline and online phases. In the offline phase, a paired noisy-clean speech dataset is prepared and used to train the parameters in a CMAC model. In the online phase, the trained CMAC model transforms the input noisy speech signals to enhanced speech signals with reduced noise components. To test the CMAC-based speech enhancement system, this study adopted three speech objective evaluation metrics, including perceptual evaluation of speech quality (PESQ), segmental signal-to-noise ratio (SSNR) and speech distortion index (SDI). A well-known traditional speech enhancement approach, minimum mean-square-error (MMSE) algorithm, was also tested performance for comparison. Experimental results demonstrated that CMAC provides superior performances to the MMSE method for all of the three objective evaluation metrics.
起訖頁	123-135
關鍵詞	小腦模型控制器、語音增強、最小均方誤差、CMAC、Speech Enhancement、MMSE
刊名	ROCLING論文集
期數	2015 (2015期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	透過語音特徵建構基於堆疊稀疏自編碼器演算法之婚姻治療中夫妻互動行為量表自動化評分系統
該期刊-下一篇	類神經網路訓練結合環境群集及專家混合系統於強健性語音辨識