Integrating Complementary Features from Vocal Source and Vocal Tract for Speaker Identification

Nengheng Zheng; Tan Lee; Ning Wang; P. C. Ching

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	Integrating Complementary Features from Vocal Source and Vocal Tract for Speaker Identification
作者	Nengheng Zheng (Nengheng Zheng)、Tan Lee (Tan Lee)、Ning Wang (Ning Wang)、P. C. Ching (P. C. Ching )
中文摘要	This paper describes a speaker identification system that uses complementary acoustic features derived from the vocal source excitation and the vocal tract system. Conventional speaker recognition systems typically adopt the cepstral coefficients, e.g., Mel-frequency cepstral coefficients (MFCC) and linear predictive cepstral coefficients (LPCC), as the representative features. The cepstral features aim at characterizing the formant structure of the vocal tract system. This study proposes a new feature set, named the wavelet octave coefficients of residues (WOCOR), to characterize the vocal source excitation signal. WOCOR is derived by wavelet transformation of the linear predictive (LP) residual signal and is capable of capturing the spectro-temporal properties of vocal source excitation. WOCOR and MFCC contain complementary information for speaker recognition since they characterize two physiologically distinct components of speech production. The complementary contributions of MFCC and WOCOR in speaker identification are investigated. A confidence measure based score-level fusion technique is proposed to take full advantage of these two complementary features for speaker identification. Experiments show that an identification system using both MFCC and WOCOR significantly outperforms one using MFCC only. In comparison with the identification error rate of 6.8% obtained with MFCC-based system, an error rate of 4.1% is obtained with the proposed confidence measure based integrating system.
起訖頁	273-290
關鍵詞	Speaker Identification、Vocal Source Feature、Vocal Tract Feature、Information Fusion、Confidence Measure
刊名	中文計算語言學期刊
期數	200709 (12:3期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	A Novel Characterization of the Alternative Hypothesis Using Kernel Discriminant Analysis for LLR-Based Speaker Verification
該期刊-下一篇	Performance of Discriminative HMM Training in Noise