探討語者驗證系統中特徵處理模組與注意力機制

陳廷威; 林威廷; 陳嘉平; 呂仲理; 詹博丞; 鄭羽涵; 莊向峰; 陳威妤

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	探討語者驗證系統中特徵處理模組與注意力機制
並列篇名	Investigation of Feature Processing Modules and Attention Mechanisms in Speaker Verification System
作者	陳廷威、林威廷、陳嘉平、呂仲理、詹博丞、鄭羽涵、莊向峰、陳威妤
中文摘要	本論文建構並替換不同的音訊特徵前處理模組與注意力機制來改進語者驗證系統。我們使用了基於ECAPA-TDNN 所改進的模型作為基準模型，並透過替換與組合不同的前處理模組與注意力機制來進行比較，以選出最佳的組合作為論文提出的最終模型。訓練上我們使用了VoxCeleb 2資料集進行訓練，並使用多個測試集來測試模型的表現。最終模型在VoxSRC2022驗證集中對比基準模型有16% 的進步幅度，成功在語者驗證系統上取得了更好的成效。
英文摘要	In this paper, we use several combinations of feature front-end modules and attention mechanisms to improve the performance of our speaker verification system. An updated version of ECAPA-TDNN is chosen as a baseline. We replace and integrate different feature front-end and attention mechanism modules to compare and find the most effective model design, and this model would be our final system. We use VoxCeleb 2 dataset as our training set, and test the performance of our models on several test sets. With our final proposed model, we improved performance by 16% over baseline on VoxSRC2022 valudation set, achieving better results for our speaker verification system.
起訖頁	31-45
關鍵詞	語者驗證、前處理模組、注意力機制、時延神經網路、Speaker Verification、Frontend Module、Attention Mechanism、Time Delay Neural Network
刊名	中文計算語言學期刊
期數	202212 (27:2期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	Aligning Sentences in a Paragraph-Paraphrased Corpus with New Embedding-based Similarity Measures
該期刊-下一篇	中英文語碼轉換語音合成系統開發