結構性重參數化VGG架構之輕量化聲音事件偵測模型

Chia-Chuan Liu; Sung-Jen Huang; Chia-Ping Chen; Chung-Li Lu; Bo-Cheng Chan; Yu-Han Cheng; Hsiang-Feng Chuang; Wei-Yu Chen

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	結構性重參數化VGG架構之輕量化聲音事件偵測模型
並列篇名	Lightweight Sound Event Detection Model with RepVGG Architecture
作者	Chia-Chuan Liu (Chia-Chuan Liu)、Sung-Jen Huang (Sung-Jen Huang)、Chia-Ping Chen (Chia-Ping Chen)、Chung-Li Lu (Chung-Li Lu)、Bo-Cheng Chan (Bo-Cheng Chan)、Yu-Han Cheng (Yu-Han Cheng)、Hsiang-Feng Chuang (Hsiang-Feng Chuang)、Wei-Yu Chen (Wei-Yu Chen)
中文摘要	本文中提出以模型輕量化為目標的聲音事件偵測RepVGGRNN模型。其於卷積層使用RepVGG卷積塊，透過殘差連接的網路結構使模型達到良好的效能，並於模型訓練完畢後透過結構重參數化使得卷積參數得以縮減。此外，其於訓練階段合併使用知識蒸餾及均值教師模型之訓練方法進一步提昇輕量化模型之預測準確度。RepVGGRNN在DCASE 2022Task4驗證集中，PSDS(Polyphonic soundevent detection score)-scenario 1, 2分別以40.8%, 67.7%優於官方baseline系統所達到的34.4%, 57.2%，並在模型參數量上，RepVGGRNN使用的參數量約為49.6萬，僅baseline系統之44.6%。
英文摘要	In this paper, we proposed RepVGGRNN, which is a light weight sound event detection model. We use RepVGG convolution blocks in the convolution part to improve performance, and re-parameterize the RepVGG blocks after the model is trained to reduce the parameters of the convolution layers. To further improve the accuracy of the model, we incorporated both the mean teacher method and knowledge distillation to train the lightweight model. The proposed system achieves PSDS (Polyphonic sound event detection score)-scenario 1, 2 of 40.8% and 67.7% outperforms the baseline system of 34.4% and 57.2% on the DCASE 2022 Task4 validation dataset. The quantity of the parameters in the proposed system is about 49.6K, only 44.6% of the baseline system.
起訖頁	129-137
關鍵詞	聲音事件偵測、輕量化模型、知識蒸餾、Sound event detection、Light weight model、Knowledge distillation
刊名	ROCLING論文集
期數	202212 (2022期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	老年扶養費請求案件之准駁及扶養金額預測
該期刊-下一篇	A Dimensional Valence-Arousal-Irony Dataset for Chinese Sentence and Context

新書閱讀

元照讀書館

優惠活動

月旦品評家

元照讀書館

．研討會新訊

月旦知識庫

月旦法律分析庫
月旦醫事法網
月旦會計財稅網

期刊數位服務

社群平台

讀者服務

關於元照

讀者服務專線：+886-2-23756688　傳真：+886-2-23318496
地址：臺北市館前路28 號 7 樓　客服信箱