改善多細粒度的發音評測上資料不平衡的問題

林孟欣; 王馨偉; 羅天宏; 陳柏琳; 趙偉成

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	改善多細粒度的發音評測上資料不平衡的問題
並列篇名	Addressing the issue of Data Imbalance in Multi-granularity Pronunciation Assessment
作者	林孟欣、王馨偉、羅天宏、陳柏琳、趙偉成
中文摘要	自動發音評測(Automatic Pronunciation Assessment, APA)是在量化非母語(L2)學習者在某種語言中發音的熟練程度。然而隨著技術的發展APA已經可以評測多個發音細粒度如音素層級、單字層級和語句層級及發音準確度、流利度、重音等多個面向。然而目前的APA方法使用均方誤差(Mean Squard Error, MSE)損失函數，但在每個細粒度的標籤都存在資料高度不平衡的問題，這會影響模型的泛化能力和公平性，MSE會低估稀有的標籤，但現有的研究卻很少涉及數據不平衡的問題。因此在本研究中，我們參考了在視覺分類建模中使用的類平衡損失函數，使用重新採樣的方式及加入一個可訓練的變數，縮小了在不平衡的回歸任務中，訓練集和測試集不匹配的程度。而我們在speechocean762資料集上評估我們的方法，這個資料集上字詞層級顯示出明顯不平衡的標籤，而我們的實驗結果顯示，在這個不平衡的資料集上，我們實驗的結果明顯獲得改善。
英文摘要	Automatic Pronunciation Assessment (APA) aims to quantify non-native (L2) learners' pronunciation proficiency in a specific language. With technological advancements, APA now evaluates various aspects of pronunciation, from phoneme level to sentence level, including accuracy, fluency, stress, and more. However, current APA methods rely on the Mean Squared Error (MSE) loss function, which struggles with imbalanced labels across different levels of granularity. This imbalance affects model generalizability and fairness, as MSE tends to underestimate rare labels. Despite these issues, existing research has not adequately addressed data imbalance. To address this gap, we draw inspiration from class-balanced loss functions in visual classification. Our approach involves resampling and introducing a trainable variable to narrow the gap between training and testing sets in imbalanced regression tasks, aiming to alleviate label imbalance effects in APA. Evaluating our method on the Speechocean762 dataset, known for significant word-level label imbalance, we observe remarkable enhancements in performance. Our proposed approach shows promise in tackling challenges stemming from imbalanced data in automatic pronunciation assessment.
起訖頁	134-140
關鍵詞	自動發音評測、資料不平衡、回歸損失函數、Automatic Pronunciation Assessment、data imbalanced、regression loss function
刊名	ROCLING論文集
期數	202310 (2023期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	Is GPT-4 a Good Islamic Expert for Answering Quran Questions?
該期刊-下一篇	Category Mapping for Zero-shot Text Classification