月旦知識庫
 
  1. 熱門:
 
首頁 臺灣期刊   法律   公行政治   醫事相關   財經   社會學   教育   其他 大陸期刊   核心   重要期刊 DOI文章
ROCLING論文集 本站僅提供期刊文獻檢索。
  【月旦知識庫】是否收錄該篇全文,敬請【登入】查詢為準。
最新【購點活動】


篇名
運用響應式知識蒸餾機制增進中文多標籤文本分類效能
並列篇名
Enhancing Chinese Multi-Label Text Classification Performance with Response-based Knowledge Distillation
作者 Szu-Chi Huang (Szu-Chi Huang)Cheng-Fu Cao (Cheng-Fu Cao)Po-Hsun Liao (Po-Hsun Liao)Lung-Hao Lee (Lung-Hao Lee)Po-Lei Lee (Po-Lei Lee)Kuo-Kai Shyu (Kuo-Kai Shyu)
中文摘要
資料類別不平衡存在長尾標籤問題,單獨的多標籤分類模型一次預測所有類別,針對個別標籤的最佳化十分困難,對於出現次數較少的長尾標籤效能通常不佳。本論文提出一種響應式知識蒸餾機制,將多個最佳化的二元模型作為教師網路,單一多標籤模型做為學生網路,改善多標籤模型在非平衡標籤的資料集分類效能。實驗資料來自2,724個中文健康照護文本,人工標記文章內容橫跨9個類別,總共標籤數量是8,731,平均每個樣本有3.2個標籤。實驗設定採用5折交互驗證,比較TextRNN、TextCNN、HAN和GRU-att模型,使用知識蒸餾機制與否的效能差異,結果顯示透過知識蒸餾機制能夠顯著提升單一多標籤分類模型的micro-F1約2至3 %、macro-F1約4至6 %、weighted-F1約3至4 %,以及subset accuracy約1至2 %。
英文摘要
It’s difficult to optimize individual label performance of multi-label text classification, especially in those imbalanced data containing long-tailed labels. Therefore, this study proposes a response-based knowledge distillation mechanism comprising a teacher model that optimizes binary classifiers of the corresponding labels and a student model that is a standalone multi-label classifier learning from distilled knowledge passed by the teacher model. A total of 2,724 Chinese healthcare texts were collected and manually annotated across nine defined labels, resulting in 8731 labels, each containing an average of 3.2 labels. We used 5-fold cross-validation to compare the performance of several multi-label models, including TextRNN, TextCNN, HAN, and GRU-att. Experimental results indicate that using the proposed knowledge distillation mechanism effectively improved the performance no matter which model was used, about 2-3% of micro-F1, 4-6% of macro-F1, 3-4% of weighted-F1 and 1-2% of subset accuracy for performance enhancement.
起訖頁 25-31
關鍵詞 多標籤分類長尾標籤二元相關知識蒸餾Multi-label classificationlongtailed labelsbinary relevanceknowledge distillation
刊名 ROCLING論文集  
期數 202212 (2022期)
出版單位 中華民國計算語言學學會
該期刊-上一篇 Analyzing discourse functions with acoustic features and phone embeddings: non-lexical items in Taiwan Mandarin
該期刊-下一篇 Analyzing discourse functions with acoustic features and phone embeddings: non-lexical items in Taiwan Mandarin
 

新書閱讀



最新影音


優惠活動




讀者服務專線:+886-2-23756688 傳真:+886-2-23318496
地址:臺北市館前路28 號 7 樓 客服信箱
Copyright © 元照出版 All rights reserved. 版權所有,禁止轉貼節錄