Category Mapping for Zero-shot Text Classification

Qiu-Xia Zhang; Te-Yu Chi; Te-Lun Yang; Yu-Meng Tang; Ta-Lin Chen; Jyh-Shing Roger Jang

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	Category Mapping for Zero-shot Text Classification
並列篇名	Category Mapping for Zero-shot Text Classification
作者	Qiu-Xia Zhang (Qiu-Xia Zhang)、Te-Yu Chi (Te-Yu Chi)、Te-Lun Yang (Te-Lun Yang)、Yu-Meng Tang (Yu-Meng Tang)、Ta-Lin Chen (Ta-Lin Chen)、Jyh-Shing Roger Jang (Jyh-Shing Roger Jang)
英文摘要	The existing method of using large pre-trained models with prompts for zero-shot text classification possesses powerful representation ability and scalability. However, its commercial availability is relatively limited. The approach of employing class labels and existing datasets to fine-tune smaller models for zero-shot classification is comparatively straightforward, yet it might lead to weaker model generalization ability. This paper introduces three methods to enhance the accuracy and generalization capability of pre-trained models in zero-shot text classification tasks: 1) utilizing pretrained language models and structuring inputs into a standardized multiple-choice format; 2) creating a text classification training dataset using Wikipedia text data and refining the pre-trained model through fine-tuning; and 3) suggesting a zero-shot category mapping technique based on GloVe text similarity, wherein Wikipedia categories replace textual categories. Remarkably, without employing labeled samples for fine-tuning, the proposed method achieves results comparable to the best models fine-tuned with labeled samples.
起訖頁	141-156
關鍵詞	Natural Language Processing、Pre-trained Language Models、Zero-shot Text Classification、Classification、GloVe
刊名	ROCLING論文集
期數	202310 (2023期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	改善多細粒度的發音評測上資料不平衡的問題
該期刊-下一篇	通過卷積多視角注意力和SudoNet進行高效的人聲分離