| 英文摘要 |
This study experiments with both Encoder and Decoder architectures of pre-trained language models to determine their effectiveness in identifying hate speech related to body shaming. Previous research has largely focused on discussing and mitigating the automatic generation of discriminatory language in generative models within LLMs. However, there hasn’t been research investigating the further application of generative models in automatically classifying and identifying discriminatory language. Therefore, this study employs a zero-shot classification approach and provides a comprehensive definition of body shaming to examine whether Decoder-focused generative models (ChatGLM-6B and Chinese-Alpaca-Plus-7B) are suitable for automatically identifying discriminatory language. Furthermore, to gain a more comprehensive understanding of how different architectures within LLMs perform in hate speech detection, a BERT model with an Encoder architecture is also employed for classification. The results from both architectures are then further analyzed and compared. BERT shows good performance with minimal finetuning data, while generative models struggle with zero-shot classification. Thus we aim to explore the potential for improving the performance of generative models by providing detailed explanations for sentences with various structures. |