Sentiment Analysis on Social Network: Using Emoticon Characteristics for Twitter Polarity Classification

Chia-Ping Chen; Tzu-Hsuan Tseng; Tzu-Hsuan Yang

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	Sentiment Analysis on Social Network: Using Emoticon Characteristics for Twitter Polarity Classification
並列篇名	Sentiment Analysis on Social Network: Using Emoticon Characteristics for Twitter Polarity Classification
作者	Chia-Ping Chen (Chia-Ping Chen)、Tzu-Hsuan Tseng (Tzu-Hsuan Tseng)、Tzu-Hsuan Yang (Tzu-Hsuan Yang)
英文摘要	In this paper, we describe a sentiment analysis system implemented for the semantic-evaluation task of message polarity classification for English on Twitter. Our system contains modules of data pre-processing, word embedding, and sentiment classification. In order to decrease the data complexity and increase the coverage of the word vector model for better learning, we perform a series of data pre-processing tasks, including emoticon normalization, specific suffix splitting, and hashtag segmentation. In word embedding, we utilize the pre-trained word vector provided by GloVe. We believe that emojis in tweets are important characteristics for Twitter sentiment classification, but most pre-trained sets of word vectors contain few or no emoji representations. Thus, we propose embedding emojis into the vector space by neural network models. We train the emoji vector with relevant words that contain descriptions and contexts of emojis. The models of long short-term memory (LSTM) and convolutional neural network (CNN) are used as our sentiment classifiers. The proposed emoji embedding is evaluated on the SemEval 2017 tasks. Using emoji embedding, we achieved recall rates of 0.652 with the LSTM classifier and 0.640 with the CNN classifier.
起訖頁	1-18
關鍵詞	Sentiment Analysis、Polarity Classification、Machine Learning、Neural Network、Word Embedding
刊名	中文計算語言學期刊
期數	201806 (23:1期)
出版單位	中華民國計算語言學學會
該期刊-下一篇	長短期記憶模型之忘記閘提取語意流暢度之架構以自閉症小孩說故事為例