英文摘要 |
"Online news has become one of the major sources for investors to make investment decisions. However, it has become a great challenge for investors to judge and verify the current market sentiment from a huge amount of financial news generated by financial news websites. Consequently, using financial news collected from Taiwanese stock market news of Anue website (www.cnyes.com), this paper employs three text classification models (naive Bayes classifier, fastText, and deep learning model) with four feature extraction and selection techniques of text mining (N-gram, term frequency–inverse document frequency [TF-IDF], Chi-square test, and mutual information) to classify news sentiments. The empirical results demonstrate that (1) N-gram feature extraction can improve the accuracy of all classification models, especially for the naive Bayes classifier which can effectively overcome the shortcomings of independence assumptions of text features. (2) In spite of the long training time, multi-layer perceptron with Chi-square test and mutual information feature selection techniques can provide the best performance for two and three classification with the accuracy up to 90% and 80%, respectively. (3) Compared to multi-layer perceptron, fastText model with Chi-square test and mutual information feature selection techniques can reduce the training speed without loss of the accuracy of classification." |