英文摘要 |
The relationship between Taiwan’s national legislative election results and web‐based news is exploredthrough multiple regression models applied to news content taken from the online version of one ofTaiwan’s major daily newspapers from January 1, 2002 to December 31, 2009. The 2008 electionresults were used as training data, while that of 2012 was used for testing to evaluate the predictivevalue of online news content for election results. The best results featured an MAE of about 7% with aPearson correlation coefficient of about 0.5. Although these results lack precision, they can still serveas a reference for online political opinion. The models are constructed using natural languageprocessing and machine learning, and address the sparse matrix problem with feature selection. Futurework will integrate sentiment analysis to improve model performance. |