英文摘要 |
This research proposes the probability mapping approach for a product name to attempt to resolve the synonym identification problem, which is usually ignored in the field of Named Entity Recognition. Using the same name to describe a product or service may effectively improve the results of opinion mining or sentiment analysis. However, as WOM is a user generated content (UGC), different names may be used by the same or different users. Besides, there is no unified naming rule when writing the WOM. Even though the authors are the same or different, they may use different names to describe the same products. In this case, searching or organizing the WOM article without the consideration of the naming issue may lead to the problem of information loss. Thus, we propose the probability mapping approach via the co-occurrence naming dataset and the Word2vect language model in order to reduce the naming issue. According to our initially experimental results, the probability mapping approach for a product name present its potential in the naming issue. |