英文摘要 |
As the World Wide Web prevails nowadays, many companies tend to disseminate information through their commercial Web sites. From a strategic planning viewpoint, identifying a company's external environment helps to create its business values. Therefore, it becomes essential for companies to identify its external environment through the Web. Traditional approaches such as analysis of access log or registration suffer from limited or incorrect data collection. In contrast, Web content classification can be used for a company's external environment identification. Furthermore, relationships among Web pages resemble social interactions in the real world and contribute to classifying the external environment. We, therefore, propose a classifier, CNB-HI, that utilizes Web contents and hyperlink structure to identify the roles of a company's external environment. Two experiments are conducted to examine the performance. In the first experiment, we compare CNB with variants of Naive Bayes classifiers, and conclude that CNB achieves a better performance. The second experiment further shows that the performance of CNB-HI improves markedly compared to CNB. The feasibility of our proposed approach is thus justified. |