Web Mining for Unsupervised Classification

Wei-Yen Day; Chun-Yi Chi; Ruey-Cheng Chen; Pu-Jen Cheng; Pei-Sen Liu

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	Web Mining for Unsupervised Classification
並列篇名	Web Mining for Unsupervised Classification
作者	Wei-Yen Day (Wei-Yen Day)、Chun-Yi Chi (Chun-Yi Chi)、Ruey-Cheng Chen (Ruey-Cheng Chen)、Pu-Jen Cheng (Pu-Jen Cheng)、Pei-Sen Liu (Pei-Sen Liu)
英文摘要	Data acquisition is a major concern in text classification. The excessive human efforts required by conventional methods to build up quality training collection might not always be available to research workers. In this paper, we look into possibilities to automatically collect training data by sampling the Web with a set of given class names. The basic idea is to populate appropriate keywords and submit them as queries to search engines for acquiring training data. Two methods are presented in this study: One method is based on sampling the common concepts among the classes, and the other based on sampling the discriminative concepts for each class. A series of experiments were carried out independently on two different datasets, and the result shows that the proposed methods significantly improve classifier performance even without using manually labeled training data. Our strategy for retrieving Web samples, we find that, is substantially helpful in conventional document classification in terms of accuracy and efficiency.
起訖頁	53-67
關鍵詞	Unsupervised classification、text classification、Web mining
刊名	ROCLING論文集
期數	2009 (2009期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	強健性語音辨識中分頻段調變頻譜補償之研究
該期刊-下一篇	Query Formulation by Selecting Good Terms

新書閱讀

元照讀書館

優惠活動

月旦品評家

元照讀書館

．研討會新訊

月旦知識庫

月旦法律分析庫
月旦醫事法網
月旦會計財稅網

期刊數位服務

社群平台

讀者服務

關於元照

讀者服務專線：+886-2-23756688　傳真：+886-2-23318496
地址：臺北市館前路28 號 7 樓　客服信箱