Unsupervised Approach for Automatic Keyword Extraction from Arabic Documents

Arafat Awajan

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	Unsupervised Approach for Automatic Keyword Extraction from Arabic Documents
並列篇名	Unsupervised Approach for Automatic Keyword Extraction from Arabic Documents
作者	Arafat Awajan (Arafat Awajan )
英文摘要	In this paper, we present an unsupervised two-phase approach to extract keywords from Arabic documents that combines statistical analysis and linguistic information. The first phase detects all the N-grams that may be considered keywords. In the second phase, the N-grams are analyzed using a morphological analyzer to replace the words of the N-grams with their base forms that are the roots for the derived words and the stems for the non-derivative words. The N-grams that have the same base forms are regrouped and their counts accumulated. The ones that appear more frequently are then selected as keywords. An experiment is conducted to evaluate the proposed approach by comparing the extracted keywords with those manually selected. The results show that the proposed approach achieved an average precision of 0.51.
起訖頁	175-184
關鍵詞	Keyword extraction、Keyphrase extraction、Arabic Language、N-gram
刊名	ROCLING論文集
期數	2014 (2014期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	以二維共振峰分布建立語者音色模型及其在語者驗證上之應用
該期刊-下一篇	Testing Distributional Hypothesis in Patent Translation

新書閱讀

元照讀書館

優惠活動

月旦品評家

元照讀書館

．研討會新訊

月旦知識庫

月旦法律分析庫
月旦醫事法網
月旦會計財稅網

期刊數位服務

社群平台

讀者服務

關於元照

讀者服務專線：+886-2-23756688　傳真：+886-2-23318496
地址：臺北市館前路28 號 7 樓　客服信箱