A Light Weight Stemmer in Kokborok

Braja Gopal Patra; Khumbar Debbarma; Swapan Debbarma

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	A Light Weight Stemmer in Kokborok
並列篇名	A Light Weight Stemmer in Kokborok
作者	Braja Gopal Patra (Braja Gopal Patra)、Khumbar Debbarma (Khumbar Debbarma)、Swapan Debbarma (Swapan Debbarma)
英文摘要	Started from the very beginning, Stemming has been playing significant roles in several Natural Language Processing Applications such as information retrieval (IR), machine translation (MT), morph analysis and deciding the part of speech (POS). Several stemmers have been developed for a large number of languages including Indian languages; however no work has been done in Kokborok, a native language of Tripura. In this paper, we have designed a simple rule based stemmer for Kokborok using an affix stripping algorithm. The reduction of inflected words to the stem or root form is performed in the stemmer by stripping the affixes and applying boundary rules where needed. The stemming algorithm has been tested using a corpus of 32578 words and out of which 13044 were uniquely found to have an overall accuracy of 80.02% for minimum suffix stripping algorithm and 85.13% for maximum suffix stripping algorithm.
起訖頁	318-325
關鍵詞	Stemming、part of speech (POS)、Kokborok、suffix、prefix
刊名	ROCLING論文集
期數	2012 (2012期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	Implementation of Malayalam Morphological Analyzer Based on Hybrid Approach
該期刊-下一篇	台語關鍵詞辨識之實作與比較

新書閱讀

元照讀書館

優惠活動

月旦品評家

元照讀書館

．研討會新訊

月旦知識庫

月旦法律分析庫
月旦醫事法網
月旦會計財稅網

期刊數位服務

社群平台

讀者服務

關於元照

讀者服務專線：+886-2-23756688　傳真：+886-2-23318496
地址：臺北市館前路28 號 7 樓　客服信箱