Using Statistical and Semantic Models for Multi-Document Summarization

Divyanshu Daiya; Jaipur, INDIA; Anukarsh Singh; Jaipur, INDIA

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	Using Statistical and Semantic Models for Multi-Document Summarization
並列篇名	Using Statistical and Semantic Models for Multi-Document Summarization
作者	Divyanshu Daiya (Divyanshu Daiya)、Jaipur, INDIA (Jaipur, INDIA)、Anukarsh Singh (Anukarsh Singh)、Jaipur, INDIA (Jaipur, INDIA)
英文摘要	We report a series of experiments with different semantic models on top of various statistical models for extractive text summarization. Though statistical models may better capture word co-occurrences and distribution around the text, they fail to detect the context and the sense of sentences /words as a whole. Semantic models help us gain better insight into the context of sentences. We show that how tuning weights between different models can help us achieve significant results on various benchmarks. Learning pre-trained vectors used in semantic models further, on given corpus, can give addition spike in performance. Using weighing techniques in between various statistical models too further refines our result. For Statistical models, we have used TF/IDF, TextRAnk, Jaccard/Cosine Similarities. For Semantic Models, we have used WordNet-based Model and proposed two models based on Glove Vectors and Facebook's InferSent. We tested our approach on DUC 2004 dataset, generating 100-word summaries. We have discussed the system, algorithms, analysis and also proposed and tested possible improvements. ROUGE scores were used to compare to other summarizers.
起訖頁	169-183
關鍵詞	Extractive Text Summarization、Semantic Summarization Models、Statistical Summarization Models、Multi Document Summarization
刊名	ROCLING論文集
期數	2018 (2018期)
出版單位	中華民國計算語言學學會
該期刊-上一篇	LENA computerized automatic analysis of speech development from birth to three
該期刊-下一篇	台語古詩朗誦系統