月旦知識庫
 
  1. 熱門:
 
首頁 臺灣期刊   法律   公行政治   醫事相關   財經   社會學   教育   其他 大陸期刊   核心   重要期刊 DOI文章
ROCLING論文集 本站僅提供期刊文獻檢索。
  【月旦知識庫】是否收錄該篇全文,敬請【登入】查詢為準。
最新【購點活動】


篇名
可讀性預測於中小學國語文教科書及優良課外讀物之研究
並列篇名
A Study of Readability Prediction on Elementary and Secondary Chinese Textbooks and Excellent Extracurricular Reading
作者 劉憶年陳冠宇曾厚強陳柏琳
中文摘要
可讀性(Readability)是指閱讀材料能夠被讀者理解的程度。可讀性高的文章較容易被讀者理解。文章的可讀性與很多因素有關,如:文長、字詞難度、句法結構、內容是否符合讀者的先備知識等,然而表淺的語言特徵無法反映這些複雜的成分。本論文以先前的研究為基礎,更深入的探討不同種類的特徵,包括句法分析(Syntactic Analysis)、詞性標記(Part-of-Speech, POS)、詞表示法(Word Embedding)、語意資訊(Semantic Information)與寫作程度(Well-written)等特徵,分析比對不同類型的特徵與可讀性高低的關聯性。實驗資料分為二部分:其一為中小學國語文教科書,選自98年度台灣三大出版社所出版的1~9年級(共18冊)審定版國中小國語文教科書;其二為優良課外讀物,選自文化部歷屆「中小學生優良課外讀物」獲選書籍。本論文嘗試透過逐步迴歸與支持向量機等兩種方式建立可讀性模型,比較兩者之效能優劣;最後,再將兩者加以結合,以提升預測之正確率。實驗結果顯示,本論文所提出的可讀性特徵相較於傳統所使用的表淺特徵,在文本難易度評估的任務中,能有顯著的效能提升。
英文摘要
Readability is basically concerned with readers' comprehension of given textual materials: the higher the readability of a document, the easier the document can be understood. It may be affected by various factors, such as document length, word difficulty, sentence structure and whether the content of a document meets the prior knowledge of a reader or not. However, simple surface linguistic features cannot always account for these factors in an appropriate manner. To cater for this, we explore in this study a variety of extra features, including syntactic analysis, parts of speech, word embedding, semantic role features and well-written features. The experimental datasets are composed of two parts: one is textbooks of the Chinese language for elementary and junior high schools (K1 to K9) in Taiwan, compiled from three publishers in the academic year of 2009; the other is excellent extracurricular reading materials for students of elementary and junior high schools, collected by the Ministry of Culture in Taiwan. Two readability prediction models, viz. stepwise regression and support vector machine, are evaluated and compared, while the combination of these two models is also investigated so as to further enhance the accuracy of readability prediction. Experimental results reveal that our proposed approach can yield consistently better performance than traditional ones merely with simple surface linguistic features in evaluating text difficulty.
起訖頁 71-86
關鍵詞 可讀性文本特徵逐步迴歸支持向量機ReadabilityTextual FeaturesStepwise RegressionSupport Vector Machine
刊名 ROCLING論文集  
期數 2015 (2015期)
出版單位 中華民國計算語言學學會
該期刊-上一篇 Explanation Generation for a Math Word Problem Solver
該期刊-下一篇 基於貝氏定理自動分析語料庫與標定文步
 

新書閱讀



最新影音


優惠活動




讀者服務專線:+886-2-23756688 傳真:+886-2-23318496
地址:臺北市館前路28 號 7 樓 客服信箱
Copyright © 元照出版 All rights reserved. 版權所有,禁止轉貼節錄