月旦知識庫
月旦知識庫 會員登入元照網路書店月旦品評家
 
 
  1. 熱門:
首頁 臺灣期刊   法律   公行政治   醫事相關   財經   社會學   教育   其他 大陸期刊   核心   重要期刊 DOI文章
創新科技期刊 本站僅提供期刊文獻檢索。
  【月旦知識庫】是否收錄該篇全文,敬請【登入】查詢為準。
最新【購點活動】


篇名
Enhanced Siamese MaLSTM with ELMo: Incorporating Squared Euclidean Distance and Feature Engineering
並列篇名
Enhanced Siamese MaLSTM with ELMo: Incorporating Squared Euclidean Distance and Feature Engineering
作者 Munazza Nida (Munazza Nida)Khurram Khan (Khurram Khan)Zahid Halim (Zahid Halim)
英文摘要
Identifying duplicate sentences remains a significant challenge in NLP, which is utilised in question-answering and paraphrase detection systems. One such platform is Quora, where users can post questions and answers. Due to the large number of users, it is commonly seen that most of the inquiries that people post are the same. This makes it challenging to ask and answer the same question multiple times in distinct ways. High-quality answers can be obtained by identifying such repeated requests, which could improve the user experience. One of the already existing approaches, which has employed the Siamese MaLSTM Model and ELMo Word Embedding for Quora Questions Detection, utilized the Manhattan Distance for sentence similarity measurement in the Quora Question pairs dataset available on Kaggle. In this paper, we have proposed an enhancement model by incorporating Squared Eu¬clidean Distance alongside Manhattan Distance. Feature engineering is also used to generate additional features, such as sentence length difference and cosine similarity between ELMo embeddings. In addition, a few preprocessing techniques are also applied to improve the effectiveness of data samples. Due to computational constraints, we utilized a subset of the dataset, and the findings showed that the proposed model outperformed the existing one by 2%. Hence, the suggested model has made a substantial contribu¬tion to the detection of duplicate questions. For comparison, we have used multiple transformer-based models from HuggingFace.
起訖頁 55-64
關鍵詞 ELMOQuora Question PairsSquared Euclidean distanceFeature engineeringDuplicate Question Detection
刊名 創新科技期刊  
期數 202509 (7:2期)
出版單位 國立雲林科技大學
該期刊-上一篇 Noise-Induced Hearing Loss Among Medical Students: Evaluating the Impact of Personal Music Player Use and Earbud Dependency
 

新書閱讀



最新影音


優惠活動




讀者服務專線:+886-2-23756688 傳真:+886-2-23318496
地址:臺北市館前路28 號 7 樓 客服信箱
Copyright © 元照出版 All rights reserved. 版權所有,禁止轉貼節錄