Enhanced Siamese MaLSTM with ELMo: Incorporating Squared Euclidean Distance and Feature Engineering

Munazza Nida; Khurram Khan; Zahid Halim

月旦知識庫會員登入｜元照網路書店｜月旦品評家

熱門：

首頁

臺灣期刊 法律公行政治醫事相關財經社會學教育其他

大陸期刊 核心重要期刊

DOI文章

	本站僅提供期刊文獻檢索。　　【月旦知識庫】是否收錄該篇全文，敬請【登入】查詢為準。最新【購點活動】
篇名	Enhanced Siamese MaLSTM with ELMo: Incorporating Squared Euclidean Distance and Feature Engineering
並列篇名	Enhanced Siamese MaLSTM with ELMo: Incorporating Squared Euclidean Distance and Feature Engineering
作者	Munazza Nida (Munazza Nida)、Khurram Khan (Khurram Khan)、Zahid Halim (Zahid Halim)
英文摘要	Identifying duplicate sentences remains a significant challenge in NLP, which is utilised in question-answering and paraphrase detection systems. One such platform is Quora, where users can post questions and answers. Due to the large number of users, it is commonly seen that most of the inquiries that people post are the same. This makes it challenging to ask and answer the same question multiple times in distinct ways. High-quality answers can be obtained by identifying such repeated requests, which could improve the user experience. One of the already existing approaches, which has employed the Siamese MaLSTM Model and ELMo Word Embedding for Quora Questions Detection, utilized the Manhattan Distance for sentence similarity measurement in the Quora Question pairs dataset available on Kaggle. In this paper, we have proposed an enhancement model by incorporating Squared Eu¬clidean Distance alongside Manhattan Distance. Feature engineering is also used to generate additional features, such as sentence length difference and cosine similarity between ELMo embeddings. In addition, a few preprocessing techniques are also applied to improve the effectiveness of data samples. Due to computational constraints, we utilized a subset of the dataset, and the findings showed that the proposed model outperformed the existing one by 2%. Hence, the suggested model has made a substantial contribu¬tion to the detection of duplicate questions. For comparison, we have used multiple transformer-based models from HuggingFace.
起訖頁	55-64
關鍵詞	ELMO、Quora Question Pairs、Squared Euclidean distance、Feature engineering、Duplicate Question Detection
刊名	創新科技期刊
期數	202509 (7:2期)
出版單位	國立雲林科技大學
該期刊-上一篇	Noise-Induced Hearing Loss Among Medical Students: Evaluating the Impact of Personal Music Player Use and Earbud Dependency

新書閱讀

元照讀書館

優惠活動

月旦品評家

元照讀書館

．研討會新訊

月旦知識庫

月旦法律分析庫
月旦醫事法網
月旦會計財稅網

期刊數位服務

社群平台

讀者服務

關於元照

讀者服務專線：+886-2-23756688　傳真：+886-2-23318496
地址：臺北市館前路28 號 7 樓　客服信箱