月旦知識庫
 
  1. 熱門:
 
首頁 臺灣期刊   法律   公行政治   醫事相關   財經   社會學   教育   其他 大陸期刊   核心   重要期刊 DOI文章
電子商務學報 本站僅提供期刊文獻檢索。
  【月旦知識庫】是否收錄該篇全文,敬請【登入】查詢為準。
最新【購點活動】


篇名
幾個快速挖掘關聯規則的資料探勘方法
並列篇名
Several Improved Data Mining Algorithms For Finding Association Rules
作者 陳彥良趙書榮陳禹辰
中文摘要
關聯規則的挖掘,是目前最重要的資料挖掘問題之一,它的目的是要從銷售的交易資料庫中,發現商品項目間的關聯。在過去已經有相當多挖掘關聯規則演算法被提出來,當中FP-tree演算法可說是最主要的演算法之一,並以高執行效率著稱。它的主要概念是不產生candidate itemsets,而將資料庫壓縮在FP-tree的結構中以避免多次的高成本資料庫掃瞄。在本文中,我們針對原本的FP tree演算法,更進一步改進其所用的資料結構以提高挖掘效率。在本文中共建立了三種資料結構:(一)FP-tree_tail演算法,也就是在head table中增加一個tail欄位,(二) FP-treel hash演算法,乃是以hash function計算出每個node所在位置方式建立FP-tree,(三)FP-treel hash+tail演算法,為結合(一)、(二)之優點,所完成之演算法。作者將以上三個演算法與傳統FP_tree演算法一起比較以找出各演算法之優缺點。經由各種實驗數據發現,傳統FP_tree演算法所需花費之時間,為三個改良FP-tree演算法的數十倍。
英文摘要
Mining association rules is one of the most important problems in data mining. Its aim is to discover the associations between items in a large database of sales transactions. In the past, a large number of algorithms for mining association rules have been proposed, and the FP-tree algorithm is one of the most famous ones, known for its efficiency. Unlike the traditional approach that requires many phases of candidate itemsets generation and database scan, the FP-tree algorithm compresses and stores the entire database into a sophisticated tree structure, called FP-tree, by which all the associations can be found by two database scans. In this paper, we attempt to further improve the standard structure of the FP-tree such that the mining performance can be improved. To this end, three variants of the improved FP-tree algorithm are proposed. The first variant is called FP-tree+tail, which adds a tail pointer into the head table of the original FP-tree structure. The second is named as FP-tree hash, which adds a hash table into every node of the FP-tree. Finally, we call the last FP-treel1ash+tail, which is a combination of the first two improvements. Finally, a performance evaluation is done to compare their performances. The result indicates that the three proposed algorithms are about 20-50 times faster than the original FP-tree algorithm.
起訖頁 1-16
關鍵詞 資料挖掘關聯規則交易資料庫Data miningAssociation rule Transaction database
刊名 電子商務學報  
期數 200309 (5:2期)
出版單位 中華企業資源規劃學會
該期刊-下一篇 以資料挖礦法則預測網頁更新規則之研究
 

新書閱讀



最新影音


優惠活動




讀者服務專線:+886-2-23756688 傳真:+886-2-23318496
地址:臺北市館前路28 號 7 樓 客服信箱
Copyright © 元照出版 All rights reserved. 版權所有,禁止轉貼節錄