- Turkish Journal of Electrical Engineering and Computer Science
- Volume:25 Issue:3
- Smart frequent itemsets mining algorithm based on FP-tree and DIFFset data structures
Smart frequent itemsets mining algorithm based on FP-tree and DIFFset data structures
Authors : George GATUHA, Tao JIANG
Pages : 2096-2107
View : 14 | Download : 8
Publication Date : 0000-00-00
Article Type : Research Paper
Abstract :Association rule data mining is an important technique for finding important relationships in large datasets. Several frequent itemsets mining techniques have been proposed using a prefix-tree structure, FP-tree, a compressed data structure for database representation. The DIFFset data structure has also been shown to significantly reduce the run time and memory utilization of some data mining algorithms. Experimental results have demonstrated the efficiency of the two data structures in frequent itemsets mining. This work proposes FDM, a new algorithm based on FP-tree and DIFFset data structures for efficiently discovering frequent patterns in data. FDM can adapt its characteristics to efficiently mine long and short patterns from both dense and sparse datasets. Several optimization techniques are also outlined to increase the efficiency of FDM. An evaluation of FDM against three frequent itemset data mining algorithms, dEclat, FP-growth, and FDM* insert ignore into journalissuearticles values(FDM without optimization);, was performed using datasets having both long and short frequent patterns. The experimental results show significant improvement in performance compared to the FP-growth, dEclat, and FDM* algorithms.Keywords : Association rule data mining, FP tree, Eclat, FP growth, frequent itemsets