GA-Apriori: Combining apriori heuristic and genetic algorithms for solving the frequent itemsets mining problem

Scholarworks@UNIST

UNIST Library

There are no files associated with this item.

Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Related Researcher

MarcoComuzzi

Read More

Cited time in webofscience

Cited time in scopus

Metadata Downloads

GA-Apriori: Combining apriori heuristic and genetic algorithms for solving the frequent itemsets mining problem

Citation: 21st Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2017, v.10526 LNAI, pp.138 - 148

Abstract: Finding frequent itemsets is a popular data mining problem, aiming to extract hidden patterns from a transactional database. Several bio-inspired approaches to solve this problem have been proposed to overcome the poor performance of exact algorithms, such as Apriori and FPGrowth. Approaches based on genetic algorithms are among the most efficient ones from the point of view of runtime performance, but they are still inefficient in terms of solution’s quality, i.e., the number of frequent itemsets discovered. To deal with this issue, we propose in this paper a new genetic algorithm for finding frequent itemsets called GA-Apriori, in which the crossover and mutation operators are defined by taking into account the Apriori heuristic principle. The results of our evaluation show that GA-Apriori outperforms other approaches to frequent itemset mining based on genetic algorithms, especially when dealing with large instances. The experiments also show that GA-Apriori is competitive with exact approaches in terms of the number of frequent itemsets discovered.

Publisher: 21st Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2017

qrcode

Tel : 052-217-1403 / Email : scholarworks@unist.ac.kr

ScholarWorks@UNIST was established as an OAK Project for the National Library of Korea.