A COMPARATIVE STUDY ON VARIOUS PARALLEL COMPUTING TECHNIQUES USING APRIORI ALGORITHM

Main Article Content

Sai Kiran Chintalapudi
Harshavardhan Metla
Yeshwanth Kamisetty
Nalluri Rahul
Manikandan K
Siddharth Kolagatla

Abstract

A popular Association Rule Mining algorithm called Apriori algorithm helps in finding various frequent itemsets in the database. The constraints for finding these itemsets are given by the user in terms of support - measured by the proportion of transactions in which an itemset appears, and confidence - measured by the proportion of transactions with an itemset, in which another itemset also appears. The problem with this algorithm is that it is highly iterative and thus its efficiency rapidly decreases with increase in size or dimension of the dataset. Our project increases its efficiency with the help of openMP threads. We use data decomposition to split the transaction database into various parts, each taken by a thread to find the support count of all the candidate itemsets for all the transactions assigned to that particular thread. To give an example of the application, this project is used to determine the probability of the occurrence of a forest fire. Here, the transaction database can consist of various occurrences of natural phenomena, in which a few transactions also have the forest fire phenomenon, which means that it has occurred in the presence of the other itemsets in the transaction. Hence, if a new transaction is taken from the user, then the probability (or confidence) that a forest fire occurs, given this transaction, is calculated.

Downloads

Download data is not yet available.

Article Details

Section
Articles

References

Hu, L., Zhuo, G., & Qiu, Y. (2009, August). Application of Apriori algorithm to the data mining of the wildfire. In Fuzzy Systems and Knowledge Discovery, 2009. FSKD'09. Sixth International Conference on (Vol. 2, pp. 426-429). IEEE.

Korde, N. S., & Shende, S. W. (2014). Parallel Implementation of Apriori Algorithm. IOSR Journal of Computer Science, 01-04.

Chai, S., Yang, J., & Cheng, Y. (2007, June). The research of improved apriori algorithm for mining association rules. In Service Systems and Service Management, 2007 International Conference on (pp. 1-4). IEEE.

Changsheng, Z., Zhongyue, L., & Dongsong, Z. (2009, March). An improved algorithm for apriori. In Education Technology and Computer Science, 2009. ETCS'09. First International Workshop on (Vol. 1, pp. 995-998). IEEE.

Spandana, K., Sirisha, D., & Shahida, S. (2016). Parallelizing Apriori Algorithm on GPU. International Journal of Computer Applications, 155(10).

Rathee, S., Kaul, M., & Kashyap, A. (2015, October). R-Apriori: an efficient apriori based algorithm on spark. In Proceedings of the 8th Workshop on Ph. D. Workshop in Information and Knowledge Management (pp. 27-34). ACM.

Parsania, V., Kamani, G., & Ghodasara, Y. R. (2014). Mining Frequent Itemset Using Parallel Computing Apriori Algorithm.

Wang, G., Yu, X., Peng, D., Cui, Y., & Li, Q. (2010, June). Research of data mining based on Apriori algorithm in cutting database. In Mechanic Automation and Control Engineering (MACE), 2010 International Conference on (pp. 3765-3768). IEEE.

Li, N., Zeng, L., He, Q., & Shi, Z. (2012, August). Parallel implementation of apriori algorithm based on mapreduce. In Software Engineering, Artificial Intelligence, Networking and Parallel & Distributed Computing (SNPD), 2012 13th ACIS International Conference on (pp. 236-241). IEEE.

Shah, A. (2016, July). Association rule mining with modified apriori algorithm using top down approach. In Applied and Theoretical Computing and Communication Technology (iCATccT), 2016 2nd International Conference on (pp. 747-752). IEEE.

Harikumar, S., & Dilipkumar, D. U. (2016, August). Apriori algorithm for association rule mining in high dimensional data. In Data Science and Engineering (ICDSE), 2016 International Conference on (pp. 1-6). IEEE.