A COMPARATIVE STUDY ON VARIOUS PARALLEL COMPUTING TECHNIQUES USING APRIORI ALGORITHM
Main Article Content
Abstract
A popular Association Rule Mining algorithm called Apriori algorithm helps in finding various frequent itemsets in the database. The constraints for finding these itemsets are given by the user in terms of support - measured by the proportion of transactions in which an itemset appears, and confidence - measured by the proportion of transactions with an itemset, in which another itemset also appears. The problem with this algorithm is that it is highly iterative and thus its efficiency rapidly decreases with increase in size or dimension of the dataset. Our project increases its efficiency with the help of openMP threads. We use data decomposition to split the transaction database into various parts, each taken by a thread to find the support count of all the candidate itemsets for all the transactions assigned to that particular thread. To give an example of the application, this project is used to determine the probability of the occurrence of a forest fire. Here, the transaction database can consist of various occurrences of natural phenomena, in which a few transactions also have the forest fire phenomenon, which means that it has occurred in the presence of the other itemsets in the transaction. Hence, if a new transaction is taken from the user, then the probability (or confidence) that a forest fire occurs, given this transaction, is calculated.
Downloads
Article Details
COPYRIGHT
Submission of a manuscript implies: that the work described has not been published before, that it is not under consideration for publication elsewhere; that if and when the manuscript is accepted for publication, the authors agree to automatic transfer of the copyright to the publisher.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work
- The journal allows the author(s) to retain publishing rights without restrictions.
- The journal allows the author(s) to hold the copyright without restrictions.
References
Hu, L., Zhuo, G., & Qiu, Y. (2009, August). Application of Apriori algorithm to the data mining of the wildfire. In Fuzzy Systems and Knowledge Discovery, 2009. FSKD'09. Sixth International Conference on (Vol. 2, pp. 426-429). IEEE.
Korde, N. S., & Shende, S. W. (2014). Parallel Implementation of Apriori Algorithm. IOSR Journal of Computer Science, 01-04.
Chai, S., Yang, J., & Cheng, Y. (2007, June). The research of improved apriori algorithm for mining association rules. In Service Systems and Service Management, 2007 International Conference on (pp. 1-4). IEEE.
Changsheng, Z., Zhongyue, L., & Dongsong, Z. (2009, March). An improved algorithm for apriori. In Education Technology and Computer Science, 2009. ETCS'09. First International Workshop on (Vol. 1, pp. 995-998). IEEE.
Spandana, K., Sirisha, D., & Shahida, S. (2016). Parallelizing Apriori Algorithm on GPU. International Journal of Computer Applications, 155(10).
Rathee, S., Kaul, M., & Kashyap, A. (2015, October). R-Apriori: an efficient apriori based algorithm on spark. In Proceedings of the 8th Workshop on Ph. D. Workshop in Information and Knowledge Management (pp. 27-34). ACM.
Parsania, V., Kamani, G., & Ghodasara, Y. R. (2014). Mining Frequent Itemset Using Parallel Computing Apriori Algorithm.
Wang, G., Yu, X., Peng, D., Cui, Y., & Li, Q. (2010, June). Research of data mining based on Apriori algorithm in cutting database. In Mechanic Automation and Control Engineering (MACE), 2010 International Conference on (pp. 3765-3768). IEEE.
Li, N., Zeng, L., He, Q., & Shi, Z. (2012, August). Parallel implementation of apriori algorithm based on mapreduce. In Software Engineering, Artificial Intelligence, Networking and Parallel & Distributed Computing (SNPD), 2012 13th ACIS International Conference on (pp. 236-241). IEEE.
Shah, A. (2016, July). Association rule mining with modified apriori algorithm using top down approach. In Applied and Theoretical Computing and Communication Technology (iCATccT), 2016 2nd International Conference on (pp. 747-752). IEEE.
Harikumar, S., & Dilipkumar, D. U. (2016, August). Apriori algorithm for association rule mining in high dimensional data. In Data Science and Engineering (ICDSE), 2016 International Conference on (pp. 1-6). IEEE.