TWO-PHASE STACKING ENSEMBLE TO EFFECTIVELY HANDLE DATA IMBALANCES IN CLASSIFICATION PROBLEMS

Main Article Content

Madasamy Kaliappan
M. Ramaswami
M. Ramaswami

Abstract

Increase in generation of real-time data resulted in need of more processing requirements. However, processing of such data has several challenges associated with it. One of the major challenges in processing real-time data is to handle the implicit data imbalance. This paper proposes a two-phase stacking ensemble method to handle data imbalances more effectively during classification process. The proposed model utilizes multiple classifier algorithms in the first phase to predict data. The predicted data is used as input for the second phase. The second phase is a meta-learner, operating on predictions rather than the actual data. Experiments were conducted on data with varied imbalance levels. Obtained results indicate high efficiency of the proposed model in predicting with imbalanced data. A comparison with state-of-the-art model indicates improved performance.

Downloads

Download data is not yet available.

Article Details

Section
Articles