A New Hybrid Hard-Fuzzy (K-MFCM) Data Clustering Method for Finding Cluster Centroid

Main Article Content

O. A. Mohamed Jafar

Abstract

Data mining is a collection of methods used to extract useful information from large data bases. Cluster Analysis refers to the grouping of a set of data points into clusters. Most widely used partitioning methods are K-means and Fuzzy c-means (FCM) algorithms. However, they suffer from the difficulties such as random selection of initial centre values and handling outlier data points. Most of the existing clustering methods use the Euclidean distance metric. The modified fuzzy c-means algorithm (MFCM) is efficient in handling outlier data points. In this paper, a new hybrid algorithm is proposed to solve the limitations of the traditional clustering methods. The hybrid K-MFCM algorithm is tested on four real world bench mark data sets from UCI machine learning repository with various distance metrics including Euclidean, City Block and Chessboard. The cluster centroid values of hybrid algorithm are calculated for various data sets. The experimental results show that the hybrid algorithm gives good results in terms of objective function value and better fuzzy cluster validity results for chessboard distance metric than other distance metrics.

Downloads

Download data is not yet available.

Article Details

Section
Articles
Author Biography

O. A. Mohamed Jafar, Associate Professor, PG & Research Department of Computer Science Jamal Mohamed College (Autonomous) Tiruchirappalli, Tamil Nadu, India mdjafar2021@gmail.com

Associate Professor, PG & Research Department of Computer Science Jamal Mohamed College (Autonomous) Tiruchirappalli, Tamil Nadu, India mdjafar2021@gmail.com