A Bootstrap Aggregating Technique on Link-Based Cluster Ensemble Approach for Categorical Data Clustering

INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY

View Publication Info
 
 
Field Value
 
Title A Bootstrap Aggregating Technique on Link-Based Cluster Ensemble Approach for Categorical Data Clustering
 
Creator Reddy, S Pavan Kumar
Sesadri, U
 
Subject Computer Science; Information Technology
BSA; Clustering; categorical data; cluster ensembles; link-based similarity; data mining.
 
Description Although attempts have been made to solve the problem of clustering categorical data via cluster ensembles, with the results being competitive to conventional algorithms, it is observed that these techniques unfortunately generate a final data partition based on incomplete information. The underlying ensemble-information matrix presents only cluster-data point relations, with many entries being left unknown. The paper presents an analysis that suggests this problem degrades the quality of the clustering result, and it presents a BSA (Bootstrap Aggregation) is a machine learning ensemble meta-algorithm designed to improve the stability and accuracy along with a new link-based approach, which improves the conventional matrix by discovering unknown entries through similarity between clusters in an ensemble. In particular, an efficient BSA and link-based algorithm is proposed for the underlying similarity assessment. Afterward, to obtain the final clustering result, a graph partitioning technique is applied to a weighted bipartite graph that is formulated from the refined matrix. Experimental results on multiple real data sets suggest that the proposed link-based method almost always outperforms both conventional clustering algorithms for categorical data and well-known cluster ensemble techniques.
 
Publisher CIRWORLD
 
Date 2011-08-30
 
Type info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
Peer-reviewed Article
 
Format application/pdf
 
Identifier http://cirworld.com/index.php/ijct/article/view/1468
 
Source INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY; Vol 10 No 8; 1913-1921
2277-3061
 
Language eng
 
Relation http://cirworld.com/index.php/ijct/article/view/1468/1432
 
Rights Copyright (c) 2016 INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY
http://creativecommons.org/licenses/by/4.0
 

Contact Us

The PKP Index is an initiative of the Public Knowledge Project.

For PKP Publishing Services please use the PKP|PS contact form.

For support with PKP software we encourage users to consult our wiki for documentation and search our support forums.

For any other correspondence feel free to contact us using the PKP contact form.

Find Us

Twitter

Copyright © 2015-2018 Simon Fraser University Library