Publication:
An efficient preprocessing stage for the relationship-based clustering framework

dc.contributor.authorsBilgin, Turgay Tugay; Camurcu, Ali Yilmaz
dc.date.accessioned2022-03-12T17:48:14Z
dc.date.accessioned2026-01-11T11:07:07Z
dc.date.available2022-03-12T17:48:14Z
dc.date.issued2010
dc.description.abstractThe goal of this study was to develop an efficient clustering framework for processing high-dimensional datasets with reasonable memory and computing power requirements. Strehl and Ghosh proposed a novel clustering approach and developed a framework which is called relationship-based clustering framework [1]. In this study, a preprocessing system has been implemented on top of their approach and it has been integrated into the relationship-based clustering framework. Three different benchmark datasets were used to evaluate its efficiency. The results are presented in various tables and charts, and in addition CLUSION graphs are plotted to enable visual evaluation of cluster quality. It is demonstrated that CPU and memory usage has been substantially decreased compared with Strehl and Ghosh's framework 1, without any noticeable decrease in clustering quality. This fact enables the use of the relationship-based clustering framework for much larger datasets than was heretofore possible, and also increases its scalability with respect to number of dimensions.
dc.identifier.doi10.3233/IDA-2010-0449
dc.identifier.eissn1571-4128
dc.identifier.issn1088-467X
dc.identifier.urihttps://hdl.handle.net/11424/229921
dc.identifier.wosWOS:000284213700008
dc.language.isoeng
dc.publisherIOS PRESS
dc.relation.ispartofINTELLIGENT DATA ANALYSIS
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.subjectData mining
dc.subjectclustering
dc.subjecthigh dimensional data
dc.subjectstratified sampling
dc.titleAn efficient preprocessing stage for the relationship-based clustering framework
dc.typearticle
dspace.entity.typePublication
oaire.citation.endPage748
oaire.citation.issue6
oaire.citation.startPage731
oaire.citation.titleINTELLIGENT DATA ANALYSIS
oaire.citation.volume14

Files