Please use this identifier to cite or link to this item: https://dspace.ctu.edu.vn/jspui/handle/123456789/100262
Title: An ensemble model approach for many-feature data clustering
Authors: Le, Thi Cam Binh
Ngo, Thanh Long
Pham, Van Nha
Pham, The Long
Keywords: Clustering
Classification
Ensemble model
Lecture reduction
Many-feature
Big-data
Issue Date: 2021
Series/Report no.: Tạp chí Khoa học Công nghệ Thông tin và Truyền thông;Số 03(CS.01) .- Tr.04-12
Abstract: Big data processing is attracting the attention of researchers in the context of the globalization of the fourth industrial revolution. A fundamental property of big data is that data has many features. To deal with big data, it is necessary to use powerful tools for knowledge discovery. In this paper, we propose a many-feature data clustering model using advanced machine learning techniques. We call the ensemble feature-reduction clustering model - EFRC. The EFRC model consists of three stages. First, data is reduced-feature using a random projection. Then, data is divided into subsets based on the potential for noise quantification and overlap. Different clustering techniques are then used to cluster the subset of data. Finally, the results of clustering modules are consensus using a classification technique to produce the final clustering result. Some experiments were conducted on benchmark datasets. Experimental results demonstrate the superior performance of the EFRC model compared to the previous models.
URI: https://dspace.ctu.edu.vn/jspui/handle/123456789/100262
ISSN: 2525-2224
Appears in Collections:Khoa học Công nghệ Thông tin và Truyền thông

Files in This Item:
File Description SizeFormat 
_file_
  Restricted Access
2.91 MBAdobe PDF
Your IP: 52.15.35.129


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.