Please use this identifier to cite or link to this item:
https://dspace.ctu.edu.vn/jspui/handle/123456789/100262
Title: | An ensemble model approach for many-feature data clustering |
Authors: | Le, Thi Cam Binh Ngo, Thanh Long Pham, Van Nha Pham, The Long |
Keywords: | Clustering Classification Ensemble model Lecture reduction Many-feature Big-data |
Issue Date: | 2021 |
Series/Report no.: | Tạp chí Khoa học Công nghệ Thông tin và Truyền thông;Số 03(CS.01) .- Tr.04-12 |
Abstract: | Big data processing is attracting the attention of researchers in the context of the globalization of the fourth industrial revolution. A fundamental property of big data is that data has many features. To deal with big data, it is necessary to use powerful tools for knowledge discovery. In this paper, we propose a many-feature data clustering model using advanced machine learning techniques. We call the ensemble feature-reduction clustering model - EFRC. The EFRC model consists of three stages. First, data is reduced-feature using a random projection. Then, data is divided into subsets based on the potential for noise quantification and overlap. Different clustering techniques are then used to cluster the subset of data. Finally, the results of clustering modules are consensus using a classification technique to produce the final clustering result. Some experiments were conducted on benchmark datasets. Experimental results demonstrate the superior performance of the EFRC model compared to the previous models. |
URI: | https://dspace.ctu.edu.vn/jspui/handle/123456789/100262 |
ISSN: | 2525-2224 |
Appears in Collections: | Khoa học Công nghệ Thông tin và Truyền thông |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
_file_ Restricted Access | 2.91 MB | Adobe PDF | ||
Your IP: 52.15.35.129 |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.