Please use this identifier to cite or link to this item:
https://dspace.ctu.edu.vn/jspui/handle/123456789/45238
Title: | STUDENT INFORMATION MANAGEMENT USING CLOUDERA FRAMEWORK |
Authors: | Thái, Minh Tuấn Nguyễn, Thị Anh Thư |
Keywords: | CÔNG NGHỆ THÔNG TIN |
Issue Date: | 2021 |
Publisher: | Trường Đại Học Cần Thơ |
Abstract: | In recent years, facing a dramatic increase in the number and variety of data types. The problem of storing and processing huge amounts of data is making it is difficult for organizations in general; training centers and educational institutions in particular. As a result, this thesis will present the process of applying the technologies of the Cloudera framework to address such a problem. To be more specific, we use Cloudera components to restructure the data of a training center, which is in different formats, into a unified structure data warehouse. After that, we create a website that exploits the data warehouse for managing the student information of the center. Specifically, we use Sqoop to process structured data from DBMSs, e.g., SQL Server and MySQL, while Solr is utilized to process unstructured and semistructured data such as word and excel files. The processed data is then stored and aggregated on HDFS in a unified form. Finally, the student management website is developed using the Flask framework. |
Description: | 54 Tr |
URI: | https://dspace.ctu.edu.vn/jspui/handle/123456789/45238 |
Appears in Collections: | Trường Công nghệ Thông tin & Truyền thông |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
_file_ Restricted Access | 1.93 MB | Adobe PDF | ||
Your IP: 18.117.170.226 |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.