Please use this identifier to cite or link to this item:
https://dspace.ctu.edu.vn/jspui/handle/123456789/124158Full metadata record
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.advisor | Phạm, Thế Phi | - |
| dc.contributor.author | Hà, Nhựt Tuấn | - |
| dc.date.accessioned | 2026-01-10T03:20:54Z | - |
| dc.date.available | 2026-01-10T03:20:54Z | - |
| dc.date.issued | 2025 | - |
| dc.identifier.other | B2112021 | - |
| dc.identifier.uri | https://dspace.ctu.edu.vn/jspui/handle/123456789/124158 | - |
| dc.description | 134 Tr | vi_VN |
| dc.description.abstract | Malware poses a serious threat to modern computer systems and user information, causing significant economic losses and security risks. Therefore, developing effective and scalable malware detection methods remains a critical challenge. Among existing approaches, static analysis based on Portable Executable (PE) file features is widely adopted due to its safety, efficiency, and suitability for large-scale deployment. This thesis proposes a static malware classification system utilizing tree-based machine learning models, including LightGBM, XGBoost, CatBoost, and Random Forest. Experiments are conducted on three datasets with distinct static characteristics: EMBER2018, MalwareBazaar, and a multi-class custom dataset. Model performance is evaluated using standard metrics such as Accuracy, Precision, Recall, F1-score, ROC-AUC, and inference time. In addition to in-domain evaluation, cross-domain experiments are conducted to assess the models' generalization under domain-shift conditions. The experimental results show that while the models achieve high accuracy and low inference latency in in-domain scenarios, their performance degrades significantly in cross-dataset evaluations, particularly when feature distributions or class structures differ across datasets. These findings highlight the substantial impact of domain shift and underscore the need for additional training strategies, fine-tuning, or domain adaptation techniques to ensure robust malware detection in real-world deployments. Keywords: Malware detection, Static analysis, Portable Executable, Machine learning, Domain shift, Cross-dataset evaluation. | vi_VN |
| dc.language.iso | en | vi_VN |
| dc.publisher | Trường Đại Học Cần Thơ | vi_VN |
| dc.subject | CÔNG NGHỆ THÔNG TIN - CHẤT LƯỢNG CAO | vi_VN |
| dc.title | STATIC FEATURE-BASED MALWARE CLASSIFICATION | vi_VN |
| dc.title.alternative | PHÂN LOẠI MÃ ĐỘC SỬ DỤNG ĐẶC TRƯNG TĨNH | vi_VN |
| dc.type | Thesis | vi_VN |
| Appears in Collections: | Trường Công nghệ Thông tin & Truyền thông | |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| _file_ Restricted Access | 4.41 MB | Adobe PDF | ||
| Your IP: 216.73.216.105 |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.