Please use this identifier to cite or link to this item: https://dspace.ctu.edu.vn/jspui/handle/123456789/124166
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorThái, Minh Tuấn-
dc.contributor.authorPhan, Thị Hồng Nguyên-
dc.date.accessioned2026-01-10T03:34:01Z-
dc.date.available2026-01-10T03:34:01Z-
dc.date.issued2025-
dc.identifier.otherB2105679-
dc.identifier.urihttps://dspace.ctu.edu.vn/jspui/handle/123456789/124166-
dc.description53 Trvi_VN
dc.description.abstractIn the era of rapid digital transformation, online communication platforms have become integral to daily life in Vietnam, accompanied by a significant surge in sophisticated cyber fraud. Scammers continuously adapt their linguistic patterns to bypass traditional keyword-based filters, rendering reactive security measures ineffective. Consequently, there is a critical need for advanced Natural Language Processing (NLP) methodologies capable of deeply understanding the semantic nuances of the Vietnamese language to identify these evolving threats. Addressing this challenge, this thesis conducts a comparative analysis of two distinct state-of-the-art NLP architectures: PhoBERT and a fine-tuned Gemini model. PhoBERT, a pre-trained language model optimized for Vietnamese, is utilized to assess the efficacy of discriminative modeling in extracting contextual features for robust text classification. Complementing this, the research investigates the advanced reasoning capabilities of Large Language Models (LLMs) by fine-tuning Google’s Gemini to analyze complex narrative structures and psychological triggers, such as urgency and authority impersonation, that traditional models often miss. The models are evaluated on a curated dataset of real-world scam messages designed to distinguish malicious intent from legitimate communications. The experimental results provide empirical evidence on the trade-offs between the syntactic precision of discriminative models and the semantic reasoning of generative models. Ultimately, this study offers critical insights into the applicability of deep learning approaches for proactive scam detection, highlighting the potential of semantic analysis in mitigating 'zero-day' vulnerabilities inherent in conventional defense mechanisms.vi_VN
dc.language.isoenvi_VN
dc.publisherTrường Đại Học Cần Thơvi_VN
dc.subjectCÔNG NGHỆ THÔNG TIN - CHẤT LƯỢNG CAOvi_VN
dc.titleDETECTING SCAM CONTENT USING PHOBERT AND FINE-TUNED GEMINIvi_VN
dc.title.alternativePHÁT TRIỂN MÔ HÌNH NHẬN BIẾT CÁC NỘI DUNG CÓ TÍNH LỪA ĐẢOvi_VN
dc.typeThesisvi_VN
Appears in Collections:Trường Công nghệ Thông tin & Truyền thông

Files in This Item:
File Description SizeFormat 
_file_
  Restricted Access
1.01 MBAdobe PDF
Your IP: 216.73.216.63


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.