DETECTING SCAM CONTENT USING PHOBERT AND FINE-TUNED GEMINI

Phan, Thị Hồng Nguyên

Vui lòng dùng định danh này để trích dẫn hoặc liên kết đến tài liệu này: https://dspace.ctu.edu.vn/jspui/handle/123456789/124166

Nhan đề:	DETECTING SCAM CONTENT USING PHOBERT AND FINE-TUNED GEMINI
Nhan đề khác:	PHÁT TRIỂN MÔ HÌNH NHẬN BIẾT CÁC NỘI DUNG CÓ TÍNH LỪA ĐẢO
Tác giả:	Thái, Minh Tuấn Phan, Thị Hồng Nguyên
Từ khoá:	CÔNG NGHỆ THÔNG TIN - CHẤT LƯỢNG CAO
Năm xuất bản:	2025
Nhà xuất bản:	Trường Đại Học Cần Thơ
Tóm tắt:	In the era of rapid digital transformation, online communication platforms have become integral to daily life in Vietnam, accompanied by a significant surge in sophisticated cyber fraud. Scammers continuously adapt their linguistic patterns to bypass traditional keyword-based filters, rendering reactive security measures ineffective. Consequently, there is a critical need for advanced Natural Language Processing (NLP) methodologies capable of deeply understanding the semantic nuances of the Vietnamese language to identify these evolving threats. Addressing this challenge, this thesis conducts a comparative analysis of two distinct state-of-the-art NLP architectures: PhoBERT and a fine-tuned Gemini model. PhoBERT, a pre-trained language model optimized for Vietnamese, is utilized to assess the efficacy of discriminative modeling in extracting contextual features for robust text classification. Complementing this, the research investigates the advanced reasoning capabilities of Large Language Models (LLMs) by fine-tuning Google’s Gemini to analyze complex narrative structures and psychological triggers, such as urgency and authority impersonation, that traditional models often miss. The models are evaluated on a curated dataset of real-world scam messages designed to distinguish malicious intent from legitimate communications. The experimental results provide empirical evidence on the trade-offs between the syntactic precision of discriminative models and the semantic reasoning of generative models. Ultimately, this study offers critical insights into the applicability of deep learning approaches for proactive scam detection, highlighting the potential of semantic analysis in mitigating 'zero-day' vulnerabilities inherent in conventional defense mechanisms.
Mô tả:	53 Tr
Định danh:	https://dspace.ctu.edu.vn/jspui/handle/123456789/124166
Bộ sưu tập:	Trường Công nghệ Thông tin & Truyền thông

Các tập tin trong tài liệu này:

Tập tin	Mô tả	Kích thước	Định dạng
_file_ Giới hạn truy cập		1.01 MB	Adobe PDF
Your IP: 216.73.216.105

Hiển thị đầy đủ biểu ghi tài liệu Xem thống kê

Khi sử dụng các tài liệu trong Thư viện số phải tuân thủ Luật bản quyền.

Thư viện số DSPACE

Thư viện số cho phép quản lý các nguồn tài liệu số như: Văn bản, hình ảnh, âm thanh, phim ảnh...