Please use this identifier to cite or link to this item:
https://dspace.ctu.edu.vn/jspui/handle/123456789/125270| Title: | BUILDING AN ENGLISH VOCABULARY LEARNING APPLICATION WITH LISTENING AND PRONUNCIATION PRACTICE |
| Other Titles: | XÂY DỰNG ỨNG DỤNG HỌC TỪ VỰNG, LUYỆN NGHE VÀ PHÁT ÂM TIẾNG ANH |
| Authors: | Bùi, Võ Quốc Bảo Châu, Đình Thông |
| Keywords: | CÔNG NGHỆ THÔNG TIN - CHẤT LƯỢNG CAO |
| Issue Date: | 2025 |
| Publisher: | Trường Đại Học Cần Thơ |
| Abstract: | Currently, searching and cross-referencing information in Vietnamese administrative documents is mainly done manually, which is time-consuming, prone to missing important content, and carries the risk of misapplying regulations, especially when handling multiple complex hierarchical documents simultaneously. The thesis develops a chatbot system for querying administrative documents based on the Retrieval-Augmented Generation (RAG) architecture. The system supports uploading multiple documents, automatically extracting text, performing logical chunking suitable for regulatory documents, generating semantic embeddings, indexing, and conducting efficient semantic search. Upon receiving natural language queries, the system retrieves relevant passages, refines result ranking, and generates responses entirely based on the original content, eliminating the hallucination phenomenon. The system runs locally with a user-friendly web interface, integrating features for user management, document management, chat history, and data backup. Testing results on real administrative documents demonstrate fast response times, high accuracy, low hardware requirements, and suitability for practical deployment. |
| Description: | 95 Tr |
| URI: | https://dspace.ctu.edu.vn/jspui/handle/123456789/125270 |
| Appears in Collections: | Trường Công nghệ Thông tin & Truyền thông |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| _file_ Restricted Access | 4.41 MB | Adobe PDF | ||
| Your IP: 216.73.216.55 |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.