Vui lòng dùng định danh này để trích dẫn hoặc liên kết đến tài liệu này:
https://dspace.ctu.edu.vn/jspui/handle/123456789/74475
Nhan đề: | BUILDING A PLAGIARISM DETECTION SYSTEM MODULE: INDEXING AND VECTOR DATABASE |
Tác giả: | Trần, Công Án Lý, Hiểu Sang |
Từ khoá: | CÔNG NGHỆ THÔNG TIN-CHẤT LƯỢNG CAO |
Năm xuất bản: | 2021 |
Nhà xuất bản: | Trường Đại Học Cần Thơ |
Tóm tắt: | With the rapid development of the Internet, there are thousands of published scientific articles and technical guides, which has led to plagiarism becoming more common in the academic environment. This shows that a plagiarism detection system is absolutely necessary. However, the challenge in detecting plagiarism is the very high search time. Fortunately, in recent years, with the great advances in the field of ML, there have been many technological solutions that can be used to solve this problem. This study deals with Milvus and Faiss. Milvus is a vector database and Faiss is a library with indexing support. These two technologies are very important in solving the similarity search problem. In this report, the rationale for using these two technologies in the plagiarism detection system is presented as well as an overview of the architecture of Milvus, Faiss indexing and its performance. To this end, information on related studies was researched and summarized, the theory of the vector database and index was examined, and finally the results were tested and evaluated. In summary, this report has presented the application of Milvus and Faiss for plagiarism detection systems with the aim of improving the time for similarity search and providing the theoretical basis of Milvus and Faiss as a basis for other relevant studies. |
Mô tả: | 35 Tr |
Định danh: | https://dspace.ctu.edu.vn/jspui/handle/123456789/74475 |
Bộ sưu tập: | Trường Công nghệ Thông tin & Truyền thông |
Các tập tin trong tài liệu này:
Tập tin | Mô tả | Kích thước | Định dạng | |
---|---|---|---|---|
_file_ Giới hạn truy cập | 796.66 kB | Adobe PDF | ||
Your IP: 3.148.115.187 |
Khi sử dụng các tài liệu trong Thư viện số phải tuân thủ Luật bản quyền.