IMAGE CAPTIONING USING EFFICIENTNETB2 AND DEEP TRANSFORMER

Trần, Dương Mỹ Thuận

Vui lòng dùng định danh này để trích dẫn hoặc liên kết đến tài liệu này: https://dspace.ctu.edu.vn/jspui/handle/123456789/94521

Nhan đề:	IMAGE CAPTIONING USING EFFICIENTNETB2 AND DEEP TRANSFORMER
Nhan đề khác:	XÂY DỰNG MÔ HÌNH SINH CÂU MÔ TẢ ẢNH SỬ DỤNG MÔ HÌNH EFFICIENTNETB2 VÀ DEEP TRANSFORMER
Tác giả:	Lâm, Nhựt Khang Trần, Dương Mỹ Thuận
Từ khoá:	CÔNG NGHỆ THÔNG TIN - CHẤT LƯỢNG CAO
Năm xuất bản:	2023
Nhà xuất bản:	Trường Đại Học Cần Thơ
Tóm tắt:	Image captioning is a fascinating field that converges computer vision and natural language processing (NLP) within the vast field of artificial intelligence. The fundamental objective is to automatically generate natural descriptive sentences that express the content of an image. Various image captioning models often using integrating convolutional neural networks (CNNs) for image feature extraction and recurrent neural networks (RNNs) or transformers for generating coherent sentences. In this thesis, we employ a combination of EfficientNetB2 and Deep Transformer to generate a natural descriptive sentence from image. We perform experiments on both the English and Vietnamese Flickr8k dataset, then assessing the performance of this through the BLEU metric. The experimental results show that a combination of EfficientNetB2 and Deep Transformer effectively generates captions for images in Vietnamese. The BLEU-1,2,3, and 4 scores of the models on the English and Vietnamese Flickr8k datasets are 0.510, 0.211, 0.083, 0.030; and 0.532, 0.282, 0.155, 0.075, respectively.
Mô tả:	40 Tr
Định danh:	https://dspace.ctu.edu.vn/jspui/handle/123456789/94521
Bộ sưu tập:	Trường Công nghệ Thông tin & Truyền thông

Các tập tin trong tài liệu này:

Tập tin	Mô tả	Kích thước	Định dạng
_file_ Giới hạn truy cập		1.41 MB	Adobe PDF
Your IP: 3.133.150.56

Hiển thị đầy đủ biểu ghi tài liệu Xem thống kê

Khi sử dụng các tài liệu trong Thư viện số phải tuân thủ Luật bản quyền.

Thư viện số DSPACE

Thư viện số cho phép quản lý các nguồn tài liệu số như: Văn bản, hình ảnh, âm thanh, phim ảnh...