Please use this identifier to cite or link to this item: https://dspace.ctu.edu.vn/jspui/handle/123456789/94521
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorLâm, Nhựt Khang-
dc.contributor.authorTrần, Dương Mỹ Thuận-
dc.date.accessioned2024-01-10T07:18:50Z-
dc.date.available2024-01-10T07:18:50Z-
dc.date.issued2023-
dc.identifier.otherB1809723-
dc.identifier.urihttps://dspace.ctu.edu.vn/jspui/handle/123456789/94521-
dc.description40 Trvi_VN
dc.description.abstractImage captioning is a fascinating field that converges computer vision and natural language processing (NLP) within the vast field of artificial intelligence. The fundamental objective is to automatically generate natural descriptive sentences that express the content of an image. Various image captioning models often using integrating convolutional neural networks (CNNs) for image feature extraction and recurrent neural networks (RNNs) or transformers for generating coherent sentences. In this thesis, we employ a combination of EfficientNetB2 and Deep Transformer to generate a natural descriptive sentence from image. We perform experiments on both the English and Vietnamese Flickr8k dataset, then assessing the performance of this through the BLEU metric. The experimental results show that a combination of EfficientNetB2 and Deep Transformer effectively generates captions for images in Vietnamese. The BLEU-1,2,3, and 4 scores of the models on the English and Vietnamese Flickr8k datasets are 0.510, 0.211, 0.083, 0.030; and 0.532, 0.282, 0.155, 0.075, respectively.vi_VN
dc.language.isoenvi_VN
dc.publisherTrường Đại Học Cần Thơvi_VN
dc.subjectCÔNG NGHỆ THÔNG TIN - CHẤT LƯỢNG CAOvi_VN
dc.titleIMAGE CAPTIONING USING EFFICIENTNETB2 AND DEEP TRANSFORMERvi_VN
dc.title.alternativeXÂY DỰNG MÔ HÌNH SINH CÂU MÔ TẢ ẢNH SỬ DỤNG MÔ HÌNH EFFICIENTNETB2 VÀ DEEP TRANSFORMERvi_VN
dc.typeThesisvi_VN
Appears in Collections:Trường Công nghệ Thông tin & Truyền thông

Files in This Item:
File Description SizeFormat 
_file_
  Restricted Access
1.41 MBAdobe PDF
Your IP: 3.144.95.22


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.