Please use this identifier to cite or link to this item:
https://dspace.ctu.edu.vn/jspui/handle/123456789/73742
Title: | VIETNAMESE ABSTRACTIVE TEXT SUMMARIZATION USING TRANSFORMER |
Authors: | Lâm, Nhựt Khang Huỳnh, Thanh Bảo |
Keywords: | CÔNG NGHỆ THÔNG TIN-CHẤT LƯỢNG CAO |
Issue Date: | 2021 |
Publisher: | Trường Đại Học Cần Thơ |
Abstract: | In recent years, there has been an explosion of the amount of text data from variety sources. This volume of text is an invaluable source of information and knowledge which needs to be effectively summarized to be useful. Producing a text summarization technique which helps reducing the amount of text data is essential. Thus, as the evolution of machine learning, a lot of Natural Languages Processing (NLP) model were introduced. This thesis will apply Transformer – one of state at the art techniques, and a pre-trained model for implementing a new machine learning model which will receive long sentences as input and take it into shorter ones with the same meaning. The model will be trained and tested on both Vietnamese and English online articles. For evaluation, there have been many methods to evaluate the quality of summaries. However, in NLP field, ROUGE method is the most commonly used. Hence, ROUGE method will be used in this thesis to evaluate the summaries. The results of the model after evaluated is considered to be good and acceptable compared to other text summarization models. The result of the thesis introduces an abstractive Vietnamese summarization model. The model enables users to generate summary that remains the original meaning of the source text in sentence level. |
Description: | 41 Tr |
URI: | https://dspace.ctu.edu.vn/jspui/handle/123456789/73742 |
Appears in Collections: | Trường Công nghệ Thông tin & Truyền thông |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
_file_ Restricted Access | 1.1 MB | Adobe PDF | ||
Your IP: 3.133.128.210 |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.