Please use this identifier to cite or link to this item: https://dspace.ctu.edu.vn/jspui/handle/123456789/119557
Title: Graph-based and generative approaches to multi-document summarization
Authors: Thanh, Tam Doan
Nguyen, Tan Minh
Nguyen, Thai Binh
Nguyen, Hoang Trung
Nguyen, Hai Long
Tran, Mai Vu
Ha, Quang Thuy
Nguyen, Ha Thanh
Keywords: Multi-document summarization
Abstractive summarization
NLP
Graph-based
Generative models
Issue Date: 2024
Series/Report no.: Journal of Computer Science and Cybernetics;Vol.40, No.03 .- P.203-217
Abstract: Multi-document summarization is a challenging problem in the Natural Language Processing field that has drawn a lot of interest from the research community. In this paper, we propose a two-phase pipeline to tackle the Vietnamese abstractive multi-document summarization task. The initial phase of the pipeline involves an extractive summarization stage including two different systems. The first system employs a hybrid model based on the TextRank algorithm and a text correlation consideration mechanism. The second system is a modified version of SummPip - an unsupervised graph-based method for multi-document summarization. The second phase of the pipeline is abstractive summarization models. Particularly, generative models are applied to produce abstractive summaries from previous phase outputs. The proposed method achieves competitive results as we surpassed many strong research teams to finish the first rank in the AbMusu task - Vietnamese abstractive multi-document summarization, organized in the VLSP 2022 workshop.
URI: https://dspace.ctu.edu.vn/jspui/handle/123456789/119557
ISSN: 1813-9663
Appears in Collections:Tin học và Điều khiển học (Journal of Computer Science and Cybernetics)

Files in This Item:
File Description SizeFormat 
_file_
  Restricted Access
947.07 kBAdobe PDF
Your IP: 216.73.216.220


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.