Deep Reinforcement Learning and Generative Adversarial Networks for Abstractive Text Summarization

Lie, Borgar Rannem; Kalmar, Alf Niklas Håkonsen

dc.contributor.advisor	Ruocco, Massimiliano
dc.contributor.advisor	Aune, Erlend
dc.contributor.author	Lie, Borgar Rannem
dc.contributor.author	Kalmar, Alf Niklas Håkonsen
dc.date.accessioned	2018-11-08T15:00:35Z
dc.date.available	2018-11-08T15:00:35Z
dc.date.created	2018-06-28
dc.date.issued	2018
dc.identifier	ntnudaim:19457
dc.identifier.uri	http://hdl.handle.net/11250/2571695
dc.description.abstract	News articles, papers and encyclopedias, among other texts can be time-consuming to digest. Often, you are not interested in reading all the material, but only some of it. Summaries can be useful to get a grasp of what they are about. The task of generating a summary is time-consuming because you need to read the text, and you need to understand which parts are important. This makes it very attractive to try to automatically generate summaries using a computer program. Abstractive text summarization has gained a lot of attraction in recent years and the standard supervised learning approach have seen promising results when used to train abstractive text summarization models. However, they are limited by the fact that they assume the ground truth to be provided at each time-step during training. This is not the case at test time, where the previous generated word is provided instead. This creates a gap between training and testing, also known as "exposure bias". In this thesis, we explore how to improve an abstractive text summarization model by employing reinforcement learning and generative adversarial networks, which do not assume the ground truth to be provided during training. As a base model to improve upon, we implement a variation of the Pointer-Generator Network [See et al., 2017]. There are a lot of implementation details and parameter choices that are important for training stability and convergence, which are mostly left out of research papers. In this regard, we conduct an extensive study on how different training strategies, parameters and objective functions affect training stability and convergence, as well as the generated summaries. Another problem with training of abstractive text summarization models is that it is generally very time-consuming. To cope with some of these problems, we propose code optimization techniques that will help speed up the training time. We show improvements from the base model in terms of ROUGE scores, as well as differences in generated summaries, using the objective functions of ROUGE-1, ROUGE-2, discriminator (adversarial training), and a combined model of ROUGE-2 and the discriminator.
dc.language	eng
dc.publisher	NTNU
dc.subject	Datateknologi (2 årig), Kunstig intelligens
dc.title	Deep Reinforcement Learning and Generative Adversarial Networks for Abstractive Text Summarization
dc.type	Master thesis

Tilhørende fil(er)

Filnavn:: 19457_FULLTEXT.pdf
Størrelse:: 3.375Mb
Format:: PDF

Åpne

Filnavn:: 19457_COVER.pdf
Størrelse:: 1.556Mb
Format:: PDF

Åpne

Denne innførselen finnes i følgende samling(er)

Institutt for datateknologi og informatikk [6544]

Vis enkel innførsel