Tuning Abstractive Summarization Models Towards Increased Novelty

Havikbotn, Eivind Tveita

dc.contributor.advisor	Ruocco, Massimiliano
dc.contributor.author	Havikbotn, Eivind Tveita
dc.date.accessioned	2019-09-11T10:55:39Z
dc.date.created	2018-06-19
dc.date.issued	2018
dc.identifier	ntnudaim:18121
dc.identifier.uri	http://hdl.handle.net/11250/2615790
dc.description.abstract	Neural machine translation models, based on attention and pointer-mechanism, has in recent studies been successfully applied to the task of Abstractive Summarization of long documents such as news articles. Although state-of-the-art architectures inhibit abstractive capabilities, it has been observed that these models mostly copy large fragments from the source, even in scenarios where the models should paraphrase and use novel word combinations. In this thesis we explore the possibility of improving the novelty in the model generated summaries. After training strong baseline models by combining architectural components from state-of-the-art systems, we attempt to improve the novelty by (1) selective data sampling, (2) adding a novel extraction loss component and (3) by engineering reward functions that captures novelty used in optimization by reinforcement learning. We explore multiple parameters related to each approach, and present quantitative scores in terms of relative ROUGE increase and qualitative output from each model. For our reinforcement learning experiments we demonstrate higher ROUGE scores compared to previous work utilizing joint policy gradient loss and single model architecture. However the textual quality of our is left to be determined.	en
dc.language	eng
dc.publisher	NTNU
dc.subject	Datateknologi, Kunstig intelligens	en
dc.title	Tuning Abstractive Summarization Models Towards Increased Novelty	en
dc.type	Master thesis	en
dc.source.pagenumber	91
dc.contributor.department	Norges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi og elektroteknikk,Institutt for datateknologi og informatikk	nb_NO
dc.date.embargoenddate	10000-01-01

Tilhørende fil(er)

Filnavn:: 18121_FULLTEXT.pdf
Størrelse:: 1.432Mb
Format:: PDF

Låst

Filnavn:: 18121_COVER.pdf
Størrelse:: 1.556Mb
Format:: PDF

Låst

Denne innførselen finnes i følgende samling(er)

Institutt for datateknologi og informatikk [6552]

Vis enkel innførsel