Generating Semantically Similar and Human-Readable Summaries With Generative Adversarial Networks

The application of neural networks in natural language processing, including abstractive text summarization, is increasingly attractive in recent years. However, teaching a neural network to generate a human-readable summary that reflects the core idea of the original source text (i.e., semantically...

Full description

Bibliographic Details
Main Authors: Haojie Zhuang, Weibin Zhang
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8910526/
id doaj-dc595e68cae1458ebac48ba281bfd270
record_format Article
spelling doaj-dc595e68cae1458ebac48ba281bfd2702021-03-30T00:26:55ZengIEEEIEEE Access2169-35362019-01-01716942616943310.1109/ACCESS.2019.29550878910526Generating Semantically Similar and Human-Readable Summaries With Generative Adversarial NetworksHaojie Zhuang0https://orcid.org/0000-0001-6426-1814Weibin Zhang1Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, ChinaSchool of Electronic and Information Engineering, South China University of Technology (SCUT), Guangzhou, ChinaThe application of neural networks in natural language processing, including abstractive text summarization, is increasingly attractive in recent years. However, teaching a neural network to generate a human-readable summary that reflects the core idea of the original source text (i.e., semantically similar) remains a challenging problem. In this paper, we explore using generative adversarial networks to solve this problem. The proposed model contains three components: a generator that encodes the long input text into a shorter representation; a discriminator to teach the generator to create human-readable summaries and another discriminator to restrict the output of the generator to reflect the core idea of the input text. The main training process can be carried out in an adversarial learning process. To solve the non-differentiable problem caused by the words sampling process, we use the policy gradient algorithm to optimize the generator. We evaluate the proposed model on the CNN/Daily Mail summarization task. The experimental results show that the model outperforms previous state-of-the-art models.https://ieeexplore.ieee.org/document/8910526/Abstractive text summarizationgenerative adversarial networksnatural language processing
collection DOAJ
language English
format Article
sources DOAJ
author Haojie Zhuang
Weibin Zhang
spellingShingle Haojie Zhuang
Weibin Zhang
Generating Semantically Similar and Human-Readable Summaries With Generative Adversarial Networks
IEEE Access
Abstractive text summarization
generative adversarial networks
natural language processing
author_facet Haojie Zhuang
Weibin Zhang
author_sort Haojie Zhuang
title Generating Semantically Similar and Human-Readable Summaries With Generative Adversarial Networks
title_short Generating Semantically Similar and Human-Readable Summaries With Generative Adversarial Networks
title_full Generating Semantically Similar and Human-Readable Summaries With Generative Adversarial Networks
title_fullStr Generating Semantically Similar and Human-Readable Summaries With Generative Adversarial Networks
title_full_unstemmed Generating Semantically Similar and Human-Readable Summaries With Generative Adversarial Networks
title_sort generating semantically similar and human-readable summaries with generative adversarial networks
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description The application of neural networks in natural language processing, including abstractive text summarization, is increasingly attractive in recent years. However, teaching a neural network to generate a human-readable summary that reflects the core idea of the original source text (i.e., semantically similar) remains a challenging problem. In this paper, we explore using generative adversarial networks to solve this problem. The proposed model contains three components: a generator that encodes the long input text into a shorter representation; a discriminator to teach the generator to create human-readable summaries and another discriminator to restrict the output of the generator to reflect the core idea of the input text. The main training process can be carried out in an adversarial learning process. To solve the non-differentiable problem caused by the words sampling process, we use the policy gradient algorithm to optimize the generator. We evaluate the proposed model on the CNN/Daily Mail summarization task. The experimental results show that the model outperforms previous state-of-the-art models.
topic Abstractive text summarization
generative adversarial networks
natural language processing
url https://ieeexplore.ieee.org/document/8910526/
work_keys_str_mv AT haojiezhuang generatingsemanticallysimilarandhumanreadablesummarieswithgenerativeadversarialnetworks
AT weibinzhang generatingsemanticallysimilarandhumanreadablesummarieswithgenerativeadversarialnetworks
_version_ 1724188290998861824