Generating Semantically Similar and Human-Readable Summaries With Generative Adversarial Networks
The application of neural networks in natural language processing, including abstractive text summarization, is increasingly attractive in recent years. However, teaching a neural network to generate a human-readable summary that reflects the core idea of the original source text (i.e., semantically...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8910526/ |
id |
doaj-dc595e68cae1458ebac48ba281bfd270 |
---|---|
record_format |
Article |
spelling |
doaj-dc595e68cae1458ebac48ba281bfd2702021-03-30T00:26:55ZengIEEEIEEE Access2169-35362019-01-01716942616943310.1109/ACCESS.2019.29550878910526Generating Semantically Similar and Human-Readable Summaries With Generative Adversarial NetworksHaojie Zhuang0https://orcid.org/0000-0001-6426-1814Weibin Zhang1Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, ChinaSchool of Electronic and Information Engineering, South China University of Technology (SCUT), Guangzhou, ChinaThe application of neural networks in natural language processing, including abstractive text summarization, is increasingly attractive in recent years. However, teaching a neural network to generate a human-readable summary that reflects the core idea of the original source text (i.e., semantically similar) remains a challenging problem. In this paper, we explore using generative adversarial networks to solve this problem. The proposed model contains three components: a generator that encodes the long input text into a shorter representation; a discriminator to teach the generator to create human-readable summaries and another discriminator to restrict the output of the generator to reflect the core idea of the input text. The main training process can be carried out in an adversarial learning process. To solve the non-differentiable problem caused by the words sampling process, we use the policy gradient algorithm to optimize the generator. We evaluate the proposed model on the CNN/Daily Mail summarization task. The experimental results show that the model outperforms previous state-of-the-art models.https://ieeexplore.ieee.org/document/8910526/Abstractive text summarizationgenerative adversarial networksnatural language processing |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Haojie Zhuang Weibin Zhang |
spellingShingle |
Haojie Zhuang Weibin Zhang Generating Semantically Similar and Human-Readable Summaries With Generative Adversarial Networks IEEE Access Abstractive text summarization generative adversarial networks natural language processing |
author_facet |
Haojie Zhuang Weibin Zhang |
author_sort |
Haojie Zhuang |
title |
Generating Semantically Similar and Human-Readable Summaries With Generative Adversarial Networks |
title_short |
Generating Semantically Similar and Human-Readable Summaries With Generative Adversarial Networks |
title_full |
Generating Semantically Similar and Human-Readable Summaries With Generative Adversarial Networks |
title_fullStr |
Generating Semantically Similar and Human-Readable Summaries With Generative Adversarial Networks |
title_full_unstemmed |
Generating Semantically Similar and Human-Readable Summaries With Generative Adversarial Networks |
title_sort |
generating semantically similar and human-readable summaries with generative adversarial networks |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
The application of neural networks in natural language processing, including abstractive text summarization, is increasingly attractive in recent years. However, teaching a neural network to generate a human-readable summary that reflects the core idea of the original source text (i.e., semantically similar) remains a challenging problem. In this paper, we explore using generative adversarial networks to solve this problem. The proposed model contains three components: a generator that encodes the long input text into a shorter representation; a discriminator to teach the generator to create human-readable summaries and another discriminator to restrict the output of the generator to reflect the core idea of the input text. The main training process can be carried out in an adversarial learning process. To solve the non-differentiable problem caused by the words sampling process, we use the policy gradient algorithm to optimize the generator. We evaluate the proposed model on the CNN/Daily Mail summarization task. The experimental results show that the model outperforms previous state-of-the-art models. |
topic |
Abstractive text summarization generative adversarial networks natural language processing |
url |
https://ieeexplore.ieee.org/document/8910526/ |
work_keys_str_mv |
AT haojiezhuang generatingsemanticallysimilarandhumanreadablesummarieswithgenerativeadversarialnetworks AT weibinzhang generatingsemanticallysimilarandhumanreadablesummarieswithgenerativeadversarialnetworks |
_version_ |
1724188290998861824 |