Evaluation of pooling operations in convolutional architectures for drug-drug interaction extraction

Abstract Background Deep Neural Networks (DNN), in particular, Convolutional Neural Networks (CNN), has recently achieved state-of-art results for the task of Drug-Drug Interaction (DDI) extraction. Most CNN architectures incorporate a pooling layer to reduce the dimensionality of the convolution la...

Full description

Bibliographic Details
Main Authors:	Víctor Suárez-Paniagua, Isabel Segura-Bedmar
Format:	Article
Language:	English
Published:	BMC 2018-06-01
Series:	BMC Bioinformatics
Subjects:	Deep learning Convolutional neural network Pooling Attention model Drug-drug interaction extraction
Online Access:	http://link.springer.com/article/10.1186/s12859-018-2195-1

id	doaj-dba792d88a2843b1b309771fc836fc90
record_format	Article
spelling	doaj-dba792d88a2843b1b309771fc836fc902020-11-24T22:17:20ZengBMCBMC Bioinformatics1471-21052018-06-0119S8394710.1186/s12859-018-2195-1Evaluation of pooling operations in convolutional architectures for drug-drug interaction extractionVíctor Suárez-Paniagua0Isabel Segura-Bedmar1Computer Science Department, Carlos III University of MadridComputer Science Department, Carlos III University of MadridAbstract Background Deep Neural Networks (DNN), in particular, Convolutional Neural Networks (CNN), has recently achieved state-of-art results for the task of Drug-Drug Interaction (DDI) extraction. Most CNN architectures incorporate a pooling layer to reduce the dimensionality of the convolution layer output, preserving relevant features and removing irrelevant details. All the previous CNN based systems for DDI extraction used max-pooling layers. Results In this paper, we evaluate the performance of various pooling methods (in particular max-pooling, average-pooling and attentive pooling), as well as their combination, for the task of DDI extraction. Our experiments show that max-pooling exhibits a higher performance in F1-score (64.56%) than attentive pooling (59.92%) and than average-pooling (58.35%). Conclusions Max-pooling outperforms the others alternatives because is the only one which is invariant to the special pad tokens that are appending to the shorter sentences known as padding. Actually, the combination of max-pooling and attentive pooling does not improve the performance as compared with the single max-pooling technique.http://link.springer.com/article/10.1186/s12859-018-2195-1Deep learningConvolutional neural networkPoolingAttention modelDrug-drug interaction extraction
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Víctor Suárez-Paniagua Isabel Segura-Bedmar
spellingShingle	Víctor Suárez-Paniagua Isabel Segura-Bedmar Evaluation of pooling operations in convolutional architectures for drug-drug interaction extraction BMC Bioinformatics Deep learning Convolutional neural network Pooling Attention model Drug-drug interaction extraction
author_facet	Víctor Suárez-Paniagua Isabel Segura-Bedmar
author_sort	Víctor Suárez-Paniagua
title	Evaluation of pooling operations in convolutional architectures for drug-drug interaction extraction
title_short	Evaluation of pooling operations in convolutional architectures for drug-drug interaction extraction
title_full	Evaluation of pooling operations in convolutional architectures for drug-drug interaction extraction
title_fullStr	Evaluation of pooling operations in convolutional architectures for drug-drug interaction extraction
title_full_unstemmed	Evaluation of pooling operations in convolutional architectures for drug-drug interaction extraction
title_sort	evaluation of pooling operations in convolutional architectures for drug-drug interaction extraction
publisher	BMC
series	BMC Bioinformatics
issn	1471-2105
publishDate	2018-06-01
description	Abstract Background Deep Neural Networks (DNN), in particular, Convolutional Neural Networks (CNN), has recently achieved state-of-art results for the task of Drug-Drug Interaction (DDI) extraction. Most CNN architectures incorporate a pooling layer to reduce the dimensionality of the convolution layer output, preserving relevant features and removing irrelevant details. All the previous CNN based systems for DDI extraction used max-pooling layers. Results In this paper, we evaluate the performance of various pooling methods (in particular max-pooling, average-pooling and attentive pooling), as well as their combination, for the task of DDI extraction. Our experiments show that max-pooling exhibits a higher performance in F1-score (64.56%) than attentive pooling (59.92%) and than average-pooling (58.35%). Conclusions Max-pooling outperforms the others alternatives because is the only one which is invariant to the special pad tokens that are appending to the shorter sentences known as padding. Actually, the combination of max-pooling and attentive pooling does not improve the performance as compared with the single max-pooling technique.
topic	Deep learning Convolutional neural network Pooling Attention model Drug-drug interaction extraction
url	http://link.springer.com/article/10.1186/s12859-018-2195-1
work_keys_str_mv	AT victorsuarezpaniagua evaluationofpoolingoperationsinconvolutionalarchitecturesfordrugdruginteractionextraction AT isabelsegurabedmar evaluationofpoolingoperationsinconvolutionalarchitecturesfordrugdruginteractionextraction
_version_	1725785382826541056

Evaluation of pooling operations in convolutional architectures for drug-drug interaction extraction

Similar Items