Information Extraction Using Distant Supervision and Semantic Similarities

Information extraction is one of the main research tasks in natural language processing and text mining that extracts useful information from unstructured sentences. Information extraction techniques include named entity recognition, relation extraction, and co-reference resolution. Among them, rel...

Full description

Bibliographic Details
Main Authors: PARK, Y., KANG, S., SEO, J.
Format: Article
Language:English
Published: Stefan cel Mare University of Suceava 2016-02-01
Series:Advances in Electrical and Computer Engineering
Subjects:
Online Access:http://dx.doi.org/10.4316/AECE.2016.01002
id doaj-6e65559863cb42d19582dbeced12e52f
record_format Article
spelling doaj-6e65559863cb42d19582dbeced12e52f2020-11-24T21:20:15ZengStefan cel Mare University of SuceavaAdvances in Electrical and Computer Engineering1582-74451844-76002016-02-01161111810.4316/AECE.2016.01002Information Extraction Using Distant Supervision and Semantic SimilaritiesPARK, Y.KANG, S.SEO, J.Information extraction is one of the main research tasks in natural language processing and text mining that extracts useful information from unstructured sentences. Information extraction techniques include named entity recognition, relation extraction, and co-reference resolution. Among them, relation extraction refers to a task that extracts semantic relations between entities such as personal and geographic names in documents. This is an important research area, which is used in knowledge base construction and question and answering systems. This study presents relation extraction using a distant supervision learning technique among semi-supervised learning methods, which have been spotlighted in recent years to reduce human manual work and costs required for supervised learning. That is, this study proposes a method that can improve relation extraction by improving a distant supervision learning technique by applying a clustering method to create a learning corpus and semantic analysis for relation extraction that is difficult to identify using existing distant supervision. Through comparison experiments of various semantic similarity comparison methods, similarity calculation methods that are useful to relation extraction using distant supervision are searched, and a large number of accurate relation triples can be extracted using the proposed structural advantages and semantic similarity comparison.http://dx.doi.org/10.4316/AECE.2016.01002relation extractionunsupervised learningdistant supervisioninformation extractionnatural language processing
collection DOAJ
language English
format Article
sources DOAJ
author PARK, Y.
KANG, S.
SEO, J.
spellingShingle PARK, Y.
KANG, S.
SEO, J.
Information Extraction Using Distant Supervision and Semantic Similarities
Advances in Electrical and Computer Engineering
relation extraction
unsupervised learning
distant supervision
information extraction
natural language processing
author_facet PARK, Y.
KANG, S.
SEO, J.
author_sort PARK, Y.
title Information Extraction Using Distant Supervision and Semantic Similarities
title_short Information Extraction Using Distant Supervision and Semantic Similarities
title_full Information Extraction Using Distant Supervision and Semantic Similarities
title_fullStr Information Extraction Using Distant Supervision and Semantic Similarities
title_full_unstemmed Information Extraction Using Distant Supervision and Semantic Similarities
title_sort information extraction using distant supervision and semantic similarities
publisher Stefan cel Mare University of Suceava
series Advances in Electrical and Computer Engineering
issn 1582-7445
1844-7600
publishDate 2016-02-01
description Information extraction is one of the main research tasks in natural language processing and text mining that extracts useful information from unstructured sentences. Information extraction techniques include named entity recognition, relation extraction, and co-reference resolution. Among them, relation extraction refers to a task that extracts semantic relations between entities such as personal and geographic names in documents. This is an important research area, which is used in knowledge base construction and question and answering systems. This study presents relation extraction using a distant supervision learning technique among semi-supervised learning methods, which have been spotlighted in recent years to reduce human manual work and costs required for supervised learning. That is, this study proposes a method that can improve relation extraction by improving a distant supervision learning technique by applying a clustering method to create a learning corpus and semantic analysis for relation extraction that is difficult to identify using existing distant supervision. Through comparison experiments of various semantic similarity comparison methods, similarity calculation methods that are useful to relation extraction using distant supervision are searched, and a large number of accurate relation triples can be extracted using the proposed structural advantages and semantic similarity comparison.
topic relation extraction
unsupervised learning
distant supervision
information extraction
natural language processing
url http://dx.doi.org/10.4316/AECE.2016.01002
work_keys_str_mv AT parky informationextractionusingdistantsupervisionandsemanticsimilarities
AT kangs informationextractionusingdistantsupervisionandsemanticsimilarities
AT seoj informationextractionusingdistantsupervisionandsemanticsimilarities
_version_ 1726003209700376576