Financial News Mining: : Extracting useful Information from Continuous Streams of Text

Online financial news sources continuously publish information about actors involved in the Norwegian financial market. These are often short messages describing temporal relations. However, the amount of information is overwhelming and it requires a great effort to stay up to date on both the lates...

Full description

Bibliographic Details
Main Authors: Lægreid, Tarjei, Sandal, Paul Christian
Format: Others
Language:English
Published: Norges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskap 2006
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-10091
id ndltd-UPSALLA1-oai-DiVA.org-ntnu-10091
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-ntnu-100912013-01-08T13:26:43ZFinancial News Mining: : Extracting useful Information from Continuous Streams of TextengLægreid, TarjeiSandal, Paul ChristianNorges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskapNorges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskapInstitutt for datateknikk og informasjonsvitenskap2006ntnudaimSIF2 datateknikkProgram- og informasjonssystemerOnline financial news sources continuously publish information about actors involved in the Norwegian financial market. These are often short messages describing temporal relations. However, the amount of information is overwhelming and it requires a great effort to stay up to date on both the latest news and historical relations. Therefore it would have been advantageous to automatically analyse the information. In this report we present a framework for identifying actors and relations between them. Text mining techniques are employed to extract the relations and how they evolve over time. Techniques such as part of speech tagging, named entity identification, along with traditional information retrieval and information extraction methods are employed. Features extracted from the news articles are represented as vectors in a vector space. The framework employs the feature vectors to identify and describe relations between entities in the financial market. A qualitative evaluation of the framework shows that the approach has promising results. Our main finding is that vector representations of features have potential for detecting relations between actors, and how these relations evolve. We also found that the approach taken is dependent on an accurate identification of named entities. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-10091Local ntnudaim:1436application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic ntnudaim
SIF2 datateknikk
Program- og informasjonssystemer
spellingShingle ntnudaim
SIF2 datateknikk
Program- og informasjonssystemer
Lægreid, Tarjei
Sandal, Paul Christian
Financial News Mining: : Extracting useful Information from Continuous Streams of Text
description Online financial news sources continuously publish information about actors involved in the Norwegian financial market. These are often short messages describing temporal relations. However, the amount of information is overwhelming and it requires a great effort to stay up to date on both the latest news and historical relations. Therefore it would have been advantageous to automatically analyse the information. In this report we present a framework for identifying actors and relations between them. Text mining techniques are employed to extract the relations and how they evolve over time. Techniques such as part of speech tagging, named entity identification, along with traditional information retrieval and information extraction methods are employed. Features extracted from the news articles are represented as vectors in a vector space. The framework employs the feature vectors to identify and describe relations between entities in the financial market. A qualitative evaluation of the framework shows that the approach has promising results. Our main finding is that vector representations of features have potential for detecting relations between actors, and how these relations evolve. We also found that the approach taken is dependent on an accurate identification of named entities.
author Lægreid, Tarjei
Sandal, Paul Christian
author_facet Lægreid, Tarjei
Sandal, Paul Christian
author_sort Lægreid, Tarjei
title Financial News Mining: : Extracting useful Information from Continuous Streams of Text
title_short Financial News Mining: : Extracting useful Information from Continuous Streams of Text
title_full Financial News Mining: : Extracting useful Information from Continuous Streams of Text
title_fullStr Financial News Mining: : Extracting useful Information from Continuous Streams of Text
title_full_unstemmed Financial News Mining: : Extracting useful Information from Continuous Streams of Text
title_sort financial news mining: : extracting useful information from continuous streams of text
publisher Norges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskap
publishDate 2006
url http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-10091
work_keys_str_mv AT lægreidtarjei financialnewsminingextractingusefulinformationfromcontinuousstreamsoftext
AT sandalpaulchristian financialnewsminingextractingusefulinformationfromcontinuousstreamsoftext
_version_ 1716520322613641216