Exploring Financial Risk via Text Mining Approaches

碩士 === 國立政治大學 === 資訊科學學系 === 103 === In recent years, there have been some studies using machine learning techniques to predict stock tendency and investment risks in finance. There have also been some applications that analyze the textual information in fi- nancial reports, financial news, or even...

Full description

Bibliographic Details
Main Author: 劉澤
Other Authors: 蔡銘峰
Format: Others
Language:en_US
Online Access:http://ndltd.ncl.edu.tw/handle/v8a7f8
id ndltd-TW-103NCCU5394022
record_format oai_dc
spelling ndltd-TW-103NCCU53940222019-05-15T22:17:24Z http://ndltd.ncl.edu.tw/handle/v8a7f8 Exploring Financial Risk via Text Mining Approaches 以文字探勘為基礎之財務風險分析方法研究 劉澤 碩士 國立政治大學 資訊科學學系 103 In recent years, there have been some studies using machine learning techniques to predict stock tendency and investment risks in finance. There have also been some applications that analyze the textual information in fi- nancial reports, financial news, or even twitters on social network to provide useful information for stock investors. In this paper, we focus on the problem that uses the textual information in financial reports and numerical informa- tion of companies to predict the financial risk. We use the textual information in financial report of companies to predict the financial risk in the following year. We utilize stock volatility to measure financial risk. In the first part of the thesis, we use a finance-specific sentiment lexicon to improve the pre- diction models that are trained only textual information of financial reports. Then we also provide a sentiment analysis to the results. In the second part of the thesis, we attempt to combine the textual information and the numeri- cal information, such as stock returns to further improve the performance of the prediction models. In specific, in the proposed approach each company instance associated with its financial textual information will be weighted by its stock returns by using the cost-sensitive learning techniques. Our experi- mental results show that, finance-specific sentiment lexicon models conduct comparable performance to those on the original texts, which confirms the importance of financial sentiment words on risk prediction. More impor- tantly, the learned models suggest strong correlations between financial sen- timent words and risk of companies. In addition, our cost-sensitive results significantly improve the cost-insensitive results. As a result, these findings identify the impact of sentiment words in financial reports, and the numerical information can be utilized as the cost weights of learning techniques. 蔡銘峰 學位論文 ; thesis 31 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立政治大學 === 資訊科學學系 === 103 === In recent years, there have been some studies using machine learning techniques to predict stock tendency and investment risks in finance. There have also been some applications that analyze the textual information in fi- nancial reports, financial news, or even twitters on social network to provide useful information for stock investors. In this paper, we focus on the problem that uses the textual information in financial reports and numerical informa- tion of companies to predict the financial risk. We use the textual information in financial report of companies to predict the financial risk in the following year. We utilize stock volatility to measure financial risk. In the first part of the thesis, we use a finance-specific sentiment lexicon to improve the pre- diction models that are trained only textual information of financial reports. Then we also provide a sentiment analysis to the results. In the second part of the thesis, we attempt to combine the textual information and the numeri- cal information, such as stock returns to further improve the performance of the prediction models. In specific, in the proposed approach each company instance associated with its financial textual information will be weighted by its stock returns by using the cost-sensitive learning techniques. Our experi- mental results show that, finance-specific sentiment lexicon models conduct comparable performance to those on the original texts, which confirms the importance of financial sentiment words on risk prediction. More impor- tantly, the learned models suggest strong correlations between financial sen- timent words and risk of companies. In addition, our cost-sensitive results significantly improve the cost-insensitive results. As a result, these findings identify the impact of sentiment words in financial reports, and the numerical information can be utilized as the cost weights of learning techniques.
author2 蔡銘峰
author_facet 蔡銘峰
劉澤
author 劉澤
spellingShingle 劉澤
Exploring Financial Risk via Text Mining Approaches
author_sort 劉澤
title Exploring Financial Risk via Text Mining Approaches
title_short Exploring Financial Risk via Text Mining Approaches
title_full Exploring Financial Risk via Text Mining Approaches
title_fullStr Exploring Financial Risk via Text Mining Approaches
title_full_unstemmed Exploring Financial Risk via Text Mining Approaches
title_sort exploring financial risk via text mining approaches
url http://ndltd.ncl.edu.tw/handle/v8a7f8
work_keys_str_mv AT liúzé exploringfinancialriskviatextminingapproaches
AT liúzé yǐwénzìtànkānwèijīchǔzhīcáiwùfēngxiǎnfēnxīfāngfǎyánjiū
_version_ 1719127097353437184