Summary: | 碩士 === 國立屏東科技大學 === 資訊管理系所 === 102 === The data loading of internet keeps growing with the evolution of information technology, these data have lots of implied knowledge, but unstructured type and noise could be a problem, so we study to refine "Text Mining" for analysis process.
Considering about quality and quantity of data, valid convenience, and time cost, we subject financial news to predict the trend of stock market for research, through HTML parser and CKIP for bag-of-words processing and WEKA for machine learning.
Base on the importance of document itself, we propose a context oriented feature assign method by fluctuation of each stock, and investigate the feasibility by sampling test.
|