Performance Analysis of Statistical and Supervised Learning Techniques in Stock Data Mining
Nowadays, overwhelming stock data is available, which areonly of use if it is properly examined and mined. In this paper, the last twelve years of ICICI Bank’s stock data have been extensively examined using statistical and supervised learning techniques. This study may be of great interes...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2018-11-01
|
Series: | Data |
Subjects: | |
Online Access: | https://www.mdpi.com/2306-5729/3/4/54 |
id |
doaj-6faafdb910d1485d9b7d713deeb21bc0 |
---|---|
record_format |
Article |
spelling |
doaj-6faafdb910d1485d9b7d713deeb21bc02020-11-24T23:31:41ZengMDPI AGData2306-57292018-11-01345410.3390/data3040054data3040054Performance Analysis of Statistical and Supervised Learning Techniques in Stock Data MiningManik Sharma0Samriti Sharma1Gurvinder Singh2Department of Computer Science and Applications, DAV University, Jalandhar 144401, IndiaDepartment of Computer Science, Guru Nanak Dev University, Amritar 143001, IndiaDepartment of Computer Science, Guru Nanak Dev University, Amritar 143001, IndiaNowadays, overwhelming stock data is available, which areonly of use if it is properly examined and mined. In this paper, the last twelve years of ICICI Bank’s stock data have been extensively examined using statistical and supervised learning techniques. This study may be of great interest for those who wish to mine or study the stock data of banks or any financial organization. Different statistical measures have been computed to explore the nature, range, distribution, and deviation of data. The different descriptive statistical measures assist in finding different valuable metrics such as mean, variance, skewness, kurtosis, <i>p</i>-value, a-squared, and 95% confidence mean interval level of ICICI Bank’s stock data. Moreover, daily percentage changes occurring over the last 12 years have also been recorded and examined. Additionally, the intraday stock status has been mined using ten different classifiers. The performance of different classifiers has been evaluated on the basis of various parameters such as accuracy, misclassification rate, precision, recall, specificity, and sensitivity. Based upon different parameters, the predictive results obtained using logistic regression are more acceptable than the outcomes of other classifiers, whereas naïve Bayes, C4.5, random forest, linear discriminant, and cubic support vector machine (SVM) merely act as a random guessing machine. The outstanding performance of logistic regression has been validated using TOPSIS (technique for order preference by similarity to ideal solution) and WSA (weighted sum approach).https://www.mdpi.com/2306-5729/3/4/54stock forecastingnaïve BayesC4.5random forestlogistic regressionsupport vector machine |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Manik Sharma Samriti Sharma Gurvinder Singh |
spellingShingle |
Manik Sharma Samriti Sharma Gurvinder Singh Performance Analysis of Statistical and Supervised Learning Techniques in Stock Data Mining Data stock forecasting naïve Bayes C4.5 random forest logistic regression support vector machine |
author_facet |
Manik Sharma Samriti Sharma Gurvinder Singh |
author_sort |
Manik Sharma |
title |
Performance Analysis of Statistical and Supervised Learning Techniques in Stock Data Mining |
title_short |
Performance Analysis of Statistical and Supervised Learning Techniques in Stock Data Mining |
title_full |
Performance Analysis of Statistical and Supervised Learning Techniques in Stock Data Mining |
title_fullStr |
Performance Analysis of Statistical and Supervised Learning Techniques in Stock Data Mining |
title_full_unstemmed |
Performance Analysis of Statistical and Supervised Learning Techniques in Stock Data Mining |
title_sort |
performance analysis of statistical and supervised learning techniques in stock data mining |
publisher |
MDPI AG |
series |
Data |
issn |
2306-5729 |
publishDate |
2018-11-01 |
description |
Nowadays, overwhelming stock data is available, which areonly of use if it is properly examined and mined. In this paper, the last twelve years of ICICI Bank’s stock data have been extensively examined using statistical and supervised learning techniques. This study may be of great interest for those who wish to mine or study the stock data of banks or any financial organization. Different statistical measures have been computed to explore the nature, range, distribution, and deviation of data. The different descriptive statistical measures assist in finding different valuable metrics such as mean, variance, skewness, kurtosis, <i>p</i>-value, a-squared, and 95% confidence mean interval level of ICICI Bank’s stock data. Moreover, daily percentage changes occurring over the last 12 years have also been recorded and examined. Additionally, the intraday stock status has been mined using ten different classifiers. The performance of different classifiers has been evaluated on the basis of various parameters such as accuracy, misclassification rate, precision, recall, specificity, and sensitivity. Based upon different parameters, the predictive results obtained using logistic regression are more acceptable than the outcomes of other classifiers, whereas naïve Bayes, C4.5, random forest, linear discriminant, and cubic support vector machine (SVM) merely act as a random guessing machine. The outstanding performance of logistic regression has been validated using TOPSIS (technique for order preference by similarity to ideal solution) and WSA (weighted sum approach). |
topic |
stock forecasting naïve Bayes C4.5 random forest logistic regression support vector machine |
url |
https://www.mdpi.com/2306-5729/3/4/54 |
work_keys_str_mv |
AT maniksharma performanceanalysisofstatisticalandsupervisedlearningtechniquesinstockdatamining AT samritisharma performanceanalysisofstatisticalandsupervisedlearningtechniquesinstockdatamining AT gurvindersingh performanceanalysisofstatisticalandsupervisedlearningtechniquesinstockdatamining |
_version_ |
1725536502897704960 |