Performance Analysis of Statistical and Supervised Learning Techniques in Stock Data Mining

Nowadays, overwhelming stock data is available, which areonly of use if it is properly examined and mined. In this paper, the last twelve years of ICICI Bank’s stock data have been extensively examined using statistical and supervised learning techniques. This study may be of great interes...

Full description

Bibliographic Details
Main Authors: Manik Sharma, Samriti Sharma, Gurvinder Singh
Format: Article
Language:English
Published: MDPI AG 2018-11-01
Series:Data
Subjects:
Online Access:https://www.mdpi.com/2306-5729/3/4/54
id doaj-6faafdb910d1485d9b7d713deeb21bc0
record_format Article
spelling doaj-6faafdb910d1485d9b7d713deeb21bc02020-11-24T23:31:41ZengMDPI AGData2306-57292018-11-01345410.3390/data3040054data3040054Performance Analysis of Statistical and Supervised Learning Techniques in Stock Data MiningManik Sharma0Samriti Sharma1Gurvinder Singh2Department of Computer Science and Applications, DAV University, Jalandhar 144401, IndiaDepartment of Computer Science, Guru Nanak Dev University, Amritar 143001, IndiaDepartment of Computer Science, Guru Nanak Dev University, Amritar 143001, IndiaNowadays, overwhelming stock data is available, which areonly of use if it is properly examined and mined. In this paper, the last twelve years of ICICI Bank&#8217;s stock data have been extensively examined using statistical and supervised learning techniques. This study may be of great interest for those who wish to mine or study the stock data of banks or any financial organization. Different statistical measures have been computed to explore the nature, range, distribution, and deviation of data. The different descriptive statistical measures assist in finding different valuable metrics such as mean, variance, skewness, kurtosis, <i>p</i>-value, a-squared, and 95% confidence mean interval level of ICICI Bank&#8217;s stock data. Moreover, daily percentage changes occurring over the last 12 years have also been recorded and examined. Additionally, the intraday stock status has been mined using ten different classifiers. The performance of different classifiers has been evaluated on the basis of various parameters such as accuracy, misclassification rate, precision, recall, specificity, and sensitivity. Based upon different parameters, the predictive results obtained using logistic regression are more acceptable than the outcomes of other classifiers, whereas na&#239;ve Bayes, C4.5, random forest, linear discriminant, and cubic support vector machine (SVM) merely act as a random guessing machine. The outstanding performance of logistic regression has been validated using TOPSIS (technique for order preference by similarity to ideal solution) and WSA (weighted sum approach).https://www.mdpi.com/2306-5729/3/4/54stock forecastingnaïve BayesC4.5random forestlogistic regressionsupport vector machine
collection DOAJ
language English
format Article
sources DOAJ
author Manik Sharma
Samriti Sharma
Gurvinder Singh
spellingShingle Manik Sharma
Samriti Sharma
Gurvinder Singh
Performance Analysis of Statistical and Supervised Learning Techniques in Stock Data Mining
Data
stock forecasting
naïve Bayes
C4.5
random forest
logistic regression
support vector machine
author_facet Manik Sharma
Samriti Sharma
Gurvinder Singh
author_sort Manik Sharma
title Performance Analysis of Statistical and Supervised Learning Techniques in Stock Data Mining
title_short Performance Analysis of Statistical and Supervised Learning Techniques in Stock Data Mining
title_full Performance Analysis of Statistical and Supervised Learning Techniques in Stock Data Mining
title_fullStr Performance Analysis of Statistical and Supervised Learning Techniques in Stock Data Mining
title_full_unstemmed Performance Analysis of Statistical and Supervised Learning Techniques in Stock Data Mining
title_sort performance analysis of statistical and supervised learning techniques in stock data mining
publisher MDPI AG
series Data
issn 2306-5729
publishDate 2018-11-01
description Nowadays, overwhelming stock data is available, which areonly of use if it is properly examined and mined. In this paper, the last twelve years of ICICI Bank&#8217;s stock data have been extensively examined using statistical and supervised learning techniques. This study may be of great interest for those who wish to mine or study the stock data of banks or any financial organization. Different statistical measures have been computed to explore the nature, range, distribution, and deviation of data. The different descriptive statistical measures assist in finding different valuable metrics such as mean, variance, skewness, kurtosis, <i>p</i>-value, a-squared, and 95% confidence mean interval level of ICICI Bank&#8217;s stock data. Moreover, daily percentage changes occurring over the last 12 years have also been recorded and examined. Additionally, the intraday stock status has been mined using ten different classifiers. The performance of different classifiers has been evaluated on the basis of various parameters such as accuracy, misclassification rate, precision, recall, specificity, and sensitivity. Based upon different parameters, the predictive results obtained using logistic regression are more acceptable than the outcomes of other classifiers, whereas na&#239;ve Bayes, C4.5, random forest, linear discriminant, and cubic support vector machine (SVM) merely act as a random guessing machine. The outstanding performance of logistic regression has been validated using TOPSIS (technique for order preference by similarity to ideal solution) and WSA (weighted sum approach).
topic stock forecasting
naïve Bayes
C4.5
random forest
logistic regression
support vector machine
url https://www.mdpi.com/2306-5729/3/4/54
work_keys_str_mv AT maniksharma performanceanalysisofstatisticalandsupervisedlearningtechniquesinstockdatamining
AT samritisharma performanceanalysisofstatisticalandsupervisedlearningtechniquesinstockdatamining
AT gurvindersingh performanceanalysisofstatisticalandsupervisedlearningtechniquesinstockdatamining
_version_ 1725536502897704960