Mid-price prediction based on machine learning methods with technical and quantitative indicators.
Stock price prediction is a challenging task, in which machine learning methods have recently been successfully used. In this paper, we extract over 270 hand-crafted features (factors) inspired by technical indicators and quantitative analysis and test their validity on short-term mid-price movement...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2020-01-01
|
Series: | PLoS ONE |
Online Access: | https://doi.org/10.1371/journal.pone.0234107 |
id |
doaj-55a519a4b9a144e59ba5a2b2747395d5 |
---|---|
record_format |
Article |
spelling |
doaj-55a519a4b9a144e59ba5a2b2747395d52021-03-03T21:51:51ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-01156e023410710.1371/journal.pone.0234107Mid-price prediction based on machine learning methods with technical and quantitative indicators.Adamantios NtakarisJuho KanniainenMoncef GabboujAlexandros IosifidisStock price prediction is a challenging task, in which machine learning methods have recently been successfully used. In this paper, we extract over 270 hand-crafted features (factors) inspired by technical indicators and quantitative analysis and test their validity on short-term mid-price movement prediction for Nordic TotalView-ITCH stocks. The suggested feature list represents one of the most extensive studies in the field of financial feature engineering. We focus on a wrapper feature selection method using entropy, least-mean squares, and linear discriminant analysis. We also introduce a novel quantitative feature based on adaptive logistic regression for online learning. The proposed feature is consistently selected as the first feature among a large number of indicators used in this study. We further examine the best combinations of features using a high-frequency limit order book Nordic database. Our results suggest that sorting methods and classifiers can be used in such a way that one can reach the best classification performance with a combination of only a few advanced hand-crafted features.https://doi.org/10.1371/journal.pone.0234107 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Adamantios Ntakaris Juho Kanniainen Moncef Gabbouj Alexandros Iosifidis |
spellingShingle |
Adamantios Ntakaris Juho Kanniainen Moncef Gabbouj Alexandros Iosifidis Mid-price prediction based on machine learning methods with technical and quantitative indicators. PLoS ONE |
author_facet |
Adamantios Ntakaris Juho Kanniainen Moncef Gabbouj Alexandros Iosifidis |
author_sort |
Adamantios Ntakaris |
title |
Mid-price prediction based on machine learning methods with technical and quantitative indicators. |
title_short |
Mid-price prediction based on machine learning methods with technical and quantitative indicators. |
title_full |
Mid-price prediction based on machine learning methods with technical and quantitative indicators. |
title_fullStr |
Mid-price prediction based on machine learning methods with technical and quantitative indicators. |
title_full_unstemmed |
Mid-price prediction based on machine learning methods with technical and quantitative indicators. |
title_sort |
mid-price prediction based on machine learning methods with technical and quantitative indicators. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2020-01-01 |
description |
Stock price prediction is a challenging task, in which machine learning methods have recently been successfully used. In this paper, we extract over 270 hand-crafted features (factors) inspired by technical indicators and quantitative analysis and test their validity on short-term mid-price movement prediction for Nordic TotalView-ITCH stocks. The suggested feature list represents one of the most extensive studies in the field of financial feature engineering. We focus on a wrapper feature selection method using entropy, least-mean squares, and linear discriminant analysis. We also introduce a novel quantitative feature based on adaptive logistic regression for online learning. The proposed feature is consistently selected as the first feature among a large number of indicators used in this study. We further examine the best combinations of features using a high-frequency limit order book Nordic database. Our results suggest that sorting methods and classifiers can be used in such a way that one can reach the best classification performance with a combination of only a few advanced hand-crafted features. |
url |
https://doi.org/10.1371/journal.pone.0234107 |
work_keys_str_mv |
AT adamantiosntakaris midpricepredictionbasedonmachinelearningmethodswithtechnicalandquantitativeindicators AT juhokanniainen midpricepredictionbasedonmachinelearningmethodswithtechnicalandquantitativeindicators AT moncefgabbouj midpricepredictionbasedonmachinelearningmethodswithtechnicalandquantitativeindicators AT alexandrosiosifidis midpricepredictionbasedonmachinelearningmethodswithtechnicalandquantitativeindicators |
_version_ |
1714814661011963904 |