Mid-price prediction based on machine learning methods with technical and quantitative indicators.

Stock price prediction is a challenging task, in which machine learning methods have recently been successfully used. In this paper, we extract over 270 hand-crafted features (factors) inspired by technical indicators and quantitative analysis and test their validity on short-term mid-price movement...

Full description

Bibliographic Details
Main Authors: Adamantios Ntakaris, Juho Kanniainen, Moncef Gabbouj, Alexandros Iosifidis
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0234107
id doaj-55a519a4b9a144e59ba5a2b2747395d5
record_format Article
spelling doaj-55a519a4b9a144e59ba5a2b2747395d52021-03-03T21:51:51ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-01156e023410710.1371/journal.pone.0234107Mid-price prediction based on machine learning methods with technical and quantitative indicators.Adamantios NtakarisJuho KanniainenMoncef GabboujAlexandros IosifidisStock price prediction is a challenging task, in which machine learning methods have recently been successfully used. In this paper, we extract over 270 hand-crafted features (factors) inspired by technical indicators and quantitative analysis and test their validity on short-term mid-price movement prediction for Nordic TotalView-ITCH stocks. The suggested feature list represents one of the most extensive studies in the field of financial feature engineering. We focus on a wrapper feature selection method using entropy, least-mean squares, and linear discriminant analysis. We also introduce a novel quantitative feature based on adaptive logistic regression for online learning. The proposed feature is consistently selected as the first feature among a large number of indicators used in this study. We further examine the best combinations of features using a high-frequency limit order book Nordic database. Our results suggest that sorting methods and classifiers can be used in such a way that one can reach the best classification performance with a combination of only a few advanced hand-crafted features.https://doi.org/10.1371/journal.pone.0234107
collection DOAJ
language English
format Article
sources DOAJ
author Adamantios Ntakaris
Juho Kanniainen
Moncef Gabbouj
Alexandros Iosifidis
spellingShingle Adamantios Ntakaris
Juho Kanniainen
Moncef Gabbouj
Alexandros Iosifidis
Mid-price prediction based on machine learning methods with technical and quantitative indicators.
PLoS ONE
author_facet Adamantios Ntakaris
Juho Kanniainen
Moncef Gabbouj
Alexandros Iosifidis
author_sort Adamantios Ntakaris
title Mid-price prediction based on machine learning methods with technical and quantitative indicators.
title_short Mid-price prediction based on machine learning methods with technical and quantitative indicators.
title_full Mid-price prediction based on machine learning methods with technical and quantitative indicators.
title_fullStr Mid-price prediction based on machine learning methods with technical and quantitative indicators.
title_full_unstemmed Mid-price prediction based on machine learning methods with technical and quantitative indicators.
title_sort mid-price prediction based on machine learning methods with technical and quantitative indicators.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2020-01-01
description Stock price prediction is a challenging task, in which machine learning methods have recently been successfully used. In this paper, we extract over 270 hand-crafted features (factors) inspired by technical indicators and quantitative analysis and test their validity on short-term mid-price movement prediction for Nordic TotalView-ITCH stocks. The suggested feature list represents one of the most extensive studies in the field of financial feature engineering. We focus on a wrapper feature selection method using entropy, least-mean squares, and linear discriminant analysis. We also introduce a novel quantitative feature based on adaptive logistic regression for online learning. The proposed feature is consistently selected as the first feature among a large number of indicators used in this study. We further examine the best combinations of features using a high-frequency limit order book Nordic database. Our results suggest that sorting methods and classifiers can be used in such a way that one can reach the best classification performance with a combination of only a few advanced hand-crafted features.
url https://doi.org/10.1371/journal.pone.0234107
work_keys_str_mv AT adamantiosntakaris midpricepredictionbasedonmachinelearningmethodswithtechnicalandquantitativeindicators
AT juhokanniainen midpricepredictionbasedonmachinelearningmethodswithtechnicalandquantitativeindicators
AT moncefgabbouj midpricepredictionbasedonmachinelearningmethodswithtechnicalandquantitativeindicators
AT alexandrosiosifidis midpricepredictionbasedonmachinelearningmethodswithtechnicalandquantitativeindicators
_version_ 1714814661011963904