Decision Tree Classification and Forecasting of Pricing Time Series Data

Many companies today, in different fields of operations and sizes, have access to a vast amount of data which was not available only a couple of years ago. This situation gives rise to questions regarding how to organize and use the data in the best way possible. In this thesis a large database of p...

Full description

Bibliographic Details
Main Author: Lundkvist, Emil
Format: Others
Language:English
Published: KTH, Reglerteknik 2014
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-151017
id ndltd-UPSALLA1-oai-DiVA.org-kth-151017
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-kth-1510172014-09-13T04:46:57ZDecision Tree Classification and Forecasting of Pricing Time Series DataengLundkvist, EmilKTH, Reglerteknik2014Many companies today, in different fields of operations and sizes, have access to a vast amount of data which was not available only a couple of years ago. This situation gives rise to questions regarding how to organize and use the data in the best way possible. In this thesis a large database of pricing data for products within various market segments is analysed. The pricing data is from both external and internal sources and is therefore confidential. Because of the confidentiality, the labels from the database are in this thesis substituted with generic ones and the company is not referred to by name, but the analysis is carried out on the real data set. The data is from the beginning unstructured and difficult to overlook. Therefore, it is first classified. This is performed by feeding some manual training data into an algorithm which builds a decision tree. The decision tree is used to divide the rest of the products in the database into classes. Then, for each class, a multivariate time series model is built and each product’s future price within the class can be predicted. In order to interact with the classification and price prediction, a front end is also developed. The results show that the classification algorithm both is fast enough to operate in real time and performs well. The time series analysis shows that it is possible to use the information within each class to do predictions, and a simple vector autoregressive model used to perform it shows good predictive results. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-151017application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
description Many companies today, in different fields of operations and sizes, have access to a vast amount of data which was not available only a couple of years ago. This situation gives rise to questions regarding how to organize and use the data in the best way possible. In this thesis a large database of pricing data for products within various market segments is analysed. The pricing data is from both external and internal sources and is therefore confidential. Because of the confidentiality, the labels from the database are in this thesis substituted with generic ones and the company is not referred to by name, but the analysis is carried out on the real data set. The data is from the beginning unstructured and difficult to overlook. Therefore, it is first classified. This is performed by feeding some manual training data into an algorithm which builds a decision tree. The decision tree is used to divide the rest of the products in the database into classes. Then, for each class, a multivariate time series model is built and each product’s future price within the class can be predicted. In order to interact with the classification and price prediction, a front end is also developed. The results show that the classification algorithm both is fast enough to operate in real time and performs well. The time series analysis shows that it is possible to use the information within each class to do predictions, and a simple vector autoregressive model used to perform it shows good predictive results.
author Lundkvist, Emil
spellingShingle Lundkvist, Emil
Decision Tree Classification and Forecasting of Pricing Time Series Data
author_facet Lundkvist, Emil
author_sort Lundkvist, Emil
title Decision Tree Classification and Forecasting of Pricing Time Series Data
title_short Decision Tree Classification and Forecasting of Pricing Time Series Data
title_full Decision Tree Classification and Forecasting of Pricing Time Series Data
title_fullStr Decision Tree Classification and Forecasting of Pricing Time Series Data
title_full_unstemmed Decision Tree Classification and Forecasting of Pricing Time Series Data
title_sort decision tree classification and forecasting of pricing time series data
publisher KTH, Reglerteknik
publishDate 2014
url http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-151017
work_keys_str_mv AT lundkvistemil decisiontreeclassificationandforecastingofpricingtimeseriesdata
_version_ 1716714307518988288