Modeling of fuel consumption in a road network

The fuel consumption accounts for a large portion of operational cost for logistics companies and hence building an accurate fuel prediction model is of the key importance. Recently machine learning methods have been widely used in this area, and data like historical GPS data, road conditions, weath...

Full description

Bibliographic Details
Main Author:	Chen, Zehua
Format:	Others
Language:	English
Published:	KTH, Skolan för elektroteknik och datavetenskap (EECS) 2020
Subjects:	Computer and Information Sciences Data- och informationsvetenskap
Online Access:	http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-281955

id	ndltd-UPSALLA1-oai-DiVA.org-kth-281955
record_format	oai_dc
spelling	ndltd-UPSALLA1-oai-DiVA.org-kth-2819552020-09-30T05:49:08ZModeling of fuel consumption in a road networkengChen, ZehuaKTH, Skolan för elektroteknik och datavetenskap (EECS)2020Computer and Information SciencesData- och informationsvetenskapThe fuel consumption accounts for a large portion of operational cost for logistics companies and hence building an accurate fuel prediction model is of the key importance. Recently machine learning methods have been widely used in this area, and data like historical GPS data, road conditions, weather conditions are involved when building such models. This study aims at investigating the possibility to replace the road condition features with an index that is constructed by aggregating the data collected and maintained by Scania. Two normalization methods are used for building fuel consumption index, some commonly used models including Support Vector Machine, Random Forest and Gradient Boosted Machines are trained and evaluated both with and without it. The experimental results show that the Random Forest model outperforms the others in most cases. By comparing the results with the previous studies, we can see that replacing road conditions by a fuel consumption index can lead to almost the same performance of machine learning models. To guarantee the reliability of this index, approximately 4000 to 5000 samples are expected for each road segment, this is however not realistic for many of them. When predicting fuel consumption for a given route, the more road segments with adequate samples it contains, the higher predictive accuracy we can expect. För logistikföretag står bränsleförbrukningen för en stor del av driftskostnaden och därför är det viktigt att bygga en exakt modell för bränsleprognos. Under senare år har maskininlärningsmetoder använts i stor utsträckning inom detta område, och data som historiska GPS-positioner, vägförhållanden och väderförhållanden är involverade när man bygger dessa modeller. Denna studie syftar till att undersöka möjligheten att ersätta karrakteristika beskrivande vägtillståndet med ett index som är konstruerat genom att använda de uppgifter som samlas in och underhålls av Scania. Två normaliseringsmetoder används för att bygga bränsleförbrukningsindex. En uppsättning vanliga modeller inkluderande supportvektormaskinen, Random Forest och Gradient Boosted Machines tränas och utvärderas både med och utan normalisering. De experimentella resultaten visar att Random Forest-modellen överträffar de andra i de flesta fall. Genom att jämföra resultaten med tidigare studier kan vi se att byte av vägförhållandekarrakteristika mot ett bränsleförbrukningsindex kan leda till nästan samma prestanda som maskininlärningsmodeller. För att garantera tillförlitligheten för detta index behövs cirka 4000 till 5000 prover för varje vägsegment, men detta är dock inte realistiskt för många av segmenten. När vi predicerar bränsleförbrukningen för en given rutt blir förväntad prediktionsnoggranhet högre desto fler vägsegment med adekvata dataprover den innehåller. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-281955TRITA-EECS-EX ; 2020:618application/pdfinfo:eu-repo/semantics/openAccess
collection	NDLTD
language	English
format	Others
sources	NDLTD
topic	Computer and Information Sciences Data- och informationsvetenskap
spellingShingle	Computer and Information Sciences Data- och informationsvetenskap Chen, Zehua Modeling of fuel consumption in a road network
description	The fuel consumption accounts for a large portion of operational cost for logistics companies and hence building an accurate fuel prediction model is of the key importance. Recently machine learning methods have been widely used in this area, and data like historical GPS data, road conditions, weather conditions are involved when building such models. This study aims at investigating the possibility to replace the road condition features with an index that is constructed by aggregating the data collected and maintained by Scania. Two normalization methods are used for building fuel consumption index, some commonly used models including Support Vector Machine, Random Forest and Gradient Boosted Machines are trained and evaluated both with and without it. The experimental results show that the Random Forest model outperforms the others in most cases. By comparing the results with the previous studies, we can see that replacing road conditions by a fuel consumption index can lead to almost the same performance of machine learning models. To guarantee the reliability of this index, approximately 4000 to 5000 samples are expected for each road segment, this is however not realistic for many of them. When predicting fuel consumption for a given route, the more road segments with adequate samples it contains, the higher predictive accuracy we can expect. === För logistikföretag står bränsleförbrukningen för en stor del av driftskostnaden och därför är det viktigt att bygga en exakt modell för bränsleprognos. Under senare år har maskininlärningsmetoder använts i stor utsträckning inom detta område, och data som historiska GPS-positioner, vägförhållanden och väderförhållanden är involverade när man bygger dessa modeller. Denna studie syftar till att undersöka möjligheten att ersätta karrakteristika beskrivande vägtillståndet med ett index som är konstruerat genom att använda de uppgifter som samlas in och underhålls av Scania. Två normaliseringsmetoder används för att bygga bränsleförbrukningsindex. En uppsättning vanliga modeller inkluderande supportvektormaskinen, Random Forest och Gradient Boosted Machines tränas och utvärderas både med och utan normalisering. De experimentella resultaten visar att Random Forest-modellen överträffar de andra i de flesta fall. Genom att jämföra resultaten med tidigare studier kan vi se att byte av vägförhållandekarrakteristika mot ett bränsleförbrukningsindex kan leda till nästan samma prestanda som maskininlärningsmodeller. För att garantera tillförlitligheten för detta index behövs cirka 4000 till 5000 prover för varje vägsegment, men detta är dock inte realistiskt för många av segmenten. När vi predicerar bränsleförbrukningen för en given rutt blir förväntad prediktionsnoggranhet högre desto fler vägsegment med adekvata dataprover den innehåller.
author	Chen, Zehua
author_facet	Chen, Zehua
author_sort	Chen, Zehua
title	Modeling of fuel consumption in a road network
title_short	Modeling of fuel consumption in a road network
title_full	Modeling of fuel consumption in a road network
title_fullStr	Modeling of fuel consumption in a road network
title_full_unstemmed	Modeling of fuel consumption in a road network
title_sort	modeling of fuel consumption in a road network
publisher	KTH, Skolan för elektroteknik och datavetenskap (EECS)
publishDate	2020
url	http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-281955
work_keys_str_mv	AT chenzehua modelingoffuelconsumptioninaroadnetwork
_version_	1719346900996456448

Modeling of fuel consumption in a road network

Similar Items