Performance Prediction for Enabling Intelligent Resource Management on Big Data Processing Workflows

Mobile cloud computing offers an augmented infrastructure that allows resource-constrained devices to use remote computational resources as an enabler for highly intensive computation, thus improving end users experience. Being able to efficiently manage cloud elasticity represents a big challenge f...

Full description

Bibliographic Details
Main Author: Obeso Duque, Aleksandra
Format: Others
Language:English
Published: Uppsala universitet, Institutionen för informationsteknologi 2018
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-372178
id ndltd-UPSALLA1-oai-DiVA.org-uu-372178
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-uu-3721782019-01-07T05:54:06ZPerformance Prediction for Enabling Intelligent Resource Management on Big Data Processing WorkflowsengObeso Duque, AleksandraUppsala universitet, Institutionen för informationsteknologi2018Engineering and TechnologyTeknik och teknologierMobile cloud computing offers an augmented infrastructure that allows resource-constrained devices to use remote computational resources as an enabler for highly intensive computation, thus improving end users experience. Being able to efficiently manage cloud elasticity represents a big challenge for dynamic resource scaling on-demand. In this sense, the development of intelligent tools that could ease the understanding of the behavior of a highly dynamic system and to detect resource bottlenecks given certain service level constrains represents an interesting case of study. In this project, a comparative study has been carried out for different distributed services taking into account the tools that are available for load generation, benchmarking and sensing of key performance indicators. Based on that, the big data processing framework Hadoop Mapreduce, has been deployed as a virtualized service on top of a distributed environment. Experiments for different cluster setups using different benchmarks have been conducted on this testbed in order to collect traces for both resource usage statistics at the infrastructure level and performance metrics at the platform level. Different machine learning approaches have been applied on the collected traces, thus generating prediction and classification models whose performance is then evaluated and compared. The highly accurate results, namely a Normalized Mean Absolute Error below 10.3% for the regressor and an accuracy score above 99.9% for the classifier, show the feasibility of the prediction models generated for service performance prediction and resource bottleneck detection that could be further used to trigger auto-scaling processes on cloud environments under dynamic loads in order to fulfill service level requirements. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-372178IT ; 18056application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic Engineering and Technology
Teknik och teknologier
spellingShingle Engineering and Technology
Teknik och teknologier
Obeso Duque, Aleksandra
Performance Prediction for Enabling Intelligent Resource Management on Big Data Processing Workflows
description Mobile cloud computing offers an augmented infrastructure that allows resource-constrained devices to use remote computational resources as an enabler for highly intensive computation, thus improving end users experience. Being able to efficiently manage cloud elasticity represents a big challenge for dynamic resource scaling on-demand. In this sense, the development of intelligent tools that could ease the understanding of the behavior of a highly dynamic system and to detect resource bottlenecks given certain service level constrains represents an interesting case of study. In this project, a comparative study has been carried out for different distributed services taking into account the tools that are available for load generation, benchmarking and sensing of key performance indicators. Based on that, the big data processing framework Hadoop Mapreduce, has been deployed as a virtualized service on top of a distributed environment. Experiments for different cluster setups using different benchmarks have been conducted on this testbed in order to collect traces for both resource usage statistics at the infrastructure level and performance metrics at the platform level. Different machine learning approaches have been applied on the collected traces, thus generating prediction and classification models whose performance is then evaluated and compared. The highly accurate results, namely a Normalized Mean Absolute Error below 10.3% for the regressor and an accuracy score above 99.9% for the classifier, show the feasibility of the prediction models generated for service performance prediction and resource bottleneck detection that could be further used to trigger auto-scaling processes on cloud environments under dynamic loads in order to fulfill service level requirements.
author Obeso Duque, Aleksandra
author_facet Obeso Duque, Aleksandra
author_sort Obeso Duque, Aleksandra
title Performance Prediction for Enabling Intelligent Resource Management on Big Data Processing Workflows
title_short Performance Prediction for Enabling Intelligent Resource Management on Big Data Processing Workflows
title_full Performance Prediction for Enabling Intelligent Resource Management on Big Data Processing Workflows
title_fullStr Performance Prediction for Enabling Intelligent Resource Management on Big Data Processing Workflows
title_full_unstemmed Performance Prediction for Enabling Intelligent Resource Management on Big Data Processing Workflows
title_sort performance prediction for enabling intelligent resource management on big data processing workflows
publisher Uppsala universitet, Institutionen för informationsteknologi
publishDate 2018
url http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-372178
work_keys_str_mv AT obesoduquealeksandra performancepredictionforenablingintelligentresourcemanagementonbigdataprocessingworkflows
_version_ 1718806646168223744