Using Machine Learning to Manage Resources in Datacenters with Diverse Computing Requirements
碩士 === 國立清華大學 === 資訊工程學系所 === 106 === Apache Mesos has become a popular cluster resource management tool with the emergence of various new cluster computing applications, such as Big Data analytics and deep learning. Resource offer mechanism of Mesos gives framework schedulers the ability to choose...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2017
|
Online Access: | http://ndltd.ncl.edu.tw/handle/jrgj47 |
id |
ndltd-TW-106NTHU5392002 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-106NTHU53920022019-05-16T00:00:23Z http://ndltd.ncl.edu.tw/handle/jrgj47 Using Machine Learning to Manage Resources in Datacenters with Diverse Computing Requirements 運用機器學習解決資料中心多樣化計算要求的資源管理問題 Lee, Chin-Feng 李青峰 碩士 國立清華大學 資訊工程學系所 106 Apache Mesos has become a popular cluster resource management tool with the emergence of various new cluster computing applications, such as Big Data analytics and deep learning. Resource offer mechanism of Mesos gives framework schedulers the ability to choose the best resources based on their own constraints and preferences. The default hierarchical DRF allocator gives near optimal results for simple task placement preferences and resource requirements under large resource pool running mostly short-living jobs. However, if these properties do not hold, higher offer rejection rate is expected, which leads to degraded overall performance. Moreover, in scenarios where the overall system throughput is the main concern, improving allocator has more chance for optimization instead of passively waiting for desirable resource offer to be given to frameworks. Therefore, we propose to use machine learning techniques to improve offer quality. We consider the problem of actively improving the quality of resource offers with limited information and interactions to users. In this work, we propose a quality-aware allocator with a pre-defined quality function for optimizing job execution time. In addition, we implemented an emulation environment to evaluate the performance of proposed allocator under various synthetic batch-processing workloads. Our evaluation shows up to 2x improvement in total completion time, 33% higher residual capacity, 46% less rejection rate and 70% better allocation placement with data locality. Chou, Jerry 周志遠 2017 學位論文 ; thesis 32 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立清華大學 === 資訊工程學系所 === 106 === Apache Mesos has become a popular cluster resource management tool with the emergence of various new cluster computing applications, such as Big Data analytics and deep learning. Resource offer mechanism of Mesos gives framework schedulers the ability to choose the best resources based on their own constraints and preferences. The default hierarchical DRF allocator gives near optimal results for simple task placement preferences and resource requirements under large resource pool running mostly short-living jobs. However, if these properties do not hold, higher offer rejection rate is expected, which leads to degraded overall performance. Moreover, in scenarios where the overall system throughput is the main concern, improving allocator has more chance for optimization instead of passively waiting for desirable resource offer to be given to frameworks.
Therefore, we propose to use machine learning techniques to improve offer quality. We consider the problem of actively improving the quality of resource offers with limited information and interactions to users.
In this work, we propose a quality-aware allocator with a pre-defined quality function for optimizing job execution time. In addition, we implemented an emulation environment to evaluate the performance of proposed allocator under various synthetic batch-processing workloads.
Our evaluation shows up to 2x improvement in total completion time, 33% higher residual capacity, 46% less rejection rate and 70% better allocation placement with data locality.
|
author2 |
Chou, Jerry |
author_facet |
Chou, Jerry Lee, Chin-Feng 李青峰 |
author |
Lee, Chin-Feng 李青峰 |
spellingShingle |
Lee, Chin-Feng 李青峰 Using Machine Learning to Manage Resources in Datacenters with Diverse Computing Requirements |
author_sort |
Lee, Chin-Feng |
title |
Using Machine Learning to Manage Resources in Datacenters with Diverse Computing Requirements |
title_short |
Using Machine Learning to Manage Resources in Datacenters with Diverse Computing Requirements |
title_full |
Using Machine Learning to Manage Resources in Datacenters with Diverse Computing Requirements |
title_fullStr |
Using Machine Learning to Manage Resources in Datacenters with Diverse Computing Requirements |
title_full_unstemmed |
Using Machine Learning to Manage Resources in Datacenters with Diverse Computing Requirements |
title_sort |
using machine learning to manage resources in datacenters with diverse computing requirements |
publishDate |
2017 |
url |
http://ndltd.ncl.edu.tw/handle/jrgj47 |
work_keys_str_mv |
AT leechinfeng usingmachinelearningtomanageresourcesindatacenterswithdiversecomputingrequirements AT lǐqīngfēng usingmachinelearningtomanageresourcesindatacenterswithdiversecomputingrequirements AT leechinfeng yùnyòngjīqìxuéxíjiějuézīliàozhōngxīnduōyànghuàjìsuànyàoqiúdezīyuánguǎnlǐwèntí AT lǐqīngfēng yùnyòngjīqìxuéxíjiějuézīliàozhōngxīnduōyànghuàjìsuànyàoqiúdezīyuánguǎnlǐwèntí |
_version_ |
1719158164448870400 |