Critical Task Scheduling for Data Stream Computing on Heterogeneous Clouds
碩士 === 國立臺中教育大學 === 資訊工程學系 === 105 === Recently, Internet of Things is widely used in many different fields, such as logistics, medical care, science and urban development. Sensor data of these fields could be generated and collected through sensors. In the meantime, sensors created large and contin...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2017
|
Online Access: | http://ndltd.ncl.edu.tw/handle/p7fd7m |
id |
ndltd-TW-105NTCT0394012 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-105NTCT03940122019-05-15T23:24:51Z http://ndltd.ncl.edu.tw/handle/p7fd7m Critical Task Scheduling for Data Stream Computing on Heterogeneous Clouds 異質雲中資料串流計算之關鍵任務排程 KUO, YEN-HSUAN 郭晏瑄 碩士 國立臺中教育大學 資訊工程學系 105 Recently, Internet of Things is widely used in many different fields, such as logistics, medical care, science and urban development. Sensor data of these fields could be generated and collected through sensors. In the meantime, sensors created large and continuous data stream, which is called Big Data. In the application of Internet of things, the data collected through the sensor is continuously generated. This way of generating data is called Data Stream. Data stream processing usually requires a lot of calculation and immediate response, so how effective and immediate processing of large-scale streaming data is a new research topic. This study selects Apache Spark system to process data stream, this system is the use of In-Memory Computing calculation approach, eliminating the need for data access I/O time to speed up the data processing time. However, the scale of big data is too large, there is a lot of processing resources required in terms of calculation or storage. Cloud computing technology could be support enough computing capability. In Spark system, the stage execution order use FIFO algorithm to sort, the stage submit order is used as execution order. But the method not concern that the stage order change was affected the execution performance. In the worker node of the heterogeneous resources(offer), using Random approach to sort offer, but the offer of poor performance may be given priority to run job resulting in the impact of execution performance. So, this study proposes a Proactive Task Scheduling on Heterogeneous Resources (PTSHR) approach about these problem, the dependence of stages is generated by the profiling system, reorder the stages order and concern heterogeneous resources of the offers to sort, let better performance offer have priority to run task and tried to improve overall processing time. LAI, KUAN-CHOU 賴冠州 2017 學位論文 ; thesis 64 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺中教育大學 === 資訊工程學系 === 105 === Recently, Internet of Things is widely used in many different fields, such as logistics, medical care, science and urban development. Sensor data of these fields could be generated and collected through sensors. In the meantime, sensors created large and continuous data stream, which is called Big Data. In the application of Internet of things, the data collected through the sensor is continuously generated. This way of generating data is called Data Stream. Data stream processing usually requires a lot of calculation and immediate response, so how effective and immediate processing of large-scale streaming data is a new research topic.
This study selects Apache Spark system to process data stream, this system is the use of In-Memory Computing calculation approach, eliminating the need for data access I/O time to speed up the data processing time. However, the scale of big data is too large, there is a lot of processing resources required in terms of calculation or storage. Cloud computing technology could be support enough computing capability.
In Spark system, the stage execution order use FIFO algorithm to sort, the stage submit order is used as execution order. But the method not concern that the stage order change was affected the execution performance. In the worker node of the heterogeneous resources(offer), using Random approach to sort offer, but the offer of poor performance may be given priority to run job resulting in the impact of execution performance.
So, this study proposes a Proactive Task Scheduling on Heterogeneous Resources (PTSHR) approach about these problem, the dependence of stages is generated by the profiling system, reorder the stages order and concern heterogeneous resources of the offers to sort, let better performance offer have priority to run task and tried to improve overall processing time.
|
author2 |
LAI, KUAN-CHOU |
author_facet |
LAI, KUAN-CHOU KUO, YEN-HSUAN 郭晏瑄 |
author |
KUO, YEN-HSUAN 郭晏瑄 |
spellingShingle |
KUO, YEN-HSUAN 郭晏瑄 Critical Task Scheduling for Data Stream Computing on Heterogeneous Clouds |
author_sort |
KUO, YEN-HSUAN |
title |
Critical Task Scheduling for Data Stream Computing on Heterogeneous Clouds |
title_short |
Critical Task Scheduling for Data Stream Computing on Heterogeneous Clouds |
title_full |
Critical Task Scheduling for Data Stream Computing on Heterogeneous Clouds |
title_fullStr |
Critical Task Scheduling for Data Stream Computing on Heterogeneous Clouds |
title_full_unstemmed |
Critical Task Scheduling for Data Stream Computing on Heterogeneous Clouds |
title_sort |
critical task scheduling for data stream computing on heterogeneous clouds |
publishDate |
2017 |
url |
http://ndltd.ncl.edu.tw/handle/p7fd7m |
work_keys_str_mv |
AT kuoyenhsuan criticaltaskschedulingfordatastreamcomputingonheterogeneousclouds AT guōyànxuān criticaltaskschedulingfordatastreamcomputingonheterogeneousclouds AT kuoyenhsuan yìzhìyúnzhōngzīliàochuànliújìsuànzhīguānjiànrènwùpáichéng AT guōyànxuān yìzhìyúnzhōngzīliàochuànliújìsuànzhīguānjiànrènwùpáichéng |
_version_ |
1719147309168590848 |