Critical Task Scheduling for Data Stream Computing on Heterogeneous Clouds

碩士 === 國立臺中教育大學 === 資訊工程學系 === 105 === Recently, Internet of Things is widely used in many different fields, such as logistics, medical care, science and urban development. Sensor data of these fields could be generated and collected through sensors. In the meantime, sensors created large and contin...

Full description

Bibliographic Details
Main Authors: KUO, YEN-HSUAN, 郭晏瑄
Other Authors: LAI, KUAN-CHOU
Format: Others
Language:zh-TW
Published: 2017
Online Access:http://ndltd.ncl.edu.tw/handle/p7fd7m
id ndltd-TW-105NTCT0394012
record_format oai_dc
spelling ndltd-TW-105NTCT03940122019-05-15T23:24:51Z http://ndltd.ncl.edu.tw/handle/p7fd7m Critical Task Scheduling for Data Stream Computing on Heterogeneous Clouds 異質雲中資料串流計算之關鍵任務排程 KUO, YEN-HSUAN 郭晏瑄 碩士 國立臺中教育大學 資訊工程學系 105 Recently, Internet of Things is widely used in many different fields, such as logistics, medical care, science and urban development. Sensor data of these fields could be generated and collected through sensors. In the meantime, sensors created large and continuous data stream, which is called Big Data. In the application of Internet of things, the data collected through the sensor is continuously generated. This way of generating data is called Data Stream. Data stream processing usually requires a lot of calculation and immediate response, so how effective and immediate processing of large-scale streaming data is a new research topic. This study selects Apache Spark system to process data stream, this system is the use of In-Memory Computing calculation approach, eliminating the need for data access I/O time to speed up the data processing time. However, the scale of big data is too large, there is a lot of processing resources required in terms of calculation or storage. Cloud computing technology could be support enough computing capability. In Spark system, the stage execution order use FIFO algorithm to sort, the stage submit order is used as execution order. But the method not concern that the stage order change was affected the execution performance. In the worker node of the heterogeneous resources(offer), using Random approach to sort offer, but the offer of poor performance may be given priority to run job resulting in the impact of execution performance. So, this study proposes a Proactive Task Scheduling on Heterogeneous Resources (PTSHR) approach about these problem, the dependence of stages is generated by the profiling system, reorder the stages order and concern heterogeneous resources of the offers to sort, let better performance offer have priority to run task and tried to improve overall processing time. LAI, KUAN-CHOU 賴冠州 2017 學位論文 ; thesis 64 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺中教育大學 === 資訊工程學系 === 105 === Recently, Internet of Things is widely used in many different fields, such as logistics, medical care, science and urban development. Sensor data of these fields could be generated and collected through sensors. In the meantime, sensors created large and continuous data stream, which is called Big Data. In the application of Internet of things, the data collected through the sensor is continuously generated. This way of generating data is called Data Stream. Data stream processing usually requires a lot of calculation and immediate response, so how effective and immediate processing of large-scale streaming data is a new research topic. This study selects Apache Spark system to process data stream, this system is the use of In-Memory Computing calculation approach, eliminating the need for data access I/O time to speed up the data processing time. However, the scale of big data is too large, there is a lot of processing resources required in terms of calculation or storage. Cloud computing technology could be support enough computing capability. In Spark system, the stage execution order use FIFO algorithm to sort, the stage submit order is used as execution order. But the method not concern that the stage order change was affected the execution performance. In the worker node of the heterogeneous resources(offer), using Random approach to sort offer, but the offer of poor performance may be given priority to run job resulting in the impact of execution performance. So, this study proposes a Proactive Task Scheduling on Heterogeneous Resources (PTSHR) approach about these problem, the dependence of stages is generated by the profiling system, reorder the stages order and concern heterogeneous resources of the offers to sort, let better performance offer have priority to run task and tried to improve overall processing time.
author2 LAI, KUAN-CHOU
author_facet LAI, KUAN-CHOU
KUO, YEN-HSUAN
郭晏瑄
author KUO, YEN-HSUAN
郭晏瑄
spellingShingle KUO, YEN-HSUAN
郭晏瑄
Critical Task Scheduling for Data Stream Computing on Heterogeneous Clouds
author_sort KUO, YEN-HSUAN
title Critical Task Scheduling for Data Stream Computing on Heterogeneous Clouds
title_short Critical Task Scheduling for Data Stream Computing on Heterogeneous Clouds
title_full Critical Task Scheduling for Data Stream Computing on Heterogeneous Clouds
title_fullStr Critical Task Scheduling for Data Stream Computing on Heterogeneous Clouds
title_full_unstemmed Critical Task Scheduling for Data Stream Computing on Heterogeneous Clouds
title_sort critical task scheduling for data stream computing on heterogeneous clouds
publishDate 2017
url http://ndltd.ncl.edu.tw/handle/p7fd7m
work_keys_str_mv AT kuoyenhsuan criticaltaskschedulingfordatastreamcomputingonheterogeneousclouds
AT guōyànxuān criticaltaskschedulingfordatastreamcomputingonheterogeneousclouds
AT kuoyenhsuan yìzhìyúnzhōngzīliàochuànliújìsuànzhīguānjiànrènwùpáichéng
AT guōyànxuān yìzhìyúnzhōngzīliàochuànliújìsuànzhīguānjiànrènwùpáichéng
_version_ 1719147309168590848