Adaptive Heterogeneous Computing using Runtime Dispatching and Translation Techniques

碩士 === 國立交通大學 === 資訊科學與工程研究所 === 101 === This thesis is to quip adaptability for heterogeneous computing in OpenCL framework. The implementation is that first we obtain the statuses of devices at runtime and then determine on which device kernels execute. If the kernel is dispatched into CPU, in ord...

Full description

Bibliographic Details
Main Authors:	Tsai, Yi-Pu, 蔡怡璞
Other Authors:	Hsu, Wei-Chung
Format:	Others
Language:	en_US
Published:	2013
Online Access:	http://ndltd.ncl.edu.tw/handle/36023886441426475476

id	ndltd-TW-101NCTU5394153
record_format	oai_dc
spelling	ndltd-TW-101NCTU53941532016-07-02T04:20:29Z http://ndltd.ncl.edu.tw/handle/36023886441426475476 Adaptive Heterogeneous Computing using Runtime Dispatching and Translation Techniques 使用執行期配置與轉譯技術達成可適性異質多核心計算 Tsai, Yi-Pu 蔡怡璞碩士國立交通大學資訊科學與工程研究所 101 This thesis is to quip adaptability for heterogeneous computing in OpenCL framework. The implementation is that first we obtain the statuses of devices at runtime and then determine on which device kernels execute. If the kernel is dispatched into CPU, in order to enhance the performance on CPU, it will be translated to SIMD instructions by the translation techniques provided by Whole Function Vectorization. Moreover, for the sake of more intelligent dispatch, we utilized the profile history to predict the loads of devices. With the improvement, experiments indicate that the average improvement of execution time is 65% on the samples provided by AMD APP SDK. Hsu, Wei-Chung 徐慰中 2013 學位論文 ; thesis 28 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 國立交通大學 === 資訊科學與工程研究所 === 101 === This thesis is to quip adaptability for heterogeneous computing in OpenCL framework. The implementation is that first we obtain the statuses of devices at runtime and then determine on which device kernels execute. If the kernel is dispatched into CPU, in order to enhance the performance on CPU, it will be translated to SIMD instructions by the translation techniques provided by Whole Function Vectorization. Moreover, for the sake of more intelligent dispatch, we utilized the profile history to predict the loads of devices. With the improvement, experiments indicate that the average improvement of execution time is 65% on the samples provided by AMD APP SDK.
author2	Hsu, Wei-Chung
author_facet	Hsu, Wei-Chung Tsai, Yi-Pu 蔡怡璞
author	Tsai, Yi-Pu 蔡怡璞
spellingShingle	Tsai, Yi-Pu 蔡怡璞 Adaptive Heterogeneous Computing using Runtime Dispatching and Translation Techniques
author_sort	Tsai, Yi-Pu
title	Adaptive Heterogeneous Computing using Runtime Dispatching and Translation Techniques
title_short	Adaptive Heterogeneous Computing using Runtime Dispatching and Translation Techniques
title_full	Adaptive Heterogeneous Computing using Runtime Dispatching and Translation Techniques
title_fullStr	Adaptive Heterogeneous Computing using Runtime Dispatching and Translation Techniques
title_full_unstemmed	Adaptive Heterogeneous Computing using Runtime Dispatching and Translation Techniques
title_sort	adaptive heterogeneous computing using runtime dispatching and translation techniques
publishDate	2013
url	http://ndltd.ncl.edu.tw/handle/36023886441426475476
work_keys_str_mv	AT tsaiyipu adaptiveheterogeneouscomputingusingruntimedispatchingandtranslationtechniques AT càiyípú adaptiveheterogeneouscomputingusingruntimedispatchingandtranslationtechniques AT tsaiyipu shǐyòngzhíxíngqīpèizhìyǔzhuǎnyìjìshùdáchéngkěshìxìngyìzhìduōhéxīnjìsuàn AT càiyípú shǐyòngzhíxíngqīpèizhìyǔzhuǎnyìjìshùdáchéngkěshìxìngyìzhìduōhéxīnjìsuàn
_version_	1718331500284346368

Adaptive Heterogeneous Computing using Runtime Dispatching and Translation Techniques

Similar Items