Automatic Pipelining and Vectorization of Scientific Code for FPGAs

There is a large body of legacy scientific code in use today that could benefit from execution on accelerator devices like GPUs and FPGAs. Manual translation of such legacy code into device-specific parallel code requires significant manual effort and is a major obstacle to wider FPGA adoption. We a...

Full description

Bibliographic Details
Main Authors:	Syed Waqar Nabi, Wim Vanderbauwhede
Format:	Article
Language:	English
Published:	Hindawi Limited 2019-01-01
Series:	International Journal of Reconfigurable Computing
Online Access:	http://dx.doi.org/10.1155/2019/7348013

id	doaj-50e03b9b7115497eb5d8c89dbf6d5a93
record_format	Article
spelling	doaj-50e03b9b7115497eb5d8c89dbf6d5a932020-11-25T01:20:37ZengHindawi LimitedInternational Journal of Reconfigurable Computing1687-71951687-72092019-01-01201910.1155/2019/73480137348013Automatic Pipelining and Vectorization of Scientific Code for FPGAsSyed Waqar Nabi0Wim Vanderbauwhede1School of Computing Science, University of Glasgow, Glasgow, UKSchool of Computing Science, University of Glasgow, Glasgow, UKThere is a large body of legacy scientific code in use today that could benefit from execution on accelerator devices like GPUs and FPGAs. Manual translation of such legacy code into device-specific parallel code requires significant manual effort and is a major obstacle to wider FPGA adoption. We are developing an automated optimizing compiler TyTra to overcome this obstacle. The TyTra flow aims to compile legacy Fortran code automatically for FPGA-based acceleration, while applying suitable optimizations. We present the flow with a focus on two key optimizations, automatic pipelining and vectorization. Our compiler frontend extracts patterns from legacy Fortran code that can be pipelined and vectorized. The backend first creates fine and coarse-grained pipelines and then automatically vectorizes both the memory access and the datapath based on a cost model, generating an OpenCL-HDL hybrid working solution for FPGA targets on the Amazon cloud. Our results show up to 4.2× performance improvement over baseline OpenCL code.http://dx.doi.org/10.1155/2019/7348013
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Syed Waqar Nabi Wim Vanderbauwhede
spellingShingle	Syed Waqar Nabi Wim Vanderbauwhede Automatic Pipelining and Vectorization of Scientific Code for FPGAs International Journal of Reconfigurable Computing
author_facet	Syed Waqar Nabi Wim Vanderbauwhede
author_sort	Syed Waqar Nabi
title	Automatic Pipelining and Vectorization of Scientific Code for FPGAs
title_short	Automatic Pipelining and Vectorization of Scientific Code for FPGAs
title_full	Automatic Pipelining and Vectorization of Scientific Code for FPGAs
title_fullStr	Automatic Pipelining and Vectorization of Scientific Code for FPGAs
title_full_unstemmed	Automatic Pipelining and Vectorization of Scientific Code for FPGAs
title_sort	automatic pipelining and vectorization of scientific code for fpgas
publisher	Hindawi Limited
series	International Journal of Reconfigurable Computing
issn	1687-7195 1687-7209
publishDate	2019-01-01
description	There is a large body of legacy scientific code in use today that could benefit from execution on accelerator devices like GPUs and FPGAs. Manual translation of such legacy code into device-specific parallel code requires significant manual effort and is a major obstacle to wider FPGA adoption. We are developing an automated optimizing compiler TyTra to overcome this obstacle. The TyTra flow aims to compile legacy Fortran code automatically for FPGA-based acceleration, while applying suitable optimizations. We present the flow with a focus on two key optimizations, automatic pipelining and vectorization. Our compiler frontend extracts patterns from legacy Fortran code that can be pipelined and vectorized. The backend first creates fine and coarse-grained pipelines and then automatically vectorizes both the memory access and the datapath based on a cost model, generating an OpenCL-HDL hybrid working solution for FPGA targets on the Amazon cloud. Our results show up to 4.2× performance improvement over baseline OpenCL code.
url	http://dx.doi.org/10.1155/2019/7348013
work_keys_str_mv	AT syedwaqarnabi automaticpipeliningandvectorizationofscientificcodeforfpgas AT wimvanderbauwhede automaticpipeliningandvectorizationofscientificcodeforfpgas
_version_	1725133123318972416

Automatic Pipelining and Vectorization of Scientific Code for FPGAs

Similar Items