Performance Analysis of a Scalable Algorithm for 3D Linear Transforms on Supercomputer with Intel Processors/Co-Processors

Practical realizations of 3D forward/inverse separable discrete transforms, such as Fourier transform, cosine/sine transform, etc. are frequently the principal limiters that prevent many practical applications from scaling to a large number of processors. Existing approaches, which are based primari...

Full description

Bibliographic Details
Published in:Cybernetics and Information Technologies
Main Author: Lirkov Ivan
Format: Article
Language:English
Published: Sciendo 2020-12-01
Subjects:
Online Access:https://doi.org/10.2478/cait-2020-0064
_version_ 1852790098541674496
author Lirkov Ivan
author_facet Lirkov Ivan
author_sort Lirkov Ivan
collection DOAJ
container_title Cybernetics and Information Technologies
description Practical realizations of 3D forward/inverse separable discrete transforms, such as Fourier transform, cosine/sine transform, etc. are frequently the principal limiters that prevent many practical applications from scaling to a large number of processors. Existing approaches, which are based primarily on 1D or 2D data decompositions, prevent the 3D transforms from effectively scaling to the maximum (possible/available) number of computer nodes. A highly scalable approach to realize forward/inverse 3D transforms has been proposed. It is based on a 3D decomposition of data and geared towards a torus network of computer nodes. The proposed algorithms requires compute-and-roll time-steps, where each step consists of an execution of multiple GEMM operations and concurrent movement of cubical data blocks between nearest neighbors. The aim of this paper is to present an experimental performance study of an implementation on high performance computer architecture.
format Article
id doaj-art-b10c4e152c834aa38a84fbcd55cbe4d4
institution Directory of Open Access Journals
issn 1314-4081
language English
publishDate 2020-12-01
publisher Sciendo
record_format Article
spelling doaj-art-b10c4e152c834aa38a84fbcd55cbe4d42025-08-19T20:44:12ZengSciendoCybernetics and Information Technologies1314-40812020-12-012069410410.2478/cait-2020-0064Performance Analysis of a Scalable Algorithm for 3D Linear Transforms on Supercomputer with Intel Processors/Co-ProcessorsLirkov Ivan0Institute of Information and Communication, Technologies, Bulgarian Academy of Sciences,1113Sofia, BulgariaPractical realizations of 3D forward/inverse separable discrete transforms, such as Fourier transform, cosine/sine transform, etc. are frequently the principal limiters that prevent many practical applications from scaling to a large number of processors. Existing approaches, which are based primarily on 1D or 2D data decompositions, prevent the 3D transforms from effectively scaling to the maximum (possible/available) number of computer nodes. A highly scalable approach to realize forward/inverse 3D transforms has been proposed. It is based on a 3D decomposition of data and geared towards a torus network of computer nodes. The proposed algorithms requires compute-and-roll time-steps, where each step consists of an execution of multiple GEMM operations and concurrent movement of cubical data blocks between nearest neighbors. The aim of this paper is to present an experimental performance study of an implementation on high performance computer architecture.https://doi.org/10.2478/cait-2020-00643d linear transformsparallel implementationintel processors/ co-processors
spellingShingle Lirkov Ivan
Performance Analysis of a Scalable Algorithm for 3D Linear Transforms on Supercomputer with Intel Processors/Co-Processors
3d linear transforms
parallel implementation
intel processors/ co-processors
title Performance Analysis of a Scalable Algorithm for 3D Linear Transforms on Supercomputer with Intel Processors/Co-Processors
title_full Performance Analysis of a Scalable Algorithm for 3D Linear Transforms on Supercomputer with Intel Processors/Co-Processors
title_fullStr Performance Analysis of a Scalable Algorithm for 3D Linear Transforms on Supercomputer with Intel Processors/Co-Processors
title_full_unstemmed Performance Analysis of a Scalable Algorithm for 3D Linear Transforms on Supercomputer with Intel Processors/Co-Processors
title_short Performance Analysis of a Scalable Algorithm for 3D Linear Transforms on Supercomputer with Intel Processors/Co-Processors
title_sort performance analysis of a scalable algorithm for 3d linear transforms on supercomputer with intel processors co processors
topic 3d linear transforms
parallel implementation
intel processors/ co-processors
url https://doi.org/10.2478/cait-2020-0064
work_keys_str_mv AT lirkovivan performanceanalysisofascalablealgorithmfor3dlineartransformsonsupercomputerwithintelprocessorscoprocessors