OpenCNN: A Winograd Minimal Filtering Algorithm Implementation in CUDA

Improving the performance of the convolution operation has become a key target for High Performance Computing (HPC) developers due to its prevalence in deep learning applied mainly to video processing. The improvement is being pushed by algorithmic and implementation innovations. Algorithmically, th...

Full description

Bibliographic Details
Main Authors: Roberto L. Castro, Diego Andrade, Basilio B. Fraguela
Format: Article
Language:English
Published: MDPI AG 2021-08-01
Series:Mathematics
Subjects:
Online Access:https://www.mdpi.com/2227-7390/9/17/2033