A Binary Translator to Accelerate Development of Deep Learning Processing Library for AArch64 CPU

To accelerate deep learning (DL) processes on the supercomputer Fugaku, the authors have ported and optimized oneDNN for Fugaku's CPU, the Fujitsu A64FX. oneDNN is an open-source DL processing library developed by Intel for the x86 64 architecture. The A64FX CPU is based on the Armv8-A architec...

Full description

Bibliographic Details
Main Authors: Fukumoto, N. (Author), Honda, T. (Author), Kawakami, K. (Author), Kurihara, K. (Author), Yamazaki, M. (Author)
Format: Article
Language:English
Published: Institute of Electronics Information Communication Engineers 2022
Subjects:
Online Access:View Fulltext in Publisher