A Binary Translator to Accelerate Development of Deep Learning Processing Library for AArch64 CPU

To accelerate deep learning (DL) processes on the supercomputer Fugaku, the authors have ported and optimized oneDNN for Fugaku's CPU, the Fujitsu A64FX. oneDNN is an open-source DL processing library developed by Intel for the x86 64 architecture. The A64FX CPU is based on the Armv8-A architec...

Full description

Bibliographic Details
Main Authors:	Fukumoto, N. (Author), Honda, T. (Author), Kawakami, K. (Author), Kurihara, K. (Author), Yamazaki, M. (Author)
Format:	Article
Language:	English
Published:	Institute of Electronics Information Communication Engineers 2022
Subjects:	AArch64 binary translator deep learning just-in-time assembler oneDNN
Online Access:	View Fulltext in Publisher

Internet

View Fulltext in Publisher

A Binary Translator to Accelerate Development of Deep Learning Processing Library for AArch64 CPU

Internet

Similar Items