A Binary Translator to Accelerate Development of Deep Learning Processing Library for AArch64 CPU
To accelerate deep learning (DL) processes on the supercomputer Fugaku, the authors have ported and optimized oneDNN for Fugaku's CPU, the Fujitsu A64FX. oneDNN is an open-source DL processing library developed by Intel for the x86 64 architecture. The A64FX CPU is based on the Armv8-A architec...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Institute of Electronics Information Communication Engineers
2022
|
Subjects: | |
Online Access: | View Fulltext in Publisher |