Neuron-by-Neuron Quantization for Efficient Low-Bit QNN Training

Quantized neural networks (QNNs) are widely used to achieve computationally efficient solutions to recognition problems. Overall, eight-bit QNNs have almost the same accuracy as full-precision networks, but working several times faster. However, the networks with lower quantization levels demonstrat...

Full description

Bibliographic Details
Main Authors: Arlazarov, V.V (Author), Limonova, E. (Author), Nikolaev, D. (Author), Sher, A. (Author), Trusov, A. (Author)
Format: Article
Language:English
Published: MDPI 2023
Subjects:
Online Access:View Fulltext in Publisher
View in Scopus