Learning Sparse Low-Precision Neural Networks With Learnable Regularization

Learning Sparse Low-Precision Neural Networks With Learnable Regularization

We consider learning deep neural networks (DNNs) that consist of low-precision weights and activations for efficient inference of fixed-point operations. In training low-precision networks, gradient descent in the backward pass is performed with high-precision weights while quantized low-precision w...

Full description

Bibliographic Details
Main Authors:	Yoojin Choi, Mostafa El-Khamy, Jungwon Lee
Format:	Article
Language:	English
Published:	IEEE 2020-01-01
Series:	IEEE Access
Subjects:	Deep neural networks fixed-point arithmetic model compression quantization regularization weight pruning
Online Access:	https://ieeexplore.ieee.org/document/9098870/

Similar Items

Exploring Accumulated Gradient-Based Quantization and Compression for Deep Neural Networks
by: Gaopande, Meghana Laxmidhar
Published: (2020)

Learning Sparse Convolutional Neural Network via Quantization With Low Rank Regularization
by: Xin Long, et al.
Published: (2019-01-01)

Optimized Compression for Implementing Convolutional Neural Networks on FPGA
by: Min Zhang, et al.
Published: (2019-03-01)

Ps and Qs: Quantization-Aware Pruning for Efficient Low Latency Neural Network Inference
by: Benjamin Hawks, et al.
Published: (2021-07-01)

Deep Learning Models Compression for Agricultural Plants
by: Arnauld Nzegha Fountsop, et al.
Published: (2020-09-01)

Training Multi-Bit Quantized and Binarized Networks with a Learnable Symmetric Quantizer
by: Phuoc Pham, et al.
Published: (2021-01-01)

Structured Pruning of Convolutional Neural Networks via L1 Regularization
by: Chen Yang, et al.
Published: (2019-01-01)

Zero-Centered Fixed-Point Quantization With Iterative Retraining for Deep Convolutional Neural Network-Based Object Detectors
by: Sungrae Kim, et al.
Published: (2021-01-01)

Fixed-Point Arithmetic in FPGA
by: M. Bečvář, et al.
Published: (2005-01-01)

Weight Quantization Retraining for Sparse and Compressed Spatial Domain Correlation Filters
by: Dilshad Sabir, et al.
Published: (2021-02-01)

Lightweight Compression of Intermediate Neural Network Features for Collaborative Intelligence
by: Robert A. Cohen, et al.
Published: (2021-01-01)

Asymptotic regularity and fixed point theorems on a 2-Banach space
by: Mantu Saha, et al.
Published: (2012-09-01)

Inverting Incomplete Fourier Transforms by a Sparse Regularization Model and Applications in Seismic Wavefield Modeling
by: Wu, T., et al.
Published: (2022)

A Deep Learning Framework of Quantized Compressed Sensing for Wireless Neural Recording
by: Biao Sun, et al.
Published: (2016-01-01)

Simulink <sup>TM</sup>modules that emulate digital controllers realized with fixed-point or floating-point arithmetic
by: Robe, Edward D.
Published: (1994)

Pruning optimization based on deep convolution neural network
by: Ma Zhinan, et al.
Published: (2018-12-01)

Coarse-Grained Pruning of Neural Network Models Based on Blocky Sparse Structure
by: Lan Huang, et al.
Published: (2021-08-01)

Theorems on Fixed Points for Asymptotically Regular Sequences and Maps in
by: Mohammad S. Khan, et al.
Published: (2017-01-01)

Roulette: A Pruning Framework to Train a Sparse Neural Network From Scratch
by: Qiaoling Zhong, et al.
Published: (2021-01-01)

LOCOFloat: A Low-Cost Floating-Point Format for FPGAs.: Application to HIL Simulators
by: Alberto Sanchez, et al.
Published: (2020-01-01)

Human Segmentation Based on Compressed Deep Convolutional Neural Network
by: Jun Miao, et al.
Published: (2020-01-01)

A Layer-Wise Extreme Network Compression for Super Resolution
by: Jiwon Hwang, et al.
Published: (2021-01-01)

Differential Evolution Based Layer-Wise Weight Pruning for Compressing Deep Neural Networks
by: Tao Wu, et al.
Published: (2021-01-01)

Methods to evaluate accuracy-energy trade-off in operator-level approximate computing
by: Barrois, Benjamin
Published: (2017)

Network Compression via Mixed Precision Quantization Using a Multi-Layer Perceptron for the Bit-Width Allocation
by: Efstathia Soufleri, et al.
Published: (2021-01-01)

Fixed point results for a class of asymptotically regular maps in g-metric space with order n
by: Mohamed Amine Ighachane, et al.
Published: (2021-02-01)

Pruning Convolutional Filters Using Batch Bridgeout
by: Najeeb Khan, et al.
Published: (2020-01-01)

PermLSTM: A High Energy-Efficiency LSTM Accelerator Architecture
by: Yong Zheng, et al.
Published: (2021-04-01)

On the Redundancy in the Rank of Neural Network Parameters and Its Controllability
by: Chanhee Lee, et al.
Published: (2021-01-01)

Análise do efeito da precisão finita no algoritmo adaptativo sigmoidal
by: Fonseca, José de Ribamar Silva
Published: (2017)

Truncated SIMD Multiplier Architecture for Approximate Computing in Low-Power Programmable Processors
by: Roberto R. Osorio, et al.
Published: (2019-01-01)

Évaluation analytique de la précision des systèmes en virgule fixe pour des applications de communication numérique
by: Chakhari, Aymen
Published: (2014)

A Sparse Connected Long Short-Term Memory With Sharing Weight for Time Series Prediction
by: Liyan Xiong, et al.
Published: (2020-01-01)

Detecting Malware Code as Video With Compressed, Time-Distributed Neural Networks
by: Michael L. Santacroce, et al.
Published: (2020-01-01)

Low regularity solutions for Dirac-Klein-Gordon equations in one space dimension
by: Yung-fu Fang
Published: (2004-08-01)

Exploring the efficiency of the combined application of connection pruning and source data preprocessing when training a multilayer perceptron
by: Oleg Galchonkov, et al.
Published: (2020-04-01)

Fixed-Point Arithmetic Unit with a Scaling Mechanism for FPGA-Based Embedded Systems
by: Andrzej Przybył
Published: (2021-05-01)

Pruning-Based Sparse Recovery for Electrocardiogram Reconstruction from Compressed Measurements
by: Jaeseok Lee, et al.
Published: (2017-01-01)

Hardware-Aware Design for Edge Intelligence
by: Warren J. Gross, et al.
Published: (2021-01-01)

Learning a Deep Vector Quantization Network for Image Compression
by: Xiaotong Lu, et al.
Published: (2019-01-01)