Convergence of a Relaxed Variable Splitting Coarse Gradient Descent Method for Learning Sparse Weight Binarized Activation Neural Network

Sparsification of neural networks is one of the effective complexity reduction methods to improve efficiency and generalizability. Binarized activation offers an additional computational saving for inference. Due to vanishing gradient issue in training networks with binarized activation, coarse grad...

Full description

Bibliographic Details
Main Authors: Thu Dinh, Jack Xin
Format: Article
Language:English
Published: Frontiers Media S.A. 2020-05-01
Series:Frontiers in Applied Mathematics and Statistics
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fams.2020.00013/full