Conferences >2017 IEEE International Confe...

Centered Weight Normalization in Accelerating Training of Deep Neural Networks

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Training deep neural networks is difficult for the pathological curvature problem. Re-parameterization is an effective way to relieve the problem by learning the curvatur...Show More

Metadata

Abstract:

Training deep neural networks is difficult for the pathological curvature problem. Re-parameterization is an effective way to relieve the problem by learning the curvature approximately or constraining the solutions of weights with good properties for optimization. This paper proposes to reparameterize the input weight of each neuron in deep neural networks by normalizing it with zero-mean and unit-norm, followed by a learnable scalar parameter to adjust the norm of the weight. This technique effectively stabilizes the distribution implicitly. Besides, it improves the conditioning of the optimization problem and thus accelerates the training of deep neural networks. It can be wrapped as a linear module in practice and plugged in any architecture to replace the standard linear module. We highlight the benefits of our method on both multi-layer perceptrons and convolutional neural networks, and demonstrate its scalability and efficiency on SVHN, CIFAR-10, CIFAR-100 and ImageNet datasets.

Published in: 2017 IEEE International Conference on Computer Vision (ICCV)

Date of Conference: 22-29 October 2017

Date Added to IEEE Xplore: 25 December 2017

ISBN Information:

Electronic ISSN: 2380-7504

DOI: 10.1109/ICCV.2017.305

Conference Location: Venice, Italy

Contents

References is not available for this document.

Centered Weight Normalization in Accelerating Training of Deep Neural Networks

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Centered Weight Normalization in Accelerating Training of Deep Neural Networks

Alerts

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?