Deconstructing the generalization gap

Gromov, Andrey

doi:10.1038/s42256-023-00766-7

News & Views
Published: 18 December 2023

Neural networks

Deconstructing the generalization gap

Andrey Gromov^1,2

Nature Machine Intelligence volume 5, pages 1340–1341 (2023)Cite this article

806 Accesses
1 Altmetric
Metrics details

Subjects

New research reveals a duality between neural network weights and neuron activities that enables a geometric decomposition of the generalization gap. The framework provides a way to interpret the effects of regularization schemes such as stochastic gradient descent and dropout on generalization — and to improve upon these methods.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: Two methods to reduce the generalization gap.**

References

Neyshabur, B., Tomioka, R. & Srebro, N. Preprint at https://doi.org/10.48550/arXiv.1412.6614 (2014).
Zhang, C., Bengio, S., Hardt, M., Recht, B. & Vinyals, O. In Int. Conf. Learning Representations (ICLR) 2017 https://openreview.net/forum?id=Sy8gdB9xx (2022).
Feng, Y., Zhang, W. & Tu, Y. Nat. Mach. Intell. 5, 908–918 (2023).
Article Google Scholar

Download references

Author information

Authors and Affiliations

Condensed Matter Theory Center, University of Maryland, College Park, MD, USA
Andrey Gromov
Department of Physics, University of Maryland, College Park, MD, USA
Andrey Gromov

Authors

Andrey Gromov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrey Gromov.

Ethics declarations

Competing interests

The author declares no competing interests.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gromov, A. Deconstructing the generalization gap. Nat Mach Intell 5, 1340–1341 (2023). https://doi.org/10.1038/s42256-023-00766-7

Download citation

Published: 18 December 2023
Issue Date: December 2023
DOI: https://doi.org/10.1038/s42256-023-00766-7

Deconstructing the generalization gap

Subjects

Access options

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

Activity–weight duality in feed-forward neural networks reveals two co-determinants for generalization

Search

Quick links

Subjects

Access options

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links