AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
PDF (2.1 MB)
Collect
Submit Manuscript AI Chat Paper
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Original Paper | Open Access | Just Accepted

Coevolutionary SGD for Improving Generalization

Dan SuChunhua Yang( )Weihua Gui

School of Automation, Central South University, Changsha, Hunan 410083, China

Show Author Information

Abstract

Stochastic gradient descent (SGD) is one of the most widely used optimization methods in neural network training. However, it is vulnerable to certain limitations, such as falling into a sharp minima and overfitting. The coevolutionary neural-based optimization algorithm is recognized for its robust global search capability and effectiveness in reducing overfitting. Additionally, building on previous work exploring the relationship between loss landscape geometry and generalization, the squared gradient norm can be used as a criterion for identifying flat loss landscapes, thereby improving model generalization. In this work, we propose a coevolutionary SGD (CSGD) algorithm that integrates the coevolutionary neural-based optimization approach with the squared gradient norm as a comparison criterion. This algorithm aims to minimize both the loss values and the sharpness of the loss landscape, thereby simultaneously addressing the problems of poor generalization. We analyze the convergence of the proposed algorithm. We elaborate on the experimental results using multiple neural networks on benchmark datasets to demonstrate the advantages of the proposed method with respect to model generalization and local minima phenomenon.

References

【1】
【1】
 
 
Tsinghua Science and Technology

{{item.num}}

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Close
Close
Cite this article:
Su D, Yang C, Gui W. Coevolutionary SGD for Improving Generalization. Tsinghua Science and Technology, 2025, https://doi.org/10.26599/TST.2025.9010126

320

Views

20

Downloads

0

Crossref

0

Web of Science

0

Scopus

0

CSCD

Received: 01 April 2025
Revised: 08 June 2025
Accepted: 29 July 2025
Available online: 30 July 2025

© The author(s) 2025

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).