AI Chat Paper

Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.

Chat more with AI

| Sign up

Browse by Subject

Search for peer-reviewed journals with full access.

Journals A - Z

About Us

Discover the SciOpen Platform and Achieve Your Research Goals with Ease.

About Us

Publish with Us

Support

Search articles, authors, keywords, DOl and etc.

Published Date

Reset Search

{{expandStatus?'Exit ':''}}Advanced Search

Journals A - Z

About Us

Publish with Us

Support

PDF (2.1 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

AI Chat Paper

Show Outline

Outline

Show full outline

Hide outline

Outline

Show full outline

Hide outline

Original Paper | Open Access | Just Accepted

Coevolutionary SGD for Improving Generalization

Dan Su, Chunhua Yang(

), Weihua Gui

School of Automation, Central South University, Changsha, Hunan 410083, China

Show Author Information

Abstract

Stochastic gradient descent (SGD) is one of the most widely used optimization methods in neural network training. However, it is vulnerable to certain limitations, such as falling into a sharp minima and overfitting. The coevolutionary neural-based optimization algorithm is recognized for its robust global search capability and effectiveness in reducing overfitting. Additionally, building on previous work exploring the relationship between loss landscape geometry and generalization, the squared gradient norm can be used as a criterion for identifying flat loss landscapes, thereby improving model generalization. In this work, we propose a coevolutionary SGD (CSGD) algorithm that integrates the coevolutionary neural-based optimization approach with the squared gradient norm as a comparison criterion. This algorithm aims to minimize both the loss values and the sharpness of the loss landscape, thereby simultaneously addressing the problems of poor generalization. We analyze the convergence of the proposed algorithm. We elaborate on the experimental results using multiple neural networks on benchmark datasets to demonstrate the advantages of the proposed method with respect to model generalization and local minima phenomenon.

Keywords

Deep neural networks coevolutionary neural-based optimization algorithm generalization

References

【1】

Crossref Google Scholar

Tsinghua Science and Technology

DOI: 10.26599/TST.2025.9010126

	{{item.num}}
{{version.versionName}} Author Response
{{version.versionName}} Review comment

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Cite this Report

. . , , {{reviewData.reportCite.doi}}

Cite this article:

Su D, Yang C, Gui W. Coevolutionary SGD for Improving Generalization. Tsinghua Science and Technology, 2025, https://doi.org/10.26599/TST.2025.9010126

411

Views

Downloads

Crossref

Web of Science

Scopus

CSCD

Google Scholar
Citation

Received: 01 April 2025

Revised: 08 June 2025

Accepted: 29 July 2025

Available online: 30 July 2025

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).