AI Chat Paper

Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.

Chat more with AI

| Sign up

Browse by Subject

Search for peer-reviewed journals with full access.

Journals A - Z

About Us

Discover the SciOpen Platform and Achieve Your Research Goals with Ease.

About Us

Publish with Us

Support

Search articles, authors, keywords, DOl and etc.

Published Date

Reset Search

{{expandStatus?'Exit ':''}}Advanced Search

Journals A - Z

About Us

Publish with Us

Support

Article Link

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

Show Outline

Outline

Show full outline

Hide outline

Outline

Show full outline

Hide outline

Research | Open Access

ED2: environment dynamics decomposition world models for continuous control

Yifu Yuan^¹

, Hongyao Tang^¹

, Cong Wang^¹

, Yan Zheng^¹, Jianye Hao^¹

(

)

College of Intelligence and Computing, Tianjin University, Tianjin 300350, China

Show Author Information

Abstract

Model-based reinforcement learning (MBRL) achieves significant sample efficiency in practice in comparison to model-free RL, but its performance is often limited by the existence of model prediction error. To reduce the model error, standard MBRL approaches train a single well-designed network to fit the entire environment dynamics, but this wastes rich information on multiple sub-dynamics, which can be modeled separately, allowing us to construct the world model more accurately. In this paper, we propose environment dynamics decomposition (ED2), a novel world model construction framework that models the environment in a decomposing manner. ED2 contains two key components: sub-dynamics discovery (SD2) and dynamics decomposition prediction (D2P). SD2 discovers the sub-dynamics in an environment automatically and D2P constructs the decomposed world model following the sub-dynamics. ED2 can be easily combined with the existing MBRL algorithms and empirical results show that ED2 significantly reduces the model error, increases the sample efficiency, and achieves higher asymptotic performance when combined with the state-of-the-art MBRL algorithms on various continuous control tasks.

Keywords

Reinforcement learning (RL)Model-based reinforcement learning (MBRL)Visual world model Continuous control

References

【1】

Crossref Google Scholar

Visual Intelligence

Volume 3,
2025

Article number: 23

DOI: 10.1007/s44267-025-00094-x

	{{item.num}}
{{version.versionName}} Author Response
{{version.versionName}} Review comment

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Cite this Report

. . , {{reviewData.reportCite.doi}}

Cite this article:

Yuan Y, Tang H, Wang C, et al. ED2: environment dynamics decomposition world models for continuous control. Visual Intelligence, 2025, 3: 23. https://doi.org/10.1007/s44267-025-00094-x

379

Views

Crossref

Google Scholar
Citation

Received: 10 April 2025

Revised: 16 October 2025

Accepted: 21 October 2025

Published: 03 December 2025

This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.