Feature selection in machine learning models of groundwater level and its application effect analysis

Minli GUO; Tianhang LIU; Erping BI; Xiaobin HU; Ying XIAO; Chunshi LIU

doi:10.3880/j.issn.1004-6933.2025.03.021

AI Chat Paper

Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.

Chat more with AI

| Sign up

Browse by Subject

Search for peer-reviewed journals with full access.

Journals A - Z

About Us

Discover the SciOpen Platform and Achieve Your Research Goals with Ease.

About Us

Publish with Us

Support

Search articles, authors, keywords, DOl and etc.

Published Date

Reset Search

{{expandStatus?'Exit ':''}}Advanced Search

Journals A - Z

About Us

Publish with Us

Support

PDF (2.2 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

AI Chat Paper

Show Outline

Outline

Show full outline

Hide outline

Outline

Show full outline

Hide outline

Publishing Language: Chinese

Feature selection in machine learning models of groundwater level and its application effect analysis

Minli GUO^¹(

), Tianhang LIU^², Erping BI^³, Xiaobin HU^⁴, Ying XIAO^⁴, Chunshi LIU^⁴

Beijing Water Science & Technology Institute, Beijing 100048, China

College of Humanities and Development Studies, China Agricultural University, Beijing 100083, China

School of Water Resources and Environment, China University of Geosciences, Beijing 100083, China

Beijing Water Resources Dispatching Center, Beijing 100195, China

Show Author Information

Abstract

To improve the simulation performance of machine learning models for groundwater levels, four feature selection methods, including partial correlation analysis, Pearson correlation coefficient, maximum relevance-minimum redundancy (mRMR), and random forest (RF) methods, were employed to screen input parameters for three groundwater level machine learning models in the Mihuaishun Area. The simulation results before and after parameter feature selection were compared. The results show that different parameters require different feature selection methods. Groundwater level and its lagged values can be determined using partial correlation analysis, while artificial recharge and its lagged values, as well as precipitation and its lagged values, require a combination of mRMR and RF methods. Specifically, the mRMR method is more effective for selecting precipitation and its lagged values, whereas the RF method is better suited for screening artificial recharge and its lagged values. Feature selection significantly improved the simulation accuracy of the extreme learning machine (ELM) and RF models while enhancing the computational speed of the nonlinear autoregressive neural network with exogenous inputs (NARX) model. When applied to the three groundwater level machine learning models in the Mihuaishun Area, the parameter feature selection led to notable improvements that the ELM model showed a 63% reduction in root mean square error (RMSE), a 98% increase in the Nash-Sutcliffe efficiency coefficient (NSE), and a 45% improvement in the coefficient of determination (R²). The RF model achieved a 49% reduction in RMSE, a 6% increase in NSE, and a 2% improvement in R², while the NARX model demonstrated an 11-fold increase in computational speed.

Keywords

groundwater level machine learning model feature selection the Mihuaishun Area

CLC number: TV213.4 Document code: A Article ID: 1004-6933(2025)03-0179-08

References

【1】

Crossref Google Scholar

Water Resources Protection

Volume 41 Issue 3,
May 2025

Pages 179-186

DOI: 10.3880/j.issn.1004-6933.2025.03.021

	{{item.num}}
{{version.versionName}} Author Response
{{version.versionName}} Review comment

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Cite this Report

. . , {{reviewData.reportCite.doi}}

Cite this article:

GUO M, LIU T, BI E, et al. Feature selection in machine learning models of groundwater level and its application effect analysis. Water Resources Protection, 2025, 41(3): 179-186. https://doi.org/10.3880/j.issn.1004-6933.2025.03.021

643

Views

Downloads

Crossref

Web of Science

Scopus

CSCD

Google Scholar
Citation

Received: 16 July 2024

Published: 20 May 2025