AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
PDF (2.2 MB)
Collect
Submit Manuscript AI Chat Paper
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Publishing Language: Chinese

Feature selection in machine learning models of groundwater level and its application effect analysis

Minli GUO1( )Tianhang LIU2Erping BI3Xiaobin HU4Ying XIAO4Chunshi LIU4
Beijing Water Science & Technology Institute, Beijing 100048, China
College of Humanities and Development Studies, China Agricultural University, Beijing 100083, China
School of Water Resources and Environment, China University of Geosciences, Beijing 100083, China
Beijing Water Resources Dispatching Center, Beijing 100195, China
Show Author Information

Abstract

To improve the simulation performance of machine learning models for groundwater levels, four feature selection methods, including partial correlation analysis, Pearson correlation coefficient, maximum relevance-minimum redundancy (mRMR), and random forest (RF) methods, were employed to screen input parameters for three groundwater level machine learning models in the Mihuaishun Area. The simulation results before and after parameter feature selection were compared. The results show that different parameters require different feature selection methods. Groundwater level and its lagged values can be determined using partial correlation analysis, while artificial recharge and its lagged values, as well as precipitation and its lagged values, require a combination of mRMR and RF methods. Specifically, the mRMR method is more effective for selecting precipitation and its lagged values, whereas the RF method is better suited for screening artificial recharge and its lagged values. Feature selection significantly improved the simulation accuracy of the extreme learning machine (ELM) and RF models while enhancing the computational speed of the nonlinear autoregressive neural network with exogenous inputs (NARX) model. When applied to the three groundwater level machine learning models in the Mihuaishun Area, the parameter feature selection led to notable improvements that the ELM model showed a 63% reduction in root mean square error (RMSE), a 98% increase in the Nash-Sutcliffe efficiency coefficient (NSE), and a 45% improvement in the coefficient of determination (R2). The RF model achieved a 49% reduction in RMSE, a 6% increase in NSE, and a 2% improvement in R2, while the NARX model demonstrated an 11-fold increase in computational speed.

CLC number: TV213.4 Document code: A Article ID: 1004-6933(2025)03-0179-08

References

【1】
【1】
 
 
Water Resources Protection
Pages 179-186

{{item.num}}

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Close
Close
Cite this article:
GUO M, LIU T, BI E, et al. Feature selection in machine learning models of groundwater level and its application effect analysis. Water Resources Protection, 2025, 41(3): 179-186. https://doi.org/10.3880/j.issn.1004-6933.2025.03.021

643

Views

29

Downloads

0

Crossref

0

Web of Science

1

Scopus

0

CSCD

Received: 16 July 2024
Published: 20 May 2025
© Journal of Water Resources Protection