AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
Article Link
Collect
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Article | Open Access

Design and application of a semantic-driven geospatial modeling knowledge graph based on large language models

Jianyuan LiangaShuyang HouaAnqi ZhaoaQingyang XuaLonggang XiangaRui LiaHuayi Wua,b ( )
State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, China
Collaborative Innovation Center of Geospatial Technology, Wuhan University, Wuhan, China
Show Author Information

Abstract

While leveraging large language models (LLMs) for intelligent geospatial modeling has garnered significant attention, the limited domain-specific knowledge of LLMs often leads to inefficient or unreliable geo-analysis model generation. Crowdsourced geoprocessing scripts encapsulate extensive expert knowledge for different geospatial modeling tasks, where code snippets are strategically combined into functional steps to build application-specific modeling processes. However, extracting these modeling processes from heterogeneous geoprocessing scripts and integrating them for reuse remains challenging due to the complexity of code interdependencies, the heterogeneity of scripting approaches, and the need for domain-specific customization. To address this, we propose S-GMKG, a knowledge graph that systematically extracts and integrates modeling processes from scripts as structured semantic units. Two strategies are introduced: a skeleton-based extraction method and a knowledge-enhanced chain of thought (CoT) approach, which facilitate automated modeling process extraction for S-GMKG via prompt engineering. Furthermore, a self-canonicalization and knowledge augmentation process is proposed to refine the S-GMKG. Consequently, S-GMKG serves as a robust external knowledge source to provide interpretable, graph-based modeling solutions and synergizes with LLMs for geospatial tasks. We implemented the S-GMKG using 4820 geoprocessing scripts and evaluated it across various LLMs. Results indicate that most scripts in the S-GMKG can be represented as modeling processes with 3–7 functional steps, with the proposed strategies achieving 3.2%–14.5% higher recall rates in relationship identification for these functional steps. Case studies in two distinct scenarios demonstrate the practicality of S-GMKG, particularly in collaborating with LLMs to generate code for geospatial modeling.

References

【1】
【1】
 
 
Geo-Spatial Information Science
Pages 2927-2946

{{item.num}}

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Close
Close
Cite this article:
Liang J, Hou S, Zhao A, et al. Design and application of a semantic-driven geospatial modeling knowledge graph based on large language models. Geo-Spatial Information Science, 2025, 28(6): 2927-2946. https://doi.org/10.1080/10095020.2025.2483884

173

Views

10

Crossref

12

Web of Science

12

Scopus

0

CSCD

Received: 12 September 2024
Accepted: 19 March 2025
Published: 07 April 2025
© 2025 Wuhan University.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The terms on which this article has been published allow the posting of the Accepted Manuscript in a repository by the author(s) or with their consent.