AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
PDF (2 MB)
Collect
Submit Manuscript AI Chat Paper
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Open Access

A Knowledge Graph Enhanced Pre-Trained Large Language Model for Predicting MicroRNA-circRNA Interactions

School of Computer Science, Northwestern Polytechnical University, Xi’an 710129, China
College of Computer Science and Electronic Engineering, Hunan University, Changsha 410082, China

Jiren Zhou and Rui Niu contribute equally to this work.

Show Author Information

Abstract

The interactions between circular RNAs (circRNAs) and microRNAs are one of the key mechanisms determining the functions of non-coding RNAs (ncRNAs) in biological processes such as DNA methylation and RNA-induced silencing. Studying these relationships can deepen our understanding of the function of these RNAs’ roles in developing cancer vaccines and designing treatments. Therefore, we propose a knowledge graph enhanced pre-trained Large Language Model (LLM) for predicting circRNA-microRNA interactions. Our approach employs graph contrastive learning to represent a knowledge graph consisting of circRNA and microRNA entities from multi-views. The features of these entities are derived by fine-tuning a sequential LLM by two types of ncRNAs separately. At the final, the embedding is fed into classifier for prediction. We employ an independent testing set to evaluate the model’s performance and against our model with recently reported models on two datasets. Our model achieves approximately a 3% improvement in Area Under the Receiver Operating Characteristic Curve (AUROC), reaching 93.77% and 93.07%, respectively. The stability of our model is tested by performing 10-fold cross-validation on the remaining training set where our model performs the best stability. In ablation study, we comprehensively compare strategies for sequence processing and effectiveness of independent module. Finally, on a case study dataset derived from real-world scenarios, the model assign scores to all candidates and rank them accordingly. Among the top 10 highest-scoring results, 7 have been validated by wet-lab experiments, highlighting the model’s strong generalization capability.

References

【1】
【1】
 
 
Big Data Mining and Analytics
Pages 1405-1417

{{item.num}}

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Close
Close
Cite this article:
Zhou J, Niu R, Ji B, et al. A Knowledge Graph Enhanced Pre-Trained Large Language Model for Predicting MicroRNA-circRNA Interactions. Big Data Mining and Analytics, 2025, 8(6): 1405-1417. https://doi.org/10.26599/BDMA.2025.9020014

1174

Views

101

Downloads

0

Crossref

0

Web of Science

0

Scopus

0

CSCD

Received: 22 April 2024
Revised: 21 December 2024
Accepted: 24 January 2025
Published: 19 September 2025
© The author(s) 2025.

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).