AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
PDF (19.9 MB)
Collect
Submit Manuscript AI Chat Paper
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Original Paper | Open Access | Just Accepted

SpaViT: Self-supervised Prediction of High-Resolution Spatial Transcriptomics with Vision Transformer

Wenwen Min1( )Shuailin Xue1Fangfang Zhu2( )Taosheng Xu3Changmiao Wang4Hong-Bo Xie5

1 School of Information Science and Engineering, Yunnan University, Kunming 650500, China

2 School of Health and Nursing, Yunnan Open University, Kunming 650599, China

3 Institute of Intelligent Machines, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, China

4 Shenzhen Research Institute of Big Data, Shenzhen, China

5 Centre for Data Science, School of Mathematical Sciences, Queensland University of Technology

Show Author Information

Abstract

The spatiotemporal specificity of gene expression highlights the importance of integrating cellular spatial information to better understand the specific functions of cells within tissues. The advent of spatial transcriptomics (ST) technologies has made it possible to quantitatively measure gene expression in cells while also pinpointing their exact locations within tissues. However, widely used ST techniques are frequently limited by low resolution, potentially hindering researchers from fully understanding gene expression patterns, cell type distribution, and their interactions. Here we propose SpaViT, a self-supervised method based on the Transformer architecture for predicting high-resolution gene expression. SpaViT leverages customized self-supervised proxy tasks to learn the continuous patterns of gene expression within tissues and predicting high-resolution gene expression profiles.We evaluate the performance of SpaViT on diverse datasets from different platform swith different technologies. The results indicate superior performance of SpaViT in enhancing spatial resolution and predicting gene expression in unmeasured areas compared to other deep learning and traditional interpolation methods. Additionally, SpaViT enhances the spatial patterns of gene expression, aiding researchers in identifying biologically significant differentially expressed genes and pathways. Our source code and all datasets used in this study are available at https://github.com/wenwenmin/SpaViT and https://zenodo.org/records/14160324.

References

【1】
【1】
 
 
Tsinghua Science and Technology

{{item.num}}

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Close
Close
Cite this article:
Min W, Xue S, Zhu F, et al. SpaViT: Self-supervised Prediction of High-Resolution Spatial Transcriptomics with Vision Transformer. Tsinghua Science and Technology, 2025, https://doi.org/10.26599/TST.2025.9010087

6260

Views

85

Downloads

1

Crossref

0

Web of Science

0

Scopus

0

CSCD

Received: 21 November 2024
Revised: 22 January 2025
Accepted: 21 April 2025
Available online: 25 August 2025

© The author(s) 2025

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).