AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
PDF (886.5 KB)
Collect
Submit Manuscript AI Chat Paper
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Open Access | Just Accepted

A predictive algorithm for DNA binding sites with combined feature encoding and ensemble model

Zhendong Liu1( )Jiamin Jiang1Yanjie Wei2Chuanle Xiao3Jianxin Xue1Hengyang Wu1Shaojing Song1Tong Wang1

1 School of Computer and Information Engineering at Shanghai Second Polytechnic University

2 High-Performance Computing Research Center at the Shenzhen Institutes of Advanced Technology

3 Sun Yat-Sen University

Show Author Information

Abstract

The identification of latent DNA binding domains presents both significant scientific value and analytical complexity, given the extensive diversity within biological compound datasets. To address this challenge, our research introduces WeighDF, a novel computational framework integrating hybrid feature representation with adaptive multi-granularity scanning analysis. This approach dynamically weights features across scanning windows using learnable attenuation coefficients, which amplifies key sequence patterns and suppresses background noise. For comprehensive prediction of diverse DNA binding patterns, we further develop DecLPABS, an ensemble architecture combining WeighDF's adaptive scanning with meta-learner integration strategies. This dual-phase system demonstrates superior versatility in handling both categorical classification and continuous regression problems. Empirical validation across heterogeneous datasets reveals DecLPABS's enhanced predictive capability, achieving 0.8979 accuracy through optimized feature-space partitioning.

Tsinghua Science and Technology
Cite this article:
Liu Z, Jiang J, Wei Y, et al. A predictive algorithm for DNA binding sites with combined feature encoding and ensemble model. Tsinghua Science and Technology, 2025, https://doi.org/10.26599/TST.2025.9010115

25

Views

4

Downloads

0

Crossref

0

Web of Science

0

Scopus

0

CSCD

Altmetrics

Received: 19 March 2025
Revised: 08 May 2025
Accepted: 01 July 2025
Available online: 04 July 2025

© The author(s) 2025

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).

Return