Decoding the Structural Keywords in Protein Structure Universe

Wessam Elhefnawy; Min Li; Jian-Xin Wang; Yaohang Li

doi:10.1007/s11390-019-1895-y

AI Chat Paper

Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.

Chat more with AI

| Sign up

Browse by Subject

Search for peer-reviewed journals with full access.

Journals A - Z

About Us

Discover the SciOpen Platform and Achieve Your Research Goals with Ease.

About Us

Publish with Us

Support

Search articles, authors, keywords, DOl and etc.

Published Date

Reset Search

{{expandStatus?'Exit ':''}}Advanced Search

Journals A - Z

About Us

Publish with Us

Support

Article Link

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

Show Outline

Outline

Show full outline

Hide outline

Outline

Show full outline

Hide outline

Regular Paper

Decoding the Structural Keywords in Protein Structure Universe

Wessam Elhefnawy^¹, Min Li^², Jian-Xin Wang^², Yaohang Li^¹(

)

Department of Computer Science, Old Dominion University, Norfolk, VA 23452, U.S.A.

Department of Computer Science, Central South University, Changsha 410083, China

Show Author Information

Abstract

Although the protein sequence-structure gap continues to enlarge due to the development of high-throughput sequencing tools, the protein structure universe tends to be complete without proteins with novel structural folds deposited in the protein data bank (PDB) recently. In this work, we identify a protein structural dictionary (Frag-K) composed of a set of backbone fragments ranging from 4 to 20 residues as the structural “keywords” that can effectively distinguish between major protein folds. We firstly apply randomized spectral clustering and random forest algorithms to construct representative and sensitive protein fragment libraries from a large scale of high-quality, non-homologous protein structures available in PDB. We analyze the impacts of clustering cut-offs on the performance of the fragment libraries. Then, the Frag-K fragments are employed as structural features to classify protein structures in major protein folds defined by SCOP (Structural Classification of Proteins). Our results show that a structural dictionary with ~400 4- to 20-residue Frag-K fragments is capable of classifying major SCOP folds with high accuracy.

Keywords

protein fragment fold recognition protein structure universe

Electronic Supplementary Material

Download File(s)

jcst-34-1-3-Highlights.pdf (721.9 KB)

References

【1】

Crossref Google Scholar

Journal of Computer Science and Technology

Volume 34 Issue 1,
January 2019

Pages 3-15

DOI: 10.1007/s11390-019-1895-y

	{{item.num}}
{{version.versionName}} Author Response
{{version.versionName}} Review comment

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Cite this Report

. . , , {{reviewData.reportCite.doi}}

Cite this article:

Elhefnawy W, Li M, Wang J-X, et al. Decoding the Structural Keywords in Protein Structure Universe. Journal of Computer Science and Technology, 2019, 34(1): 3-15. https://doi.org/10.1007/s11390-019-1895-y

854

Views

Crossref

N/A

Web of Science

Scopus

CSCD

Google Scholar
Citation

Received: 13 July 2018

Revised: 04 December 2018

Published: 18 January 2019