Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Jung J, Lee B. Protein structure alignment using environmental profiles. Protein Eng 2000;13:535-43. [PMID: 10964982 DOI: 10.1093/protein/13.8.535] [Citation(s) in RCA: 95] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]

Number

Cited by Other Article(s)

Choi DH, Kang SK, Lee KE, Jung J, Kim EJ, Kim WH, Kwon YG, Kim KP, Jo I, Park YS, Park SI. Nitrosylation of β2-Tubulin Promotes Microtubule Disassembly and Differentiated Cardiomyocyte Beating in Ischemic Mice. Tissue Eng Regen Med 2023;20:921-937. [PMID: 37679590 PMCID: PMC10519925 DOI: 10.1007/s13770-023-00582-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 05/04/2023] [Accepted: 05/10/2023] [Indexed: 09/09/2023] Open

Dixit H, Kulharia M, Verma SK. Metalloproteome of human-infective RNA viruses: a study towards understanding the role of metal ions in virology. Pathog Dis 2023;81:ftad020. [PMID: 37653445 DOI: 10.1093/femspd/ftad020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 08/07/2023] [Accepted: 08/29/2023] [Indexed: 09/02/2023] Open

SeqCP: A sequence-based algorithm for searching circularly permuted proteins. Comput Struct Biotechnol J 2022;21:185-201. [PMID: 36582435 PMCID: PMC9763678 DOI: 10.1016/j.csbj.2022.11.024] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 11/10/2022] [Accepted: 11/10/2022] [Indexed: 11/16/2022] Open

Liu C, Boland S, Scholle MD, Bardiot D, Marchand A, Chaltin P, Blatt LM, Beigelman L, Symons JA, Raboisson P, Gurard-Levin ZA, Vandyck K, Deval J. Dual inhibition of SARS-CoV-2 and human rhinovirus with protease inhibitors in clinical development. Antiviral Res 2021;187:105020. [PMID: 33515606 PMCID: PMC7839511 DOI: 10.1016/j.antiviral.2021.105020] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2020] [Revised: 01/05/2021] [Accepted: 01/17/2021] [Indexed: 12/14/2022]

Wen Z, He J, Huang SY. Topology-independent and global protein structure alignment through an FFT-based algorithm. Bioinformatics 2020;36:478-486. [PMID: 31384919 DOI: 10.1093/bioinformatics/btz609] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Revised: 07/22/2019] [Accepted: 08/02/2019] [Indexed: 12/12/2022] Open

Benchmarking Methods of Protein Structure Alignment. J Mol Evol 2020;88:575-597. [PMID: 32725409 DOI: 10.1007/s00239-020-09960-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Accepted: 07/10/2020] [Indexed: 10/23/2022]

Frank M, Beccati D, Leeflang BR, Vliegenthart JFG. C-Mannosylation Enhances the Structural Stability of Human RNase 2. iScience 2020;23:101371. [PMID: 32739833 PMCID: PMC7399192 DOI: 10.1016/j.isci.2020.101371] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2020] [Revised: 06/22/2020] [Accepted: 07/13/2020] [Indexed: 12/25/2022] Open

Koo N, Shin AY, Oh S, Kim H, Hong S, Park SJ, Sim YM, Byeon I, Kim KY, Lim YP, Kwon SY, Kim YM. Comprehensive analysis of Translationally Controlled Tumor Protein (TCTP) provides insights for lineage-specific evolution and functional divergence. PLoS One 2020;15:e0232029. [PMID: 32374732 PMCID: PMC7202613 DOI: 10.1371/journal.pone.0232029] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2019] [Accepted: 04/06/2020] [Indexed: 12/28/2022] Open

Abstract

BACKGROUND

Translationally controlled tumor protein (TCTP) is a conserved, multifunctional protein involved in numerous cellular processes in eukaryotes. Although the functions of TCTP have been investigated sporadically in animals, invertebrates, and plants, few lineage-specific activities of this molecule, have been reported. An exception is in Arabidopsis thaliana, in which TCTP (AtTCTP1) functions in stomatal closuer by regulating microtubule stability. Further, although the development of next-generation sequencing technologies has facilitated the analysis of many eukaryotic genomes in public databases, inter-kingdom comparative analyses using available genome information are comparatively scarce.

METHODOLOGY

To carry out inter-kingdom comparative analysis of TCTP, TCTP genes were identified from 377 species. Then phylogenetic analysis, prediction of protein structure, molecular docking simulation and molecular dynamics analysis were performed to investigate the evolution of TCTP genes and their binding proteins.

RESULTS

A total of 533 TCTP genes were identified from 377 eukaryotic species, including protozoa, fungi, invertebrates, vertebrates, and plants. Phylogenetic and secondary structure analyses reveal lineage-specific evolution of TCTP, and inter-kingdom comparisons highlight the lineage-specific emergence of, or changes in, secondary structure elements in TCTP proteins from different kingdoms. Furthermore, secondary structure comparisons between TCTP proteins within each kingdom, combined with measurements of the degree of sequence conservation, suggest that TCTP genes have evolved to conserve protein secondary structures in a lineage-specific manner. Additional tertiary structure analysis of TCTP-binding proteins and their interacting partners and docking simulations between these proteins further imply that TCTP gene variation may influence the tertiary structures of TCTP-binding proteins in a lineage-specific manner.

CONCLUSIONS

Our analysis suggests that TCTP has undergone lineage-specific evolution and that structural changes in TCTP proteins may correlate with the tertiary structure of TCTP-binding proteins and their binding partners in a lineage-specific manner.

Collapse

Affiliation(s)

Namjin Koo Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
Ah-Young Shin Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
Sangho Oh Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
Hyeongmin Kim Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea Department of Biomedical Informatics, Center for Genome Science, National Institute of Health, KCDC, Choongchung-Buk-do, Republic of Korea
Seongmin Hong Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea Molecular Genetics and Genomics Laboratory, Department of Horticulture, College of Agriculture and Life Science, Chungnam National University, Daejeon, Korea
Seong-Jin Park Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
Young Mi Sim Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
Iksu Byeon Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
Kye Young Kim Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
Yong Pyo Lim Molecular Genetics and Genomics Laboratory, Department of Horticulture, College of Agriculture and Life Science, Chungnam National University, Daejeon, Korea
Suk-Yoon Kwon Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
Yong-Min Kim Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea

Collapse

Fallaize CJ, Green PJ, Mardia KV, Barber S. Bayesian protein sequence and structure alignment. J R Stat Soc Ser C Appl Stat 2020. [DOI: 10.1111/rssc.12394] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Saidi R, Dhifli W, Maddouri M, Mephu Nguifo E. Efficiently Mining Recurrent Substructures from Protein Three-Dimensional Structure Graphs. J Comput Biol 2019;26:561-571. [DOI: 10.1089/cmb.2018.0171] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Antlion optimization algorithm for pairwise structural alignment with bi-objective functions. Neural Comput Appl 2019. [DOI: 10.1007/s00521-019-04176-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Lee J, Son A, Kim P, Kwon SB, Yu JE, Han G, Seong BL. RNA‐dependent chaperone (chaperna) as an engineered pro‐region for the folding of recombinant microbial transglutaminase. Biotechnol Bioeng 2019;116:490-502. [DOI: 10.1002/bit.26879] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2018] [Revised: 11/15/2018] [Accepted: 11/22/2018] [Indexed: 12/14/2022]

Kim P, Jang YH, Kwon SB, Lee CM, Han G, Seong BL. Glycosylation of Hemagglutinin and Neuraminidase of Influenza A Virus as Signature for Ecological Spillover and Adaptation among Influenza Reservoirs. Viruses 2018;10:v10040183. [PMID: 29642453 PMCID: PMC5923477 DOI: 10.3390/v10040183] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2018] [Revised: 03/25/2018] [Accepted: 04/05/2018] [Indexed: 12/12/2022] Open

Aronsson A, Güler F, Petoukhov MV, Crennell SJ, Svergun DI, Linares-Pastén JA, Nordberg Karlsson E. Structural insights of Rm Xyn10A – A prebiotic-producing GH10 xylanase with a non-conserved aglycone binding region. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2018;1866:292-306. [DOI: 10.1016/j.bbapap.2017.11.006] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/20/2017] [Revised: 10/05/2017] [Accepted: 11/12/2017] [Indexed: 02/02/2023]

Dhifli W, Diallo AB. ProtNN: fast and accurate protein 3D-structure classification in structural and topological space. BioData Min 2016;9:30. [PMID: 27688811 PMCID: PMC5034655 DOI: 10.1186/s13040-016-0108-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2016] [Accepted: 08/22/2016] [Indexed: 11/30/2022] Open

Identification of amino acid networks governing catalysis in the closed complex of class I terpene synthases. Proc Natl Acad Sci U S A 2016;113:E958-67. [PMID: 26842837 DOI: 10.1073/pnas.1519680113] [Citation(s) in RCA: 49] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Gutiérrez FI, Rodriguez-Valenzuela F, Ibarra IL, Devos DP, Melo F. Efficient and automated large-scale detection of structural relationships in proteins with a flexible aligner. BMC Bioinformatics 2016;17:20. [PMID: 26732380 PMCID: PMC4702403 DOI: 10.1186/s12859-015-0866-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2015] [Accepted: 12/21/2015] [Indexed: 12/01/2022] Open

Abstract

Background

The total number of known three-dimensional protein structures is rapidly increasing. Consequently, the need for fast structural search against complete databases without a significant loss of accuracy is increasingly demanding. Recently, TopSearch, an ultra-fast method for finding rigid structural relationships between a query structure and the complete Protein Data Bank (PDB), at the multi-chain level, has been released. However, comparable accurate flexible structural aligners to perform efficient whole database searches of multi-domain proteins are not yet available. The availability of such a tool is critical for a sustainable boosting of biological discovery.

Results

Here we report on the development of a new method for the fast and flexible comparison of protein structure chains. The method relies on the calculation of 2D matrices containing a description of the three-dimensional arrangement of secondary structure elements (angles and distances). The comparison involves the matching of an ensemble of substructures through a nested-two-steps dynamic programming algorithm. The unique features of this new approach are the integration and trade-off balancing of the following: 1) speed, 2) accuracy and 3) global and semiglobal flexible structure alignment by integration of local substructure matching. The comparison, and matching with competitive accuracy, of one medium sized (250-aa) query structure against the complete PDB database (216,322 protein chains) takes about 8 min using an average desktop computer. The method is at least 2–3 orders of magnitude faster than other tested tools with similar accuracy. We validate the performance of the method for fold and superfamily assignment in a large benchmark set of protein structures. We finally provide a series of examples to illustrate the usefulness of this method and its application in biological discovery.

Conclusions

The method is able to detect partial structure matching, rigid body shifts, conformational changes and tolerates substantial structural variation arising from insertions, deletions and sequence divergence, as well as structural convergence of unrelated proteins.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0866-8) contains supplementary material, which is available to authorized users.

Collapse

Stamm M, Forrest LR. Structure alignment of membrane proteins: Accuracy of available tools and a consensus strategy. Proteins 2015;83:1720-32. [PMID: 26178143 DOI: 10.1002/prot.24857] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2015] [Revised: 05/07/2015] [Accepted: 06/07/2015] [Indexed: 12/31/2022]

Zhao C, Sacan A. UniAlign: protein structure alignment meets evolution. Bioinformatics 2015;31:3139-46. [PMID: 26059715 DOI: 10.1093/bioinformatics/btv354] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2015] [Accepted: 06/02/2015] [Indexed: 11/15/2022] Open

Carugo O. Protomers of protein hetero-oligomers tend to resemble each other more than expected. SPRINGERPLUS 2014;3:680. [PMID: 26034682 PMCID: PMC4447755 DOI: 10.1186/2193-1801-3-680] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/16/2014] [Accepted: 11/14/2014] [Indexed: 11/26/2022]

Micale G, Pulvirenti A, Giugno R, Ferro A. Proteins comparison through probabilistic optimal structure local alignment. Front Genet 2014;5:302. [PMID: 25228906 PMCID: PMC4151033 DOI: 10.3389/fgene.2014.00302] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2014] [Accepted: 08/12/2014] [Indexed: 11/13/2022] Open

Nicholls RA, Fischer M, McNicholas S, Murshudov GN. Conformation-independent structural comparison of macromolecules with ProSMART. ACTA CRYSTALLOGRAPHICA. SECTION D, BIOLOGICAL CRYSTALLOGRAPHY 2014;70:2487-99. [PMID: 25195761 PMCID: PMC4157452 DOI: 10.1107/s1399004714016241] [Citation(s) in RCA: 150] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/09/2014] [Accepted: 07/12/2014] [Indexed: 12/05/2023]

Ma J, Wang S. Algorithms, Applications, and Challenges of Protein Structure Alignment. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2014;94:121-75. [DOI: 10.1016/b978-0-12-800168-4.00005-6] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]

Protein structure alignment beyond spatial proximity. Sci Rep 2013;3:1448. [PMID: 23486213 PMCID: PMC3596798 DOI: 10.1038/srep01448] [Citation(s) in RCA: 98] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2012] [Accepted: 02/25/2013] [Indexed: 11/08/2022] Open

Topham CM, Rouquier M, Tarrat N, André I. Adaptive Smith-Waterman residue match seeding for protein structural alignment. Proteins 2013;81:1823-39. [DOI: 10.1002/prot.24327] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2013] [Revised: 04/22/2013] [Accepted: 05/15/2013] [Indexed: 12/30/2022]

Khan MB, Sponder G, Sjöblom B, Svidová S, Schweyen RJ, Carugo O, Djinović-Carugo K. Structural and functional characterization of the N-terminal domain of the yeast Mg2+channel Mrs2. ACTA CRYSTALLOGRAPHICA SECTION D: BIOLOGICAL CRYSTALLOGRAPHY 2013;69:1653-64. [DOI: 10.1107/s0907444913011712] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/09/2013] [Accepted: 04/29/2013] [Indexed: 01/08/2023]

Cheraghi R, Hosseinkhani S, Davoodi J, Nazari M, Amini-Bayat Z, Karimi H, Shamseddin M, Gheidari F. Structural and functional effects of circular permutation on firefly luciferase: In vitro assay of caspase 3/7. Int J Biol Macromol 2013;58:336-42. [DOI: 10.1016/j.ijbiomac.2013.04.015] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2012] [Revised: 03/28/2013] [Accepted: 04/08/2013] [Indexed: 02/08/2023]

Wang JJY, Bensmail H, Gao X. Multiple graph regularized protein domain ranking. BMC Bioinformatics 2012;13:307. [PMID: 23157331 PMCID: PMC3583823 DOI: 10.1186/1471-2105-13-307] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2012] [Accepted: 10/29/2012] [Indexed: 11/10/2022] Open

Ritchie DW, Ghoorah AW, Mavridis L, Venkatraman V. Fast protein structure alignment using Gaussian overlap scoring of backbone peptide fragment similarity. Bioinformatics 2012;28:3274-81. [DOI: 10.1093/bioinformatics/bts618] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Bonnel N, Marteau PF. LNA: fast protein structural comparison using a Laplacian characterization of tertiary structure. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2012;9:1451-1458. [PMID: 22547433 DOI: 10.1109/tcbb.2012.64] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]

Ho HK, Gange G, Kuiper MJ, Ramamohanarao K. BetaSearch: a new method for querying β-residue motifs. BMC Res Notes 2012;5:391. [PMID: 22839199 PMCID: PMC3532365 DOI: 10.1186/1756-0500-5-391] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2012] [Accepted: 06/15/2012] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Searching for structural motifs across known protein structures can be useful for identifying unrelated proteins with similar function and characterising secondary structures such as β-sheets. This is infeasible using conventional sequence alignment because linear protein sequences do not contain spatial information. β-residue motifs are β-sheet substructures that can be represented as graphs and queried using existing graph indexing methods, however, these approaches are designed for general graphs that do not incorporate the inherent structural constraints of β-sheets and require computationally-expensive filtering and verification procedures. 3D substructure search methods, on the other hand, allow β-residue motifs to be queried in a three-dimensional context but at significant computational costs.

FINDINGS

We developed a new method for querying β-residue motifs, called BetaSearch, which leverages the natural planar constraints of β-sheets by indexing them as 2D matrices, thus avoiding much of the computational complexities involved with structural and graph querying. BetaSearch exhibits faster filtering, verification, and overall query time than existing graph indexing approaches whilst producing comparable index sizes. Compared to 3D substructure search methods, BetaSearch achieves 33 and 240 times speedups over index-based and pairwise alignment-based approaches, respectively. Furthermore, we have presented case-studies to demonstrate its capability of motif matching in sequentially dissimilar proteins and described a method for using BetaSearch to predict β-strand pairing.

CONCLUSIONS

We have demonstrated that BetaSearch is a fast method for querying substructure motifs. The improvements in speed over existing approaches make it useful for efficiently performing high-volume exploratory querying of possible protein substructural motifs or conformations. BetaSearch was used to identify a nearly identical β-residue motif between an entirely synthetic (Top7) and a naturally-occurring protein (Charcot-Leyden crystal protein), as well as identifying structural similarities between biotin-binding domains of avidin, streptavidin and the lipocalin gamma subunit of human C8.

Collapse

Shealy P, Valafar H. Multiple structure alignment with msTALI. BMC Bioinformatics 2012;13:105. [PMID: 22607234 PMCID: PMC3473313 DOI: 10.1186/1471-2105-13-105] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2011] [Accepted: 04/18/2012] [Indexed: 11/10/2022] Open

Wang J, Gao X, Wang Q, Li Y. ProDis-ContSHC: learning protein dissimilarity measures and hierarchical context coherently for protein-protein comparison in protein database retrieval. BMC Bioinformatics 2012;13 Suppl 7:S2. [PMID: 22594999 PMCID: PMC3348016 DOI: 10.1186/1471-2105-13-s7-s2] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open

Abstract

BACKGROUND

The need to retrieve or classify protein molecules using structure or sequence-based similarity measures underlies a wide range of biomedical applications. Traditional protein search methods rely on a pairwise dissimilarity/similarity measure for comparing a pair of proteins. This kind of pairwise measures suffer from the limitation of neglecting the distribution of other proteins and thus cannot satisfy the need for high accuracy of the retrieval systems. Recent work in the machine learning community has shown that exploiting the global structure of the database and learning the contextual dissimilarity/similarity measures can improve the retrieval performance significantly. However, most existing contextual dissimilarity/similarity learning algorithms work in an unsupervised manner, which does not utilize the information of the known class labels of proteins in the database.

RESULTS

In this paper, we propose a novel protein-protein dissimilarity learning algorithm, ProDis-ContSHC. ProDis-ContSHC regularizes an existing dissimilarity measure dij by considering the contextual information of the proteins. The context of a protein is defined by its neighboring proteins. The basic idea is, for a pair of proteins (i, j), if their context N(i) and N(j) is similar to each other, the two proteins should also have a high similarity. We implement this idea by regularizing dij by a factor learned from the context N(i) and N(j).Moreover, we divide the context to hierarchial sub-context and get the contextual dissimilarity vector for each protein pair. Using the class label information of the proteins, we select the relevant (a pair of proteins that has the same class labels) and irrelevant (with different labels) protein pairs, and train an SVM model to distinguish between their contextual dissimilarity vectors. The SVM model is further used to learn a supervised regularizing factor. Finally, with the new Supervised learned Dissimilarity measure, we update the Protein Hierarchial Context Coherently in an iterative algorithm--ProDis-ContSHC.We test the performance of ProDis-ContSHC on two benchmark sets, i.e., the ASTRAL 1.73 database and the FSSP/DALI database. Experimental results demonstrate that plugging our supervised contextual dissimilarity measures into the retrieval systems significantly outperforms the context-free dissimilarity/similarity measures and other unsupervised contextual dissimilarity measures that do not use the class label information.

CONCLUSIONS

Using the contextual proteins with their class labels in the database, we can improve the accuracy of the pairwise dissimilarity/similarity measures dramatically for the protein retrieval tasks. In this work, for the first time, we propose the idea of supervised contextual dissimilarity learning, resulting in the ProDis-ContSHC algorithm. Among different contextual dissimilarity learning approaches that can be used to compare a pair of proteins, ProDis-ContSHC provides the highest accuracy. Finally, ProDis-ContSHC compares favorably with other methods reported in the recent literature.

Collapse

SALEM SAEED, ZAKI MOHAMMEDJ, BYSTROFF CHRISTOPHER. ITERATIVE NON-SEQUENTIAL PROTEIN STRUCTURAL ALIGNMENT. J Bioinform Comput Biol 2011;7:571-96. [PMID: 19507290 DOI: 10.1142/s0219720009004205] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2008] [Revised: 11/05/2008] [Accepted: 11/06/2008] [Indexed: 11/18/2022]

Daniluk P, Lesyng B. A novel method to compare protein structures using local descriptors. BMC Bioinformatics 2011;12:344. [PMID: 21849047 PMCID: PMC3179968 DOI: 10.1186/1471-2105-12-344] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2011] [Accepted: 08/17/2011] [Indexed: 11/15/2022] Open

Shen YF, Li B, Liu ZP. Protein structure alignment based on internal coordinates. Interdiscip Sci 2010;2:308-19. [DOI: 10.1007/s12539-010-0019-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2008] [Revised: 01/05/2010] [Accepted: 01/06/2010] [Indexed: 10/18/2022]

Chu CH, Lo WC, Wang HW, Hsu YC, Hwang JK, Lyu PC, Pai TW, Tang CY. Detection and alignment of 3D domain swapping proteins using angle-distance image-based secondary structural matching techniques. PLoS One 2010;5:e13361. [PMID: 20976204 PMCID: PMC2955075 DOI: 10.1371/journal.pone.0013361] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2010] [Accepted: 09/13/2010] [Indexed: 11/18/2022] Open

Abstract

This work presents a novel detection method for three-dimensional domain swapping (DS), a mechanism for forming protein quaternary structures that can be visualized as if monomers had “opened” their “closed” structures and exchanged the opened portion to form intertwined oligomers. Since the first report of DS in the mid 1990s, an increasing number of identified cases has led to the postulation that DS might occur in a protein with an unconstrained terminus under appropriate conditions. DS may play important roles in the molecular evolution and functional regulation of proteins and the formation of depositions in Alzheimer's and prion diseases. Moreover, it is promising for designing auto-assembling biomaterials. Despite the increasing interest in DS, related bioinformatics methods are rarely available. Owing to a dramatic conformational difference between the monomeric/closed and oligomeric/open forms, conventional structural comparison methods are inadequate for detecting DS. Hence, there is also a lack of comprehensive datasets for studying DS. Based on angle-distance (A-D) image transformations of secondary structural elements (SSEs), specific patterns within A-D images can be recognized and classified for structural similarities. In this work, a matching algorithm to extract corresponding SSE pairs from A-D images and a novel DS score have been designed and demonstrated to be applicable to the detection of DS relationships. The Matthews correlation coefficient (MCC) and sensitivity of the proposed DS-detecting method were higher than 0.81 even when the sequence identities of the proteins examined were lower than 10%. On average, the alignment percentage and root-mean-square distance (RMSD) computed by the proposed method were 90% and 1.8Å for a set of 1,211 DS-related pairs of proteins. The performances of structural alignments remain high and stable for DS-related homologs with less than 10% sequence identities. In addition, the quality of its hinge loop determination is comparable to that of manual inspection. This method has been implemented as a web-based tool, which requires two protein structures as the input and then the type and/or existence of DS relationships between the input structures are determined according to the A-D image-based structural alignments and the DS score. The proposed method is expected to trigger large-scale studies of this interesting structural phenomenon and facilitate related applications.

Collapse

Cagnoli C, Stevanin G, Brussino A, Barberis M, Mancini C, Margolis RL, Holmes SE, Nobili M, Forlani S, Padovan S, Pappi P, Zaros C, Leber I, Ribai P, Pugliese L, Assalto C, Brice A, Migone N, Dürr A, Brusco A. Missense mutations in the AFG3L2 proteolytic domain account for ∼1.5% of European autosomal dominant cerebellar ataxias. Hum Mutat 2010;31:1117-24. [PMID: 20725928 DOI: 10.1002/humu.21342] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]

Stivala AD, Stuckey PJ, Wirth AI. Fast and accurate protein substructure searching with simulated annealing and GPUs. BMC Bioinformatics 2010;11:446. [PMID: 20813068 PMCID: PMC2944279 DOI: 10.1186/1471-2105-11-446] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2010] [Accepted: 09/03/2010] [Indexed: 11/10/2022] Open

Wohlers I, Domingues FS, Klau GW. Towards optimal alignment of protein structure distance matrices. Bioinformatics 2010;26:2273-80. [PMID: 20639543 DOI: 10.1093/bioinformatics/btq420] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open

The -galactosidase type A gene aglA from Aspergillus niger encodes a fully functional -N-acetylgalactosaminidase. Glycobiology 2010;20:1410-9. [DOI: 10.1093/glycob/cwq105] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

The challenge of annotating protein sequences: The tale of eight domains of unknown function in Pfam. Comput Biol Chem 2010;34:210-4. [PMID: 20537955 DOI: 10.1016/j.compbiolchem.2010.04.001] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2010] [Revised: 04/09/2010] [Accepted: 04/25/2010] [Indexed: 11/21/2022]

Hong KW, Jin HS, Lim JE, Cho YS, Go MJ, Jung J, Lee JE, Choi J, Shin C, Hwang SY, Lee SH, Park HK, Oh B. Non-synonymous single-nucleotide polymorphisms associated with blood pressure and hypertension. J Hum Hypertens 2010;24:763-74. [PMID: 20147969 DOI: 10.1038/jhh.2010.9] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]

Kairys V, Gilson MK, Lather V, Schiffer CA, Fernandes MX. Toward the design of mutation-resistant enzyme inhibitors: further evaluation of the substrate envelope hypothesis. Chem Biol Drug Des 2009;74:234-45. [PMID: 19703025 DOI: 10.1111/j.1747-0285.2009.00851.x] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

PAUL: protein structural alignment using integer linear programming and Lagrangian relaxation. BMC Bioinformatics 2009. [PMCID: PMC2764133 DOI: 10.1186/1471-2105-10-s13-p2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Micheletti C, Orland H. MISTRAL: a tool for energy-based multiple structural alignment of proteins. ACTA ACUST UNITED AC 2009;25:2663-9. [PMID: 19692555 DOI: 10.1093/bioinformatics/btp506] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]

Chi PH, Pang B, Korkin D, Shyu CR. Efficient SCOP-fold classification and retrieval using index-based protein substructure alignments. ACTA ACUST UNITED AC 2009;25:2559-65. [PMID: 19667079 DOI: 10.1093/bioinformatics/btp474] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Kim C, Tai CH, Lee B. Iterative refinement of structure-based sequence alignments by Seed Extension. BMC Bioinformatics 2009;10:210. [PMID: 19589133 PMCID: PMC2753854 DOI: 10.1186/1471-2105-10-210] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2009] [Accepted: 07/09/2009] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Accurate sequence alignment is required in many bioinformatics applications but, when sequence similarity is low, it is difficult to obtain accurate alignments based on sequence similarity alone. The accuracy improves when the structures are available, but current structure-based sequence alignment procedures still mis-align substantial numbers of residues. In order to correct such errors, we previously explored the possibility of replacing the residue-based dynamic programming algorithm in structure alignment procedures with the Seed Extension algorithm, which does not use a gap penalty. Here, we describe a new procedure called RSE (Refinement with Seed Extension) that iteratively refines a structure-based sequence alignment.

RESULTS

RSE uses SE (Seed Extension) in its core, which is an algorithm that we reported recently for obtaining a sequence alignment from two superimposed structures. The RSE procedure was evaluated by comparing the correctly aligned fractions of residues before and after the refinement of the structure-based sequence alignments produced by popular programs. CE, DaliLite, FAST, LOCK2, MATRAS, MATT, TM-align, SHEBA and VAST were included in this analysis and the NCBI's CDD root node set was used as the reference alignments. RSE improved the average accuracy of sequence alignments for all programs tested when no shift error was allowed. The amount of improvement varied depending on the program. The average improvements were small for DaliLite and MATRAS but about 5% for CE and VAST. More substantial improvements have been seen in many individual cases. The additional computation times required for the refinements were negligible compared to the times taken by the structure alignment programs.

CONCLUSION

RSE is a computationally inexpensive way of improving the accuracy of a structure-based sequence alignment. It can be used as a standalone procedure following a regular structure-based sequence alignment or to replace the traditional iterative refinement procedures based on residue-level dynamic programming algorithm in many structure alignment programs.

Collapse

Margraf T, Schenk G, Torda AE. The SALAMI protein structure search server. Nucleic Acids Res 2009;37:W480-4. [PMID: 19465380 PMCID: PMC2703935 DOI: 10.1093/nar/gkp431] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open

Thompson KE, Wang Y, Madej T, Bryant SH. Improving protein structure similarity searches using domain boundaries based on conserved sequence information. BMC STRUCTURAL BIOLOGY 2009;9:33. [PMID: 19454035 PMCID: PMC2694201 DOI: 10.1186/1472-6807-9-33] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/07/2008] [Accepted: 05/19/2009] [Indexed: 11/10/2022]