Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Drew K, Winters P, Butterfoss GL, Berstis V, Uplinger K, Armstrong J, Riffle M, Schweighofer E, Bovermann B, Goodlett DR, Davis TN, Shasha D, Malmström L, Bonneau R. The Proteome Folding Project: proteome-scale prediction of structure and function. Genome Res 2011;21:1981-94. [PMID: 21824995 DOI: 10.1101/gr.121475.111] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

For:	Drew K, Winters P, Butterfoss GL, Berstis V, Uplinger K, Armstrong J, Riffle M, Schweighofer E, Bovermann B, Goodlett DR, Davis TN, Shasha D, Malmström L, Bonneau R. The Proteome Folding Project: proteome-scale prediction of structure and function. Genome Res 2011;21:1981-94. [PMID: 21824995 DOI: 10.1101/gr.121475.111] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Number

Cited by Other Article(s)

Gracia B, Montes P, Gutierrez AM, Arun B, Karras GI. Protein-folding chaperones predict structure-function relationships and cancer risk in BRCA1 mutation carriers. Cell Rep 2024;43:113803. [PMID: 38368609 PMCID: PMC10941025 DOI: 10.1016/j.celrep.2024.113803] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 12/28/2023] [Accepted: 02/01/2024] [Indexed: 02/20/2024] Open

Du K, Huang H. Development of anti-PD-L1 antibody based on structure prediction of AlphaFold2. Front Immunol 2023;14:1275999. [PMID: 37942332 PMCID: PMC10628240 DOI: 10.3389/fimmu.2023.1275999] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Accepted: 10/11/2023] [Indexed: 11/10/2023] Open

Gracia B, Montes P, Gutierrez AM, Arun B, Karras GI. Protein-Folding Chaperones Predict Structure-Function Relationships and Cancer Risk in BRCA1 Mutation Carriers. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.14.557795. [PMID: 37745493 PMCID: PMC10515940 DOI: 10.1101/2023.09.14.557795] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]

Reuss JM, Alonso-Gamo L, Garcia-Aranda M, Reuss D, Albi M, Albi B, Vilaboa D, Vilaboa B. Oral Mucosa in Cancer Patients-Putting the Pieces Together: A Narrative Review and New Perspectives. Cancers (Basel) 2023;15:3295. [PMID: 37444405 DOI: 10.3390/cancers15133295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 06/15/2023] [Accepted: 06/18/2023] [Indexed: 07/15/2023] Open

Kuang D, Issakova D, Kim J. Learning Proteome Domain Folding Using LSTMs in an Empirical Kernel Space. J Mol Biol 2022;434:167686. [PMID: 35716781 DOI: 10.1016/j.jmb.2022.167686] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Revised: 06/08/2022] [Accepted: 06/10/2022] [Indexed: 11/30/2022]

Tunyasuvunakool K, Adler J, Wu Z, Green T, Zielinski M, Žídek A, Bridgland A, Cowie A, Meyer C, Laydon A, Velankar S, Kleywegt GJ, Bateman A, Evans R, Pritzel A, Figurnov M, Ronneberger O, Bates R, Kohl SAA, Potapenko A, Ballard AJ, Romera-Paredes B, Nikolov S, Jain R, Clancy E, Reiman D, Petersen S, Senior AW, Kavukcuoglu K, Birney E, Kohli P, Jumper J, Hassabis D. Highly accurate protein structure prediction for the human proteome. Nature 2021;596:590-596. [PMID: 34293799 PMCID: PMC8387240 DOI: 10.1038/s41586-021-03828-1] [Citation(s) in RCA: 1378] [Impact Index Per Article: 459.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Accepted: 07/16/2021] [Indexed: 02/07/2023]

Pliss A, Kuzmin AN, Lita A, Kumar R, Celiku O, Atilla-Gokcumen GE, Gokcumen O, Chandra D, Larion M, Prasad PN. A Single-Organelle Optical Omics Platform for Cell Science and Biomarker Discovery. Anal Chem 2021;93:8281-8290. [PMID: 34048235 DOI: 10.1021/acs.analchem.1c01131] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Pechmann S. Programmed Trade-offs in Protein Folding Networks. Structure 2020;28:1361-1375.e4. [PMID: 33053320 DOI: 10.1016/j.str.2020.09.009] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2020] [Revised: 07/25/2020] [Accepted: 09/23/2020] [Indexed: 12/14/2022]

Koehler Leman J, Weitzner BD, Renfrew PD, Lewis SM, Moretti R, Watkins AM, Mulligan VK, Lyskov S, Adolf-Bryfogle J, Labonte JW, Krys J, Bystroff C, Schief W, Gront D, Schueler-Furman O, Baker D, Bradley P, Dunbrack R, Kortemme T, Leaver-Fay A, Strauss CEM, Meiler J, Kuhlman B, Gray JJ, Bonneau R. Better together: Elements of successful scientific software development in a distributed collaborative community. PLoS Comput Biol 2020;16:e1007507. [PMID: 32365137 PMCID: PMC7197760 DOI: 10.1371/journal.pcbi.1007507] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

Affiliation(s)

Julia Koehler Leman Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, United States of America Dept of Biology, New York University, New York, NY, United States of America
Brian D. Weitzner Dept of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States of America Dept of Biochemistry, University of Washington, Seattle, WA, United States of America Institute for Protein Design, University of Washington, Seattle, WA, United States of America Lyell Immunopharma, Seattle, WA, United States of America
P. Douglas Renfrew Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, United States of America
Steven M. Lewis Dept of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States of America Dept of Biochemistry, Duke University, Durham, NC, United States of America Cyrus Biotechnology, Seattle, WA United States of America
Rocco Moretti Dept of Chemistry, Vanderbilt University, Nashville, TN, United States of America
Andrew M. Watkins Dept of Biochemistry, Stanford University School of Medicine, Stanford CA, United States of America
Vikram Khipple Mulligan Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, United States of America Dept of Biochemistry, University of Washington, Seattle, WA, United States of America Institute for Protein Design, University of Washington, Seattle, WA, United States of America
Sergey Lyskov Dept of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States of America
Jared Adolf-Bryfogle Dept of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA, United States of America
Jason W. Labonte Dept of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States of America Dept of Chemistry, Franklin & Marshall College, Lancaster, PA, United States of America
Justyna Krys Dept of Chemistry, University of Warsaw, Warsaw, Poland
RosettaCommons Consortium
Christopher Bystroff Dept of Biological Sciences, Rensselaer Polytechnic Institute, Troy, NY, United States of America
William Schief Dept of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA, United States of America
Dominik Gront Dept of Chemistry, University of Warsaw, Warsaw, Poland
Ora Schueler-Furman Dept of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
David Baker Dept of Biochemistry, University of Washington, Seattle, WA, United States of America Institute for Protein Design, University of Washington, Seattle, WA, United States of America
Philip Bradley Fred Hutchinson Cancer Research Center, Seattle, WA, United States of America
Roland Dunbrack Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia PA, United States of America
Tanja Kortemme Dept of Bioengineering and Therapeutic Sciences, University of California San Francisco, CA, United States of America
Andrew Leaver-Fay Dept of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States of America
Charlie E. M. Strauss Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States of America
Jens Meiler Depts of Chemistry, Pharmacology and Biomedical Informatics, Vanderbilt University, Nashville, TN, United States of America Center for Structural Biology, Vanderbilt University, Nashville, TN, United States of America Institute for Chemical Biology, Vanderbilt University, Nashville, TN, United States of America Institute for Drug Discovery, Leipzig University, Leipzig, Germany
Brian Kuhlman Dept of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States of America
Jeffrey J. Gray Dept of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States of America
Richard Bonneau Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, United States of America Dept of Biology, New York University, New York, NY, United States of America Dept of Computer Science, New York University, New York, NY, United States of America Center for Data Science, New York University, New York, NY, United States of America

Collapse

Langmead B, Nellore A. Cloud computing for genomic data analysis and collaboration. Nat Rev Genet 2018;19:208-219. [PMID: 29379135 PMCID: PMC6452449 DOI: 10.1038/nrg.2017.113] [Citation(s) in RCA: 103] [Impact Index Per Article: 17.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]

Nardo AE, Añón MC, Parisi G. Large-scale mapping of bioactive peptides in structural and sequence space. PLoS One 2018;13:e0191063. [PMID: 29351315 PMCID: PMC5774755 DOI: 10.1371/journal.pone.0191063] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2017] [Accepted: 12/27/2017] [Indexed: 12/11/2022] Open

Monzon AM, Zea DJ, Marino-Buslje C, Parisi G. Homology modeling in a dynamical world. Protein Sci 2017;26:2195-2206. [PMID: 28815769 DOI: 10.1002/pro.3274] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2017] [Revised: 08/09/2017] [Accepted: 08/09/2017] [Indexed: 12/31/2022]

Bao W, Wang D, Chen Y. Classification of Protein Structure Classes on Flexible Neutral Tree. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017;14:1122-1133. [PMID: 28113983 DOI: 10.1109/tcbb.2016.2610967] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Middleton SA, Illuminati J, Kim J. Complete fold annotation of the human proteome using a novel structural feature space. Sci Rep 2017;7:46321. [PMID: 28406174 PMCID: PMC5390313 DOI: 10.1038/srep46321] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2017] [Accepted: 03/14/2017] [Indexed: 11/11/2022] Open

Van Holle S, Rougé P, Van Damme EJM. Evolution and structural diversification of Nictaba-like lectin genes in food crops with a focus on soybean (Glycine max). ANNALS OF BOTANY 2017;119:901-914. [PMID: 28087663 PMCID: PMC5379587 DOI: 10.1093/aob/mcw259] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2016] [Revised: 10/24/2016] [Accepted: 11/17/2016] [Indexed: 05/10/2023]

Abstract

Background and Aims

The Nictaba family groups all proteins that show homology to Nictaba, the tobacco lectin. So far, Nictaba and an Arabidopsis thaliana homologue have been shown to be implicated in the plant stress response. The availability of more than 50 sequenced plant genomes provided the opportunity for a genome-wide identification of Nictaba -like genes in 15 species, representing members of the Fabaceae, Poaceae, Solanaceae, Musaceae, Arecaceae, Malvaceae and Rubiaceae. Additionally, phylogenetic relationships between the different species were explored. Furthermore, this study included domain organization analysis, searching for orthologous genes in the legume family and transcript profiling of the Nictaba -like lectin genes in soybean.

Methods

Using a combination of BLASTp, InterPro analysis and hidden Markov models, the genomes of Medicago truncatula , Cicer arietinum , Lotus japonicus , Glycine max , Cajanus cajan , Phaseolus vulgaris , Theobroma cacao , Solanum lycopersicum , Solanum tuberosum , Coffea canephora , Oryza sativa , Zea mays, Sorghum bicolor , Musa acuminata and Elaeis guineensis were searched for Nictaba -like genes. Phylogenetic analysis was performed using RAxML and additional protein domains in the Nictaba-like sequences were identified using InterPro. Expression analysis of the soybean Nictaba -like genes was investigated using microarray data.

Key Results

Nictaba -like genes were identified in all studied species and analysis of the duplication events demonstrated that both tandem and segmental duplication contributed to the expansion of the Nictaba gene family in angiosperms. The single-domain Nictaba protein and the multi-domain F-box Nictaba architectures are ubiquitous among all analysed species and microarray analysis revealed differential expression patterns for all soybean Nictaba-like genes.

Conclusions

Taken together, the comparative genomics data contributes to our understanding of the Nictaba -like gene family in species for which the occurrence of Nictaba domains had not yet been investigated. Given the ubiquitous nature of these genes, they have probably acquired new functions over time and are expected to take on various roles in plant development and defence.

Collapse

Sheu MJ, Hsieh MJ, Chou YE, Wang PH, Yeh CB, Yang SF, Lee HL, Liu YF. Effects of ADAMTS14 genetic polymorphism and cigarette smoking on the clinicopathologic development of hepatocellular carcinoma. PLoS One 2017;12:e0172506. [PMID: 28231306 PMCID: PMC5322915 DOI: 10.1371/journal.pone.0172506] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2016] [Accepted: 02/05/2017] [Indexed: 01/12/2023] Open

Taghipour S, Zarrineh P, Ganjtabesh M, Nowzari-Dalini A. Improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of Escherichia coli from different physical interaction data sources. BMC Bioinformatics 2017;18:10. [PMID: 28049415 PMCID: PMC5209909 DOI: 10.1186/s12859-016-1422-x] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2016] [Accepted: 12/12/2016] [Indexed: 11/10/2022] Open

Jing R, Sun J, Wang Y, Li M. Domain position prediction based on sequence information by using fuzzy mean operator. Proteins 2015;83:1462-9. [PMID: 26009844 DOI: 10.1002/prot.24833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2015] [Revised: 04/23/2015] [Accepted: 05/17/2015] [Indexed: 11/09/2022]

Secondary and Tertiary Structure Prediction of Proteins: A Bioinformatic Approach. COMPLEX SYSTEM MODELLING AND CONTROL THROUGH INTELLIGENT SOFT COMPUTATIONS 2015. [DOI: 10.1007/978-3-319-12883-2_19] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]

Zou T, Williams N, Ozkan SB, Ghosh K. Proteome folding kinetics is limited by protein halflife. PLoS One 2014;9:e112701. [PMID: 25393560 PMCID: PMC4231061 DOI: 10.1371/journal.pone.0112701] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2014] [Accepted: 10/10/2014] [Indexed: 12/29/2022] Open

Mahajan S, de Brevern AG, Sanejouand YH, Srinivasan N, Offmann B. Use of a structural alphabet to find compatible folds for amino acid sequences. Protein Sci 2014;24:145-53. [PMID: 25297700 DOI: 10.1002/pro.2581] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2014] [Accepted: 10/06/2014] [Indexed: 01/01/2023]

Joseph AP, de Brevern AG. From local structure to a global framework: recognition of protein folds. J R Soc Interface 2014;11:20131147. [PMID: 24740960 DOI: 10.1098/rsif.2013.1147] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open

Rhee SY, Mutwil M. Towards revealing the functions of all genes in plants. TRENDS IN PLANT SCIENCE 2014;19:212-21. [PMID: 24231067 DOI: 10.1016/j.tplants.2013.10.006] [Citation(s) in RCA: 146] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/06/2013] [Revised: 10/10/2013] [Accepted: 10/16/2013] [Indexed: 05/19/2023]

Abrusán G, Zhang Y, Szilágyi A. Structure prediction and analysis of DNA transposon and LINE retrotransposon proteins. J Biol Chem 2013;288:16127-38. [PMID: 23530042 PMCID: PMC3668768 DOI: 10.1074/jbc.m113.451500] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2013] [Revised: 03/21/2013] [Indexed: 01/15/2023] Open

Youngs N, Penfold-Brown D, Drew K, Shasha D, Bonneau R. Parametric Bayesian priors and better choice of negative examples improve protein function prediction. Bioinformatics 2013;29:1190-8. [PMID: 23511543 PMCID: PMC3634187 DOI: 10.1093/bioinformatics/btt110] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open

Tertiary model of a plant cellulose synthase. Proc Natl Acad Sci U S A 2013;110:7512-7. [PMID: 23592721 DOI: 10.1073/pnas.1301027110] [Citation(s) in RCA: 121] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open

Brylinski M. The utility of artificially evolved sequences in protein threading and fold recognition. J Theor Biol 2013;328:77-88. [PMID: 23542050 DOI: 10.1016/j.jtbi.2013.03.018] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2012] [Revised: 01/24/2013] [Accepted: 03/18/2013] [Indexed: 12/23/2022]

Abstract

Template-based protein structure prediction plays an important role in Functional Genomics by providing structural models of gene products, which can be utilized by structure-based approaches to function inference. From a systems level perspective, the high structural coverage of gene products in a given organism is critical. Despite continuous efforts towards the development of more sensitive threading approaches, confident structural models cannot be constructed for a considerable fraction of proteins due to difficulties in recognizing low-sequence identity templates with a similar fold to the target. Here we introduce a new modeling stratagem, which employs a library of synthetic sequences to improve template ranking in fold recognition by sequence profile-based methods. We developed a new method for the optimization of generic protein-like amino acid sequences to stabilize the respective structures using a combined empirical scoring function, which is compatible with these commonly used in protein threading and fold recognition. We show that the artificially evolved sequences, whose average sequence identity to the wild-type sequences is as low as 13.8%, have significant capabilities to recognize the correct structures. Importantly, the quality of the corresponding threading alignments is comparable to these constructed using conventional wild-type approaches (the average TM-score is 0.48 and 0.54, respectively). Fold recognition that uses data fusion to combine ranks calculated for both wild-type and synthetic template libraries systematically improves the detection of structural analogs. Depending on the threading algorithm used, it yields on average 4-16% higher recognition rates than using the wild-type template library alone. Synthetic sequences artificially evolved for the template structures provide an orthogonal source of signal that could be exploited to detect these templates unrecognized by standard modeling techniques. It opens up new directions in the development of more sensitive threading methods with the enhanced capabilities of targeting difficult, midnight zone templates.

Collapse

Fang H, Gough J. A domain-centric solution to functional genomics via dcGO Predictor. BMC Bioinformatics 2013;14 Suppl 3:S9. [PMID: 23514627 PMCID: PMC3584936 DOI: 10.1186/1471-2105-14-s3-s9] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open

Abstract

Background

Computational/manual annotations of protein functions are one of the first routes to making sense of a newly sequenced genome. Protein domain predictions form an essential part of this annotation process. This is due to the natural modularity of proteins with domains as structural, evolutionary and functional units. Sometimes two, three, or more adjacent domains (called supra-domains) are the operational unit responsible for a function, e.g. via a binding site at the interface. These supra-domains have contributed to functional diversification in higher organisms. Traditionally functional ontologies have been applied to individual proteins, rather than families of related domains and supra-domains. We expect, however, to some extent functional signals can be carried by protein domains and supra-domains, and consequently used in function prediction and functional genomics.

Results

Here we present a domain-centric Gene Ontology (dcGO) perspective. We generalize a framework for automatically inferring ontological terms associated with domains and supra-domains from full-length sequence annotations. This general framework has been applied specifically to primary protein-level annotations from UniProtKB-GOA, generating GO term associations with SCOP domains and supra-domains. The resulting 'dcGO Predictor', can be used to provide functional annotation to protein sequences. The functional annotation of sequences in the Critical Assessment of Function Annotation (CAFA) has been used as a valuable opportunity to validate our method and to be assessed by the community. The functional annotation of all completely sequenced genomes has demonstrated the potential for domain-centric GO enrichment analysis to yield functional insights into newly sequenced or yet-to-be-annotated genomes. This generalized framework we have presented has also been applied to other domain classifications such as InterPro and Pfam, and other ontologies such as mammalian phenotype and disease ontology. The dcGO and its predictor are available at http://supfam.org/SUPERFAMILY/dcGO including an enrichment analysis tool.

Conclusions

As functional units, domains offer a unique perspective on function prediction regardless of whether proteins are multi-domain or single-domain. The 'dcGO Predictor' holds great promise for contributing to a domain-centric functional understanding of genomes in the next generation sequencing era.

Collapse

Fey P, Dodson RJ, Basu S, Chisholm RL. One stop shop for everything Dictyostelium: dictyBase and the Dicty Stock Center in 2012. Methods Mol Biol 2013;983:59-92. [PMID: 23494302 DOI: 10.1007/978-1-62703-302-2_4] [Citation(s) in RCA: 120] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

The mRNA-bound proteome and its global occupancy profile on protein-coding transcripts. Mol Cell 2012;46:674-90. [PMID: 22681889 DOI: 10.1016/j.molcel.2012.05.021] [Citation(s) in RCA: 877] [Impact Index Per Article: 73.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2012] [Revised: 05/14/2012] [Accepted: 05/17/2012] [Indexed: 01/17/2023]

Pentony MM, Winters P, Penfold-Brown D, Drew K, Narechania A, DeSalle R, Bonneau R, Purugganan MD. The plant proteome folding project: structure and positive selection in plant protein families. Genome Biol Evol 2012;4:360-71. [PMID: 22345424 PMCID: PMC3318447 DOI: 10.1093/gbe/evs015] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Ashworth J, Wurtmann EJ, Baliga NS. Reverse engineering systems models of regulation: discovery, prediction and mechanisms. Curr Opin Biotechnol 2011;23:598-603. [PMID: 22209016 DOI: 10.1016/j.copbio.2011.12.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2011] [Accepted: 12/08/2011] [Indexed: 10/14/2022]