Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wilkins A, Erdin S, Lua R, Lichtarge O. Evolutionary trace for prediction and redesign of protein functional sites. Methods Mol Biol 2012;819:29-42. [PMID: 22183528 DOI: 10.1007/978-1-61779-465-0_3] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

For:	Wilkins A, Erdin S, Lua R, Lichtarge O. Evolutionary trace for prediction and redesign of protein functional sites. Methods Mol Biol 2012;819:29-42. [PMID: 22183528 DOI: 10.1007/978-1-61779-465-0_3] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Number

Cited by Other Article(s)

Amalfitano A, Stocchi N, Atencio HM, Villarreal F, Ten Have A. Seqrutinator: scrutiny of large protein superfamily sequence datasets for the identification and elimination of non-functional homologues. Genome Biol 2024;25:230. [PMID: 39187866 PMCID: PMC11346255 DOI: 10.1186/s13059-024-03371-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Accepted: 08/13/2024] [Indexed: 08/28/2024] Open

Pomarici ND, Cacciato R, Kokot J, Fernández-Quintero ML, Liedl KR. Evolution of the Immunoglobulin Isotypes-Variations of Biophysical Properties among Animal Classes. Biomolecules 2023;13:801. [PMID: 37238671 PMCID: PMC10216798 DOI: 10.3390/biom13050801] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 05/03/2023] [Accepted: 05/05/2023] [Indexed: 05/28/2023] Open

Huh E, Agosto MA, Wensel TG, Lichtarge O. Coevolutionary signals in metabotropic glutamate receptors capture residue contacts and long-range functional interactions. J Biol Chem 2023;299:103030. [PMID: 36806686 PMCID: PMC10060750 DOI: 10.1016/j.jbc.2023.103030] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 02/09/2023] [Accepted: 02/10/2023] [Indexed: 02/18/2023] Open

Walther D. Specifics of Metabolite-Protein Interactions and Their Computational Analysis and Prediction. Methods Mol Biol 2023;2554:179-197. [PMID: 36178627 DOI: 10.1007/978-1-0716-2624-5_12] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Wang L, Guo S, Zeng B, Wang S, Chen Y, Cheng S, Liu B, Wang C, Wang Y, Meng Q. Draft Genome Assembly and Annotation for Cutaneotrichosporon dermatis NICC30027, an Oleaginous Yeast Capable of Simultaneous Glucose and Xylose Assimilation. MYCOBIOLOGY 2022;50:69-81. [PMID: 35291590 PMCID: PMC8890563 DOI: 10.1080/12298093.2022.2038844] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 01/10/2022] [Accepted: 02/02/2022] [Indexed: 06/14/2023]

Affiliation(s)

Laiyou Wang School of Biological and Chemical Engineering, Nanyang Institute of Technology, Nanyang, China Henan Key Laboratory of Industrial Microbial Resources and Fermentation Technology, Nanyang Institute of Technology, Nanyang, China
Shuxian Guo School of Biological and Chemical Engineering, Nanyang Institute of Technology, Nanyang, China Henan Key Laboratory of Industrial Microbial Resources and Fermentation Technology, Nanyang Institute of Technology, Nanyang, China
Bo Zeng School of Biological and Chemical Engineering, Nanyang Institute of Technology, Nanyang, China Henan Key Laboratory of Industrial Microbial Resources and Fermentation Technology, Nanyang Institute of Technology, Nanyang, China
Shanshan Wang School of Biological and Chemical Engineering, Nanyang Institute of Technology, Nanyang, China Henan Key Laboratory of Industrial Microbial Resources and Fermentation Technology, Nanyang Institute of Technology, Nanyang, China
Yan Chen School of Biological and Chemical Engineering, Nanyang Institute of Technology, Nanyang, China Henan Key Laboratory of Industrial Microbial Resources and Fermentation Technology, Nanyang Institute of Technology, Nanyang, China
Shuang Cheng School of Biological and Chemical Engineering, Nanyang Institute of Technology, Nanyang, China Henan Key Laboratory of Industrial Microbial Resources and Fermentation Technology, Nanyang Institute of Technology, Nanyang, China
Bingbing Liu School of Biological and Chemical Engineering, Nanyang Institute of Technology, Nanyang, China Henan Key Laboratory of Industrial Microbial Resources and Fermentation Technology, Nanyang Institute of Technology, Nanyang, China
Chunyan Wang School of Biological and Chemical Engineering, Nanyang Institute of Technology, Nanyang, China Henan Key Laboratory of Industrial Microbial Resources and Fermentation Technology, Nanyang Institute of Technology, Nanyang, China
Yu Wang College of Biological Science and Engineering, Jiangxi Agricultural University, Nanchang, China
Qingshan Meng State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China

Collapse

Functional Classification and Characterization of the Fungal Glycoside Hydrolase 28 Protein Family. J Fungi (Basel) 2022;8:jof8030217. [PMID: 35330219 PMCID: PMC8952511 DOI: 10.3390/jof8030217] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2022] [Revised: 02/13/2022] [Accepted: 02/15/2022] [Indexed: 02/01/2023] Open

Extracting phylogenetic dimensions of coevolution reveals hidden functional signals. Sci Rep 2022;12:820. [PMID: 35039514 PMCID: PMC8764114 DOI: 10.1038/s41598-021-04260-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Accepted: 12/17/2021] [Indexed: 11/08/2022] Open

Lehnart SE, Wehrens XHT. The Role of Junctophilin Proteins in Cellular Function. Physiol Rev 2022;102:1211-1261. [PMID: 35001666 PMCID: PMC8934682 DOI: 10.1152/physrev.00024.2021] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open

Tsutakawa SE, Bacolla A, Katsonis P, Bralić A, Hamdan SM, Lichtarge O, Tainer JA, Tsai CL. Decoding Cancer Variants of Unknown Significance for Helicase-Nuclease-RPA Complexes Orchestrating DNA Repair During Transcription and Replication. Front Mol Biosci 2021;8:791792. [PMID: 34966786 PMCID: PMC8710748 DOI: 10.3389/fmolb.2021.791792] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2021] [Accepted: 11/16/2021] [Indexed: 01/13/2023] Open

Abstract

All tumors have DNA mutations, and a predictive understanding of those mutations could inform clinical treatments. However, 40% of the mutations are variants of unknown significance (VUS), with the challenge being to objectively predict whether a VUS is pathogenic and supports the tumor or whether it is benign. To objectively decode VUS, we mapped cancer sequence data and evolutionary trace (ET) scores onto crystallography and cryo-electron microscopy structures with variant impacts quantitated by evolutionary action (EA) measures. As tumors depend on helicases and nucleases to deal with transcription/replication stress, we targeted helicase–nuclease–RPA complexes: (1) XPB-XPD (within TFIIH), XPF-ERCC1, XPG, and RPA for transcription and nucleotide excision repair pathways and (2) BLM, EXO5, and RPA plus DNA2 for stalled replication fork restart. As validation, EA scoring predicts severe effects for most disease mutations, but disease mutants with low ET scores not only are likely destabilizing but also disrupt sophisticated allosteric mechanisms. For sites of disease mutations and VUS predicted to be severe, we found strong co-localization to ordered regions. Rare discrepancies highlighted the different survival requirements between disease and tumor mutations, as well as the value of examining proteins within complexes. In a genome-wide analysis of 33 cancer types, we found correlation between the number of mutations in each tumor and which pathways or functional processes in which the mutations occur, revealing different mutagenic routes to tumorigenesis. We also found upregulation of ancient genes including BLM, which supports a non-random and concerted cancer process: reversion to a unicellular, proliferation-uncontrolled, status by breaking multicellular constraints on cell division. Together, these genes and global analyses challenge the binary “driver” and “passenger” mutation paradigm, support a gradient impact as revealed by EA scoring from moderate to severe at a single gene level, and indicate reduced regulation as well as activity. The objective quantitative assessment of VUS scoring and gene overexpression in the context of functional interactions and pathways provides insights for biology, oncology, and precision medicine.

Collapse

The Functional Differences between the GroEL Chaperonin of Escherichia coli and the HtpB Chaperonin of Legionella pneumophila Can Be Mapped to Specific Amino Acid Residues. Biomolecules 2021;12:biom12010059. [PMID: 35053207 PMCID: PMC8774168 DOI: 10.3390/biom12010059] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 12/26/2021] [Accepted: 12/28/2021] [Indexed: 11/17/2022] Open

Sinelnikov IG, Siedhoff NE, Chulkin AM, Zorov IN, Schwaneberg U, Davari MD, Sinitsyna OA, Shcherbakova LA, Sinitsyn AP, Rozhkova AM. Expression and Refolding of the Plant Chitinase From Drosera capensis for Applications as a Sustainable and Integrated Pest Management. Front Bioeng Biotechnol 2021;9:728501. [PMID: 34621729 PMCID: PMC8490864 DOI: 10.3389/fbioe.2021.728501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Accepted: 09/08/2021] [Indexed: 11/13/2022] Open

Abstract

Recently, the study of chitinases has become an important target of numerous research projects due to their potential for applications, such as biocontrol pest agents. Plant chitinases from carnivorous plants of the genus Drosera are most aggressive against a wide range of phytopathogens. However, low solubility or insolubility of the target protein hampered application of chitinases as biofungicides. To obtain plant chitinase from carnivorous plants of the genus Drosera in soluble form in E.coli expression strains, three different approaches including dialysis, rapid dilution, and refolding on Ni-NTA agarose to renaturation were tested. The developed « Rapid dilution » protocol with renaturation buffer supplemented by 10% glycerol and 2M arginine in combination with the redox pair of reduced/oxidized glutathione, increased the yield of active soluble protein to 9.5 mg per 1 g of wet biomass. A structure-based removal of free cysteines in the core domain based on homology modeling of the structure was carried out in order to improve the soluble of chitinase. One improved chitinase variant (C191A/C231S/C286T) was identified which shows improved expression and solubility in E. coli expression systems compared to wild type. Computational analyzes of the wild-type and the improved variant revealed overall higher fluctuations of the structure while maintaining a global protein stability. It was shown that free cysteines on the surface of the protein globule which are not involved in the formation of inner disulfide bonds contribute to the insolubility of chitinase from Drosera capensis. The functional characteristics showed that chitinase exhibits high activity against colloidal chitin (360 units/g) and high fungicidal properties of recombinant chitinases against Parastagonospora nodorum. Latter highlights the application of chitinase from D. capensis as a promising enzyme for the control of fungal pathogens in agriculture.

Collapse

Das S, Scholes HM, Sen N, Orengo C. CATH functional families predict functional sites in proteins. Bioinformatics 2021;37:1099-1106. [PMID: 33135053 PMCID: PMC8150129 DOI: 10.1093/bioinformatics/btaa937] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2020] [Revised: 09/30/2020] [Accepted: 10/27/2020] [Indexed: 01/12/2023] Open

Porzio E, Faraone Mennella MR, Manco G. DING Proteins Extend to the Extremophilic World. Int J Mol Sci 2021;22:2035. [PMID: 33670786 PMCID: PMC7922408 DOI: 10.3390/ijms22042035] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Revised: 02/04/2021] [Accepted: 02/16/2021] [Indexed: 11/16/2022] Open

Evolutionary History of Alzheimer Disease-Causing Protein Family Presenilins with Pathological Implications. J Mol Evol 2020;88:674-688. [DOI: 10.1007/s00239-020-09966-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2019] [Accepted: 09/22/2020] [Indexed: 12/14/2022]

Serçinoğlu O, Ozbek P. Sequence-structure-function relationships in class I MHC: A local frustration perspective. PLoS One 2020;15:e0232849. [PMID: 32421728 PMCID: PMC7233585 DOI: 10.1371/journal.pone.0232849] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Accepted: 04/22/2020] [Indexed: 12/22/2022] Open

Novikov IB, Wilkins AD, Lichtarge O. An Evolutionary Trace method defines functionally important bases and sites common to RNA families. PLoS Comput Biol 2020;16:e1007583. [PMID: 32208421 PMCID: PMC7092961 DOI: 10.1371/journal.pcbi.1007583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2018] [Accepted: 11/27/2019] [Indexed: 11/18/2022] Open

Abstract

Functional non-coding (fnc)RNAs are nucleotide sequences of varied lengths, structures, and mechanisms that ubiquitously influence gene expression and translation, genome stability and dynamics, and human health and disease. Here, to shed light on their functional determinants, we seek to exploit the evolutionary record of variation and divergence read from sequence comparisons. The approach follows the phylogenetic Evolutionary Trace (ET) paradigm, first developed and extensively validated on proteins. We assigned a relative rank of importance to every base in a study of 1070 functional RNAs, including the ribosome, and observed evolutionary patterns strikingly similar to those seen in proteins, namely, (1) the top-ranked bases clustered in secondary and tertiary structures. (2) In turn, these clusters mapped functional regions for catalysis, binding proteins and drugs, post-transcriptional modification, and deleterious mutations. (3) Moreover, the quantitative quality of these clusters correlated with the identification of functional regions. (4) As a result of this correlation, smoother structural distributions of evolutionary important nucleotides improved functional site predictions. Thus, in practice, phylogenetic analysis can broadly identify functional determinants in RNA sequences and functional sites in RNA structures, and reveal details on the basis of RNA molecular functions. As example of application, we report several previously undocumented and potentially functional ET nucleotide clusters in the ribosome. This work is broadly relevant to studies of structure-function in ribonucleic acids. Additionally, this generalization of ET shows that evolutionary constraints among sequence, structure, and function are similar in structured RNA and proteins. RNA ET is currently available as part of the ET command-line package, and will be available as a web-server.

Traditionally, RNA has been delegated to the role of an intermediate between DNA and proteins. However, we now recognize that RNAs are broadly functional beyond their role in translation, and that a number of diverse classes exist. Because functional, non-coding RNAs are prevalent in biology and impact human health, it is important to better understand their functional determinants. However, the classical solution to this problem, targeted mutagenesis, is time-consuming and scales poorly. We propose an alternative computational approach to this problem, the Evolutionary Trace method. Previously developed and validated for proteins, Evolutionary Trace examines evolutionary history of a molecule and predicts evolutionarily important residues in the sequence. We apply Evolutionary Trace to a set of diverse RNAs, and find that the evolutionarily important nucleotides cluster on the three-dimensional structure, and that these clusters closely overlap functional sites. We also find that the clustering property can be used to refine and improve predictions. These findings are in close agreement with our observations of Evolutionary Trace in proteins, and suggest that structured functional RNAs and proteins evolve under similar constraints. In practice, the approach is to be used by RNA researches seeking insight into their molecule of interest, and the Evolutionary Trace program, along with a working example, is available at https://github.com/LichtargeLab/RNA_ET_ms.

Collapse

Deep Analysis of Residue Constraints (DARC): identifying determinants of protein functional specificity. Sci Rep 2020;10:1691. [PMID: 32015389 PMCID: PMC6997377 DOI: 10.1038/s41598-019-55118-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2019] [Accepted: 11/23/2019] [Indexed: 01/03/2023] Open

White MA, Tsalkova T, Mei FC, Cheng X. Conformational States of Exchange Protein Directly Activated by cAMP (EPAC1) Revealed by Ensemble Modeling and Integrative Structural Biology. Cells 2019;9:cells9010035. [PMID: 31877746 PMCID: PMC7016869 DOI: 10.3390/cells9010035] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2019] [Revised: 12/16/2019] [Accepted: 12/18/2019] [Indexed: 02/08/2023] Open

Split intein-mediated selection of cells containing two plasmids using a single antibiotic. Nat Commun 2019;10:4967. [PMID: 31672972 PMCID: PMC6823396 DOI: 10.1038/s41467-019-12911-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2019] [Accepted: 10/07/2019] [Indexed: 11/08/2022] Open

Hamm MO, Moss BL, Leydon AR, Gala HP, Lanctot A, Ramos R, Klaeser H, Lemmex AC, Zahler ML, Nemhauser JL, Wright RC. Accelerating structure-function mapping using the ViVa webtool to mine natural variation. PLANT DIRECT 2019;3:e00147. [PMID: 31372596 PMCID: PMC6658840 DOI: 10.1002/pld3.147] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2018] [Revised: 04/20/2019] [Accepted: 04/29/2019] [Indexed: 05/13/2023]

Ban X, Lahiri P, Dhoble AS, Li D, Gu Z, Li C, Cheng L, Hong Y, Li Z, Kaustubh B. Evolutionary Stability of Salt Bridges Hints Its Contribution to Stability of Proteins. Comput Struct Biotechnol J 2019;17:895-903. [PMID: 31333816 PMCID: PMC6620738 DOI: 10.1016/j.csbj.2019.06.022] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2019] [Revised: 06/19/2019] [Accepted: 06/20/2019] [Indexed: 11/18/2022] Open

Feltes BC, Grisci BI, Poloni JDF, Dorn M. Perspectives and applications of machine learning for evolutionary developmental biology. Mol Omics 2018;14:289-306. [PMID: 30168572 DOI: 10.1039/c8mo00111a] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Butler BM, Kazan IC, Kumar A, Ozkan SB. Coevolving residues inform protein dynamics profiles and disease susceptibility of nSNVs. PLoS Comput Biol 2018;14:e1006626. [PMID: 30496278 PMCID: PMC6289467 DOI: 10.1371/journal.pcbi.1006626] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2018] [Revised: 12/11/2018] [Accepted: 11/09/2018] [Indexed: 11/18/2022] Open

Abstract

The conformational dynamics of proteins is rarely used in methodologies used to predict the impact of genetic mutations due to the paucity of three-dimensional protein structures as compared to the vast number of available sequences. Until now a three-dimensional (3D) structure has been required to predict the conformational dynamics of a protein. We introduce an approach that estimates the conformational dynamics of a protein, without relying on structural information. This de novo approach utilizes coevolving residues identified from a multiple sequence alignment (MSA) using Potts models. These coevolving residues are used as contacts in a Gaussian network model (GNM) to obtain protein dynamics. B-factors calculated using sequence-based GNM (Seq-GNM) are in agreement with crystallographic B-factors as well as theoretical B-factors from the original GNM that utilizes the 3D structure. Moreover, we demonstrate the ability of the calculated B-factors from the Seq-GNM approach to discriminate genomic variants according to their phenotypes for a wide range of proteins. These results suggest that protein dynamics can be approximated based on sequence information alone, making it possible to assess the phenotypes of nSNVs in cases where a 3D structure is unknown. We hope this work will promote the use of dynamics information in genetic disease prediction at scale by circumventing the need for 3D structures.

Proteins are dynamic machines that undergo atomic fluctuations, side chain rotations, and collective domain movements that are required for biological function. There is, therefore, a need for quantitative metrics that capture the dynamic fluctuations per position to understand the critical role of protein dynamics in shaping biological functions. A limiting factor in incorporating structural dynamics information in the classification of non-synonymous single nucleotide variants (nSNVs) is the limited number of known 3D structures compared to the vast number of available sequences. We have developed a new sequence-based GNM method, termed Seq-GNM, which uses co-evolving amino acid positions based on the multiple sequence alignment of a given query sequence to estimate the thermal motions of C-alpha atoms. In this paper, we have demonstrated that the predicted thermal motions using Seq-GNM are in reasonable agreement with experimental B-factors as well as B-factors computed using 3D crystal structures. We also provide evidence that B-factors predicted by Seq-GNM are capable of distinguishing between disease-associated and neutral nSNVs.

Collapse

Han M, Song Y, Qian J, Ming D. Sequence-based prediction of physicochemical interactions at protein functional sites using a function-and-interaction-annotated domain profile database. BMC Bioinformatics 2018;19:204. [PMID: 29859055 PMCID: PMC5984826 DOI: 10.1186/s12859-018-2206-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2017] [Accepted: 05/15/2018] [Indexed: 01/16/2023] Open

Karasev DA, Veselovsky AV, Lagunin AA, Filimonov DA, Sobolev BN. Determination of Amino Acid Residues Responsible for Specific Interaction of Protein Kinases with Small Molecule Inhibitors. Mol Biol 2018. [DOI: 10.1134/s002689331802005x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Neuwald AF, Aravind L, Altschul SF. Inferring joint sequence-structural determinants of protein functional specificity. eLife 2018;7. [PMID: 29336305 PMCID: PMC5770160 DOI: 10.7554/elife.29880] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2017] [Accepted: 12/22/2017] [Indexed: 01/05/2023] Open

Gallion J, Koire A, Katsonis P, Schoenegge A, Bouvier M, Lichtarge O. Predicting phenotype from genotype: Improving accuracy through more robust experimental and computational modeling. Hum Mutat 2017;38:569-580. [PMID: 28230923 PMCID: PMC5516182 DOI: 10.1002/humu.23193] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2016] [Revised: 01/25/2017] [Accepted: 02/04/2017] [Indexed: 11/11/2022]

Neuwald AF, Altschul SF. Inference of Functionally-Relevant N-acetyltransferase Residues Based on Statistical Correlations. PLoS Comput Biol 2016;12:e1005294. [PMID: 28002465 PMCID: PMC5225019 DOI: 10.1371/journal.pcbi.1005294] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2016] [Revised: 01/10/2017] [Accepted: 12/08/2016] [Indexed: 11/25/2022] Open

Abstract

Over evolutionary time, members of a superfamily of homologous proteins sharing a common structural core diverge into subgroups filling various functional niches. At the sequence level, such divergence appears as correlations that arise from residue patterns distinct to each subgroup. Such a superfamily may be viewed as a population of sequences corresponding to a complex, high-dimensional probability distribution. Here we model this distribution as hierarchical interrelated hidden Markov models (hiHMMs), which describe these sequence correlations implicitly. By characterizing such correlations one may hope to obtain information regarding functionally-relevant properties that have thus far evaded detection. To do so, we infer a hiHMM distribution from sequence data using Bayes’ theorem and Markov chain Monte Carlo (MCMC) sampling, which is widely recognized as the most effective approach for characterizing a complex, high dimensional distribution. Other routines then map correlated residue patterns to available structures with a view to hypothesis generation. When applied to N-acetyltransferases, this reveals sequence and structural features indicative of functionally important, yet generally unknown biochemical properties. Even for sets of proteins for which nothing is known beyond unannotated sequences and structures, this can lead to helpful insights. We describe, for example, a putative coenzyme-A-induced-fit substrate binding mechanism mediated by arginine residue switching between salt bridge and π-π stacking interactions. A suite of programs implementing this approach is available (psed.igs.umaryland.edu).

Protein sequence data, when gathered in great quantity, contain important but implicit biological information manifest as statistical correlations. Here we describe an approach to access this information by comprehensively modeling and characterizing the distribution of sequences belonging to a major protein superfamily. This approach takes as input a large set of unaligned sequences belonging to the superfamily. By applying the minimum description length principle, it seeks the statistical model that best explains the sequences while avoiding over-fitting the data. It concurrently aligns the sequences and, to model evolutionary divergence, partitions them into subgroups that are hierarchically-arranged based upon correlated residue patterns. Auxiliary routines create PyMOL scripts to visualize the locations of correlated residues within available structures. Because these correlations likely arise from structural and biochemical constraints, they can help elucidate protein properties important for functional specificity. Comparing and contrasting sequence and structural features in this way may therefore suggest, in the light of published studies, plausible biological hypotheses for experimental investigation. We illustrate this approach with N-acetyltransferases.

Collapse

Gallion J, Wilkins AD, Lichtarge O. HUMAN KINASES DISPLAY MUTATIONAL HOTSPOTS AT COGNATE POSITIONS WITHIN CANCER. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2016;22:414-425. [PMID: 27896994 DOI: 10.1142/9789813207813_0039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Protein stabilization improves STAT3 function in autosomal dominant hyper-IgE syndrome. Blood 2016;128:3061-3072. [PMID: 27799162 DOI: 10.1182/blood-2016-02-702373] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2016] [Accepted: 10/19/2016] [Indexed: 12/17/2022] Open

Farhoodi R, Akbal-Delibas B, Haspel N. Machine Learning Approaches for Predicting Protein Complex Similarity. J Comput Biol 2016;24:40-51. [PMID: 27748625 DOI: 10.1089/cmb.2016.0137] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Akbal-Delibas B, Pomplun M, Haspel N. Accurate Prediction of Docked Protein Structure Similarity. J Comput Biol 2016;22:892-904. [PMID: 26335807 DOI: 10.1089/cmb.2015.0114] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open

Akbal-Delibas B, Farhoodi R, Pomplun M, Haspel N. Accurate refinement of docked protein complexes using evolutionary information and deep learning. J Bioinform Comput Biol 2015;14:1642002. [PMID: 26846813 DOI: 10.1142/s0219720016420026] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]

Lua RC, Wilson SJ, Konecki DM, Wilkins AD, Venner E, Morgan DH, Lichtarge O. UET: a database of evolutionarily-predicted functional determinants of protein sequences that cluster as functional sites in protein structures. Nucleic Acids Res 2015;44:D308-12. [PMID: 26590254 PMCID: PMC4702906 DOI: 10.1093/nar/gkv1279] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2015] [Accepted: 11/02/2015] [Indexed: 02/07/2023] Open

Assessing the genetic diversity of Cu resistance in mine tailings through high-throughput recovery of full-length copA genes. Sci Rep 2015;5:13258. [PMID: 26286020 PMCID: PMC4541151 DOI: 10.1038/srep13258] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2015] [Accepted: 06/16/2015] [Indexed: 11/17/2022] Open

Elucidation of G-protein and β-arrestin functional selectivity at the dopamine D2 receptor. Proc Natl Acad Sci U S A 2015;112:7097-102. [PMID: 25964346 DOI: 10.1073/pnas.1502742112] [Citation(s) in RCA: 70] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open

Aumentado-Armstrong TT, Istrate B, Murgita RA. Algorithmic approaches to protein-protein interaction site prediction. Algorithms Mol Biol 2015;10:7. [PMID: 25713596 PMCID: PMC4338852 DOI: 10.1186/s13015-015-0033-9] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2014] [Accepted: 01/07/2015] [Indexed: 12/19/2022] Open

Donegan RK, Hill SE, Freeman DM, Nguyen E, Orwig SD, Turnage KC, Lieberman RL. Structural basis for misfolding in myocilin-associated glaucoma. Hum Mol Genet 2014;24:2111-24. [PMID: 25524706 DOI: 10.1093/hmg/ddu730] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Lua RC, Marciano DC, Katsonis P, Adikesavan AK, Wilkins AD, Lichtarge O. Prediction and redesign of protein-protein interactions. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2014;116:194-202. [PMID: 24878423 DOI: 10.1016/j.pbiomolbio.2014.05.004] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/25/2014] [Revised: 05/02/2014] [Accepted: 05/17/2014] [Indexed: 12/14/2022]

Pelé J, Moreau M, Abdi H, Rodien P, Castel H, Chabbert M. Comparative analysis of sequence covariation methods to mine evolutionary hubs: Examples from selected GPCR families. Proteins 2014;82:2141-56. [DOI: 10.1002/prot.24570] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2013] [Revised: 03/11/2014] [Accepted: 03/19/2014] [Indexed: 01/26/2023]

Young E, Zheng ZY, Wilkins AD, Jeong HT, Li M, Lichtarge O, Chang EC. Regulation of Ras localization and cell transformation by evolutionarily conserved palmitoyltransferases. Mol Cell Biol 2014;34:374-85. [PMID: 24248599 PMCID: PMC3911504 DOI: 10.1128/mcb.01248-13] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2013] [Revised: 10/16/2013] [Accepted: 11/09/2013] [Indexed: 01/06/2023] Open

Homan EP, Lietman C, Grafe I, Lennington J, Morello R, Napierala D, Jiang MM, Munivez EM, Dawson B, Bertin TK, Chen Y, Lua R, Lichtarge O, Hicks J, Weis MA, Eyre D, Lee BHL. Differential effects of collagen prolyl 3-hydroxylation on skeletal tissues. PLoS Genet 2014;10:e1004121. [PMID: 24465224 PMCID: PMC3900401 DOI: 10.1371/journal.pgen.1004121] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2013] [Accepted: 12/04/2013] [Indexed: 02/04/2023] Open

Abstract

Mutations in the genes encoding cartilage associated protein (CRTAP) and prolyl 3-hydroxylase 1 (P3H1 encoded by LEPRE1) were the first identified causes of recessive Osteogenesis Imperfecta (OI). These proteins, together with cyclophilin B (encoded by PPIB), form a complex that 3-hydroxylates a single proline residue on the α1(I) chain (Pro986) and has cis/trans isomerase (PPIase) activity essential for proper collagen folding. Recent data suggest that prolyl 3-hydroxylation of Pro986 is not required for the structural stability of collagen; however, the absence of this post-translational modification may disrupt protein-protein interactions integral for proper collagen folding and lead to collagen over-modification. P3H1 and CRTAP stabilize each other and absence of one results in degradation of the other. Hence, hypomorphic or loss of function mutations of either gene cause loss of the whole complex and its associated functions. The relative contribution of losing this complex's 3-hydroxylation versus PPIase and collagen chaperone activities to the phenotype of recessive OI is unknown. To distinguish between these functions, we generated knock-in mice carrying a single amino acid substitution in the catalytic site of P3h1 (Lepre1^H662A). This substitution abolished P3h1 activity but retained ability to form a complex with Crtap and thus the collagen chaperone function. Knock-in mice showed absence of prolyl 3-hydroxylation at Pro986 of the α1(I) and α1(II) collagen chains but no significant over-modification at other collagen residues. They were normal in appearance, had no growth defects and normal cartilage growth plate histology but showed decreased trabecular bone mass. This new mouse model recapitulates elements of the bone phenotype of OI but not the cartilage and growth phenotypes caused by loss of the prolyl 3-hydroxylation complex. Our observations suggest differential tissue consequences due to selective inactivation of P3H1 hydroxylase activity versus complete ablation of the prolyl 3-hydroxylation complex.

The prolyl 3-hydroxylase complex serves to hydroxylate a single residue in type I collagen and also serves as a collagen chaperone. The complex is comprised of prolyl 3-hydroxylase 1, cartilage associated protein, and cyclophilin B. Mutations have been identified in the genes encoding the complex members in patients with recessive Osteogenesis Imperfecta. Recent data suggest that prolyl 3-hydroxylation of collagen does not alter the stability of collagen but may rather mediate protein-protein interactions. Additionally, the collagen chaperoning function of the complex is an important rate limiting step in the modification of type I collagen. Irrespective of whether patients with mutations in the genes encoding the members of the prolyl 3-hydroxylase complex have hypomorphic or complete loss of function alleles, either circumstance leads to the loss of both functions of the prolyl 3-hydroxylase complex. Thus, it is unknown how collagen chaperoning and/or hydroxylation affect bone and cartilage homeostasis. In this study, we generated a mouse model lacking the prolyl 3-hydroxylation activity of the complex while maintaining the chaperoning function. We found that the hydroxylase mutant mice have normal cartilage and normal cortical bone but decreased trabecular bone, suggesting that there is a differential requirement for hydroxylation in different tissues.

Collapse

Affiliation(s)

Erica P. Homan Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
Caressa Lietman Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
Ingo Grafe Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
Jennifer Lennington Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
Roy Morello Department of Physiology and Biophysics, University of Arkansas for Medical Sciences, Little Rock, Arkansas, United States of America
Dobrawa Napierala Department of Oral and Maxillofacial Surgery, School of Dentistry, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
Ming-Ming Jiang Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America Howard Hughes Medical Institute, Baylor College of Medicine, Houston, Texas, United States of America
Elda M. Munivez Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
Brian Dawson Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America Howard Hughes Medical Institute, Baylor College of Medicine, Houston, Texas, United States of America
Terry K. Bertin Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
Yuqing Chen Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America Howard Hughes Medical Institute, Baylor College of Medicine, Houston, Texas, United States of America
Rhonald Lua Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
Olivier Lichtarge Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
John Hicks Department of Pathology, Texas Children's Hospital, Baylor College of Medicine, Houston, Texas, United States of America
Mary Ann Weis Department of Orthopaedics and Sports Medicine, University of Washington, Seattle, Washington, United States of America
David Eyre Department of Orthopaedics and Sports Medicine, University of Washington, Seattle, Washington, United States of America
Brendan H. L. Lee Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America Howard Hughes Medical Institute, Baylor College of Medicine, Houston, Texas, United States of America * E-mail:

Collapse

Erdin S, Venner E, Lisewski AM, Lichtarge O. Function prediction from networks of local evolutionary similarity in protein structure. BMC Bioinformatics 2013;14 Suppl 3:S6. [PMID: 23514548 PMCID: PMC3584919 DOI: 10.1186/1471-2105-14-s3-s6] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open

Skolnick J, Zhou H, Gao M. Are predicted protein structures of any value for binding site prediction and virtual ligand screening? Curr Opin Struct Biol 2013;23:191-7. [PMID: 23415854 DOI: 10.1016/j.sbi.2013.01.009] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2012] [Revised: 01/04/2013] [Accepted: 01/23/2013] [Indexed: 01/03/2023]

Engel AS, Johnson LR, Porter ML. Arsenite oxidase gene diversity among Chloroflexi and Proteobacteria from El Tatio Geyser Field, Chile. FEMS Microbiol Ecol 2012;83:745-56. [PMID: 23066664 DOI: 10.1111/1574-6941.12030] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2012] [Revised: 10/05/2012] [Accepted: 10/07/2012] [Indexed: 11/29/2022] Open

Bowen DM, Lewis JA, Lu W, Schein CH. Simplifying complex sequence information: a PCP-consensus protein binds antibodies against all four Dengue serotypes. Vaccine 2012;30:6081-7. [PMID: 22863657 DOI: 10.1016/j.vaccine.2012.07.042] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2012] [Revised: 07/13/2012] [Accepted: 07/18/2012] [Indexed: 12/15/2022]