1
|
Lu H, Twan WK, Ikawa Y, Khare V, Mukherjee I, Schou KB, Chua KX, Aqasha A, Chakrabarti S, Hamada H, Roy S. Localisation and function of key axonemal microtubule inner proteins and dynein docking complex members reveal extensive diversity among vertebrate motile cilia. Development 2024; 151:dev202737. [PMID: 39007638 DOI: 10.1242/dev.202737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2024] [Accepted: 06/25/2024] [Indexed: 07/16/2024]
Abstract
Vertebrate motile cilia are classified as (9+2) or (9+0), based on the presence or absence of the central pair apparatus, respectively. Cryogenic electron microscopy analyses of (9+2) cilia have uncovered an elaborate axonemal protein composition. The extent to which these features are conserved in (9+0) cilia remains unclear. CFAP53, a key axonemal filamentous microtubule inner protein (fMIP) and a centriolar satellites component, is essential for motility of (9+0), but not (9+2) cilia. Here, we show that in (9+2) cilia, CFAP53 functions redundantly with a paralogous fMIP, MNS1. MNS1 localises to ciliary axonemes, and combined loss of both proteins in zebrafish and mice caused severe outer dynein arm loss from (9+2) cilia, significantly affecting their motility. Using immunoprecipitation, we demonstrate that, whereas MNS1 can associate with itself and CFAP53, CFAP53 is unable to self-associate. We also show that additional axonemal dynein-interacting proteins, two outer dynein arm docking (ODAD) complex members, show differential localisation between types of motile cilia. Together, our findings clarify how paralogous fMIPs, CFAP53 and MNS1, function in regulating (9+2) versus (9+0) cilia motility, and further emphasise extensive structural diversity among these organelles.
Collapse
Affiliation(s)
- Hao Lu
- Institute of Molecular and Cell Biology, Agency for Science, Technology and Research, Proteos, 61 Biopolis Drive, Singapore138673
| | - Wang Kyaw Twan
- Laboratory for Organismal Patterning, RIKEN Centre for Biosystems Dynamics Research, 2-2-3 Minatojima-minamimachi, Chuo-Ku, Kobe 650-0005, Japan
| | - Yayoi Ikawa
- Laboratory for Organismal Patterning, RIKEN Centre for Biosystems Dynamics Research, 2-2-3 Minatojima-minamimachi, Chuo-Ku, Kobe 650-0005, Japan
| | - Vani Khare
- Institute of Molecular and Cell Biology, Agency for Science, Technology and Research, Proteos, 61 Biopolis Drive, Singapore138673
| | - Ishita Mukherjee
- Translational Research Unit of Excellence, Structural Biology and Bioinformatics Division, Council for Scientific and Industrial Research - Indian Institute of Chemical Biology, Kolkata 700091, India
| | - Kenneth Bødtker Schou
- The Danish Cancer Society Research Centre, Danish Cancer Institute, Strandboulevarden 49, 2100 Copenhagen, Denmark
| | - Kai Xin Chua
- Institute of Molecular and Cell Biology, Agency for Science, Technology and Research, Proteos, 61 Biopolis Drive, Singapore138673
| | - Adam Aqasha
- Institute of Molecular and Cell Biology, Agency for Science, Technology and Research, Proteos, 61 Biopolis Drive, Singapore138673
| | - Saikat Chakrabarti
- Translational Research Unit of Excellence, Structural Biology and Bioinformatics Division, Council for Scientific and Industrial Research - Indian Institute of Chemical Biology, Kolkata 700091, India
| | - Hiroshi Hamada
- Laboratory for Organismal Patterning, RIKEN Centre for Biosystems Dynamics Research, 2-2-3 Minatojima-minamimachi, Chuo-Ku, Kobe 650-0005, Japan
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, Bellary Road, Bengaluru 560065, India
- Trivedi School of Biosciences, Ashoka University, Sonepat, 131029, India
| | - Sudipto Roy
- Institute of Molecular and Cell Biology, Agency for Science, Technology and Research, Proteos, 61 Biopolis Drive, Singapore138673
- Department of Paediatrics, Yong Loo Ling School of Medicine, National University of Singapore, 1E Kent Ridge Road, Singapore119288
| |
Collapse
|
2
|
Lupo U, Sgarbossa D, Bitbol AF. Pairing interacting protein sequences using masked language modeling. Proc Natl Acad Sci U S A 2024; 121:e2311887121. [PMID: 38913900 PMCID: PMC11228504 DOI: 10.1073/pnas.2311887121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Accepted: 12/18/2023] [Indexed: 06/26/2024] Open
Abstract
Predicting which proteins interact together from amino acid sequences is an important task. We develop a method to pair interacting protein sequences which leverages the power of protein language models trained on multiple sequence alignments (MSAs), such as MSA Transformer and the EvoFormer module of AlphaFold. We formulate the problem of pairing interacting partners among the paralogs of two protein families in a differentiable way. We introduce a method called Differentiable Pairing using Alignment-based Language Models (DiffPALM) that solves it by exploiting the ability of MSA Transformer to fill in masked amino acids in multiple sequence alignments using the surrounding context. MSA Transformer encodes coevolution between functionally or structurally coupled amino acids within protein chains. It also captures inter-chain coevolution, despite being trained on single-chain data. Relying on MSA Transformer without fine-tuning, DiffPALM outperforms existing coevolution-based pairing methods on difficult benchmarks of shallow multiple sequence alignments extracted from ubiquitous prokaryotic protein datasets. It also outperforms an alternative method based on a state-of-the-art protein language model trained on single sequences. Paired alignments of interacting protein sequences are a crucial ingredient of supervised deep learning methods to predict the three-dimensional structure of protein complexes. Starting from sequences paired by DiffPALM substantially improves the structure prediction of some eukaryotic protein complexes by AlphaFold-Multimer. It also achieves competitive performance with using orthology-based pairing.
Collapse
Affiliation(s)
- Umberto Lupo
- Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne, Lausanne CH-1015, Switzerland
- SIB Swiss Institute of Bioinformatics, Lausanne CH-1015, Switzerland
| | - Damiano Sgarbossa
- Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne, Lausanne CH-1015, Switzerland
- SIB Swiss Institute of Bioinformatics, Lausanne CH-1015, Switzerland
| | - Anne-Florence Bitbol
- Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne, Lausanne CH-1015, Switzerland
- SIB Swiss Institute of Bioinformatics, Lausanne CH-1015, Switzerland
| |
Collapse
|
3
|
Lee YHG, Cerf NT, Shalaby N, Montes MR, Clarke RJ. Bioinformatic Study of Possible Acute Regulation of Acid Secretion in the Stomach. J Membr Biol 2024; 257:79-89. [PMID: 38436710 PMCID: PMC11006737 DOI: 10.1007/s00232-024-00310-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2023] [Accepted: 02/21/2024] [Indexed: 03/05/2024]
Abstract
The gastric H+,K+-ATPase is an integral membrane protein which derives energy from the hydrolysis of ATP to transport H+ ions from the parietal cells of the gastric mucosa into the stomach in exchange for K+ ions. It is responsible for the acidic environment of the stomach, which is essential for digestion. Acid secretion is regulated by the recruitment of the H+,K+-ATPase from intracellular stores into the plasma membrane on the ingestion of food. The similar amino acid sequences of the lysine-rich N-termini α-subunits of the H+,K+- and Na+,K+-ATPases, suggests similar acute regulation mechanisms, specifically, an electrostatic switch mechanism involving an interaction of the N-terminal tail with the surface of the surrounding membrane and a modulation of the interaction via regulatory phosphorylation by protein kinases. From a consideration of sequence alignment of the H+,K+-ATPase and an analysis of its coevolution with protein kinase C and kinases of the Src family, the evidence points towards a phosphorylation of tyrosine-7 of the N-terminus by either Lck or Yes in all vertebrates except cartilaginous fish. The results obtained will guide and focus future experimental research.
Collapse
Affiliation(s)
- Yan Hay Grace Lee
- School of Chemistry, University of Sydney, Sydney, NSW, 2006, Australia
| | - Nicole T Cerf
- Instituto de Química y Fisicoquímica Biológica (IQUIFIB), CONICET, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Nicholas Shalaby
- School of Chemistry, University of Sydney, Sydney, NSW, 2006, Australia
| | - Mónica R Montes
- Instituto de Química y Fisicoquímica Biológica (IQUIFIB), CONICET, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Ronald J Clarke
- School of Chemistry, University of Sydney, Sydney, NSW, 2006, Australia.
- The University of Sydney Nano Institute, Sydney, NSW, 2006, Australia.
| |
Collapse
|
4
|
Durham J, Zhang J, Humphreys IR, Pei J, Cong Q. Recent advances in predicting and modeling protein-protein interactions. Trends Biochem Sci 2023; 48:527-538. [PMID: 37061423 DOI: 10.1016/j.tibs.2023.03.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 03/03/2023] [Accepted: 03/17/2023] [Indexed: 04/17/2023]
Abstract
Protein-protein interactions (PPIs) drive biological processes, and disruption of PPIs can cause disease. With recent breakthroughs in structure prediction and a deluge of genomic sequence data, computational methods to predict PPIs and model spatial structures of protein complexes are now approaching the accuracy of experimental approaches for permanent interactions and show promise for elucidating transient interactions. As we describe here, the key to this success is rich evolutionary information deciphered from thousands of homologous sequences that coevolve in interacting partners. This covariation signal, revealed by sophisticated statistical and machine learning (ML) algorithms, predicts physiological interactions. Accurate artificial intelligence (AI)-based modeling of protein structures promises to provide accurate 3D models of PPIs at a proteome-wide scale.
Collapse
Affiliation(s)
- Jesse Durham
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX, USA; Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, USA; Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Jing Zhang
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX, USA; Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, USA; Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Ian R Humphreys
- Department of Biochemistry, University of Washington, Seattle, WA, USA; Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Jimin Pei
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX, USA; Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, USA; Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Qian Cong
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX, USA; Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, USA; Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX, USA.
| |
Collapse
|
5
|
Gandarilla-Pérez CA, Pinilla S, Bitbol AF, Weigt M. Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins. PLoS Comput Biol 2023; 19:e1011010. [PMID: 36996234 PMCID: PMC10089317 DOI: 10.1371/journal.pcbi.1011010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 04/11/2023] [Accepted: 03/08/2023] [Indexed: 04/01/2023] Open
Abstract
Predicting protein-protein interactions from sequences is an important goal of computational biology. Various sources of information can be used to this end. Starting from the sequences of two interacting protein families, one can use phylogeny or residue coevolution to infer which paralogs are specific interaction partners within each species. We show that these two signals can be combined to improve the performance of the inference of interaction partners among paralogs. For this, we first align the sequence-similarity graphs of the two families through simulated annealing, yielding a robust partial pairing. We next use this partial pairing to seed a coevolution-based iterative pairing algorithm. This combined method improves performance over either separate method. The improvement obtained is striking in the difficult cases where the average number of paralogs per species is large or where the total number of sequences is modest.
Collapse
Affiliation(s)
- Carlos A Gandarilla-Pérez
- Facultad de Física, Universidad de la Habana, San Lázaro y L, Vedado, Habana, Cuba
- Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratoire de Biologie Computationnelle et Quantitative (LCQB, UMR 7238), Paris, France
| | - Sergio Pinilla
- Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratoire de Biologie Computationnelle et Quantitative (LCQB, UMR 7238), Paris, France
- Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratoire Jean Perrin (UMR 8237), Paris, France
| | - Anne-Florence Bitbol
- Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Martin Weigt
- Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratoire de Biologie Computationnelle et Quantitative (LCQB, UMR 7238), Paris, France
| |
Collapse
|
6
|
Li W, Wang Z, Cao J, Dong Y, Chen Y. Perfecting the Life Clock: The Journey from PTO to TTFL. Int J Mol Sci 2023; 24:ijms24032402. [PMID: 36768725 PMCID: PMC9916482 DOI: 10.3390/ijms24032402] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Revised: 01/20/2023] [Accepted: 01/21/2023] [Indexed: 01/27/2023] Open
Abstract
The ubiquity of biological rhythms in life implies that it results from selection in the evolutionary process. The origin of the biological clock has two possible hypotheses: the selective pressure hypothesis of the oxidative stress cycle and the light evasion hypothesis. Moreover, the biological clock gives life higher adaptability. Two biological clock mechanisms have been discovered: the negative feedback loop of transcription-translation (TTFL) and the post-translational oscillation mechanism (PTO). The TTFL mechanism is the most classic and relatively conservative circadian clock oscillation mechanism, commonly found in eukaryotes. We have introduced the TTFL mechanism of the classical model organisms. However, the biological clock of prokaryotes is based on the PTO mechanism. The Peroxiredoxin (PRX or PRDX) protein-based PTO mechanism circadian clock widely existing in eukaryotic and prokaryotic life is considered a more conservative oscillation mechanism. The coexistence of the PTO and TTFL mechanisms in eukaryotes prompted us to explain the relationship between the two. Finally, we speculated that there might be a driving force for the evolution of the biological clock. The biological clock may have an evolutionary trend from the PTO mechanism to the TTFL mechanism, resulting from the evolution of organisms adapting to the environment.
Collapse
Affiliation(s)
- Weitian Li
- College of Veterinary Medicine, China Agricultural University, Haidian, Beijing 100193, China
| | - Zixu Wang
- College of Veterinary Medicine, China Agricultural University, Haidian, Beijing 100193, China
| | - Jing Cao
- College of Veterinary Medicine, China Agricultural University, Haidian, Beijing 100193, China
| | - Yulan Dong
- College of Veterinary Medicine, China Agricultural University, Haidian, Beijing 100193, China
| | - Yaoxing Chen
- College of Veterinary Medicine, China Agricultural University, Haidian, Beijing 100193, China
- Department of Nutrition and Health, China Agricultural University, Haidian, Beijing 100193, China
- Correspondence: ; Tel.: +86-10-62733778
| |
Collapse
|
7
|
Murakami Y, Mizuguchi K. Recent developments of sequence-based prediction of protein-protein interactions. Biophys Rev 2022; 14:1393-1411. [PMID: 36589735 PMCID: PMC9789376 DOI: 10.1007/s12551-022-01038-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/08/2022] [Indexed: 12/25/2022] Open
Abstract
The identification of protein-protein interactions (PPIs) can lead to a better understanding of cellular functions and biological processes of proteins and contribute to the design of drugs to target disease-causing PPIs. In addition, targeting host-pathogen PPIs is useful for elucidating infection mechanisms. Although several experimental methods have been used to identify PPIs, these methods can yet to draw complete PPI networks. Hence, computational techniques are increasingly required for the prediction of potential PPIs, which have never been seen experimentally. Recent high-performance sequence-based methods have contributed to the construction of PPI networks and the elucidation of pathogenetic mechanisms in specific diseases. However, the usefulness of these methods depends on the quality and quantity of training data of PPIs. In this brief review, we introduce currently available PPI databases and recent sequence-based methods for predicting PPIs. Also, we discuss key issues in this field and present future perspectives of the sequence-based PPI predictions.
Collapse
Affiliation(s)
- Yoichi Murakami
- grid.440890.10000 0004 0640 9413Tokyo University of Information Sciences, 4-1 Onaridai, Wakaba-Ku, Chiba, 265-8501 Japan
| | - Kenji Mizuguchi
- grid.136593.b0000 0004 0373 3971Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita-Shi, Osaka, 565-0871 Japan ,grid.482562.fNational Institutes of Biomedical Innovation, Health and Nutrition, 7-6-8 Saito Asagi, Ibaraki, Osaka 567-0085 Japan
| |
Collapse
|
8
|
Bioinformatic Analysis of Na +, K +-ATPase Regulation through Phosphorylation of the Alpha-Subunit N-Terminus. Int J Mol Sci 2022; 24:ijms24010067. [PMID: 36613508 PMCID: PMC9820343 DOI: 10.3390/ijms24010067] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2022] [Revised: 12/01/2022] [Accepted: 12/17/2022] [Indexed: 12/24/2022] Open
Abstract
The Na+, K+-ATPase is an integral membrane protein which uses the energy of ATP hydrolysis to pump Na+ and K+ ions across the plasma membrane of all animal cells. It plays crucial roles in numerous physiological processes, such as cell volume regulation, nutrient reabsorption in the kidneys, nerve impulse transmission, and muscle contraction. Recent data suggest that it is regulated via an electrostatic switch mechanism involving the interaction of its lysine-rich N-terminus with the cytoplasmic surface of its surrounding lipid membrane, which can be modulated through the regulatory phosphorylation of the conserved serine and tyrosine residues on the protein's N-terminal tail. Prior data indicate that the kinases responsible for phosphorylation belong to the protein kinase C (PKC) and Src kinase families. To provide indications of which particular enzyme of these families might be responsible, we analysed them for evidence of coevolution via the mirror tree method, utilising coevolution as a marker for a functional interaction. The results obtained showed that the most likely kinase isoforms to interact with the Na+, K+-ATPase were the θ and η isoforms of PKC and the Src kinase itself. These theoretical results will guide the direction of future experimental studies.
Collapse
|
9
|
Wu C, Guo D. Computational Docking Reveals Co-Evolution of C4 Carbon Delivery Enzymes in Diverse Plants. Int J Mol Sci 2022; 23:12688. [PMID: 36293547 PMCID: PMC9604239 DOI: 10.3390/ijms232012688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Revised: 10/14/2022] [Accepted: 10/19/2022] [Indexed: 11/16/2022] Open
Abstract
Proteins are modular functionalities regulating multiple cellular activities in prokaryotes and eukaryotes. As a consequence of higher plants adapting to arid and thermal conditions, C4 photosynthesis is the carbon fixation process involving multi-enzymes working in a coordinated fashion. However, how these enzymes interact with each other and whether they co-evolve in parallel to maintain interactions in different plants remain elusive to date. Here, we report our findings on the global protein co-evolution relationship and local dynamics of co-varying site shifts in key C4 photosynthetic enzymes. We found that in most of the selected key C4 photosynthetic enzymes, global pairwise co-evolution events exist to form functional couplings. Besides, protein-protein interactions between these enzymes may suggest their unknown functionalities in the carbon delivery process. For PEPC and PPCK regulation pairs, pocket formation at the interactive interface are not necessary for their function. This feature is distinct from another well-known regulation pair in C4 photosynthesis, namely, PPDK and PPDK-RP, where the pockets are necessary. Our findings facilitate the discovery of novel protein regulation types and contribute to expanding our knowledge about C4 photosynthesis.
Collapse
Affiliation(s)
| | - Dianjing Guo
- State Key Laboratory of Agrobiotechnology, School of Life Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong SAR, China
| |
Collapse
|
10
|
Hephzibah Cathryn R, Udhaya Kumar S, Younes S, Zayed H, George Priya Doss C. A review of bioinformatics tools and web servers in different microarray platforms used in cancer research. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2022; 131:85-164. [PMID: 35871897 DOI: 10.1016/bs.apcsb.2022.05.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Over the past decade, conventional lab work strategies have gradually shifted from being limited to a laboratory setting towards a bioinformatics era to help manage and process the vast amounts of data generated by omics technologies. The present work outlines the latest contributions of bioinformatics in analyzing microarray data and their application to cancer. We dissect different microarray platforms and their use in gene expression in cancer models. We highlight how computational advances empowered the microarray technology in gene expression analysis. The study on protein-protein interaction databases classified into primary, derived, meta-database, and prediction databases describes the strategies to curate and predict novel interaction networks in silico. In addition, we summarize the areas of bioinformatics where neural graph networks are currently being used, such as protein functions, protein interaction prediction, and in silico drug discovery and development. We also discuss the role of deep learning as a potential tool in the prognosis, diagnosis, and treatment of cancer. Integrating these resources efficiently, practically, and ethically is likely to be the most challenging task for the healthcare industry over the next decade; however, we believe that it is achievable in the long term.
Collapse
Affiliation(s)
- R Hephzibah Cathryn
- Laboratory of Integrative Genomics, Department of Integrative Biology, School of Biosciences and Technology, Vellore Institute of Technology, Vellore, India
| | - S Udhaya Kumar
- Laboratory of Integrative Genomics, Department of Integrative Biology, School of Biosciences and Technology, Vellore Institute of Technology, Vellore, India
| | - Salma Younes
- Department of Biomedical Sciences, College of Health and Sciences, Qatar University, QU Health, Doha, Qatar
| | - Hatem Zayed
- Department of Biomedical Sciences, College of Health and Sciences, Qatar University, QU Health, Doha, Qatar
| | - C George Priya Doss
- Laboratory of Integrative Genomics, Department of Integrative Biology, School of Biosciences and Technology, Vellore Institute of Technology, Vellore, India.
| |
Collapse
|
11
|
Gerardos A, Dietler N, Bitbol AF. Correlations from structure and phylogeny combine constructively in the inference of protein partners from sequences. PLoS Comput Biol 2022; 18:e1010147. [PMID: 35576238 PMCID: PMC9135348 DOI: 10.1371/journal.pcbi.1010147] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 05/26/2022] [Accepted: 04/27/2022] [Indexed: 11/19/2022] Open
Abstract
Inferring protein-protein interactions from sequences is an important task in computational biology. Recent methods based on Direct Coupling Analysis (DCA) or Mutual Information (MI) allow to find interaction partners among paralogs of two protein families. Does successful inference mainly rely on correlations from structural contacts or from phylogeny, or both? Do these two types of signal combine constructively or hinder each other? To address these questions, we generate and analyze synthetic data produced using a minimal model that allows us to control the amounts of structural constraints and phylogeny. We show that correlations from these two sources combine constructively to increase the performance of partner inference by DCA or MI. Furthermore, signal from phylogeny can rescue partner inference when signal from contacts becomes less informative, including in the realistic case where inter-protein contacts are restricted to a small subset of sites. We also demonstrate that DCA-inferred couplings between non-contact pairs of sites improve partner inference in the presence of strong phylogeny, while deteriorating it otherwise. Moreover, restricting to non-contact pairs of sites preserves inference performance in the presence of strong phylogeny. In a natural data set, as well as in realistic synthetic data based on it, we find that non-contact pairs of sites contribute positively to partner inference performance, and that restricting to them preserves performance, evidencing an important role of phylogeny.
Collapse
Affiliation(s)
- Andonis Gerardos
- Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Nicola Dietler
- Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Anne-Florence Bitbol
- Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
12
|
Varassas SP, Kouvelis VN. Mitochondrial Transcription of Entomopathogenic Fungi Reveals Evolutionary Aspects of Mitogenomes. Front Microbiol 2022; 13:821638. [PMID: 35387072 PMCID: PMC8979003 DOI: 10.3389/fmicb.2022.821638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Accepted: 02/22/2022] [Indexed: 11/13/2022] Open
Abstract
Entomopathogenic fungi and more specifically genera Beauveria and Metarhizium have been exploited for the biological control of pests. Genome analyses are important to understand better their mode of action and thus, improve their efficacy against their hosts. Until now, the sequences of their mitochondrial genomes were studied, but not at the level of transcription. Except of yeasts and Neurospora crassa, whose mt gene transcription is well described, in all other Ascomycota, i.e., Pezizomycotina, related information is extremely scarce. In this work, mt transcription and key enzymes of this function were studied. RT-PCR experiments and Northern hybridizations reveal the transcriptional map of the mt genomes of B. bassiana and M. brunneum species. The mt genes are transcribed in six main transcripts and undergo post-transcriptional modifications to create single gene transcripts. Promoters were determined in both mt genomes with a comparative in silico analysis, including all known information from other fungal mt genomes. The promoter consensus sequence is 5'-ATAGTTATTAT-3' which is in accordance with the definition of the polycistronic transcripts determined with the experiments described above. Moreover, 5'-RACE experiments in the case of premature polycistronic transcript nad1-nad4-atp8-atp6 revealed the 5' end of the RNA transcript immediately after the in silico determined promoter, as also found in other fungal species. Since several conserved elements were retrieved from these analyses compared to the already known data from yeasts and N. crassa, the phylogenetic analyses of mt RNA polymerase (Rpo41) and its transcriptional factor (Mtf1) were performed in order to define their evolution. As expected, it was found that fungal Rpo41 originate from the respective polymerase of T7/T3 phages, while the ancestor of Mtf1 is of alpha-proteobacterial origin. Therefore, this study presents insights about the fidelity of the mt single-subunit phage-like RNA polymerase during transcription, since the correct identification of mt promoters from Rpo41 requires an ortholog to bacterial sigma factor, i.e., Mtf1. Thus, a previously proposed hypothesis of a phage infected alpha-proteobacterium as the endosymbiotic progenitor of mitochondrion is confirmed in this study and further upgraded by the co-evolution of the bacterial (Mtf1) and viral (Rpo41) originated components in one functional unit.
Collapse
Affiliation(s)
| | - Vassili N. Kouvelis
- Department of Genetics and Biotechnology, Faculty of Biology, National and Kapodistrian University of Athens, Athens, Greece
| |
Collapse
|
13
|
The S100A7 nuclear interactors in autoimmune diseases: a coevolutionary study in mammals. Immunogenetics 2022; 74:271-284. [PMID: 35174412 DOI: 10.1007/s00251-022-01256-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2021] [Accepted: 02/10/2022] [Indexed: 11/05/2022]
Abstract
S100A7, a member of the S100A family of Ca2+-binding proteins, is considered a key effector in immune response. In particular, S100A7 dysregulation has been associated with several diseases, including autoimmune disorders. At the nuclear level, S100A7 interacts with several protein-binding partners which are involved in transcriptional regulation and DNA repair. By using the BioGRID and GAAD databases, S100A7 nuclear interactors with a putative involvement in autoimmune diseases were retrieved. We selected fatty acid-binding protein 5 (FABP5), autoimmune regulator (AIRE), cystic fibrosis transmembrane conductance regulator (CFTR), chromodomain helicase DNA-binding protein 4 (CHD4), epidermal growth factor receptor (EGFR), estrogen receptor 1 (ESR1), histone deacetylase 2 (HDAC2), v-myc avian myelocytomatosis viral oncogene homolog (MYC), protection of telomeres protein 1 (POT1), telomeric repeat-binding factor (NIMA-interacting) 1 (TERF1), telomeric repeat-binding factor 2 (TERF2), and Zic family member 1 (ZIC1). Linear correlation coefficients between interprotein distances were calculated with MirrorTree. Coevolution clusters were also identified with the use of a recent version of the Blocks in Sequences (BIS2) algorithm implemented in the BIS2Analyzer web server. Analysis of pair positions identified interprotein coevolving clusters between S100A7 and the binding partners CFTR and TERF1. Such findings could guide further analysis to better elucidate the function of S100A7 and its binding partners and to design drugs targeting for these molecules in autoimmune diseases.
Collapse
|
14
|
Wangchuk J, Chatterjee A, Patil S, Madugula SK, Kondabagil K. The coevolution of large and small terminases of bacteriophages is a result of purifying selection leading to phenotypic stabilization. Virology 2021; 564:13-25. [PMID: 34598064 DOI: 10.1016/j.virol.2021.09.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Revised: 09/14/2021] [Accepted: 09/14/2021] [Indexed: 10/20/2022]
Abstract
Genome packaging in many dsDNA phages requires a series of precisely coordinated actions of two phage-coded proteins, namely, large terminase (TerL) and small terminase (TerS) with DNA and ATP, and with each other. Despite the strict functional conservation, TerL and TerS homologs exhibit large sequence variations. We investigated the sequence variability across eight phage types and observed a coevolutionary framework wherein the genealogy of TerL homologs mirrored that of the corresponding TerS homologs. Furthermore, a high purifying selection observed (dN/dS«1) indicated strong structural constraints on both TerL and TerS, and identify coevolving residues in TerL and TerS of phage T4 and lambda. Using the highly coevolving (correlation coefficient of 0.99) TerL and TerS of phage N4, we show that their biochemical features are similar to the phylogenetically divergent phage λ terminases. We also demonstrate using the Surface Plasma Resonance (SPR) technique that phage N4 TerL transiently interacts with TerS.
Collapse
Affiliation(s)
- Jigme Wangchuk
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai, India
| | - Anirvan Chatterjee
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai, India
| | - Supriya Patil
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai, India
| | - Santhosh Kumar Madugula
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai, India
| | - Kiran Kondabagil
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai, India.
| |
Collapse
|
15
|
Bhattacharya T, Rice DW, Crawford JM, Hardy RW, Newton ILG. Evidence of Adaptive Evolution in Wolbachia-Regulated Gene DNMT2 and Its Role in the Dipteran Immune Response and Pathogen Blocking. Viruses 2021; 13:1464. [PMID: 34452330 PMCID: PMC8402854 DOI: 10.3390/v13081464] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Revised: 06/09/2021] [Accepted: 07/09/2021] [Indexed: 12/23/2022] Open
Abstract
Eukaryotic nucleic acid methyltransferase (MTase) proteins are essential mediators of epigenetic and epitranscriptomic regulation. DNMT2 belongs to a large, conserved family of DNA MTases found in many organisms, including holometabolous insects such as fruit flies and mosquitoes, where it is the lone MTase. Interestingly, despite its nomenclature, DNMT2 is not a DNA MTase, but instead targets and methylates RNA species. A growing body of literature suggests that DNMT2 mediates the host immune response against a wide range of pathogens, including RNA viruses. Curiously, although DNMT2 is antiviral in Drosophila, its expression promotes virus replication in mosquito species. We, therefore, sought to understand the divergent regulation, function, and evolution of these orthologs. We describe the role of the Drosophila-specific host protein IPOD in regulating the expression and function of fruit fly DNMT2. Heterologous expression of these orthologs suggests that DNMT2's role as an antiviral is host-dependent, indicating a requirement for additional host-specific factors. Finally, we identify and describe potential evidence of positive selection at different times throughout DNMT2 evolution within dipteran insects. We identify specific codons within each ortholog that are under positive selection and find that they are restricted to four distinct protein domains, which likely influence substrate binding, target recognition, and adaptation of unique intermolecular interactions. Collectively, our findings highlight the evolution of DNMT2 in Dipteran insects and point to structural, regulatory, and functional differences between mosquito and fruit fly homologs.
Collapse
Affiliation(s)
- Tamanash Bhattacharya
- Department of Biology, Indiana University Bloomington, Bloomington, IN 47405, USA; (T.B.); (D.W.R.); (J.M.C.)
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Danny W. Rice
- Department of Biology, Indiana University Bloomington, Bloomington, IN 47405, USA; (T.B.); (D.W.R.); (J.M.C.)
| | - John M. Crawford
- Department of Biology, Indiana University Bloomington, Bloomington, IN 47405, USA; (T.B.); (D.W.R.); (J.M.C.)
| | - Richard W. Hardy
- Department of Biology, Indiana University Bloomington, Bloomington, IN 47405, USA; (T.B.); (D.W.R.); (J.M.C.)
| | - Irene L. G. Newton
- Department of Biology, Indiana University Bloomington, Bloomington, IN 47405, USA; (T.B.); (D.W.R.); (J.M.C.)
| |
Collapse
|
16
|
Mukherjee I, Chakrabarti S. Co-evolutionary landscape at the interface and non-interface regions of protein-protein interaction complexes. Comput Struct Biotechnol J 2021; 19:3779-3795. [PMID: 34285778 PMCID: PMC8271121 DOI: 10.1016/j.csbj.2021.06.039] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Revised: 06/22/2021] [Accepted: 06/22/2021] [Indexed: 11/16/2022] Open
Abstract
Proteins involved in interactions throughout the course of evolution tend to co-evolve and compensatory changes may occur in interacting proteins to maintain or refine such interactions. However, certain residue pair alterations may prove to be detrimental for functional interactions. Hence, determining co-evolutionary pairings that could be structurally or functionally relevant for maintaining the conservation of an inter-protein interaction is important. Inter-protein co-evolution analysis in several complexes utilizing multiple existing methodologies suggested that co-evolutionary pairings can occur in spatially proximal and distant regions in inter-protein interactions. Subsequently, the Co-Var (Correlated Variation) method based on mutual information and Bhattacharyya coefficient was developed, validated, and found to perform relatively better than CAPS and EV-complex. Interestingly, while applying the Co-Var measure and EV-complex program on a set of protein-protein interaction complexes, co-evolutionary pairings were obtained in interface and non-interface regions in protein complexes. The Co-Var approach involves determining high degree co-evolutionary pairings that include multiple co-evolutionary connections between particular co-evolved residue positions in one protein with multiple residue positions in the binding partner. Detailed analyses of high degree co-evolutionary pairings in protein-protein complexes involved in cancer metastasis suggested that most of the residue positions forming such co-evolutionary connections mainly occurred within functional domains of constituent proteins and substitution mutations were also common among these positions. The physiological relevance of these predictions suggested that Co-Var can predict residues that could be crucial for preserving functional protein-protein interactions. Finally, Co-Var web server (http://www.hpppi.iicb.res.in/ishi/covar/index.html) that implements this methodology identifies co-evolutionary pairings in intra and inter-protein interactions.
Collapse
Affiliation(s)
- Ishita Mukherjee
- Structural Biology and Bioinformatics Division, Council for Scientific and Industrial Research (CSIR) - Indian Institute of Chemical Biology (IICB), Kolkata, West Bengal 700032, India
| | - Saikat Chakrabarti
- Structural Biology and Bioinformatics Division, Council for Scientific and Industrial Research (CSIR) - Indian Institute of Chemical Biology (IICB), Kolkata, West Bengal 700032, India
| |
Collapse
|
17
|
Gai Z, Wang Y, Tian L, Gong G, Zhao J. Whole Genome Level Analysis of the Wnt and DIX Gene Families in Mice and Their Coordination Relationship in Regulating Cardiac Hypertrophy. Front Genet 2021; 12:608936. [PMID: 34168671 PMCID: PMC8217762 DOI: 10.3389/fgene.2021.608936] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Accepted: 05/17/2021] [Indexed: 12/27/2022] Open
Abstract
The Wnt signaling pathway is an evolutionarily conserved signaling pathway that plays essential roles in embryonic development, organogenesis, and many other biological activities. Both Wnt proteins and DIX proteins are important components of Wnt signaling. Systematic studies of Wnt and DIX families at the genome-wide level may provide a comprehensive landscape to elucidate their functions and demonstrate their relationships, but they are currently lacking. In this report, we describe the correlations between mouse Wnt and DIX genes in family expansion, molecular evolution, and expression levels in cardiac hypertrophy at the genome-wide scale. We observed that both the Wnt and DIX families underwent more expansion than the overall average in the evolutionarily early stage. In addition, mirrortree analyses suggested that Wnt and DIX were co-evolved protein families. Collectively, these results would help to elucidate the evolutionary characters of Wnt and DIX families and demonstrate their correlations in mediating cardiac hypertrophy.
Collapse
Affiliation(s)
- Zhongchao Gai
- School of Food and Biological Engineering, Shaanxi University of Science and Technology, Xi'an, China
| | - Yujiao Wang
- School of Food and Biological Engineering, Shaanxi University of Science and Technology, Xi'an, China
| | - Lu Tian
- School of Food and Biological Engineering, Shaanxi University of Science and Technology, Xi'an, China
| | - Guoli Gong
- School of Food and Biological Engineering, Shaanxi University of Science and Technology, Xi'an, China
| | - Jieqiong Zhao
- Department of Cardiology, The Second Affiliated Hospital of Air Force Medical University, Xi'an, China
| |
Collapse
|
18
|
Xu C, Chang X, Hou Z, Zhang Z, Zhu Z, Zhong B. The origin of SPA reveals the divergence and convergence of light signaling in Archaeplastida. Mol Phylogenet Evol 2021; 161:107175. [PMID: 33862251 DOI: 10.1016/j.ympev.2021.107175] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Revised: 03/28/2021] [Accepted: 04/06/2021] [Indexed: 01/02/2023]
Abstract
Plants have evolved various photoreceptors to adapt to changing light environments, and photoreceptors can inactivate the large CONSTITUTIVE PHOTOMORPHOGENIC/DE-ETIOLATED/FUSCA (COP/DET/FUS) protein complex to release their repression of photoresponsive transcription factors. Here, we tracked the origin and evolution of COP/DET/FUS in Archaeplastida and found that most components of COP/DET/FUS were highly conserved. Intriguingly, the COP1-SUPPRESSOR OF PHYA-105 (SPA) protein originated in Chlorophyta but subsequently underwent a distinct evolutionary history in Viridiplantae. SPA experienced duplication events in the ancestors of specific clades after the colonization of land by plants and was divided into two clades (clades A and B) within euphyllophytes (ferns and seed plants). Our phylogenetic and experimental evidences support a new evolutionary model to clarify the divergence and convergence of light signaling during plant evolution.
Collapse
Affiliation(s)
- Chenjie Xu
- College of Life Sciences, Nanjing Normal University, 210046 Nanjing, China
| | - Xin Chang
- College of Life Sciences, Nanjing Normal University, 210046 Nanjing, China
| | - Zheng Hou
- College of Life Sciences, Nanjing Normal University, 210046 Nanjing, China
| | - Zhenhua Zhang
- College of Life Sciences, Nanjing Normal University, 210046 Nanjing, China
| | - Ziqiang Zhu
- College of Life Sciences, Nanjing Normal University, 210046 Nanjing, China
| | - Bojian Zhong
- College of Life Sciences, Nanjing Normal University, 210046 Nanjing, China.
| |
Collapse
|
19
|
The Phosphoprotein Synapsin Ia Regulates the Kinetics of Dense-Core Vesicle Release. J Neurosci 2021; 41:2828-2841. [PMID: 33632727 DOI: 10.1523/jneurosci.2593-19.2021] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2019] [Revised: 02/02/2021] [Accepted: 02/04/2021] [Indexed: 12/19/2022] Open
Abstract
Common fusion machinery mediates the Ca2+-dependent exocytosis of synaptic vesicles (SVs) and dense-core vesicles (DCVs). Previously, Synapsin Ia (Syn Ia) was found to localize to SVs, essential for mobilizing SVs to the plasma membrane through phosphorylation. However, whether (or how) the phosphoprotein Syn Ia plays a role in regulating DCV exocytosis remains unknown. To answer these questions, we measured the dynamics of DCV exocytosis by using single-vesicle amperometry in PC12 cells (derived from the pheochromocytoma of rats of unknown sex) overexpressing wild-type or phosphodeficient Syn Ia. We found that overexpression of phosphodeficient Syn Ia decreased the DCV secretion rate, specifically via residues previously shown to undergo calmodulin-dependent kinase (CaMK)-mediated phosphorylation (S9, S566, and S603). Moreover, the fusion pore kinetics during DCV exocytosis were found to be differentially regulated by Syn Ia and two phosphodeficient Syn Ia mutants (Syn Ia-S62A and Syn Ia-S9,566,603A). Kinetic analysis suggested that Syn Ia may regulate the closure and dilation of DCV fusion pores via these sites, implying the potential interactions of Syn Ia with certain DCV proteins involved in the regulation of fusion pore dynamics. Furthermore, we predicted the interaction of Syn Ia with several DCV proteins, including Synaptophysin (Syp) and soluble N-ethylmaleimide-sensitive factor attachment receptor (SNARE) proteins. By immunoprecipitation, we found that Syn Ia interacted with Syp via phosphorylation. Moreover, a proximity ligation assay (PLA) confirmed their phosphorylation-dependent, in situ interaction on DCVs. Together, these findings reveal a phosphorylation-mediated regulation of DCV exocytosis by Syn Ia.SIGNIFICANCE STATEMENT Although they exhibit distinct exocytosis dynamics upon stimulation, synaptic vesicles (SVs) and dense-core vesicles (DCVs) may undergo co-release in neurons and neuroendocrine cells through an undefined molecular mechanism. Synapsin Ia (Syn Ia) is known to recruit SVs to the plasma membrane via phosphorylation. Here, we examined whether Syn Ia also affects the dynamics of DCV exocytosis. We showed that Syn Ia regulates the DCV secretion rate and fusion pore kinetics during DCV exocytosis. Moreover, Syn Ia-mediated regulation of DCV exocytosis depends on phosphorylation. We further found that Syn Ia interacts with Synaptophysin (Syp) on DCVs in a phosphorylation-dependent manner. Thus, these results suggest that Syn Ia may regulate the release of DCVs via phosphorylation.
Collapse
|
20
|
D'Amico F, Candido S, Libra M. Interaction between matrix metalloproteinase-9 (MMP-9) and neutrophil gelatinase-associated lipocalin (NGAL): A recent evolutionary event in primates. DEVELOPMENTAL AND COMPARATIVE IMMUNOLOGY 2021; 116:103933. [PMID: 33245981 DOI: 10.1016/j.dci.2020.103933] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/10/2020] [Revised: 10/30/2020] [Accepted: 11/18/2020] [Indexed: 06/11/2023]
Abstract
Matrix metalloproteases are known to represent an early step in the evolution of the immune system. Similarly, neutrophil gelatinase-associated lipocalin is known to be a key effector in immune response. MMP-9 interacts with NGAL, but their interaction mechanisms remain unclear. Functional interaction between proteins is ensured by coevolution. Protein coevolution was inferred by calculating the linear correlation coefficients between inter-protein distance matrices using MirrorTree. Among examined mammal species, we found a robust signal of MMP-9/NGAL coevolution exclusively within Primates (R = 0.96, p < 1e-06). Owing to the high conservation of these proteins among Mammals, we chose to utilize a recent version of Blocks in Sequences (BIS2) algorithm implemented in BIS2Analyzer webserver. Coevolution clusters between the two proteins were identified in MMP-9 fibronectin and hemopexin domains. Our results suggest that MMP-9/NGAL interaction is a recent evolutionary acquisition in Primates. Furthermore, MMP-9 hemopexin domain would represent a promising target for drug design against these molecules.
Collapse
Affiliation(s)
- Fabio D'Amico
- Department of Biomedical and Biotechnological Sciences, University of Catania, Italy.
| | - Saverio Candido
- Department of Biomedical and Biotechnological Sciences, University of Catania, Italy; Research Center for Prevention, Diagnosis and Treatment of Cancer, University of Catania, 95123, Catania, Italy
| | - Massimo Libra
- Department of Biomedical and Biotechnological Sciences, University of Catania, Italy; Research Center for Prevention, Diagnosis and Treatment of Cancer, University of Catania, 95123, Catania, Italy
| |
Collapse
|
21
|
New Insights into the Co-Occurrences of Glycoside Hydrolase Genes among Prokaryotic Genomes through Network Analysis. Microorganisms 2021; 9:microorganisms9020427. [PMID: 33669523 PMCID: PMC7922503 DOI: 10.3390/microorganisms9020427] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2021] [Revised: 02/06/2021] [Accepted: 02/14/2021] [Indexed: 12/21/2022] Open
Abstract
Glycoside hydrolase (GH) represents a crucial category of enzymes for carbohydrate utilization in most organisms. A series of glycoside hydrolase families (GHFs) have been classified, with relevant information deposited in the CAZy database. Statistical analysis indicated that most GHFs (134 out of 154) were prone to exist in bacteria rather than archaea, in terms of both occurrence frequencies and average gene numbers. Co-occurrence analysis suggested the existence of strong or moderate-strong correlations among 63 GHFs. A combination of network analysis by Gephi and functional classification among these GHFs demonstrated the presence of 12 functional categories (from group A to L), with which the corresponding microbial collections were subsequently labeled, respectively. Interestingly, a progressive enrichment of particular GHFs was found among several types of microbes, and type-L as well as type-E microbes were deemed as functional intensified species which formed during the microbial evolution process toward efficient decomposition of lignocellulose as well as pectin, respectively. Overall, integrating network analysis and enzymatic functional classification, we were able to provide a new angle of view for GHs from known prokaryotic genomes, and thus this study is likely to guide the selection of GHs and microbes for efficient biomass utilization.
Collapse
|
22
|
Salmanian S, Pezeshk H, Sadeghi M. Inter-protein residue covariation information unravels physically interacting protein dimers. BMC Bioinformatics 2020; 21:584. [PMID: 33334319 PMCID: PMC7745481 DOI: 10.1186/s12859-020-03930-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Accepted: 12/09/2020] [Indexed: 01/04/2023] Open
Abstract
BACKGROUND Predicting physical interaction between proteins is one of the greatest challenges in computational biology. There are considerable various protein interactions and a huge number of protein sequences and synthetic peptides with unknown interacting counterparts. Most of co-evolutionary methods discover a combination of physical interplays and functional associations. However, there are only a handful of approaches which specifically infer physical interactions. Hybrid co-evolutionary methods exploit inter-protein residue coevolution to unravel specific physical interacting proteins. In this study, we introduce a hybrid co-evolutionary-based approach to predict physical interplays between pairs of protein families, starting from protein sequences only. RESULTS In the present analysis, pairs of multiple sequence alignments are constructed for each dimer and the covariation between residues in those pairs are calculated by CCMpred (Contacts from Correlated Mutations predicted) and three mutual information based approaches for ten accessible surface area threshold groups. Then, whole residue couplings between proteins of each dimer are unified into a single Frobenius norm value. Norms of residue contact matrices of all dimers in different accessible surface area thresholds are fed into support vector machine as single or multiple feature models. The results of training the classifiers by single features show no apparent different accuracies in distinct methods for different accessible surface area thresholds. Nevertheless, mutual information product and context likelihood of relatedness procedures may roughly have an overall higher and lower performances than other two methods for different accessible surface area cut-offs, respectively. The results also demonstrate that training support vector machine with multiple norm features for several accessible surface area thresholds leads to a considerable improvement of prediction performance. In this context, CCMpred roughly achieves an overall better performance than mutual information based approaches. The best accuracy, sensitivity, specificity, precision and negative predictive value for that method are 0.98, 1, 0.962, 0.96, and 0.962, respectively. CONCLUSIONS In this paper, by feeding norm values of protein dimers into support vector machines in different accessible surface area thresholds, we demonstrate that even small number of proteins in pairs of multiple alignments could allow one to accurately discriminate between positive and negative dimers.
Collapse
Affiliation(s)
- Sara Salmanian
- Department of Bioinformatics, Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - Hamid Pezeshk
- School of Mathematics, Statistics and Computer Science, College of Science, University of Tehran, Tehran, Iran
- Present Address: Department of Mathematics and Statistics, Concordia University, Montreal, Canada
- School of Biological Sciences, Institute for Research in Fundamental Sciences, Tehran, Iran
| | - Mehdi Sadeghi
- National Institute of Genetic Engineering and Biotechnology, Tehran, Iran
| |
Collapse
|
23
|
Arcas A, Wilkinson DG, Nieto MÁ. The Evolutionary History of Ephs and Ephrins: Toward Multicellular Organisms. Mol Biol Evol 2020; 37:379-394. [PMID: 31589243 PMCID: PMC6993872 DOI: 10.1093/molbev/msz222] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Eph receptor (Eph) and ephrin signaling regulate fundamental developmental processes through both forward and reverse signaling triggered upon cell–cell contact. In vertebrates, they are both classified into classes A and B, and some representatives have been identified in many metazoan groups, where their expression and functions have been well studied. We have extended previous phylogenetic analyses and examined the presence of Eph and ephrins in the tree of life to determine their origin and evolution. We have found that 1) premetazoan choanoflagellates may already have rudimental Eph/ephrin signaling as they have an Eph-/ephrin-like pair and homologs of downstream-signaling genes; 2) both forward- and reverse-downstream signaling might already occur in Porifera since sponges have most genes involved in these types of signaling; 3) the nonvertebrate metazoan Eph is a type-B receptor that can bind ephrins regardless of their membrane-anchoring structure, glycosylphosphatidylinositol, or transmembrane; 4) Eph/ephrin cross-class binding is specific to Gnathostomata; and 5) kinase-dead Eph receptors can be traced back to Gnathostomata. We conclude that Eph/ephrin signaling is of older origin than previously believed. We also examined the presence of protein domains associated with functional characteristics and the appearance and conservation of downstream-signaling pathways to understand the original and derived functions of Ephs and ephrins. We find that the evolutionary history of these gene families points to an ancestral function in cell–cell interactions that could contribute to the emergence of multicellularity and, in particular, to the required segregation of cell populations.
Collapse
Affiliation(s)
- Aida Arcas
- Instituto de Neurociencias (CSIC-UMH), Avda, San Juan de Alicante, Spain
| | - David G Wilkinson
- Neural Development Laboratory, The Francis Crick Institute, London, United Kingdom
| | - M Ángela Nieto
- Instituto de Neurociencias (CSIC-UMH), Avda, San Juan de Alicante, Spain
| |
Collapse
|
24
|
Gandarilla-Pérez CA, Mergny P, Weigt M, Bitbol AF. Statistical physics of interacting proteins: Impact of dataset size and quality assessed in synthetic sequences. Phys Rev E 2020; 101:032413. [PMID: 32290011 DOI: 10.1103/physreve.101.032413] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Accepted: 03/04/2020] [Indexed: 11/07/2022]
Abstract
Identifying protein-protein interactions is crucial for a systems-level understanding of the cell. Recently, algorithms based on inverse statistical physics, e.g., direct coupling analysis (DCA), have allowed to use evolutionarily related sequences to address two conceptually related inference tasks: finding pairs of interacting proteins and identifying pairs of residues which form contacts between interacting proteins. Here we address two underlying questions: How are the performances of both inference tasks related? How does performance depend on dataset size and the quality? To this end, we formalize both tasks using Ising models defined over stochastic block models, with individual blocks representing single proteins and interblock couplings protein-protein interactions; controlled synthetic sequence data are generated by Monte Carlo simulations. We show that DCA is able to address both inference tasks accurately when sufficiently large training sets of known interaction partners are available and that an iterative pairing algorithm allows to make predictions even without a training set. Noise in the training data deteriorates performance. In both tasks we find a quadratic scaling relating dataset quality and size that is consistent with noise adding in square-root fashion and signal adding linearly when increasing the dataset. This implies that it is generally good to incorporate more data even if their quality are imperfect, thereby shedding light on the empirically observed performance of DCA applied to natural protein sequences.
Collapse
Affiliation(s)
- Carlos A Gandarilla-Pérez
- Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratoire de Biologie Computationnelle et Quantitative (LCQB, UMR 7238), F-75005 Paris, France.,Facultad de Física, Universidad de la Habana, San Lázaro y L, Vedado, Habana 4, CP-10400, Cuba
| | - Pierre Mergny
- Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratoire de Biologie Computationnelle et Quantitative (LCQB, UMR 7238), F-75005 Paris, France.,Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratoire Jean Perrin (LJP, UMR 8237), F-75005 Paris, France
| | - Martin Weigt
- Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratoire de Biologie Computationnelle et Quantitative (LCQB, UMR 7238), F-75005 Paris, France
| | - Anne-Florence Bitbol
- Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratoire Jean Perrin (LJP, UMR 8237), F-75005 Paris, France.,Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), CH-1015 Lausanne, Switzerland
| |
Collapse
|
25
|
Begum Y, Mondal SK. Comprehensive study of the genes involved in chlorophyll synthesis and degradation pathways in some monocot and dicot plant species. J Biomol Struct Dyn 2020; 39:2387-2414. [PMID: 32292132 DOI: 10.1080/07391102.2020.1748717] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Chlorophyll (Chl) biosynthesis is one of the most important cellular processes essential for plant photosynthesis. Chl degradation pathway is also important catabolic process occurs during leaf senescence, fruit ripening and under biotic or abiotic stress conditions. Here we have systematically investigated the molecular evolution, gene structure, compositional analysis along with ENc plot, correspondence analysis and codon usage bias of the proteins and encoded genes involved in Chl metabolism from monocots and dicots. The gene and species specific phylogenetic trees using amino acid sequences showed clear clustering formation of the selected species based on monocots and dicots but not supported by 18S rRNA. Nucleotide composition of the encoding genes showed that average GC%, GC1%, GC2% and GC3% were higher in monocots. RSCU analysis depicts that genes from monocots for both pathways and genes for synthesis pathway from dicots only biased to G/C-ending synonymous codons but in degradation pathway most optimal codons (except UUG) in dicots biased to A/U-ending synonymous codons. We found strong evidence of episodic diversifying selection at several amino acid sites in all genes investigated. Conserved domain and gene structures were observed for the genes with varying lengths of introns and exons, involved in Chl metabolism along with some intronless genes within synthesis pathway. ENc and correspondence analyses suggested the mutational or selection constraint on the genes to shape the codon usage. These comprehensive studies may be helpful in further research in molecular phylogenetics and genomics and to better understand the evolutionary dynamics of Chl metabolic pathway.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Yasmin Begum
- Department of Biophysics, Molecular Biology and Bioinformatics, University of Calcutta, Kolkata, West Bengal, India.,Center of Excellence in Systems Biology and Biomedical Engineering (TEQIP Phase-II), University of Calcutta, Kolkata, West Bengal, India
| | - Sunil Kanti Mondal
- Department of Biotechnology, The University of Burdwan, Burdwan, West Bengal, India
| |
Collapse
|
26
|
D'Amico F, Nadalin F, Libra M. S100A7/Ran-binding protein 9 coevolution in mammals. Immunogenetics 2020; 72:155-164. [PMID: 32043173 DOI: 10.1007/s00251-020-01155-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Accepted: 01/13/2020] [Indexed: 10/25/2022]
Abstract
S100A7 has been suggested to interact with Ran-binding protein 9. Both proteins are nowadays considered key effectors in immune response. Functional interaction between proteins is ensured by coevolution. The mechanisms of vertebrate coevolution between S100A7 and RanBP9 remain unclear. Several approaches for studying coevolution have been developed. Protein coevolution was inferred by calculating the linear correlation coefficients between inter-protein distance matrices using Mirrortree. We found an overall moderate correlation value (R = 0.53, p < 1e-06). Moreover, owing to the high conservation of RanBP9 protein among vertebrates, we chose to utilize a recent version of Blocks in Sequences (BIS2) algorithm implemented in BIS2Analyzer webserver. A coevolution cluster was identified between the two proteins (p < 8.10e-05). In conclusion, our coevolutionary analysis suggests that amino acid variations may modulate S100A7/RanBP9 interaction with potential pathogenic effects. Such findings could guide further analysis to better elucidate the function of S100A7 and RanBP9 and to design drugs targeting for these molecules in diseases.
Collapse
Affiliation(s)
- Fabio D'Amico
- Department of Biomedical and Biotechnological Sciences, University of Catania, Catania, Italy.
| | - Francesca Nadalin
- Laboratoire de Biologie Computationnelle et Quantitative (LCQB) - UMR 7238, Sorbonne Université, Univ P6, CNRS, IBPS, Paris, France
| | - Massimo Libra
- Department of Biomedical and Biotechnological Sciences, University of Catania, Catania, Italy
| |
Collapse
|
27
|
Dyachkova MS, Chekalin EV, Danilenko VN. Positive Selection in Bifidobacterium Genes Drives Species-Specific Host-Bacteria Communication. Front Microbiol 2019; 10:2374. [PMID: 31681231 PMCID: PMC6803598 DOI: 10.3389/fmicb.2019.02374] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2019] [Accepted: 09/30/2019] [Indexed: 12/15/2022] Open
Abstract
Bifidobacteria are commensal microorganisms that inhabit a wide range of hosts, including insects, birds and mammals. The mechanisms responsible for the adaptation of bifidobacteria to various hosts during the evolutionary process remain poorly understood. Previously, we reported that the species-specific PFNA gene cluster is present in the genomes of various species of the Bifidobacterium genus. The cluster contains signal transduction and adhesion genes that are presumably involved in the communication between bifidobacteria and their hosts. The genes in the PFNA cluster show high sequence divergence between bifidobacterial species, which may be indicative of rapid evolution that drives species-specific adaptation to the host organism. We used the maximum likelihood approach to detect positive selection in the PFNA genes. We tested for both pervasive and episodic positive selection to identify codons that experienced adaptive evolution in all and individual branches of the Bifidobacterium phylogenetic tree, respectively. Our results provide evidence that episodic positive selection has played an important role in the divergence process and molecular evolution of sequences of the species-specific PFNA genes in most bifidobacterial species. Moreover, we found the signatures of pervasive positive selection in the molecular evolution of the tgm gene in all branches of the Bifidobacterium phylogenetic tree. These results are consistent with the suggested role of PFNA gene cluster in the process of specific adaptation of bifidobacterial species to various hosts.
Collapse
Affiliation(s)
- Marina S Dyachkova
- Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia
| | - Evgeny V Chekalin
- Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia
| | - Valery N Danilenko
- Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
28
|
Phylogenetic correlations can suffice to infer protein partners from sequences. PLoS Comput Biol 2019; 15:e1007179. [PMID: 31609984 PMCID: PMC6812855 DOI: 10.1371/journal.pcbi.1007179] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2019] [Revised: 10/24/2019] [Accepted: 09/25/2019] [Indexed: 12/30/2022] Open
Abstract
Determining which proteins interact together is crucial to a systems-level understanding of the cell. Recently, algorithms based on Direct Coupling Analysis (DCA) pairwise maximum-entropy models have allowed to identify interaction partners among paralogous proteins from sequence data. This success of DCA at predicting protein-protein interactions could be mainly based on its known ability to identify pairs of residues that are in contact in the three-dimensional structure of protein complexes and that coevolve to remain physicochemically complementary. However, interacting proteins possess similar evolutionary histories. What is the role of purely phylogenetic correlations in the performance of DCA-based methods to infer interaction partners? To address this question, we employ controlled synthetic data that only involve phylogeny and no interactions or contacts. We find that DCA accurately identifies the pairs of synthetic sequences that share evolutionary history. While phylogenetic correlations confound the identification of contacting residues by DCA, they are thus useful to predict interacting partners among paralogs. We find that DCA performs as well as phylogenetic methods to this end, and slightly better than them with large and accurate training sets. Employing DCA or phylogenetic methods within an Iterative Pairing Algorithm (IPA) allows to predict pairs of evolutionary partners without a training set. We further demonstrate the ability of these various methods to correctly predict pairings among real paralogous proteins with genome proximity but no known direct physical interaction, illustrating the importance of phylogenetic correlations in natural data. However, for physically interacting and strongly coevolving proteins, DCA and mutual information outperform phylogenetic methods. We finally discuss how to distinguish physically interacting proteins from proteins that only share a common evolutionary history. Many biologically important protein-protein interactions are conserved over evolutionary time scales. This leads to two different signals that can be used to computationally predict interactions between protein families and to identify specific interaction partners. First, the shared evolutionary history leads to highly similar phylogenetic relationships between interacting proteins of the two families. Second, the need to keep the interaction surfaces of partner proteins biophysically compatible causes a correlated amino-acid usage of interface residues. Employing simulated data, we show that the shared history alone can be used to detect partner proteins. Similar accuracies are achieved by algorithms comparing phylogenetic relationships and by methods based on Direct Coupling Analysis (DCA), which are primarily known for their ability to detect the second type of signal. Using natural sequence data, we show that in cases with shared evolutionary history but without known physical interactions, both methods work with similar accuracy, while for some physically interacting systems, DCA and mutual information outperform phylogenetic methods. We propose methods allowing both to predict interactions between protein families and to find interacting partners among paralogs.
Collapse
|
29
|
Liao Y, Tham DKL, Liang FX, Chang J, Wei Y, Sudhir PR, Sall J, Ren SJ, Chicote JU, Arnold LL, Hu CCA, Romih R, Andrade LR, Rindler MJ, Cohen SM, DeSalle R, Garcia-España A, Ding M, Wu XR, Sun TT. Mitochondrial lipid droplet formation as a detoxification mechanism to sequester and degrade excessive urothelial membranes. Mol Biol Cell 2019; 30:2969-2984. [PMID: 31577526 PMCID: PMC6857570 DOI: 10.1091/mbc.e19-05-0284] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
The apical surface of the terminally differentiated mammalian urothelial umbrella cell is mechanically stable and highly impermeable, in part due to its coverage by urothelial plaques consisting of 2D crystals of uroplakin particles. The mechanism for regulating the uroplakin/plaque level is unclear. We found that genetic ablation of the highly tissue-specific sorting nexin Snx31, which localizes to plaques lining the multivesicular bodies (MVBs) in urothelial umbrella cells, abolishes MVBs suggesting that Snx31 plays a role in stabilizing the MVB-associated plaques by allowing them to achieve a greater curvature. Strikingly, Snx31 ablation also induces a massive accumulation of uroplakin-containing mitochondria-derived lipid droplets (LDs), which mediate uroplakin degradation via autophagy/lipophagy, leading to the loss of apical and fusiform vesicle plaques. These results suggest that MVBs play an active role in suppressing the excessive/wasteful endocytic degradation of uroplakins. Failure of this suppression mechanism triggers the formation of mitochondrial LDs so that excessive uroplakin membranes can be sequestered and degraded. Because mitochondrial LD formation, which occurs at a low level in normal urothelium, can also be induced by disturbance in uroplakin polymerization due to individual uroplakin knockout and by arsenite, a bladder carcinogen, this pathway may represent an inducible, versatile urothelial detoxification mechanism.
Collapse
Affiliation(s)
- Yi Liao
- Department of Cell Biology, New York University School of Medicine, New York, NY10016
| | - Daniel K L Tham
- Department of Cell Biology, New York University School of Medicine, New York, NY10016
| | - Feng-Xia Liang
- Department of Cell Biology, New York University School of Medicine, New York, NY10016
| | - Jennifer Chang
- Department of Cell Biology, New York University School of Medicine, New York, NY10016
| | - Yuan Wei
- Department of Cell Biology, New York University School of Medicine, New York, NY10016
| | - Putty-Reddy Sudhir
- Department of Cell Biology, New York University School of Medicine, New York, NY10016
| | - Joseph Sall
- Department of Cell Biology, New York University School of Medicine, New York, NY10016
| | - Sarah J Ren
- Department of Cell Biology, New York University School of Medicine, New York, NY10016
| | - Javier U Chicote
- Research Unit, Hospital Joan XXIII, Institut de Investigacio Sanitaria Pere Virgili (IISPV), Universitat Rovira i Virgili, Tarragona 43007, Spain
| | - Lora L Arnold
- Department of Pathology and Microbiology, University of Nebraska Medical Center, Omaha, NE 68198
| | - Chih-Chi Andrew Hu
- The Wistar Institute, University of Pennsylvania, Philadelphia, PA 19104
| | - Rok Romih
- Institute of Cell Biology, Faculty of Medicine, University of Ljubljana, SI-1000 Ljubljana, Slovenia
| | | | - Michael J Rindler
- Department of Cell Biology, New York University School of Medicine, New York, NY10016
| | - Samuel M Cohen
- Department of Pathology and Microbiology, University of Nebraska Medical Center, Omaha, NE 68198
| | - Rob DeSalle
- Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, NY 10024
| | - Antonio Garcia-España
- Research Unit, Hospital Joan XXIII, Institut de Investigacio Sanitaria Pere Virgili (IISPV), Universitat Rovira i Virgili, Tarragona 43007, Spain
| | - Mingxiao Ding
- College of Life Sciences, Peking University, Dachengfang, Haidian, Beijing 100871, China
| | - Xue-Ru Wu
- Department of Urology, New York University School of Medicine, New York, NY10016.,Department of Pathology, New York University School of Medicine, New York, NY10016.,Veterans Affairs Medical Center, New York, NY 10010
| | - Tung-Tien Sun
- Department of Cell Biology, New York University School of Medicine, New York, NY10016.,Department of Urology, New York University School of Medicine, New York, NY10016.,Department of Biochemistry and Molecular Pharmacology, New York University School of Medicine, New York, NY10016.,Department of Dermatology, New York University School of Medicine, New York, NY10016
| |
Collapse
|
30
|
Onodera W, Asahi T, Sawamura N. Data for positive selection test and co-evolutionary analysis on mammalian cereblon. Data Brief 2019; 26:104499. [PMID: 31667262 PMCID: PMC6811870 DOI: 10.1016/j.dib.2019.104499] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Revised: 09/03/2019] [Accepted: 09/04/2019] [Indexed: 12/04/2022] Open
Abstract
Cereblon (CRBN) is a substrate recognition subunit of the CRL4 E3 ubiquitin ligase complex, directly binding to specific substrates for poly-ubiquitination followed by proteasome-dependent degradation of proteins. Cellular CRBN is responsible for energy metabolism, ion-channel activation, and cellular stress response through binding to proteins related to the respective pathways. As CRBN binds to various proteins, the selective pressure at the interacting surface is expected to result in functional divergence. Here, we present two mammalian CRBN datasets of molecular evolutionary analyses. (1) The multiple sequence alignment data shows that positive selection occurred, determined with a dN/dS calculation. (2) Data on co-evolutionary analysis between vertebrate CRBN and related proteins are represented by calculating the correlation coefficient based on the comparison of phylogenetic trees. Co-evolutionary analysis shows the similarity of evolutionary traits of two proteins. Further molecular, functional interpretation of these analyses is explained in ‘Positive selection of Cereblon modified function including its E3 Ubiquitin Ligase activity and binding efficiency with AMPK’ (W. Onodera, T. Asahi, N. Sawamura, Positive selection of cereblon modified function including its E3 ubiquitin ligase activity and binding efficiency with AMPK. Mol Phylogenet Evol. (2019) 135:78-85. [1]).
Collapse
Affiliation(s)
- Wataru Onodera
- Faculty of Science and Engineering, Waseda University, TWIns, 2-2 Wakamatsu, Shinjuku, Tokyo, 162-8480, Japan
| | - Toru Asahi
- Faculty of Science and Engineering, Waseda University, TWIns, 2-2 Wakamatsu, Shinjuku, Tokyo, 162-8480, Japan.,Research Organization for Nano & Life Innovation, Waseda University, Japan
| | - Naoya Sawamura
- Faculty of Science and Engineering, Waseda University, TWIns, 2-2 Wakamatsu, Shinjuku, Tokyo, 162-8480, Japan.,Research Organization for Nano & Life Innovation, Waseda University, Japan
| |
Collapse
|
31
|
Oteri F, Nadalin F, Champeimont R, Carbone A. BIS2Analyzer: a server for co-evolution analysis of conserved protein families. Nucleic Acids Res 2019; 45:W307-W314. [PMID: 28472458 PMCID: PMC5570204 DOI: 10.1093/nar/gkx336] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2017] [Accepted: 04/18/2017] [Indexed: 12/13/2022] Open
Abstract
Along protein sequences, co-evolution analysis identifies residue pairs demonstrating either a specific co-adaptation, where changes in one of the residues are compensated by changes in the other during evolution or a less specific external force that affects the evolutionary rates of both residues in a similar magnitude. In both cases, independently of the underlying cause, co-evolutionary signatures within or between proteins serve as markers of physical interactions and/or functional relationships. Depending on the type of protein under study, the set of available homologous sequences may greatly differ in size and amino acid variability. BIS2Analyzer, openly accessible at http://www.lcqb.upmc.fr/BIS2Analyzer/, is a web server providing the online analysis of co-evolving amino-acid pairs in protein alignments, especially designed for vertebrate and viral protein families, which typically display a small number of highly similar sequences. It is based on BIS2, a re-implemented fast version of the co-evolution analysis tool Blocks in Sequences (BIS). BIS2Analyzer provides a rich and interactive graphical interface to ease biological interpretation of the results.
Collapse
Affiliation(s)
- Francesco Oteri
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, IBPS, UMR 7238, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 75005 Paris, France
| | - Francesca Nadalin
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, IBPS, UMR 7238, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 75005 Paris, France
| | - Raphaël Champeimont
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, IBPS, UMR 7238, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 75005 Paris, France
| | - Alessandra Carbone
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, IBPS, UMR 7238, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 75005 Paris, France.,Institut Universitaire de France, 75005 Paris, France
| |
Collapse
|
32
|
Detecting coevolution of positively selected in turtles sperm-egg fusion proteins. Mech Dev 2019; 156:1-7. [DOI: 10.1016/j.mod.2019.02.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2018] [Revised: 02/15/2019] [Accepted: 02/15/2019] [Indexed: 12/12/2022]
|
33
|
Onodera W, Asahi T, Sawamura N. Positive selection of cereblon modified function including its E3 ubiquitin ligase activity and binding efficiency with AMPK. Mol Phylogenet Evol 2019; 135:78-85. [PMID: 30836149 DOI: 10.1016/j.ympev.2019.03.001] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2018] [Revised: 02/06/2019] [Accepted: 03/01/2019] [Indexed: 12/20/2022]
Abstract
Cereblon (CRBN) is a substrate receptor for an E3 ubiquitin ligase that directly binds to target proteins resulting in cellular activities, such as energy metabolism, membrane potential regulation, and transcription factor degradation. Genetic mutations in human CRBN lead to intellectual disabilities. In addition, it draws pathological attention because direct binding with immunomodulatory drugs can cure multiple myeloma (MM) and lymphocytic leukemia. To further explore the function of CRBN, we focused on its molecular evolution. Since CRBN interacts directly with its substrates and is widely conserved in vertebrates, evolutionary study to identify the selective pressure on CRBN that occur during CRBN-substrate interaction is an effective approach to search for a novel active site. Using mammalian CRBN sequences, dN/dS analysis was conducted to detect positive selection. By multiple sequence alignment we found that the residue at position 366 was under positive selection. This residue is present in the substrate-binding domain of CRBN. Most mammals harbor cysteine at position 366, whereas rodents and chiroptera have serine at this site. Subsequently, we constructed a C366S human CRBN to confirm the potential of positive selection. Auto-ubiquitination activity occurs in E3 ubiquitin ligases, including CRBN, and increased in C366S CRBN, which lead to the conclusion that E3 ubiquitin ligase activity may have changed over the course of mammalian evolution. Furthermore, binding with AMP-activated protein kinase was augmented when the substitution was present, which is supported by coevolution analysis. These results suggest that the molecular evolution of CRBN occurred through codon-based positive selection, providing a new approach to investigate CRBN function.
Collapse
Affiliation(s)
- Wataru Onodera
- Faculty of Science and Engineering, Waseda University, TWIns, 2-2 Wakamatsu, Shinjuku, Tokyo 162-8480, Japan
| | - Toru Asahi
- Faculty of Science and Engineering, Waseda University, TWIns, 2-2 Wakamatsu, Shinjuku, Tokyo 162-8480, Japan; Research Organization for Nano & Life Innovation, Waseda University, Japan
| | - Naoya Sawamura
- Faculty of Science and Engineering, Waseda University, TWIns, 2-2 Wakamatsu, Shinjuku, Tokyo 162-8480, Japan; Research Organization for Nano & Life Innovation, Waseda University, Japan.
| |
Collapse
|
34
|
Chen YL, Chen LJ, Chu CC, Huang PK, Wen JR, Li HM. TIC236 links the outer and inner membrane translocons of the chloroplast. Nature 2018; 564:125-129. [PMID: 30464337 DOI: 10.1038/s41586-018-0713-y] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2018] [Accepted: 10/18/2018] [Indexed: 12/28/2022]
Abstract
The two-membrane envelope is a defining feature of chloroplasts. Chloroplasts evolved from a Gram-negative cyanobacterial endosymbiont. During evolution, genes of the endosymbiont have been transferred to the host nuclear genome. Most chloroplast proteins are synthesized in the cytosol as higher-molecular-mass preproteins with an N-terminal transit peptide. Preproteins are transported into chloroplasts by the TOC and TIC (translocons at the outer- and inner-envelope membranes of chloroplasts, respectively) machineries1,2, but how TOC and TIC are assembled together is unknown. Here we report the identification of the TIC component TIC236; TIC236 is an integral inner-membrane protein that projects a 230-kDa domain into the intermembrane space, which binds directly to the outer-membrane channel TOC75. The knockout mutation of TIC236 is embryonically lethal. In TIC236-knockdown mutants, a smaller amount of the inner-membrane channel TIC20 was associated with TOC75; the amount of TOC-TIC supercomplexes was also reduced. This resulted in a reduced import rate into the stroma, though outer-membrane protein insertion was unaffected. The size and the essential nature of TIC236 indicate that-unlike in mitochondria, in which the outer- and inner-membrane translocons exist as separate complexes and a supercomplex is only transiently assembled during preprotein translocation3,4-a long and stable protein bridge in the intermembrane space is required for protein translocation into chloroplasts. Furthermore, TIC236 and TOC75 are homologues of bacterial inner-membrane TamB5 and outer-membrane BamA, respectively. Our evolutionary analyses show that, similar to TOC75, TIC236 is preserved only in plants and has co-evolved with TOC75 throughout the plant lineage. This suggests that the backbone of the chloroplast protein-import machinery evolved from the bacterial TamB-BamA protein-secretion system.
Collapse
Affiliation(s)
- Yih-Lin Chen
- Institute of Molecular Biology, Academia Sinica, Taipei, Taiwan
| | - Lih-Jen Chen
- Institute of Molecular Biology, Academia Sinica, Taipei, Taiwan
| | - Chiung-Chih Chu
- Institute of Molecular Biology, Academia Sinica, Taipei, Taiwan
| | - Po-Kai Huang
- Institute of Molecular Biology, Academia Sinica, Taipei, Taiwan
- Department of Plant Sciences, University of California, Davis, CA, USA
| | - Jie-Ru Wen
- Institute of Molecular Biology, Academia Sinica, Taipei, Taiwan
| | - Hsou-Min Li
- Institute of Molecular Biology, Academia Sinica, Taipei, Taiwan.
| |
Collapse
|
35
|
Brenzinger S, Pecina A, Mrusek D, Mann P, Völse K, Wimmi S, Ruppert U, Becker A, Ringgaard S, Bange G, Thormann KM. ZomB is essential for flagellar motor reversals in Shewanella putrefaciens and Vibrio parahaemolyticus. Mol Microbiol 2018; 109:694-709. [PMID: 29995998 DOI: 10.1111/mmi.14070] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/10/2018] [Indexed: 01/05/2023]
Abstract
The ability of most bacterial flagellar motors to reverse the direction of rotation is crucial for efficient chemotaxis. In Escherichia coli, motor reversals are mediated by binding of phosphorylated chemotaxis protein CheY to components of the flagellar rotor, FliM and FliN, which induces a conformational switch of the flagellar C-ring. Here, we show that for Shewanella putrefaciens, Vibrio parahaemolyticus and likely a number of other species an additional transmembrane protein, ZomB, is critically required for motor reversals as mutants lacking ZomB exclusively exhibit straightforward swimming also upon full phosphorylation or overproduction of CheY. ZomB is recruited to the cell poles by and is destabilized in the absence of the polar landmark protein HubP. ZomB also co-localizes to and may thus interact with the flagellar motor. The ΔzomB phenotype was suppressed by mutations in the very C-terminal region of FliM. We propose that the flagellar motors of Shewanella, Vibrio and numerous other species harboring orthologs to ZomB are locked in counterclockwise rotation and may require interaction with ZomB to enable the conformational switch required for motor reversals. Regulation of ZomB activity or abundance may provide these species with an additional means to modulate chemotaxis efficiency.
Collapse
Affiliation(s)
- Susanne Brenzinger
- Justus-Liebig Universität, Department of Microbiology and Molecular Biology, 35392, Giessen, Germany
| | - Anna Pecina
- Justus-Liebig Universität, Department of Microbiology and Molecular Biology, 35392, Giessen, Germany
| | - Devid Mrusek
- LOEWE Center for Synthetic Microbiology (Synmikro) & Department of Chemistry, Philipps-Universität Marburg, 35043, Marburg, Germany
| | - Petra Mann
- Department of Ecophysiology, Max-Planck-Institut für terrestrische Mikrobiologie, 35043, Marburg, Germany
| | - Kerstin Völse
- Justus-Liebig Universität, Department of Microbiology and Molecular Biology, 35392, Giessen, Germany
| | - Stephan Wimmi
- Department of Ecophysiology, Max-Planck-Institut für terrestrische Mikrobiologie, 35043, Marburg, Germany
| | - Ulrike Ruppert
- Justus-Liebig Universität, Department of Microbiology and Molecular Biology, 35392, Giessen, Germany
| | - Anke Becker
- LOEWE Center for Synthetic Microbiology (Synmikro) & Department of Biology, Philipps-Universität Marburg, 35043, Marburg, Germany
| | - Simon Ringgaard
- Department of Ecophysiology, Max-Planck-Institut für terrestrische Mikrobiologie, 35043, Marburg, Germany
| | - Gert Bange
- LOEWE Center for Synthetic Microbiology (Synmikro) & Department of Chemistry, Philipps-Universität Marburg, 35043, Marburg, Germany
| | - Kai M Thormann
- Justus-Liebig Universität, Department of Microbiology and Molecular Biology, 35392, Giessen, Germany
| |
Collapse
|
36
|
Taylor WR. Algorithms for matching partially labelled sequence graphs. Algorithms Mol Biol 2017; 12:24. [PMID: 29021818 PMCID: PMC5613400 DOI: 10.1186/s13015-017-0115-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2017] [Accepted: 09/06/2017] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND In order to find correlated pairs of positions between proteins, which are useful in predicting interactions, it is necessary to concatenate two large multiple sequence alignments such that the sequences that are joined together belong to those that interact in their species of origin. When each protein is unique then the species name is sufficient to guide this match, however, when there are multiple related sequences (paralogs) in each species then the pairing is more difficult. In bacteria a good guide can be gained from genome co-location as interacting proteins tend to be in a common operon but in eukaryotes this simple principle is not sufficient. RESULTS The methods developed in this paper take sets of paralogs for different proteins found in the same species and make a pairing based on their evolutionary distance relative to a set of other proteins that are unique and so have a known relationship (singletons). The former constitute a set of unlabelled nodes in a graph while the latter are labelled. Two variants were tested, one based on a phylogenetic tree of the sequences (the topology-based method) and a simpler, faster variant based only on the inter-sequence distances (the distance-based method). Over a set of test proteins, both gave good results, with the topology method performing slightly better. CONCLUSIONS The methods develop here still need refinement and augmentation from constraints other than the sequence data alone, such as known interactions from annotation and databases, or non-trivial relationships in genome location. With the ever growing numbers of eukaryotic genomes, it is hoped that the methods described here will open a route to the use of these data equal to the current success attained with bacterial sequences.
Collapse
|
37
|
Simkovic F, Ovchinnikov S, Baker D, Rigden DJ. Applications of contact predictions to structural biology. IUCRJ 2017; 4:291-300. [PMID: 28512576 PMCID: PMC5414403 DOI: 10.1107/s2052252517005115] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/12/2016] [Accepted: 04/03/2017] [Indexed: 06/07/2023]
Abstract
Evolutionary pressure on residue interactions, intramolecular or intermolecular, that are important for protein structure or function can lead to covariance between the two positions. Recent methodological advances allow much more accurate contact predictions to be derived from this evolutionary covariance signal. The practical application of contact predictions has largely been confined to structural bioinformatics, yet, as this work seeks to demonstrate, the data can be of enormous value to the structural biologist working in X-ray crystallo-graphy, cryo-EM or NMR. Integrative structural bioinformatics packages such as Rosetta can already exploit contact predictions in a variety of ways. The contribution of contact predictions begins at construct design, where structural domains may need to be expressed separately and contact predictions can help to predict domain limits. Structure solution by molecular replacement (MR) benefits from contact predictions in diverse ways: in difficult cases, more accurate search models can be constructed using ab initio modelling when predictions are available, while intermolecular contact predictions can allow the construction of larger, oligomeric search models. Furthermore, MR using supersecondary motifs or large-scale screens against the PDB can exploit information, such as the parallel or antiparallel nature of any β-strand pairing in the target, that can be inferred from contact predictions. Contact information will be particularly valuable in the determination of lower resolution structures by helping to assign sequence register. In large complexes, contact information may allow the identity of a protein responsible for a certain region of density to be determined and then assist in the orientation of an available model within that density. In NMR, predicted contacts can provide long-range information to extend the upper size limit of the technique in a manner analogous but complementary to experimental methods. Finally, predicted contacts can distinguish between biologically relevant interfaces and mere lattice contacts in a final crystal structure, and have potential in the identification of functionally important regions and in foreseeing the consequences of mutations.
Collapse
Affiliation(s)
- Felix Simkovic
- Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England
| | - Sergey Ovchinnikov
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98195, USA
- Howard Hughes Medical Institute, University of Washington, Box 357370, Seattle, WA 98195, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98195, USA
- Howard Hughes Medical Institute, University of Washington, Box 357370, Seattle, WA 98195, USA
| | - Daniel J. Rigden
- Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England
| |
Collapse
|
38
|
Yin C, Yau SST. A coevolution analysis for identifying protein-protein interactions by Fourier transform. PLoS One 2017; 12:e0174862. [PMID: 28430779 PMCID: PMC5400233 DOI: 10.1371/journal.pone.0174862] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2016] [Accepted: 03/16/2017] [Indexed: 12/29/2022] Open
Abstract
Protein-protein interactions (PPIs) play key roles in life processes, such as signal transduction, transcription regulations, and immune response, etc. Identification of PPIs enables better understanding of the functional networks within a cell. Common experimental methods for identifying PPIs are time consuming and expensive. However, recent developments in computational approaches for inferring PPIs from protein sequences based on coevolution theory avoid these problems. In the coevolution theory model, interacted proteins may show coevolutionary mutations and have similar phylogenetic trees. The existing coevolution methods depend on multiple sequence alignments (MSA); however, the MSA-based coevolution methods often produce high false positive interactions. In this paper, we present a computational method using an alignment-free approach to accurately detect PPIs and reduce false positives. In the method, protein sequences are numerically represented by biochemical properties of amino acids, which reflect the structural and functional differences of proteins. Fourier transform is applied to the numerical representation of protein sequences to capture the dissimilarities of protein sequences in biophysical context. The method is assessed for predicting PPIs in Ebola virus. The results indicate strong coevolution between the protein pairs (NP-VP24, NP-VP30, NP-VP40, VP24-VP30, VP24-VP40, and VP30-VP40). The method is also validated for PPIs in influenza and E.coli genomes. Since our method can reduce false positive and increase the specificity of PPI prediction, it offers an effective tool to understand mechanisms of disease pathogens and find potential targets for drug design. The Python programs in this study are available to public at URL (https://github.com/cyinbox/PPI).
Collapse
Affiliation(s)
- Changchuan Yin
- Department of Mathematics, Statistics and Computer Science, The University of Illinois at Chicago, Chicago, IL 60607-7045, United States of America
| | - Stephen S. -T. Yau
- Department of Mathematical Sciences, Tsinghua University, Beijing 100084, China
| |
Collapse
|
39
|
Leung MCK, Procter AC, Goldstone JV, Foox J, DeSalle R, Mattingly CJ, Siddall ME, Timme-Laragy AR. Applying evolutionary genetics to developmental toxicology and risk assessment. Reprod Toxicol 2017; 69:174-186. [PMID: 28267574 PMCID: PMC5829367 DOI: 10.1016/j.reprotox.2017.03.003] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2016] [Revised: 02/27/2017] [Accepted: 03/02/2017] [Indexed: 12/26/2022]
Abstract
Evolutionary thinking continues to challenge our views on health and disease. Yet, there is a communication gap between evolutionary biologists and toxicologists in recognizing the connections among developmental pathways, high-throughput screening, and birth defects in humans. To increase our capability in identifying potential developmental toxicants in humans, we propose to apply evolutionary genetics to improve the experimental design and data interpretation with various in vitro and whole-organism models. We review five molecular systems of stress response and update 18 consensual cell-cell signaling pathways that are the hallmark for early development, organogenesis, and differentiation; and revisit the principles of teratology in light of recent advances in high-throughput screening, big data techniques, and systems toxicology. Multiscale systems modeling plays an integral role in the evolutionary approach to cross-species extrapolation. Phylogenetic analysis and comparative bioinformatics are both valuable tools in identifying and validating the molecular initiating events that account for adverse developmental outcomes in humans. The discordance of susceptibility between test species and humans (ontogeny) reflects their differences in evolutionary history (phylogeny). This synthesis not only can lead to novel applications in developmental toxicity and risk assessment, but also can pave the way for applying an evo-devo perspective to the study of developmental origins of health and disease.
Collapse
Affiliation(s)
- Maxwell C K Leung
- Nicholas School of the Environment, Duke University, Durham, NC, United States.
| | - Andrew C Procter
- Institute for Advanced Analytics, North Carolina State University, Raleigh, NC, United States
| | - Jared V Goldstone
- Department of Biology, Woods Hole Oceanographic Institution, Woods Hole, MA, United States
| | - Jonathan Foox
- Department of Invertebrate Zoology, American Museum of Natural History, New York, New York, United States
| | - Robert DeSalle
- Department of Invertebrate Zoology, American Museum of Natural History, New York, New York, United States
| | - Carolyn J Mattingly
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina, United States
| | - Mark E Siddall
- Department of Invertebrate Zoology, American Museum of Natural History, New York, New York, United States
| | - Alicia R Timme-Laragy
- Department of Environmental Health Sciences, University of Massachusetts, Amherst, MA, United States
| |
Collapse
|
40
|
Chapa TJ, Du Y, Sun R, Yu D, French AR. Proteomic and phylogenetic coevolution analyses of pM79 and pM92 identify interactions with RNA polymerase II and delineate the murine cytomegalovirus late transcription complex. J Gen Virol 2017; 98:242-250. [PMID: 27926822 DOI: 10.1099/jgv.0.000676] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The regulation of the late viral gene expression in betaherpesviruses is largely undefined. We have previously shown that the murine cytomegalovirus proteins pM79 and pM92 are required for late gene transcription. Here, we provide insight into the mechanism of pM79 and pM92 activity by determining their interaction partners during infection. Co-immunoprecipitation-coupled MS studies demonstrate that pM79 and pM92 interact with an array of cellular and viral proteins involved in transcription. Specifically, we identify RNA polymerase II as a cellular target for both pM79 and pM92. We use inter-protein coevolution analysis to show how pM79 and pM92 likely assemble into a late transcription complex composed of late transcription regulators pM49, pM87 and pM95. Combining proteomic methods with coevolution computational analysis provides novel insights into the relationship between pM79, pM92 and RNA polymerase II and allows the generation of a model of the multi-component viral complex that regulates late gene transcription.
Collapse
Affiliation(s)
- Travis J Chapa
- Department of Molecular and Medical Pharmacology, University of California, Los Angeles, Los Angeles, CA 90095, USA.,Division of Pediatric Rheumatology, Department of Pediatrics, Washington University School of Medicine, Saint Louis, MO 63110, USA.,Department of Molecular Microbiology, Washington University School of Medicine, Saint Louis, MO 63110, USA
| | - Yushen Du
- Department of Molecular and Medical Pharmacology, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Ren Sun
- Department of Molecular and Medical Pharmacology, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Dong Yu
- Department of Molecular Microbiology, Washington University School of Medicine, Saint Louis, MO 63110, USA
| | - Anthony R French
- Division of Pediatric Rheumatology, Department of Pediatrics, Washington University School of Medicine, Saint Louis, MO 63110, USA
| |
Collapse
|
41
|
Rutledge GG, Böhme U, Sanders M, Reid AJ, Cotton JA, Maiga-Ascofare O, Djimdé AA, Apinjoh TO, Amenga-Etego L, Manske M, Barnwell JW, Renaud F, Ollomo B, Prugnolle F, Anstey NM, Auburn S, Price RN, McCarthy JS, Kwiatkowski DP, Newbold CI, Berriman M, Otto TD. Plasmodium malariae and P. ovale genomes provide insights into malaria parasite evolution. Nature 2017; 542:101-104. [PMID: 28117441 PMCID: PMC5326575 DOI: 10.1038/nature21038] [Citation(s) in RCA: 132] [Impact Index Per Article: 18.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2016] [Accepted: 12/04/2016] [Indexed: 01/12/2023]
Abstract
Elucidation of the evolutionary history and interrelatedness of Plasmodium species that infect humans has been hampered by a lack of genetic information for three human-infective species: P. malariae and two P. ovale species (P. o. curtisi and P. o. wallikeri). These species are prevalent across most regions in which malaria is endemic and are often undetectable by light microscopy, rendering their study in human populations difficult. The exact evolutionary relationship of these species to the other human-infective species has been contested. Using a new reference genome for P. malariae and a manually curated draft P. o. curtisi genome, we are now able to accurately place these species within the Plasmodium phylogeny. Sequencing of a P. malariae relative that infects chimpanzees reveals similar signatures of selection in the P. malariae lineage to another Plasmodium lineage shown to be capable of colonization of both human and chimpanzee hosts. Molecular dating suggests that these host adaptations occurred over similar evolutionary timescales. In addition to the core genome that is conserved between species, differences in gene content can be linked to their specific biology. The genome suggests that P. malariae expresses a family of heterodimeric proteins on its surface that have structural similarities to a protein crucial for invasion of red blood cells. The data presented here provide insight into the evolution of the Plasmodium genus as a whole.
Collapse
Affiliation(s)
- Gavin G Rutledge
- Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
| | - Ulrike Böhme
- Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
| | - Mandy Sanders
- Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
| | - Adam J Reid
- Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
| | - James A Cotton
- Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
| | - Oumou Maiga-Ascofare
- Malaria Research and Training Center, University of Science, Techniques, and Technologies of Bamako, Bamako BP E.2528, Mali
- German Center for Infection Research, 20359 Hamburg, Germany
| | - Abdoulaye A Djimdé
- Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
- Malaria Research and Training Center, University of Science, Techniques, and Technologies of Bamako, Bamako BP E.2528, Mali
| | - Tobias O Apinjoh
- University of Buea, Post Office Box 63, Buea, South West Region, Republic of Cameroon
| | - Lucas Amenga-Etego
- Navrongo Health Research Centre, Post Office Box 114, Navrongo, Upper East Region, Ghana
| | - Magnus Manske
- Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
| | - John W Barnwell
- Centers for Disease Control and Prevention, Atlanta, Georgia 30333, USA
| | - François Renaud
- Laboratoire MIVEGEC (UM1-CNRS-IRD), 34394 Montpellier, France
| | - Benjamin Ollomo
- Centre International de Recherches Médicales de Franceville, BP 709 Franceville, Gabon
| | - Franck Prugnolle
- Laboratoire MIVEGEC (UM1-CNRS-IRD), 34394 Montpellier, France
- Centre International de Recherches Médicales de Franceville, BP 709 Franceville, Gabon
| | - Nicholas M Anstey
- Global and Tropical Health Division, Menzies School of Health Research and Charles Darwin University, Darwin, Northern Territory 0810, Australia
| | - Sarah Auburn
- Global and Tropical Health Division, Menzies School of Health Research and Charles Darwin University, Darwin, Northern Territory 0810, Australia
| | - Ric N Price
- Global and Tropical Health Division, Menzies School of Health Research and Charles Darwin University, Darwin, Northern Territory 0810, Australia
- Centre for Tropical Medicine and Global Health, Nuffield Department of Clinical Medicine, University of Oxford, Oxford OX3 7LJ, UK
| | - James S McCarthy
- Clinical Tropical Medicine Laboratory, QIMR Berghofer Medical Research Institute, University of Queensland, Brisbane, Queensland 4006, Australia
| | - Dominic P Kwiatkowski
- Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK
| | - Chris I Newbold
- Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
- Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DS, UK
| | - Matthew Berriman
- Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
| | - Thomas D Otto
- Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
| |
Collapse
|
42
|
Yang Z, Liu L, Fang H, Li P, Xu S, Cao W, Xu C, Huang J, Zhou Y. Origin of the plant Tm-1-like gene via two independent horizontal transfer events and one gene fusion event. Sci Rep 2016; 6:33691. [PMID: 27647002 PMCID: PMC5028733 DOI: 10.1038/srep33691] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2016] [Accepted: 08/31/2016] [Indexed: 01/07/2023] Open
Abstract
The Tomato mosaic virus (ToMV) resistance gene Tm-1 encodes a direct inhibitor of ToMV RNA replication to protect tomato from infection. The plant Tm-1-like (Tm-1L) protein is predicted to contain an uncharacterized N-terminal UPF0261 domain and a C-terminal TIM-barrel signal transduction (TBST) domain. Homologous searches revealed that proteins containing both of these two domains are mainly present in charophyte green algae and land plants but absent from glaucophytes, red algae and chlorophyte green algae. Although Tm-1 homologs are widely present in bacteria, archaea and fungi, UPF0261- and TBST-domain-containing proteins are generally encoded by different genes in these linages. A co-evolution analysis also suggested a putative interaction between UPF0261- and TBST-domain-containing proteins. Phylogenetic analyses based on homologs of these two domains revealed that plants have acquired UPF0261- and TBST-domain-encoding genes through two independent horizontal gene transfer (HGT) events before the origin of land plants from charophytes. Subsequently, gene fusion occurred between these two horizontally acquired genes and resulted in the origin of the Tm-1L gene in streptophytes. Our results demonstrate a novel evolutionary mechanism through which the recipient organism may acquire genes with functional interaction through two different HGT events and further fuse them into one functional gene.
Collapse
Affiliation(s)
- Zefeng Yang
- Jiangsu Key Laboratory of Crop Genetics and Physiology/Co-Innovation Center for Modern Production Technology of Grain Crops, Key Laboratory of Plant Functional Genomics of the Ministry of Education, Yangzhou University, Yangzhou, 225009, China
| | - Li Liu
- Jiangsu Key Laboratory of Crop Genetics and Physiology/Co-Innovation Center for Modern Production Technology of Grain Crops, Key Laboratory of Plant Functional Genomics of the Ministry of Education, Yangzhou University, Yangzhou, 225009, China
| | - Huimin Fang
- Jiangsu Key Laboratory of Crop Genetics and Physiology/Co-Innovation Center for Modern Production Technology of Grain Crops, Key Laboratory of Plant Functional Genomics of the Ministry of Education, Yangzhou University, Yangzhou, 225009, China
| | - Pengcheng Li
- Jiangsu Key Laboratory of Crop Genetics and Physiology/Co-Innovation Center for Modern Production Technology of Grain Crops, Key Laboratory of Plant Functional Genomics of the Ministry of Education, Yangzhou University, Yangzhou, 225009, China
| | - Shuhui Xu
- Jiangsu Key Laboratory of Crop Genetics and Physiology/Co-Innovation Center for Modern Production Technology of Grain Crops, Key Laboratory of Plant Functional Genomics of the Ministry of Education, Yangzhou University, Yangzhou, 225009, China
| | - Wei Cao
- Jiangsu Key Laboratory of Crop Genetics and Physiology/Co-Innovation Center for Modern Production Technology of Grain Crops, Key Laboratory of Plant Functional Genomics of the Ministry of Education, Yangzhou University, Yangzhou, 225009, China
| | - Chenwu Xu
- Jiangsu Key Laboratory of Crop Genetics and Physiology/Co-Innovation Center for Modern Production Technology of Grain Crops, Key Laboratory of Plant Functional Genomics of the Ministry of Education, Yangzhou University, Yangzhou, 225009, China
| | - Jinling Huang
- Department of Biology, East Carolina University, Greenville, NC, 27858, USA
| | - Yong Zhou
- Jiangsu Key Laboratory of Crop Genetics and Physiology/Co-Innovation Center for Modern Production Technology of Grain Crops, Key Laboratory of Plant Functional Genomics of the Ministry of Education, Yangzhou University, Yangzhou, 225009, China
| |
Collapse
|
43
|
Jiménez-Sánchez A. Coevolution of RAC Small GTPases and their Regulators GEF Proteins. Evol Bioinform Online 2016; 12:121-31. [PMID: 27226705 PMCID: PMC4872645 DOI: 10.4137/ebo.s38031] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2015] [Revised: 03/31/2016] [Accepted: 04/03/2016] [Indexed: 01/16/2023] Open
Abstract
RAC proteins are small GTPases involved in important cellular processes in eukaryotes, and their deregulation may contribute to cancer. Activation of RAC proteins is regulated by DOCK and DBL protein families of guanine nucleotide exchange factors (GEFs). Although DOCK and DBL proteins act as GEFs on RAC proteins, DOCK and DBL family members are evolutionarily unrelated. To understand how DBL and DOCK families perform the same function on RAC proteins despite their unrelated primary structure, phylogenetic analyses of the RAC, DBL, and DOCK families were implemented, and interaction patterns that may suggest a coevolutionary process were searched. Interestingly, while RAC and DOCK proteins are very well conserved in humans and among eukaryotes, DBL proteins are highly divergent. Moreover, correlation analyses of the phylogenetic distances of RAC and GEF proteins and covariation analyses between residues in the interacting domains showed significant coevolution rates for both RAC–DOCK and RAC–DBL interactions.
Collapse
Affiliation(s)
- Alejandro Jiménez-Sánchez
- Cancer Research UK Cambridge Institute, University of Cambridge, Li Ka Shing Centre, Cambridge, UK.; Previously at Department of Biology, University of York, York, UK
| |
Collapse
|
44
|
Yoshida S, Hiraga K, Takehana T, Taniguchi I, Yamaji H, Maeda Y, Toyohara K, Miyamoto K, Kimura Y, Oda K. A bacterium that degrades and assimilates poly(ethylene terephthalate). Science 2016; 351:1196-9. [DOI: 10.1126/science.aad6359] [Citation(s) in RCA: 1080] [Impact Index Per Article: 135.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
45
|
Stimulation of Na(+),K(+)-ATPase Activity as a Possible Driving Force in Cholesterol Evolution. J Membr Biol 2015; 249:251-9. [PMID: 26715509 DOI: 10.1007/s00232-015-9864-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2015] [Accepted: 12/09/2015] [Indexed: 12/19/2022]
Abstract
Cholesterol is exclusively produced by animals and is present in the plasma membrane of all animal cells. In contrast, the membranes of fungi and plants contain other sterols. To explain the exclusive preference of animal cells for cholesterol, we propose that cholesterol may have evolved to optimize the activity of a crucial protein found in the plasma membrane of all multicellular animals, namely the Na(+),K(+)-ATPase. To test this hypothesis, mirror tree and phylogenetic distribution analyses have been conducted of the Na(+),K(+)-ATPase and 3β-hydroxysterol Δ(24)-reductase (DHCR24), the last enzyme in the Bloch cholesterol biosynthetic pathway. The results obtained support the hypothesis of a co-evolution of the Na(+),K(+)-ATPase and DHCR24. The evolutionary correlation between DHCR24 and the Na(+),K(+)-ATPase was found to be stronger than between DHCR24 and any other membrane protein investigated. The results obtained, thus, also support the hypothesis that cholesterol evolved together with the Na(+),K(+)-ATPase in multicellular animals to support Na(+),K(+)-ATPase activity.
Collapse
|
46
|
Grayson P. Izumo1 and Juno: the evolutionary origins and coevolution of essential sperm-egg binding partners. ROYAL SOCIETY OPEN SCIENCE 2015; 2:150296. [PMID: 27019721 PMCID: PMC4807442 DOI: 10.1098/rsos.150296] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/26/2015] [Accepted: 11/17/2015] [Indexed: 05/29/2023]
Abstract
Reproductive proteins are among the most rapidly evolving classes of proteins. For a subset of these, rapid evolution is driven by positive Darwinian selection despite vital, well-conserved, reproductive functions. Izumo1 is the only essential sperm-egg fusion protein currently known on mammalian sperm, and its egg receptor (Juno; formerly Folr4) was recently discovered. Male knockout mice for Izumo1 and female knockout mice for Juno are both healthy but sterile. Here, both sperm-egg binding proteins are shown to be evolving under positive selection. Within mammals, coevolution of Izumo1 and Juno is also uncovered, suggesting that similar forces have shaped the evolutionary histories of these binding partners within Mammalia. Additionally, genomic analyses reveal an ancient origin for the Izumo gene family, initially reported as conserved exclusively in mammals. Newly identified Izumo1 orthologues could serve reproductive functions in birds, fish and reptiles. Surprisingly, these same analyses support Juno's presence in mammals alone, suggesting a recent mammalian-specific duplication and neofunctionalization of the ancestral folate receptor. Despite the indispensability of their reproductive interaction, and their apparent coevolution within Mammalia, this binding pair arose through strikingly different evolutionary forces.
Collapse
|
47
|
Bianchetti L, Tarabay Y, Lecompte O, Stote R, Poch O, Dejaegere A, Viville S. Tex19 and Sectm1 concordant molecular phylogenies support co-evolution of both eutherian-specific genes. BMC Evol Biol 2015; 15:222. [PMID: 26459560 PMCID: PMC4603632 DOI: 10.1186/s12862-015-0506-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2015] [Accepted: 10/01/2015] [Indexed: 01/04/2023] Open
Abstract
BACKGROUND Transposable elements (TE) have attracted much attention since they shape the genome and contribute to species evolution. Organisms have evolved mechanisms to control TE activity. Testis expressed 19 (Tex19) represses TE expression in mouse testis and placenta. In the human and mouse genomes, Tex19 and Secreted and transmembrane 1 (Sectm1) are neighbors but are not homologs. Sectm1 is involved in immunity and its molecular phylogeny is unknown. METHODS Using multiple alignments of complete protein sequences (MACS), we inferred Tex19 and Sectm1 molecular phylogenies. Protein conserved regions were identified and folds were predicted. Finally, expression patterns were studied across tissues and species using RNA-seq public data and RT-PCR. RESULTS We present 2 high quality alignments of 58 Tex19 and 58 Sectm1 protein sequences from 48 organisms. First, both genes are eutherian-specific, i.e., exclusively present in mammals except monotremes (platypus) and marsupials. Second, Tex19 and Sectm1 have both duplicated in Sciurognathi and Bovidae while they have remained as single copy genes in all further placental mammals. Phylogenetic concordance between both genes was significant (p-value < 0.05) and supported co-evolution and functional relationship. At the protein level, Tex19 exhibits 3 conserved regions and 4 invariant cysteines. In particular, a CXXC motif is present in the N-terminal conserved region. Sectm1 exhibits 2 invariant cysteines and an Ig-like domain. Strikingly, Tex19 C-terminal conserved region was lost in Haplorrhini primates while a Sectm1 C-terminal extra domain was acquired. Finally, we have determined that Tex19 and Sectm1 expression levels anti-correlate across the testis of several primates (ρ = -0.72) which supports anti-regulation. CONCLUSIONS Tex19 and Sectm1 co-evolution and anti-regulated expressions support a strong functional relationship between both genes. Since Tex19 operates a control on TE and Sectm1 plays a role in immunity, Tex19 might suppress an immune response directed against cells that show TE activity in eutherian reproductive tissues.
Collapse
Affiliation(s)
- Laurent Bianchetti
- Biocomputing and Molecular Modelling Laboratory, Integrated Structural Biology Department, Genetics institute of Molecular and Cellular Biology (IGBMC), INSERM U964/CNRS UMR 1704/Strasbourg University, 1 rue Laurent Fries, 67404, Illkirch, France.
| | - Yara Tarabay
- Primordial Germ Cells' Ontogeny and Pluripotency Laboratory, Functional Genomics and Cancer Department, Genetics Institute of Molecular and Cellular Biology (IGBMC), INSERM U964/CNRS UMR 1704/Université de Strasbourg, 1 rue Laurent Fries, 67404, Illkirch, France. .,Present address: Institut de génétique humaine (IGH), 141 rue de la Cardonille, 34396, Montpellier, France.
| | - Odile Lecompte
- Bioinformatics and Integrated Genomics Laboratory (LBGI), ICube, CNRS UMR 7357/Université de Strasbourg, 11 rue Humann, 67085, Strasbourg, France.
| | - Roland Stote
- Biocomputing and Molecular Modelling Laboratory, Integrated Structural Biology Department, Genetics institute of Molecular and Cellular Biology (IGBMC), INSERM U964/CNRS UMR 1704/Strasbourg University, 1 rue Laurent Fries, 67404, Illkirch, France.
| | - Olivier Poch
- Bioinformatics and Integrated Genomics Laboratory (LBGI), ICube, CNRS UMR 7357/Université de Strasbourg, 11 rue Humann, 67085, Strasbourg, France.
| | - Annick Dejaegere
- Biocomputing and Molecular Modelling Laboratory, Integrated Structural Biology Department, Genetics institute of Molecular and Cellular Biology (IGBMC), INSERM U964/CNRS UMR 1704/Strasbourg University, 1 rue Laurent Fries, 67404, Illkirch, France.
| | - Stéphane Viville
- Primordial Germ Cells' Ontogeny and Pluripotency Laboratory, Functional Genomics and Cancer Department, Genetics Institute of Molecular and Cellular Biology (IGBMC), INSERM U964/CNRS UMR 1704/Université de Strasbourg, 1 rue Laurent Fries, 67404, Illkirch, France. .,Centre Hospitalier Universitaire, 67000, Strasbourg, France.
| |
Collapse
|
48
|
Identification of Protein–Protein Interactions by Detecting Correlated Mutation at the Interface. J Chem Inf Model 2015; 55:2042-9. [DOI: 10.1021/acs.jcim.5b00320] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
|
49
|
Iserte J, Simonetti FL, Zea DJ, Teppa E, Marino-Buslje C. I-COMS: Interprotein-COrrelated Mutations Server. Nucleic Acids Res 2015; 43:W320-5. [PMID: 26032772 PMCID: PMC4489276 DOI: 10.1093/nar/gkv572] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2015] [Accepted: 05/21/2015] [Indexed: 12/31/2022] Open
Abstract
Interprotein contact prediction using multiple sequence alignments (MSAs) is a useful approach to help detect protein–protein interfaces. Different computational methods have been developed in recent years as an approximation to solve this problem. However, as there are discrepancies in the results provided by them, there is still no consensus on which is the best performing methodology. To address this problem, I-COMS (interprotein COrrelated Mutations Server) is presented. I-COMS allows to estimate covariation between residues of different proteins by four different covariation methods. It provides a graphical and interactive output that helps compare results obtained using different methods. I-COMS automatically builds the required MSA for the calculation and produces a rich visualization of either intraprotein and/or interprotein covariating positions in a circos representation. Furthermore, comparison between any two methods is available as well as the overlap between any or all four methodologies. In addition, as a complementary source of information, a matrix visualization of the corresponding scores is made available and the density plot distribution of the inter, intra and inter+intra scores are calculated. Finally, all the results can be downloaded (including MSAs, scores and graphics) for comparison and visualization and/or for further analysis.
Collapse
Affiliation(s)
- Javier Iserte
- Fundación Instituto Leloir. Av. Patricias Argentinas 435, C1405BWE, Buenos Aires, Argentina
| | - Franco L Simonetti
- Fundación Instituto Leloir. Av. Patricias Argentinas 435, C1405BWE, Buenos Aires, Argentina
| | - Diego J Zea
- Fundación Instituto Leloir. Av. Patricias Argentinas 435, C1405BWE, Buenos Aires, Argentina
| | - Elin Teppa
- Fundación Instituto Leloir. Av. Patricias Argentinas 435, C1405BWE, Buenos Aires, Argentina
| | - Cristina Marino-Buslje
- Fundación Instituto Leloir. Av. Patricias Argentinas 435, C1405BWE, Buenos Aires, Argentina
| |
Collapse
|
50
|
Bagaya BS, Vega JF, Tian M, Nickel GC, Li Y, Krebs KC, Arts EJ, Gao Y. Functional bottlenecks for generation of HIV-1 intersubtype Env recombinants. Retrovirology 2015; 12:44. [PMID: 25997955 PMCID: PMC4445978 DOI: 10.1186/s12977-015-0170-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2015] [Accepted: 04/02/2015] [Indexed: 11/13/2022] Open
Abstract
Background Intersubtype recombination is a powerful driving force for HIV evolution, impacting both HIV-1 diversity within an infected individual and within the global epidemic. This study examines if viral protein function/fitness is the major constraint shaping selection of recombination hotspots in replication-competent HIV-1 progeny. A better understanding of the interplay between viral protein structure-function and recombination may provide insights into both vaccine design and drug development. Results In vitro HIV-1 dual infections were used to recombine subtypes A and D isolates and examine breakpoints in the Env glycoproteins. The entire env genes of 21 A/D recombinants with breakpoints in gp120 were non-functional when cloned into the laboratory strain, NL4-3. Likewise, cloning of A/D gp120 coding regions also produced dead viruses with non-functional Envs. 4/9 replication-competent viruses with functional Env’s were obtained when just the V1-V5 regions of these same A/D recombinants (i.e. same A/D breakpoints as above) were cloned into NL4-3. Conclusion These findings on functional A/D Env recombinants combined with structural models of Env suggest a conserved interplay between the C1 domain with C5 domain of gp120 and extracellular domain of gp41. Models also reveal a co-evolution within C1, C5, and ecto-gp41 domains which might explain the paucity of intersubtype recombination in the gp120 V1-V5 regions, despite their hypervariability. At least HIV-1 A/D intersubtype recombination in gp120 may result in a C1 from one subtype incompatible with a C5/gp41 from another subtype.
Collapse
Affiliation(s)
- Bernard S Bagaya
- Department of Molecular Biology and Microbiology, School of Medicine, Case Western Reserve University, 10900 Euclid Ave, Cleveland, OH, 44106, USA.
| | - José F Vega
- Division of Infectious Diseases, Department of Medicine, Case Western Reserve University, 10900 Euclid Ave, Cleveland, OH, 44106, USA.
| | - Meijuan Tian
- Department of Microbiology and Immunology, Schulich School of Medicine and Dentistry, Western University, London, ON, N6A 5C1, Canada.
| | - Gabrielle C Nickel
- Division of Infectious Diseases, Department of Medicine, Case Western Reserve University, 10900 Euclid Ave, Cleveland, OH, 44106, USA.
| | - Yuejin Li
- Division of Infectious Diseases, Department of Medicine, Case Western Reserve University, 10900 Euclid Ave, Cleveland, OH, 44106, USA.
| | - Kendall C Krebs
- Division of Infectious Diseases, Department of Medicine, Case Western Reserve University, 10900 Euclid Ave, Cleveland, OH, 44106, USA.
| | - Eric J Arts
- Department of Microbiology and Immunology, Schulich School of Medicine and Dentistry, Western University, London, ON, N6A 5C1, Canada.
| | - Yong Gao
- Department of Molecular Biology and Microbiology, School of Medicine, Case Western Reserve University, 10900 Euclid Ave, Cleveland, OH, 44106, USA. .,Division of Infectious Diseases, Department of Medicine, Case Western Reserve University, 10900 Euclid Ave, Cleveland, OH, 44106, USA.
| |
Collapse
|