Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Eisen JA, Wu M. Phylogenetic analysis and gene functional predictions: phylogenomics in action. Theor Popul Biol 2002;61:481-7. [PMID: 12167367 DOI: 10.1006/tpbi.2002.1594] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

For:	Eisen JA, Wu M. Phylogenetic analysis and gene functional predictions: phylogenomics in action. Theor Popul Biol 2002;61:481-7. [PMID: 12167367 DOI: 10.1006/tpbi.2002.1594] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Number

Cited by Other Article(s)

Xing Y, Liu C, Zheng C, Li H, Yin H. Evolution and function analysis of auxin response factors reveal the molecular basis of the developed root system of Zygophyllum xanthoxylum. BMC PLANT BIOLOGY 2024;24:81. [PMID: 38302884 PMCID: PMC10835889 DOI: 10.1186/s12870-023-04717-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Accepted: 12/29/2023] [Indexed: 02/03/2024]

Canavati C, Sherill-Rofe D, Kamal L, Bloch I, Zahdeh F, Sharon E, Terespolsky B, Allan IA, Rabie G, Kawas M, Kassem H, Avraham KB, Renbaum P, Levy-Lahad E, Kanaan M, Tabach Y. Using multi-scale genomics to associate poorly annotated genes with rare diseases. Genome Med 2024;16:4. [PMID: 38178268 PMCID: PMC10765705 DOI: 10.1186/s13073-023-01276-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Accepted: 12/15/2023] [Indexed: 01/06/2024] Open

Abstract

BACKGROUND

Next-generation sequencing (NGS) has significantly transformed the landscape of identifying disease-causing genes associated with genetic disorders. However, a substantial portion of sequenced patients remains undiagnosed. This may be attributed not only to the challenges posed by harder-to-detect variants, such as non-coding and structural variations but also to the existence of variants in genes not previously associated with the patient's clinical phenotype. This study introduces EvORanker, an algorithm that integrates unbiased data from 1,028 eukaryotic genomes to link mutated genes to clinical phenotypes.

METHODS

EvORanker utilizes clinical data, multi-scale phylogenetic profiling, and other omics data to prioritize disease-associated genes. It was evaluated on solved exomes and simulated genomes, compared with existing methods, and applied to 6260 knockout genes with mouse phenotypes lacking human associations. Additionally, EvORanker was made accessible as a user-friendly web tool.

RESULTS

In the analyzed exomic cohort, EvORanker accurately identified the "true" disease gene as the top candidate in 69% of cases and within the top 5 candidates in 95% of cases, consistent with results from the simulated dataset. Notably, EvORanker outperformed existing methods, particularly for poorly annotated genes. In the case of the 6260 knockout genes with mouse phenotypes, EvORanker linked 41% of these genes to observed human disease phenotypes. Furthermore, in two unsolved cases, EvORanker successfully identified DLGAP2 and LPCAT3 as disease candidates for previously uncharacterized genetic syndromes.

CONCLUSIONS

We highlight clade-based phylogenetic profiling as a powerful systematic approach for prioritizing potential disease genes. Our study showcases the efficacy of EvORanker in associating poorly annotated genes to disease phenotypes observed in patients. The EvORanker server is freely available at https://ccanavati.shinyapps.io/EvORanker/ .

Collapse

Affiliation(s)

Christina Canavati Department of Developmental Biology and Cancer Research, Institute of Medical Research - Israel-Canada, The Hebrew University of Jerusalem, Jerusalem, 9112102, Israel Molecular Genetics Lab, Istishari Arab Hospital, Ramallah, Palestine
Dana Sherill-Rofe Department of Developmental Biology and Cancer Research, Institute of Medical Research - Israel-Canada, The Hebrew University of Jerusalem, Jerusalem, 9112102, Israel
Lara Kamal Molecular Genetics Lab, Istishari Arab Hospital, Ramallah, Palestine Department of Human Molecular Genetics and Biochemistry, Faculty of Medicine and Sagol School of Neuroscience, Tel Aviv University, Tel Aviv, 6997801, Israel
Idit Bloch Department of Developmental Biology and Cancer Research, Institute of Medical Research - Israel-Canada, The Hebrew University of Jerusalem, Jerusalem, 9112102, Israel
Fouad Zahdeh Medical Genetics Institute, Shaare Zedek Medical Center, Jerusalem, 91031, Israel
Elad Sharon Department of Developmental Biology and Cancer Research, Institute of Medical Research - Israel-Canada, The Hebrew University of Jerusalem, Jerusalem, 9112102, Israel
Batel Terespolsky Department of Developmental Biology and Cancer Research, Institute of Medical Research - Israel-Canada, The Hebrew University of Jerusalem, Jerusalem, 9112102, Israel Medical Genetics Institute, Shaare Zedek Medical Center, Jerusalem, 91031, Israel
Islam Abu Allan Molecular Genetics Lab, Istishari Arab Hospital, Ramallah, Palestine
Grace Rabie Hereditary Research Laboratory and Department of Life Sciences, Bethlehem University, Bethlehem, 72372, Palestine
Mariana Kawas Hereditary Research Laboratory and Department of Life Sciences, Bethlehem University, Bethlehem, 72372, Palestine
Hanin Kassem Molecular Genetics Lab, Istishari Arab Hospital, Ramallah, Palestine
Karen B Avraham Department of Human Molecular Genetics and Biochemistry, Faculty of Medicine and Sagol School of Neuroscience, Tel Aviv University, Tel Aviv, 6997801, Israel
Paul Renbaum Medical Genetics Institute, Shaare Zedek Medical Center, Jerusalem, 91031, Israel
Ephrat Levy-Lahad Medical Genetics Institute, Shaare Zedek Medical Center, Jerusalem, 91031, Israel Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem, 9112102, Israel
Moien Kanaan Molecular Genetics Lab, Istishari Arab Hospital, Ramallah, Palestine Hereditary Research Laboratory and Department of Life Sciences, Bethlehem University, Bethlehem, 72372, Palestine
Yuval Tabach Department of Developmental Biology and Cancer Research, Institute of Medical Research - Israel-Canada, The Hebrew University of Jerusalem, Jerusalem, 9112102, Israel.

Collapse

Xu L, Li J, Gonzalez Ramos VM, Lyra C, Wiebenga A, Grigoriev IV, de Vries RP, Mäkelä MR, Peng M. Genome-wide prediction and transcriptome analysis of sugar transporters in four ascomycete fungi. BIORESOURCE TECHNOLOGY 2024;391:130006. [PMID: 37952592 DOI: 10.1016/j.biortech.2023.130006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/14/2023] [Revised: 11/09/2023] [Accepted: 11/09/2023] [Indexed: 11/14/2023]

Thoben C, Pucker B. Automatic annotation of the bHLH gene family in plants. BMC Genomics 2023;24:780. [PMID: 38102570 PMCID: PMC10722790 DOI: 10.1186/s12864-023-09877-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Accepted: 12/06/2023] [Indexed: 12/17/2023] Open

Abstract

BACKGROUND

The bHLH transcription factor family is named after the basic helix-loop-helix (bHLH) domain that is a characteristic element of their members. Understanding the function and characteristics of this family is important for the examination of a wide range of functions. As the availability of genome sequences and transcriptome assemblies has increased significantly, the need for automated solutions that provide reliable functional annotations is emphasised.

RESULTS

A phylogenetic approach was adapted for the automatic identification and functional annotation of the bHLH transcription factor family. The bHLH_annotator, designed for the automated functional annotation of bHLHs, was implemented in Python3. Sequences of bHLHs described in literature were collected to represent the full diversity of bHLH sequences. Previously described orthologs form the basis for the functional annotation assignment to candidates which are also screened for bHLH-specific motifs. The pipeline was successfully deployed on the two Arabidopsis thaliana accessions Col-0 and Nd-1, the monocot species Dioscorea dumetorum, and a transcriptome assembly of Croton tiglium. Depending on the applied search parameters for the initial candidates in the pipeline, species-specific candidates or members of the bHLH family which experienced domain loss can be identified.

CONCLUSIONS

The bHLH_annotator allows a detailed and systematic investigation of the bHLH family in land plant species and classifies candidates based on bHLH-specific characteristics, which distinguishes the pipeline from other established functional annotation tools. This provides the basis for the functional annotation of the bHLH family in land plants and the systematic examination of a wide range of functions regulated by this transcription factor family.

Collapse

Spiers AJ, Dorfmueller HC, Jerdan R, McGregor J, Nicoll A, Steel K, Cameron S. Bioinformatics characterization of BcsA-like orphan proteins suggest they form a novel family of pseudomonad cyclic-β-glucan synthases. PLoS One 2023;18:e0286540. [PMID: 37267309 PMCID: PMC10237404 DOI: 10.1371/journal.pone.0286540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Accepted: 05/18/2023] [Indexed: 06/04/2023] Open

Abstract

Bacteria produce a variety of polysaccharides with functional roles in cell surface coating, surface and host interactions, and biofilms. We have identified an 'Orphan' bacterial cellulose synthase catalytic subunit (BcsA)-like protein found in four model pseudomonads, P. aeruginosa PA01, P. fluorescens SBW25, P. putida KT2440 and P. syringae pv. tomato DC3000. Pairwise alignments indicated that the Orphan and BcsA proteins shared less than 41% sequence identity suggesting they may not have the same structural folds or function. We identified 112 Orphans among soil and plant-associated pseudomonads as well as in phytopathogenic and human opportunistic pathogenic strains. The wide distribution of these highly conserved proteins suggest they form a novel family of synthases producing a different polysaccharide. In silico analysis, including sequence comparisons, secondary structure and topology predictions, and protein structural modelling, revealed a two-domain transmembrane ovoid-like structure for the Orphan protein with a periplasmic glycosyl hydrolase family GH17 domain linked via a transmembrane region to a cytoplasmic glycosyltransferase family GT2 domain. We suggest the GT2 domain synthesises β-(1,3)-glucan that is transferred to the GH17 domain where it is cleaved and cyclised to produce cyclic-β-(1,3)-glucan (CβG). Our structural models are consistent with enzymatic characterisation and recent molecular simulations of the PaPA01 and PpKT2440 GH17 domains. It also provides a functional explanation linking PaPAK and PaPA14 Orphan (also known as NdvB) transposon mutants with CβG production and biofilm-associated antibiotic resistance. Importantly, cyclic glucans are also involved in osmoregulation, plant infection and induced systemic suppression, and our findings suggest this novel family of CβG synthases may provide similar range of adaptive responses for pseudomonads.

Collapse

The Structure of Evolutionary Model Space for Proteins across the Tree of Life. BIOLOGY 2023;12:biology12020282. [PMID: 36829559 PMCID: PMC9952988 DOI: 10.3390/biology12020282] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Revised: 02/04/2023] [Accepted: 02/08/2023] [Indexed: 02/12/2023]

Abstract

The factors that determine the relative rates of amino acid substitution during protein evolution are complex and known to vary among taxa. We estimated relative exchangeabilities for pairs of amino acids from clades spread across the tree of life and assessed the historical signal in the distances among these clade-specific models. We separately trained these models on collections of arbitrarily selected protein alignments and on ribosomal protein alignments. In both cases, we found a clear separation between the models trained using multiple sequence alignments from bacterial clades and the models trained on archaeal and eukaryotic data. We assessed the predictive power of our novel clade-specific models of sequence evolution by asking whether fit to the models could be used to identify the source of multiple sequence alignments. Model fit was generally able to correctly classify protein alignments at the level of domain (bacterial versus archaeal), but the accuracy of classification at finer scales was much lower. The only exceptions to this were the relatively high classification accuracy for two archaeal lineages: Halobacteriaceae and Thermoprotei. Genomic GC content had a modest impact on relative exchangeabilities despite having a large impact on amino acid frequencies. Relative exchangeabilities involving aromatic residues exhibited the largest differences among models. There were a small number of exchangeabilities that exhibited large differences in comparisons among major clades and between generalized models and ribosomal protein models. Taken as a whole, these results reveal that a small number of relative exchangeabilities are responsible for much of the structure of the "model space" for protein sequence evolution. The clade-specific models we generated may be useful tools for protein phylogenetics, and the structure of evolutionary model space that they revealed has implications for phylogenomic inference across the tree of life.

Collapse

Fang Y, Yang Y, Liu C. New feature extraction from phylogenetic profiles improved the performance of pathogen-host interactions. Front Cell Infect Microbiol 2022;12:931072. [PMID: 35982784 PMCID: PMC9378789 DOI: 10.3389/fcimb.2022.931072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Accepted: 07/11/2022] [Indexed: 11/13/2022] Open

Fang Y, Li M, Li X, Yang Y. GFICLEE: ultrafast tree-based phylogenetic profile method inferring gene function at the genomic-wide level. BMC Genomics 2021;22:774. [PMID: 34715785 PMCID: PMC8557005 DOI: 10.1186/s12864-021-08070-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Accepted: 10/10/2021] [Indexed: 11/25/2022] Open

Unterman I, Bloch I, Cazacu S, Kazimirsky G, Ben-Zeev B, Berman BP, Brodie C, Tabach Y. Expanding the MECP2 network using comparative genomics reveals potential therapeutic targets for Rett syndrome. eLife 2021;10:e67085. [PMID: 34355696 PMCID: PMC8346285 DOI: 10.7554/elife.67085] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Accepted: 07/23/2021] [Indexed: 12/12/2022] Open

Linard B, Ebersberger I, McGlynn SE, Glover N, Mochizuki T, Patricio M, Lecompte O, Nevers Y, Thomas PD, Gabaldón T, Sonnhammer E, Dessimoz C, Uchiyama I. Ten Years of Collaborative Progress in the Quest for Orthologs. Mol Biol Evol 2021;38:3033-3045. [PMID: 33822172 PMCID: PMC8321534 DOI: 10.1093/molbev/msab098] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 02/07/2021] [Accepted: 04/01/2021] [Indexed: 12/19/2022] Open

Affiliation(s)

Benjamin Linard LIRMM, University of Montpellier, CNRS, Montpellier, France.,SPYGEN, Le Bourget-du-Lac, France
Ingo Ebersberger Institute of Cell Biology and Neuroscience, Goethe University Frankfurt, Frankfurt, Germany.,Senckenberg Biodiversity and Climate Research Centre (S-BIKF), Frankfurt, Germany.,LOEWE Center for Translational Biodiversity Genomics (TBG), Frankfurt, Germany
Shawn E McGlynn Earth-Life Science Institute, Tokyo Institute of Technology, Meguro, Tokyo, Japan.,Blue Marble Space Institute of Science, Seattle, WA, USA
Natasha Glover Swiss Institute of Bioinformatics, Lausanne, Switzerland.,Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland.,Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
Tomohiro Mochizuki Earth-Life Science Institute, Tokyo Institute of Technology, Meguro, Tokyo, Japan
Mateus Patricio European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom
Odile Lecompte Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Fédération de Médecine Translationnelle de Strasbourg, Strasbourg, France
Yannis Nevers Swiss Institute of Bioinformatics, Lausanne, Switzerland.,Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland.,Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
Paul D Thomas Division of Bioinformatics, Department of Preventive Medicine, University of Southern California, Los Angeles, CA, USA
Toni Gabaldón Barcelona Supercomputing Centre (BCS-CNS), Jordi Girona, Barcelona, Spain.,Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology (BIST), Barcelona, Spain.,Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
Erik Sonnhammer Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Solna, Sweden
Christophe Dessimoz Swiss Institute of Bioinformatics, Lausanne, Switzerland.,Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland.,Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.,Department of Computer Science, University College London, London, United Kingdom.,Department of Genetics, Evolution and Environment, University College London, London, United Kingdom
Ikuo Uchiyama Department of Theoretical Biology, National Institute for Basic Biology, National Institutes of Natural Sciences, Okazaki, Aichi, Japan

Collapse

Bloch I, Sherill-Rofe D, Stupp D, Unterman I, Beer H, Sharon E, Tabach Y. Optimization of co-evolution analysis through phylogenetic profiling reveals pathway-specific signals. Bioinformatics 2021;36:4116-4125. [PMID: 32353123 DOI: 10.1093/bioinformatics/btaa281] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2019] [Revised: 04/17/2020] [Accepted: 04/23/2020] [Indexed: 12/11/2022] Open

Chen Y, Klinkhamer PGL, Memelink J, Vrieling K. Diversity and evolution of cytochrome P450s of Jacobaea vulgaris and Jacobaea aquatica. BMC PLANT BIOLOGY 2020;20:342. [PMID: 32689941 PMCID: PMC7372880 DOI: 10.1186/s12870-020-02532-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/27/2019] [Accepted: 06/28/2020] [Indexed: 06/11/2023]

Abstract

BACKGROUND

Collectively, plants produce a huge variety of secondary metabolites (SMs) which are involved in the adaptation of plants to biotic and abiotic stresses. The most characteristic feature of SMs is their striking inter- and intraspecific chemical diversity. Cytochrome P450 monooxygenases (CYPs) often play an important role in the biosynthesis of SMs and thus in the evolution of chemical diversity. Here we studied the diversity and evolution of CYPs of two Jacobaea species which contain a characteristic group of SMs namely the pyrrolizidine alkaloids (PAs).

RESULTS

We retrieved CYPs from RNA-seq data of J. vulgaris and J. aquatica, resulting in 221 and 157 full-length CYP genes, respectively. The analyses of conserved motifs confirmed that Jacobaea CYP proteins share conserved motifs including the heme-binding signature, the PERF motif, the K-helix and the I-helix. KEGG annotation revealed that the CYPs assigned as being SM metabolic pathway genes were all from the CYP71 clan but no CYPs were assigned as being involved in alkaloid pathways. Phylogenetic analyses of full-length CYPs were conducted for the six largest CYP families of Jacobaea (CYP71, CYP76, CYP706, CYP82, CYP93 and CYP72) and were compared with CYPs of two other members of the Asteraceae, Helianthus annuus and Lactuca sativa, and with Arabidopsis thaliana. The phylogenetic trees showed strong lineage specific diversification of CYPs, implying that the evolution of CYPs has been very fast even within the Asteraceae family. Only in the closely related species J. vulgaris and J. aquatica, CYPs were found often in pairs, confirming a close relationship in the evolutionary history.

CONCLUSIONS

This study discovered 378 full-length CYPs in Jacobaea species, which can be used for future exploration of their functions, including possible involvement in PA biosynthesis and PA diversity.

Collapse

Aminoglycoside antibiotic resistance conferred by Hpa2 of MDR Acinetobacter baumannii: an unusual adaptation of a common histone acetyltransferase. Biochem J 2019;476:795-808. [PMID: 30573651 DOI: 10.1042/bcj20180791] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2018] [Revised: 12/18/2018] [Accepted: 12/20/2018] [Indexed: 12/20/2022]

Sherill-Rofe D, Rahat D, Findlay S, Mellul A, Guberman I, Braun M, Bloch I, Lalezari A, Samiei A, Sadreyev R, Goldberg M, Orthwein A, Zick A, Tabach Y. Mapping global and local coevolution across 600 species to identify novel homologous recombination repair genes. Genome Res 2019;29:439-448. [PMID: 30718334 PMCID: PMC6396423 DOI: 10.1101/gr.241414.118] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2018] [Accepted: 01/22/2019] [Indexed: 12/02/2022]

Affiliation(s)

Dana Sherill-Rofe Department of Developmental Biology and Cancer Research, Institute for Medical Research-Israel-Canada, Hebrew University of Jerusalem, Jerusalem 91120, Israel
Dolev Rahat Department of Developmental Biology and Cancer Research, Institute for Medical Research-Israel-Canada, Hebrew University of Jerusalem, Jerusalem 91120, Israel.,Sharett Institute of Oncology, Hadassah Medical Center, Ein-Kerem, Jerusalem 91120, Israel
Steven Findlay Lady Davis Institute for Medical Research, Segal Cancer Centre, Jewish General Hospital, Montreal, Quebec H3T 1E2, Canada.,Division of Experimental Medicine, McGill University, Montreal, Quebec H4A 3J1, Canada
Anna Mellul Department of Developmental Biology and Cancer Research, Institute for Medical Research-Israel-Canada, Hebrew University of Jerusalem, Jerusalem 91120, Israel
Irene Guberman Department of Developmental Biology and Cancer Research, Institute for Medical Research-Israel-Canada, Hebrew University of Jerusalem, Jerusalem 91120, Israel
Maya Braun Department of Developmental Biology and Cancer Research, Institute for Medical Research-Israel-Canada, Hebrew University of Jerusalem, Jerusalem 91120, Israel
Idit Bloch Department of Developmental Biology and Cancer Research, Institute for Medical Research-Israel-Canada, Hebrew University of Jerusalem, Jerusalem 91120, Israel
Alon Lalezari Department of Developmental Biology and Cancer Research, Institute for Medical Research-Israel-Canada, Hebrew University of Jerusalem, Jerusalem 91120, Israel
Arash Samiei Lady Davis Institute for Medical Research, Segal Cancer Centre, Jewish General Hospital, Montreal, Quebec H3T 1E2, Canada.,Division of Experimental Medicine, McGill University, Montreal, Quebec H4A 3J1, Canada
Ruslan Sadreyev Department of Molecular Biology, Massachusetts General Hospital, Boston, Massachusetts 02114, USA.,Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA.,Department of Pathology, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts 02114, USA
Michal Goldberg Department of Genetics, Alexander Silberman Institute of Life Sciences, Hebrew University of Jerusalem, Jerusalem 91904, Israel
Alexandre Orthwein Lady Davis Institute for Medical Research, Segal Cancer Centre, Jewish General Hospital, Montreal, Quebec H3T 1E2, Canada.,Division of Experimental Medicine, McGill University, Montreal, Quebec H4A 3J1, Canada.,Department of Microbiology and Immunology, McGill University, Montreal, Quebec H3A 2B4, Canada.,Gerald Bronfman Department of Oncology, McGill University, Montreal, Quebec H4A 3T2, Canada
Aviad Zick Sharett Institute of Oncology, Hadassah Medical Center, Ein-Kerem, Jerusalem 91120, Israel
Yuval Tabach Department of Developmental Biology and Cancer Research, Institute for Medical Research-Israel-Canada, Hebrew University of Jerusalem, Jerusalem 91120, Israel

Collapse

Ziemert N, Alanjary M, Weber T. The evolution of genome mining in microbes - a review. Nat Prod Rep 2016;33:988-1005. [PMID: 27272205 DOI: 10.1039/c6np00025h] [Citation(s) in RCA: 404] [Impact Index Per Article: 50.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Palma-Silva C, Ferro M, Bacci M, Turchetto-Zolet AC. De novo assembly and characterization of leaf and floral transcriptomes of the hybridizing bromeliad species (Pitcairnia spp.) adapted to Neotropical Inselbergs. Mol Ecol Resour 2016;16:1012-22. [PMID: 26849180 DOI: 10.1111/1755-0998.12504] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2015] [Revised: 12/17/2015] [Accepted: 12/22/2015] [Indexed: 02/06/2023]

Valdivia HO, Scholte LLS, Oliveira G, Gabaldón T, Bartholomeu DC. The Leishmania metaphylome: a comprehensive survey of Leishmania protein phylogenetic relationships. BMC Genomics 2015;16:887. [PMID: 26518129 PMCID: PMC4628237 DOI: 10.1186/s12864-015-2091-2] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2015] [Accepted: 10/15/2015] [Indexed: 11/22/2022] Open

Abstract

Background

Leishmaniasis is a neglected parasitic disease with diverse clinical manifestations and a complex epidemiology. It has been shown that its parasite-related traits vary between species and that they modulate infectivity, pathogenicity, and virulence. However, understanding of the species-specific adaptations responsible for these features and their evolutionary background is limited. To improve our knowledge regarding the parasite biology and adaptation mechanisms of different Leishmania species, we conducted a proteome-wide phylogenomic analysis to gain insights into Leishmania evolution.

Results

The analysis of the reconstructed phylomes (totaling 45,918 phylogenies) allowed us to detect genes that are shared in pathogenic Leishmania species, such as calpain-like cysteine peptidases and 3'a2rel-related proteins, or genes that could be associated with visceral or cutaneous development. This analysis also established the phylogenetic relationship of several hypothetical proteins whose roles remain to be characterized. Our findings demonstrated that gene duplication constitutes an important evolutionary force in Leishmania, acting on protein families that mediate host-parasite interactions, such as amastins, GP63 metallopeptidases, cathepsin L-like proteases, and our methods permitted a deeper analysis of their phylogenetic relationships.

Conclusions

Our results highlight the importance of proteome wide phylogenetic analyses to detect adaptation and evolutionary processes in different organisms and underscore the need to characterize the role of expanded and species-specific proteins in the context of Leishmania evolution by providing a framework for the phylogenetic relationships of Leishmania proteins.

Phylogenomic data are publicly available for use through PhylomeDB (http://www.phylomedb.org).

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-2091-2) contains supplementary material, which is available to authorized users.

Collapse

Shin JH, Han JH, Kim KS. Genome-wide analyses of DNA-binding proteins harboring AT-hook motifs and their functional roles in the rice blast pathogen, Magnaporthe oryzae. Genes Genomics 2014. [DOI: 10.1007/s13258-014-0233-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Kleist TJ, Spencley AL, Luan S. Comparative phylogenomics of the CBL-CIPK calcium-decoding network in the moss Physcomitrella, Arabidopsis, and other green lineages. FRONTIERS IN PLANT SCIENCE 2014;5:187. [PMID: 24860579 DOI: 10.3389/fpls.2014.0018] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 01/31/2014] [Accepted: 04/21/2014] [Indexed: 05/24/2023]

Kleist TJ, Spencley AL, Luan S. Comparative phylogenomics of the CBL-CIPK calcium-decoding network in the moss Physcomitrella, Arabidopsis, and other green lineages. FRONTIERS IN PLANT SCIENCE 2014;5:187. [PMID: 24860579 PMCID: PMC4030171 DOI: 10.3389/fpls.2014.00187] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2014] [Accepted: 04/21/2014] [Indexed: 05/22/2023]

Lohse M, Nagel A, Herter T, May P, Schroda M, Zrenner R, Tohge T, Fernie AR, Stitt M, Usadel B. Mercator: a fast and simple web server for genome scale functional annotation of plant sequence data. PLANT, CELL & ENVIRONMENT 2014;37:1250-8. [PMID: 24237261 DOI: 10.1111/pce.12231] [Citation(s) in RCA: 379] [Impact Index Per Article: 37.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2013] [Revised: 10/23/2013] [Accepted: 10/28/2013] [Indexed: 05/18/2023]

Human disease locus discovery and mapping to molecular pathways through phylogenetic profiling. Mol Syst Biol 2013;9:692. [PMID: 24084807 PMCID: PMC3817400 DOI: 10.1038/msb.2013.50] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2013] [Accepted: 08/29/2013] [Indexed: 12/16/2022] Open

Abstract

By analyzing the conservation of human proteins across 87 species, we sorted proteins into clusters of coevolution. Some clusters are enriched for genes assigned to particular human diseases or molecular pathways; the other genes in the same cluster may function in related pathways and diseases.

Many genes that were thought to map to different diseases are actually coevolved together and mapped into the same phylogenetic clusters.

Many molecular pathways map to the same phylogenetic clusters as genes associated with specific human diseases.

Focusing on proteins coevolved with the microphthalmia-associated transcription factor (MITF), we identified the Notch pathway suppressor of hairless (RBP-Jk/SuH) transcription factor, and showed that RBP-Jk functions as an MITF cofactor.

Our analysis thus establishes a connectivity between different diseases and pathways, linking diseases phenotypes and functional gene groups.

Genes with common profiles of the presence and absence in disparate genomes tend to function in the same pathway. By mapping all human genes into about 1000 clusters of genes with similar patterns of conservation across eukaryotic phylogeny, we determined that sets of genes associated with particular diseases have similar phylogenetic profiles. By focusing on those human phylogenetic gene clusters that significantly overlap some of the thousands of human gene sets defined by their coexpression or annotation to pathways or other molecular attributes, we reveal the evolutionary map that connects molecular pathways and human diseases. The other genes in the phylogenetic clusters enriched for particular known disease genes or molecular pathways identify candidate genes for roles in those same disorders and pathways. Focusing on proteins coevolved with the microphthalmia-associated transcription factor (MITF), we identified the Notch pathway suppressor of hairless (RBP-Jk/SuH) transcription factor, and showed that RBP-Jk functions as an MITF cofactor.

Collapse

Bouzat JL, Hoostal MJ. Evolutionary Analysis and Lateral Gene Transfer of Two-Component Regulatory Systems Associated with Heavy-Metal Tolerance in Bacteria. J Mol Evol 2013;76:267-79. [DOI: 10.1007/s00239-013-9558-z] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2012] [Accepted: 03/23/2013] [Indexed: 11/28/2022]

Silva LL, Marcet-Houben M, Nahum LA, Zerlotini A, Gabaldón T, Oliveira G. The Schistosoma mansoni phylome: using evolutionary genomics to gain insight into a parasite's biology. BMC Genomics 2012;13:617. [PMID: 23148687 PMCID: PMC3534613 DOI: 10.1186/1471-2164-13-617] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2012] [Accepted: 10/22/2012] [Indexed: 01/10/2023] Open

Abstract

BACKGROUND

Schistosoma mansoni is one of the causative agents of schistosomiasis, a neglected tropical disease that affects about 237 million people worldwide. Despite recent efforts, we still lack a general understanding of the relevant host-parasite interactions, and the possible treatments are limited by the emergence of resistant strains and the absence of a vaccine. The S. mansoni genome was completely sequenced and still under continuous annotation. Nevertheless, more than 45% of the encoded proteins remain without experimental characterization or even functional prediction. To improve our knowledge regarding the biology of this parasite, we conducted a proteome-wide evolutionary analysis to provide a broad view of the S. mansoni's proteome evolution and to improve its functional annotation.

RESULTS

Using a phylogenomic approach, we reconstructed the S. mansoni phylome, which comprises the evolutionary histories of all parasite proteins and their homologs across 12 other organisms. The analysis of a total of 7,964 phylogenies allowed a deeper understanding of genomic complexity and evolutionary adaptations to a parasitic lifestyle. In particular, the identification of lineage-specific gene duplications pointed to the diversification of several protein families that are relevant for host-parasite interaction, including proteases, tetraspanins, fucosyltransferases, venom allergen-like proteins, and tegumental-allergen-like proteins. In addition to the evolutionary knowledge, the phylome data enabled us to automatically re-annotate 3,451 proteins through a phylogenetic-based approach rather than solely sequence similarity searches. To allow further exploitation of this valuable data, all information has been made available at PhylomeDB (http://www.phylomedb.org).

CONCLUSIONS

In this study, we used an evolutionary approach to assess S. mansoni parasite biology, improve genome/proteome functional annotation, and provide insights into host-parasite interactions. Taking advantage of a proteome-wide perspective rather than focusing on individual proteins, we identified that this parasite has experienced specific gene duplication events, particularly affecting genes that are potentially related to the parasitic lifestyle. These innovations may be related to the mechanisms that protect S. mansoni against host immune responses being important adaptations for the parasite survival in a potentially hostile environment. Continuing this work, a comparative analysis involving genomic, transcriptomic, and proteomic data from other helminth parasites, other parasites, and vectors will supply more information regarding parasite's biology as well as host-parasite interactions.

Collapse

Orthopoxvirus genome evolution: the role of gene loss. Viruses 2010;2:1933-1967. [PMID: 21994715 PMCID: PMC3185746 DOI: 10.3390/v2091933] [Citation(s) in RCA: 125] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2010] [Revised: 08/25/2010] [Accepted: 09/01/2010] [Indexed: 12/26/2022] Open

Cibrián-Jaramillo A, De la Torre-Bárcena JE, Lee EK, Katari MS, Little DP, Stevenson DW, Martienssen R, Coruzzi GM, DeSalle R. Using phylogenomic patterns and gene ontology to identify proteins of importance in plant evolution. Genome Biol Evol 2010;2:225-39. [PMID: 20624728 PMCID: PMC2997538 DOI: 10.1093/gbe/evq012] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/14/2010] [Indexed: 01/01/2023] Open

Towfic F, VanderPIas S, OIiver CA, Couture OI, TuggIe CK, West GreenIee MH, Honavar V. Detection of gene orthology from gene co-expression and protein interaction networks. BMC Bioinformatics 2010;11 Suppl 3:S7. [PMID: 20438654 PMCID: PMC2863066 DOI: 10.1186/1471-2105-11-s3-s7] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Timmins J, Gordon E, Caria S, Leonard G, Acajjaoui S, Kuo MS, Monchois V, McSweeney S. Structural and mutational analyses of Deinococcus radiodurans UvrA2 provide insight into DNA binding and damage recognition by UvrAs. Structure 2009;17:547-58. [PMID: 19368888 DOI: 10.1016/j.str.2009.02.008] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2008] [Revised: 02/03/2009] [Accepted: 02/04/2009] [Indexed: 10/20/2022]

Zhou JM, Seo YW, Ibrahim RK. Biochemical characterization of a putative wheat caffeic acid O-methyltransferase. PLANT PHYSIOLOGY AND BIOCHEMISTRY : PPB 2009;47:322-326. [PMID: 19211254 DOI: 10.1016/j.plaphy.2008.11.011] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2008] [Accepted: 11/26/2008] [Indexed: 05/27/2023]

Jiang Z. Protein Function Predictions Based on the Phylogenetic Profile Method. Crit Rev Biotechnol 2008;28:233-8. [PMID: 19051102 DOI: 10.1080/07388550802512633] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Singh S, Stavrinides J, Christendat D, Guttman DS. A phylogenomic analysis of the shikimate dehydrogenases reveals broadscale functional diversification and identifies one functionally distinct subclass. Mol Biol Evol 2008;25:2221-32. [PMID: 18669580 DOI: 10.1093/molbev/msn170] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Abstract

The shikimate dehydrogenases (SDH) represent a widely distributed enzyme family with an essential role in secondary metabolism. This superfamily had been previously subdivided into 4 enzyme groups (AroE, YdiB, SdhL, and RifI), which show clear biochemical and functional differences ranging from amino acid biosynthesis to antibiotic production. Despite the importance of this group, little is known about how such essential enzymatic functions can evolve and diversify. We dissected the enzyme superfamily with a phylogenomic analysis of approximately 250 fully sequenced genomes, making use of previously characterized representatives from each enzyme class, and the key substrate-binding residues known to distinguish substrate specificity. We identified 5 major evolutionary and functional SDH subgroups and several other potentially unique functional classes within this complex enzyme family and then validated the functional distinctiveness of each group by characterizing the 5 SDH homologs found in Pseudomonas putida KT2440 biochemically. We identified an entirely novel functionally distinct subgroup, which we designated Ael1 (AroE-like1) and also delineated a new group of shikimate/quinate dehydrogenases (YdiB2), which is phylogenetically distinct from the previously described Escherichia coli YdiB. The combination of biochemical, phylogenetic, and genomic approaches has revealed the broad extent to which the SDH enzyme superfamily has diversified. Five functional groups were validated with the potential for at least 5 additional subgroups. Our analysis also identified a new SDH functional group, which appears to have evolved recently from an ancestral AroE, illustrating a very prominent role of horizontal transmission and neofunctionalizaton in the evolutionary and functional diversification of this enzyme family.

Collapse

Levasseur A, Pontarotti P, Poch O, Thompson JD. Strategies for reliable exploitation of evolutionary concepts in high throughput biology. Evol Bioinform Online 2008;4:121-37. [PMID: 19204813 PMCID: PMC2614184 DOI: 10.4137/ebo.s597] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open

Woody OZ, Doxey AC, McConkey BJ. Assessing the evolution of gene expression using microarray data. Evol Bioinform Online 2008;4:139-52. [PMID: 19204814 PMCID: PMC2614203 DOI: 10.4137/ebo.s628] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open

Fuellen G. Homology and phylogeny and their automated inference. Naturwissenschaften 2008;95:469-81. [PMID: 18288471 DOI: 10.1007/s00114-008-0348-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2007] [Revised: 12/20/2007] [Accepted: 01/12/2008] [Indexed: 11/25/2022]

Phylogenomics, Protein Family Evolution, and the Tree of Life: An Integrated Approach between Molecular Evolution and Computational Intelligence. APPLICATIONS OF COMPUTATIONAL INTELLIGENCE IN BIOLOGY 2008. [DOI: 10.1007/978-3-540-78534-7_11] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Martins-Pinheiro M, Marques RCP, Menck CFM. Genome analysis of DNA repair genes in the alpha proteobacterium Caulobacter crescentus. BMC Microbiol 2007;7:17. [PMID: 17352799 PMCID: PMC1839093 DOI: 10.1186/1471-2180-7-17] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2006] [Accepted: 03/12/2007] [Indexed: 11/10/2022] Open

Abstract

Background

The integrity of DNA molecules is fundamental for maintaining life. The DNA repair proteins protect organisms against genetic damage, by removal of DNA lesions or helping to tolerate them. DNA repair genes are best known from the gamma-proteobacterium Escherichia coli, which is the most understood bacterial model. However, genome sequencing raises questions regarding uniformity and ubiquity of these DNA repair genes and pathways, reinforcing the need for identifying genes and proteins, which may respond to DNA damage in other bacteria.

Results

In this study, we employed a bioinformatic approach, to analyse and describe the open reading frames potentially related to DNA repair from the genome of the alpha-proteobacterium Caulobacter crescentus. This was performed by comparison with known DNA repair related genes found in public databases. As expected, although C. crescentus and E. coli bacteria belong to separate phylogenetic groups, many of their DNA repair genes are very similar. However, some important DNA repair genes are absent in the C. crescentus genome and other interesting functionally related gene duplications are present, which do not occur in E. coli. These include DNA ligases, exonuclease III (xthA), endonuclease III (nth), O₆-methylguanine-DNA methyltransferase (ada gene), photolyase-like genes, and uracil-DNA-glycosylases. On the other hand, the genes imuA and imuB, which are involved in DNA damage induced mutagenesis, have recently been described in C. crescentus, but are absent in E. coli. Particularly interesting are the potential atypical phylogeny of one of the photolyase genes in alpha-proteobacteria, indicating an origin by horizontal transfer, and the duplication of the Ada orthologs, which have diverse structural configurations, including one that is still unique for C. crescentus.

Conclusion

The absence and the presence of certain genes are discussed and predictions are made considering the particular aspects of the C. crescentus among other known DNA repair pathways. The observed differences enlarge what is known for DNA repair in the Bacterial world, and provide a useful framework for further experimental studies in this organism.

Collapse

Lee I, Narayanaswamy R, Marcotte EM. 24 Bioinformatic Prediction of Yeast Gene Function. J Microbiol Methods 2007. [DOI: 10.1016/s0580-9517(06)36024-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]

Bandyopadhyay S, Sharan R, Ideker T. Systematic identification of functional orthologs based on protein network comparison. Genome Res 2006;16:428-35. [PMID: 16510899 PMCID: PMC1415213 DOI: 10.1101/gr.4526006] [Citation(s) in RCA: 148] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]

Zhang W, Culley DE, Gritsenko MA, Moore RJ, Nie L, Scholten JCM, Petritis K, Strittmatter EF, Camp DG, Smith RD, Brockman FJ. LC-MS/MS based proteomic analysis and functional inference of hypothetical proteins in Desulfovibrio vulgaris. Biochem Biophys Res Commun 2006;349:1412-9. [PMID: 16982031 DOI: 10.1016/j.bbrc.2006.09.019] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2006] [Accepted: 09/07/2006] [Indexed: 11/26/2022]

Alako BTF, Rainey D, Nijveen H, Leunissen JAM. TreeDomViewer: a tool for the visualization of phylogeny and protein domain structure. Nucleic Acids Res 2006;34:W104-9. [PMID: 16844970 PMCID: PMC1538806 DOI: 10.1093/nar/gkl171] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open

Jothi R, Zotenko E, Tasneem A, Przytycka TM. COCO-CL: hierarchical clustering of homology relations based on evolutionary correlations. Bioinformatics 2006;22:779-88. [PMID: 16434444 PMCID: PMC1620014 DOI: 10.1093/bioinformatics/btl009] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Fan J, Lefebvre J, Manjunath P. Bovine seminal plasma proteins and their relatives: A new expanding superfamily in mammals. Gene 2006;375:63-74. [PMID: 16678981 DOI: 10.1016/j.gene.2006.02.025] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2005] [Revised: 02/10/2006] [Accepted: 02/11/2006] [Indexed: 11/17/2022]

Wu M, Ren Q, Durkin AS, Daugherty SC, Brinkac LM, Dodson RJ, Madupu R, Sullivan SA, Kolonay JF, Nelson WC, Tallon LJ, Jones KM, Ulrich LE, Gonzalez JM, Zhulin IB, Robb FT, Eisen JA. Life in hot carbon monoxide: the complete genome sequence of Carboxydothermus hydrogenoformans Z-2901. PLoS Genet 2005;1:e65. [PMID: 16311624 PMCID: PMC1287953 DOI: 10.1371/journal.pgen.0010065] [Citation(s) in RCA: 177] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2005] [Accepted: 10/19/2005] [Indexed: 11/20/2022] Open

Abstract

We report here the sequencing and analysis of the genome of the thermophilic bacterium Carboxydothermus hydrogenoformans Z-2901. This species is a model for studies of hydrogenogens, which are diverse bacteria and archaea that grow anaerobically utilizing carbon monoxide (CO) as their sole carbon source and water as an electron acceptor, producing carbon dioxide and hydrogen as waste products. Organisms that make use of CO do so through carbon monoxide dehydrogenase complexes. Remarkably, analysis of the genome of C. hydrogenoformans reveals the presence of at least five highly differentiated anaerobic carbon monoxide dehydrogenase complexes, which may in part explain how this species is able to grow so much more rapidly on CO than many other species. Analysis of the genome also has provided many general insights into the metabolism of this organism which should make it easier to use it as a source of biologically produced hydrogen gas. One surprising finding is the presence of many genes previously found only in sporulating species in the Firmicutes Phylum. Although this species is also a Firmicutes, it was not known to sporulate previously. Here we show that it does sporulate and because it is missing many of the genes involved in sporulation in other species, this organism may serve as a “minimal” model for sporulation studies. In addition, using phylogenetic profile analysis, we have identified many uncharacterized gene families found in all known sporulating Firmicutes, but not in any non-sporulating bacteria, including a sigma factor not known to be involved in sporulation previously.

Carboxydothermus hydrogenoformans, a bacterium isolated from a Russian hotspring, is studied for three major reasons: it grows at very high temperature, it lives almost entirely on a diet of carbon monoxide (CO), and it converts water to hydrogen gas as part of its metabolism. Understanding this organism's unique biology gets a boost from the decoding of its genome, reported in this issue of PLoS Genetics. For example, genome analysis reveals that it encodes five different forms of the protein machine carbon monoxide dehydrogenase (CODH). Most species have no CODH and even species that utilize CO usually have only one or two. The five CODH in C. hydrogenoformans likely allow it to both use CO for diverse cellular processes and out-compete for it when it is limiting. The genome sequence also led the researchers to experimentally document new aspects of this species' biology including the ability to form spores. The researchers then used comparative genomic analysis to identify conserved genes found in all spore-forming species, including Bacillus anthracis, and not in any other species. Finally, the genome sequence and analysis reported here will aid in those trying to develop this and other species into systems to biologically produce hydrogen gas from water.

Collapse

Affiliation(s)

Martin Wu The Institute for Genomic Research, Rockville, Maryland, United States of America
Qinghu Ren The Institute for Genomic Research, Rockville, Maryland, United States of America
A. Scott Durkin The Institute for Genomic Research, Rockville, Maryland, United States of America
Sean C Daugherty The Institute for Genomic Research, Rockville, Maryland, United States of America
Lauren M Brinkac The Institute for Genomic Research, Rockville, Maryland, United States of America
Robert J Dodson The Institute for Genomic Research, Rockville, Maryland, United States of America
Ramana Madupu The Institute for Genomic Research, Rockville, Maryland, United States of America
Steven A Sullivan The Institute for Genomic Research, Rockville, Maryland, United States of America
James F Kolonay The Institute for Genomic Research, Rockville, Maryland, United States of America
William C Nelson The Institute for Genomic Research, Rockville, Maryland, United States of America
Luke J Tallon The Institute for Genomic Research, Rockville, Maryland, United States of America
Kristine M Jones The Institute for Genomic Research, Rockville, Maryland, United States of America
Luke E Ulrich Center for Bioinformatics and Computational Biology, School of Biology, Georgia Institute of Technology, Atlanta, Georgia, United States of America
Juan M Gonzalez Center of Marine Biotechnology, University of Maryland Biotechnology Institute, Baltimore, Maryland, United States of America
Igor B Zhulin Center for Bioinformatics and Computational Biology, School of Biology, Georgia Institute of Technology, Atlanta, Georgia, United States of America
Frank T Robb Center of Marine Biotechnology, University of Maryland Biotechnology Institute, Baltimore, Maryland, United States of America
Jonathan A Eisen The Institute for Genomic Research, Rockville, Maryland, United States of America Johns Hopkins University, Baltimore, Maryland, United States of America * To whom correspondence should be addressed. E-mail:

Collapse

Fuellen G, Spitzer M, Cullen P, Lorkowski S. Correspondence of function and phylogeny of ABC proteins based on an automated analysis of 20 model protein data sets. Proteins 2005;61:888-99. [PMID: 16254912 DOI: 10.1002/prot.20616] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Krishnamurthy N, Sjölander K. Phylogenomic Inference of Protein Molecular Function. ACTA ACUST UNITED AC 2005;Chapter 6:Unit 6.9. [DOI: 10.1002/0471250953.bi0609s11] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Francke C, Siezen RJ, Teusink B. Reconstructing the metabolic network of a bacterium from its genome. Trends Microbiol 2005;13:550-8. [PMID: 16169729 DOI: 10.1016/j.tim.2005.09.001] [Citation(s) in RCA: 112] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2005] [Revised: 08/25/2005] [Accepted: 09/08/2005] [Indexed: 10/25/2022]

Ibrahim RK. A forty-year journey in plant research: original contributions to flavonoid biochemistry. ACTA ACUST UNITED AC 2005. [DOI: 10.1139/b05-030] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Ward N, Larsen Ø, Sakwa J, Bruseth L, Khouri H, Durkin AS, Dimitrov G, Jiang L, Scanlan D, Kang KH, Lewis M, Nelson KE, Methé B, Wu M, Heidelberg JF, Paulsen IT, Fouts D, Ravel J, Tettelin H, Ren Q, Read T, DeBoy RT, Seshadri R, Salzberg SL, Jensen HB, Birkeland NK, Nelson WC, Dodson RJ, Grindhaug SH, Holt I, Eidhammer I, Jonasen I, Vanaken S, Utterback T, Feldblyum TV, Fraser CM, Lillehaug JR, Eisen JA. Genomic insights into methanotrophy: the complete genome sequence of Methylococcus capsulatus (Bath). PLoS Biol 2004;2:e303. [PMID: 15383840 PMCID: PMC517821 DOI: 10.1371/journal.pbio.0020303] [Citation(s) in RCA: 204] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2004] [Accepted: 07/14/2004] [Indexed: 11/23/2022] Open

Abstract

Methanotrophs are ubiquitous bacteria that can use the greenhouse gas methane as a sole carbon and energy source for growth, thus playing major roles in global carbon cycles, and in particular, substantially reducing emissions of biologically generated methane to the atmosphere. Despite their importance, and in contrast to organisms that play roles in other major parts of the carbon cycle such as photosynthesis, no genome-level studies have been published on the biology of methanotrophs. We report the first complete genome sequence to our knowledge from an obligate methanotroph, Methylococcus capsulatus (Bath), obtained by the shotgun sequencing approach. Analysis revealed a 3.3-Mb genome highly specialized for a methanotrophic lifestyle, including redundant pathways predicted to be involved in methanotrophy and duplicated genes for essential enzymes such as the methane monooxygenases. We used phylogenomic analysis, gene order information, and comparative analysis with the partially sequenced methylotroph Methylobacterium extorquens to detect genes of unknown function likely to be involved in methanotrophy and methylotrophy. Genome analysis suggests the ability of M. capsulatus to scavenge copper (including a previously unreported nonribosomal peptide synthetase) and to use copper in regulation of methanotrophy, but the exact regulatory mechanisms remain unclear. One of the most surprising outcomes of the project is evidence suggesting the existence of previously unsuspected metabolic flexibility in M. capsulatus, including an ability to grow on sugars, oxidize chemolithotrophic hydrogen and sulfur, and live under reduced oxygen tension, all of which have implications for methanotroph ecology. The availability of the complete genome of M. capsulatus (Bath) deepens our understanding of methanotroph biology and its relationship to global carbon cycles. We have gained evidence for greater metabolic flexibility than was previously known, and for genetic components that may have biotechnological potential.

Collapse

Premzl M, Gready JE, Jermiin LS, Simonic T, Marshall Graves JA. Evolution of vertebrate genes related to prion and Shadoo proteins--clues from comparative genomic analysis. Mol Biol Evol 2004;21:2210-31. [PMID: 15342797 DOI: 10.1093/molbev/msh245] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open

Abstract

Recent findings of new genes in fish related to the prion protein (PrP) gene PRNP, including our recent report of SPRN coding for Shadoo (Sho) protein found also in mammals, raise issues of their function and evolution. Here we report additional novel fish genes found in public databases, including a duplicated SPRN gene, SPRNB, in Fugu, Tetraodon, carp, and zebrafish encoding the Sho2 protein, and we use comparative genomic analysis to analyze the evolutionary relationships and to infer evolutionary trajectories of the complete data set. Phylogenetic footprinting performed on aligned human, mouse, and Fugu SPRN genes to define candidate regulatory promoter regions, detected 16 conserved motifs, three of which are known transcription factor-binding sites for a receptor and transcription factors specific to or associated with expression in brain. This result and other homology-based (VISTA global genomic alignment; protein sequence alignment and phylogenetics) and context-dependent (genomic context; relative gene order and orientation) criteria indicate fish and mammalian SPRN genes are orthologous and suggest a strongly conserved basic function in brain. Whereas tetrapod PRNPs share context with the analogous stPrP-2-coding gene in fish, their sequences are diverged, suggesting that the tetrapod and fish genes are likely to have significantly different functions. Phylogenetic analysis predicts the SPRN/SPRNB duplication occurred before divergence of fish from tetrapods, whereas that of stPrP-1 and stPrP-2 occurred in fish. Whereas Sho appears to have a conserved function in vertebrate brain, PrP seems to have an adaptive role fine-tuned in a lineage-specific fashion. An evolutionary model consistent with our findings and literature knowledge is proposed that has an ancestral prevertebrate SPRN-like gene leading to all vertebrate PrP-related and Sho-related genes. This provides a new framework for exploring the evolution of this unusual family of proteins and for searching for members in other fish branches and intermediate vertebrate groups.

Collapse

Wu M, Sun LV, Vamathevan J, Riegler M, Deboy R, Brownlie JC, McGraw EA, Martin W, Esser C, Ahmadinejad N, Wiegand C, Madupu R, Beanan MJ, Brinkac LM, Daugherty SC, Durkin AS, Kolonay JF, Nelson WC, Mohamoud Y, Lee P, Berry K, Young MB, Utterback T, Weidman J, Nierman WC, Paulsen IT, Nelson KE, Tettelin H, O'Neill SL, Eisen JA. Phylogenomics of the reproductive parasite Wolbachia pipientis wMel: a streamlined genome overrun by mobile genetic elements. PLoS Biol 2004;2:E69. [PMID: 15024419 PMCID: PMC368164 DOI: 10.1371/journal.pbio.0020069] [Citation(s) in RCA: 587] [Impact Index Per Article: 29.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2003] [Accepted: 01/06/2004] [Indexed: 12/17/2022] Open

Abstract

The complete sequence of the 1,267,782 bp genome of Wolbachia pipientis wMel, an obligate intracellular bacteria of Drosophila melanogaster, has been determined. Wolbachia, which are found in a variety of invertebrate species, are of great interest due to their diverse interactions with different hosts, which range from many forms of reproductive parasitism to mutualistic symbioses. Analysis of the wMel genome, in particular phylogenomic comparisons with other intracellular bacteria, has revealed many insights into the biology and evolution of wMel and Wolbachia in general. For example, the wMel genome is unique among sequenced obligate intracellular species in both being highly streamlined and containing very high levels of repetitive DNA and mobile DNA elements. This observation, coupled with multiple evolutionary reconstructions, suggests that natural selection is somewhat inefficient in wMel, most likely owing to the occurrence of repeated population bottlenecks. Genome analysis predicts many metabolic differences with the closely related Rickettsia species, including the presence of intact glycolysis and purine synthesis, which may compensate for an inability to obtain ATP directly from its host, as Rickettsia can. Other discoveries include the apparent inability of wMel to synthesize lipopolysaccharide and the presence of the most genes encoding proteins with ankyrin repeat domains of any prokaryotic genome yet sequenced. Despite the ability of wMel to infect the germline of its host, we find no evidence for either recent lateral gene transfer between wMel and D. melanogaster or older transfers between Wolbachia and any host. Evolutionary analysis further supports the hypothesis that mitochondria share a common ancestor with the α-Proteobacteria, but shows little support for the grouping of mitochondria with species in the order Rickettsiales. With the availability of the complete genomes of both species and excellent genetic tools for the host, the wMel–D. melanogaster symbiosis is now an ideal system for studying the biology and evolution of Wolbachia infections.

The genome sequence of Wolbachia provides insights into the origins of mitochondria, as well as the ecology and evolution of endosymbiosis

Collapse

Affiliation(s)

Martin Wu 1The Institute for Genomic Research, RockvilleMarylandUnited States of America
Ling V Sun 2Department of Epidemiology and Public Health, Yale University School of MedicineNew Haven, ConnecticutUnited States of America
Jessica Vamathevan 1The Institute for Genomic Research, RockvilleMarylandUnited States of America
Markus Riegler 3Department of Zoology and Entomology, School of Life SciencesThe University of Queensland, St Lucia, QueenslandAustralia
Robert Deboy 1The Institute for Genomic Research, RockvilleMarylandUnited States of America
Jeremy C Brownlie 3Department of Zoology and Entomology, School of Life SciencesThe University of Queensland, St Lucia, QueenslandAustralia
Elizabeth A McGraw 3Department of Zoology and Entomology, School of Life SciencesThe University of Queensland, St Lucia, QueenslandAustralia
William Martin 4Institut für Botanik III, Heinrich-Heine UniversitätDüsseldorfGermany
Christian Esser 4Institut für Botanik III, Heinrich-Heine UniversitätDüsseldorfGermany
Nahal Ahmadinejad 4Institut für Botanik III, Heinrich-Heine UniversitätDüsseldorfGermany
Christian Wiegand 4Institut für Botanik III, Heinrich-Heine UniversitätDüsseldorfGermany
Ramana Madupu 1The Institute for Genomic Research, RockvilleMarylandUnited States of America
Maureen J Beanan 1The Institute for Genomic Research, RockvilleMarylandUnited States of America
Lauren M Brinkac 1The Institute for Genomic Research, RockvilleMarylandUnited States of America
Sean C Daugherty 1The Institute for Genomic Research, RockvilleMarylandUnited States of America
A. Scott Durkin 1The Institute for Genomic Research, RockvilleMarylandUnited States of America
James F Kolonay 1The Institute for Genomic Research, RockvilleMarylandUnited States of America
William C Nelson 1The Institute for Genomic Research, RockvilleMarylandUnited States of America
Yasmin Mohamoud 1The Institute for Genomic Research, RockvilleMarylandUnited States of America
Perris Lee 1The Institute for Genomic Research, RockvilleMarylandUnited States of America
Kristi Berry 1The Institute for Genomic Research, RockvilleMarylandUnited States of America
M. Brook Young 1The Institute for Genomic Research, RockvilleMarylandUnited States of America
Teresa Utterback 1The Institute for Genomic Research, RockvilleMarylandUnited States of America
Janice Weidman 1The Institute for Genomic Research, RockvilleMarylandUnited States of America
William C Nierman 1The Institute for Genomic Research, RockvilleMarylandUnited States of America
Ian T Paulsen 1The Institute for Genomic Research, RockvilleMarylandUnited States of America
Karen E Nelson 1The Institute for Genomic Research, RockvilleMarylandUnited States of America
Hervé Tettelin 1The Institute for Genomic Research, RockvilleMarylandUnited States of America
Scott L O'Neill 2Department of Epidemiology and Public Health, Yale University School of MedicineNew Haven, ConnecticutUnited States of America 3Department of Zoology and Entomology, School of Life SciencesThe University of Queensland, St Lucia, QueenslandAustralia
Jonathan A Eisen 1The Institute for Genomic Research, RockvilleMarylandUnited States of America

Collapse