1
|
Nicastro GG, Burroughs AM, Iyer L, Aravind L. Functionally comparable but evolutionarily distinct nucleotide-targeting effectors help identify conserved paradigms across diverse immune systems. Nucleic Acids Res 2023; 51:11479-11503. [PMID: 37889040 PMCID: PMC10681802 DOI: 10.1093/nar/gkad879] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 09/21/2023] [Accepted: 09/28/2023] [Indexed: 10/28/2023] Open
Abstract
While nucleic acid-targeting effectors are known to be central to biological conflicts and anti-selfish element immunity, recent findings have revealed immune effectors that target their building blocks and the cellular energy currency-free nucleotides. Through comparative genomics and sequence-structure analysis, we identified several distinct effector domains, which we named Calcineurin-CE, HD-CE, and PRTase-CE. These domains, along with specific versions of the ParB and MazG domains, are widely present in diverse prokaryotic immune systems and are predicted to degrade nucleotides by targeting phosphate or glycosidic linkages. Our findings unveil multiple potential immune systems associated with at least 17 different functional themes featuring these effectors. Some of these systems sense modified DNA/nucleotides from phages or operate downstream of novel enzymes generating signaling nucleotides. We also uncovered a class of systems utilizing HSP90- and HSP70-related modules as analogs of STAND and GTPase domains that are coupled to these nucleotide-targeting- or proteolysis-induced complex-forming effectors. While widespread in bacteria, only a limited subset of nucleotide-targeting effectors was integrated into eukaryotic immune systems, suggesting barriers to interoperability across subcellular contexts. This work establishes nucleotide-degrading effectors as an emerging immune paradigm and traces their origins back to homologous domains in housekeeping systems.
Collapse
Affiliation(s)
- Gianlucca G Nicastro
- Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, USA
| | - A Maxwell Burroughs
- Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, USA
| | - Lakshminarayan M Iyer
- Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, USA
| | - L Aravind
- Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, USA
| |
Collapse
|
2
|
Feng Y, Neri U, Gosselin S, Louyakis AS, Papke RT, Gophna U, Gogarten JP. The Evolutionary Origins of Extreme Halophilic Archaeal Lineages. Genome Biol Evol 2021; 13:6320066. [PMID: 34255041 PMCID: PMC8350355 DOI: 10.1093/gbe/evab166] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/10/2021] [Indexed: 12/12/2022] Open
Abstract
Interest and controversy surrounding the evolutionary origins of extremely halophilic Archaea has increased in recent years, due to the discovery and characterization of the Nanohaloarchaea and the Methanonatronarchaeia. Initial attempts in explaining the evolutionary placement of the two new lineages in relation to the classical Halobacteria (also referred to as Haloarchaea) resulted in hypotheses that imply the new groups share a common ancestor with the Haloarchaea. However, more recent analyses have led to a shift: the Nanohaloarchaea have been largely accepted as being a member of the DPANN superphylum, outside of the euryarchaeota; whereas the Methanonatronarchaeia have been placed near the base of the Methanotecta (composed of the class II methanogens, the Halobacteriales, and Archaeoglobales). These opposing hypotheses have far-reaching implications on the concepts of convergent evolution (distantly related groups evolve similar strategies for survival), genome reduction, and gene transfer. In this work, we attempt to resolve these conflicts with phylogenetic and phylogenomic data. We provide a robust taxonomic sampling of Archaeal genomes that spans the Asgardarchaea, TACK Group, euryarchaeota, and the DPANN superphylum. In addition, we assembled draft genomes from seven new representatives of the Nanohaloarchaea from distinct geographic locations. Phylogenies derived from these data imply that the highly conserved ATP synthase catalytic/noncatalytic subunits of Nanohaloarchaea share a sisterhood relationship with the Haloarchaea. We also employ a novel gene family distance clustering strategy which shows this sisterhood relationship is not likely the result of a recent gene transfer. In addition, we present and evaluate data that argue for and against the monophyly of the DPANN superphylum, in particular, the inclusion of the Nanohaloarchaea in DPANN.
Collapse
Affiliation(s)
- Yutian Feng
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
| | - Uri Neri
- Shmunis School of Biomedicine and Cancer Research, Tel Aviv University, Israel
| | - Sean Gosselin
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
| | - Artemis S Louyakis
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
| | - R Thane Papke
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
| | - Uri Gophna
- Shmunis School of Biomedicine and Cancer Research, Tel Aviv University, Israel
| | - Johann Peter Gogarten
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA.,Institute for Systems Genomics, University of Connecticut, Storrs, Connecticut, USA
| |
Collapse
|
3
|
Multi fragment melting analysis system (MFMAS) for one-step identification of lactobacilli. J Microbiol Methods 2020; 177:106045. [PMID: 32890569 DOI: 10.1016/j.mimet.2020.106045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2019] [Revised: 08/18/2020] [Accepted: 08/19/2020] [Indexed: 11/23/2022]
Abstract
The accurate identification of lactobacilli is essential for the effective management of industrial practices associated with lactobacilli strains, such as the production of fermented foods or probiotic supplements. For this reason, in this study, we proposed the Multi Fragment Melting Analysis System (MFMAS)-lactobacilli based on high resolution melting (HRM) analysis of multiple DNA regions that have high interspecies heterogeneity for fast and reliable identification and characterization of lactobacilli. The MFMAS-lactobacilli is a new and customized version of the MFMAS, which was developed by our research group. MFMAS-lactobacilli is a combined system that consists of i) a ready-to-use plate, which is designed for multiple HRM analysis, and ii) a data analysis software, which is used to characterize lactobacilli species via incorporating machine learning techniques. Simultaneous HRM analysis of multiple DNA fragments yields a fingerprint for each tested strain and the identification is performed by comparing the fingerprints of unknown strains with those of known lactobacilli species registered in the MFMAS. In this study, a total of 254 isolates, which were recovered from fermented foods and probiotic supplements, were subjected to MFMAS analysis, and the results were confirmed by a combination of different molecular techniques. All of the analyzed isolates were exactly differentiated and accurately identified by applying the single-step procedure of MFMAS, and it was determined that all of the tested isolates belonged to 18 different lactobacilli species. The individual analysis of each target DNA region provided identification with an accuracy range from 59% to 90% for all tested isolates. However, when each target DNA region was analyzed simultaneously, perfect discrimination and 100% accurate identification were obtained even in closely related species. As a result, it was concluded that MFMAS-lactobacilli is a multi-purpose method that can be used to differentiate, classify, and identify lactobacilli species. Hence, our proposed system could be a potential alternative to overcome the inconsistencies and difficulties of the current methods.
Collapse
|
4
|
Banerjee S, Feyertag F, Alvarez-Ponce D. Intrinsic protein disorder reduces small-scale gene duplicability. DNA Res 2017; 24:435-444. [PMID: 28430886 PMCID: PMC5737077 DOI: 10.1093/dnares/dsx015] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2016] [Accepted: 03/28/2017] [Indexed: 01/23/2023] Open
Abstract
Whereas the rate of gene duplication is relatively high, only certain duplications survive the filter of natural selection and can contribute to genome evolution. However, the reasons why certain genes can be retained after duplication whereas others cannot remain largely unknown. Many proteins contain intrinsically disordered regions (IDRs), whose structures fluctuate between alternative conformational states. Due to their high flexibility, IDRs often enable protein–protein interactions and are the target of post-translational modifications. Intrinsically disordered proteins (IDPs) have characteristics that might either stimulate or hamper the retention of their encoding genes after duplication. On the one hand, IDRs may enable functional diversification, thus promoting duplicate retention. On the other hand, increased IDP availability is expected to result in deleterious unspecific interactions. Here, we interrogate the proteomes of human, Drosophila melanogaster, Caenorhabditis elegans, Saccharomyces cerevisiae, Arabidopsis thaliana and Escherichia coli, in order to ascertain the impact of protein intrinsic disorder on gene duplicability. We show that, in general, proteins encoded by duplicated genes tend to be less disordered than those encoded by singletons. The only exception is proteins encoded by ohnologs, which tend to be more disordered than those encoded by singletons or genes resulting from small-scale duplications. Our results indicate that duplication of genes encoding IDPs outside the context of whole-genome duplication (WGD) is often deleterious, but that IDRs facilitate retention of duplicates in the context of WGD. We discuss the potential evolutionary implications of our results.
Collapse
Affiliation(s)
- Sanghita Banerjee
- Department of Biology, University of Nevada, Reno, NV 89557, USA.,Machine Intelligence Unit, Indian Statistical Institute, Kolkata 700108, India
| | - Felix Feyertag
- Department of Biology, University of Nevada, Reno, NV 89557, USA
| | | |
Collapse
|
5
|
Mærk M, Johansen J, Ertesvåg H, Drabløs F, Valla S. Safety in numbers: multiple occurrences of highly similar homologs among Azotobacter vinelandii carbohydrate metabolism proteins probably confer adaptive benefits. BMC Genomics 2014; 15:192. [PMID: 24625193 PMCID: PMC4022178 DOI: 10.1186/1471-2164-15-192] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2013] [Accepted: 03/05/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Gene duplication and horizontal gene transfer are common processes in bacterial and archaeal genomes, and are generally assumed to result in either diversification or loss of the redundant gene copies. However, a recent analysis of the genome of the soil bacterium Azotobacter vinelandii DJ revealed an abundance of highly similar homologs among carbohydrate metabolism genes. In many cases these multiple genes did not appear to be the result of recent duplications, or to function only as a means of stimulating expression by increasing gene dosage, as the homologs were located in varying functional genetic contexts. Based on these initial findings we here report in-depth bioinformatic analyses focusing specifically on highly similar intra-genome homologs, or synologs, among carbohydrate metabolism genes, as well as an analysis of the general occurrence of very similar synologs in prokaryotes. RESULTS Approximately 900 bacterial and archaeal genomes were analysed for the occurrence of synologs, both in general and among carbohydrate metabolism genes specifically. This showed that large numbers of highly similar synologs among carbohydrate metabolism genes are very rare in bacterial and archaeal genomes, and that the A. vinelandii DJ genome contains an unusually large amount of such synologs. The majority of these synologs were found to be non-tandemly organized and localized in varying but metabolically relevant genomic contexts. The same observation was made for other genomes harbouring high levels of such synologs. It was also shown that highly similar synologs generally constitute a very small fraction of the protein-coding genes in prokaryotic genomes. The overall synolog fraction of the A. vinelandii DJ genome was well above the data set average, but not nearly as remarkable as the levels observed when only carbohydrate metabolism synologs were considered. CONCLUSIONS Large numbers of highly similar synologs are rare in bacterial and archaeal genomes, both in general and among carbohydrate metabolism genes. However, A. vinelandii and several other soil bacteria harbour large numbers of highly similar carbohydrate metabolism synologs which seem not to result from recent duplication or transfer events. These genes may confer adaptive benefits with respect to certain lifestyles and environmental factors, most likely due to increased regulatory flexibility and/or increased gene dosage.
Collapse
Affiliation(s)
| | | | | | - Finn Drabløs
- Department of Cancer Research and Molecular Medicine, Norwegian University of Science and Technology, NO-7491, Trondheim, Norway.
| | | |
Collapse
|
6
|
Swithers KS, Soucy SM, Lasek-Nesselquist E, Lapierre P, Gogarten JP. Distribution and Evolution of the Mobile vma-1b Intein. Mol Biol Evol 2013; 30:2676-87. [DOI: 10.1093/molbev/mst164] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
7
|
Affiliation(s)
- David P. Mindell
- Department of Biochemistry & Biophysics, University of California, San Francisco, CA 94158, USA
| |
Collapse
|
8
|
Liu W, Li L, Khan MA, Zhu F. Popular molecular markers in bacteria. MOLECULAR GENETICS MICROBIOLOGY AND VIROLOGY 2012. [DOI: 10.3103/s0891416812030056] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
9
|
Huang CH, Chang MT, Huang MC, Lee FL. Rapid identification of Lactobacillus plantarum group using the SNaPshot minisequencing assay. Syst Appl Microbiol 2012; 34:586-9. [PMID: 21641139 DOI: 10.1016/j.syapm.2011.02.006] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2010] [Revised: 02/15/2011] [Accepted: 02/16/2011] [Indexed: 10/18/2022]
Abstract
This study used SNaPshot minisequencing for species identification within the Lactobacillus plantarum group. A SNaPshot minisequencing assay using dnaK as a target gene was developed, and five SNP primers were designed by analysing the conserved regions of the dnaK sequences. The specificity of the minisequencing assay was evaluated using 35 strains of L. plantarum group species. The results showed that the SNaPshot minisequencing assay was able to unambiguously and simultaneously discriminate strains belonging to the species L. plantarum subsp. plantarum, L. plantarum subsp. argentoratensis, Lactobacillus paraplantarum, Lactobacillus pentosus and Lactobacillus fabifermentans. In conclusion, a rapid, accurate and cost-effective assay was successfully developed for species identification of the members of the L. plantarum group.
Collapse
Affiliation(s)
- Chien-Hsun Huang
- Bioresource Collection and Research Center, Food Industry Research and Development Institute, P.O. Box 246, Hsinchu 30099, Taiwan, ROC
| | | | | | | |
Collapse
|
10
|
Pavlinov IY. The contemporary concepts of homology in biology: A theoretical review. ACTA ACUST UNITED AC 2012. [DOI: 10.1134/s2079086412010057] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
11
|
Swithers KS, Fournier GP, Green AG, Gogarten JP, Lapierre P. Reassessment of the lineage fusion hypothesis for the origin of double membrane bacteria. PLoS One 2011; 6:e23774. [PMID: 21876769 PMCID: PMC3158100 DOI: 10.1371/journal.pone.0023774] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2011] [Accepted: 07/25/2011] [Indexed: 11/21/2022] Open
Abstract
In 2009, James Lake introduced a new hypothesis in which reticulate phylogeny reconstruction is used to elucidate the origin of Gram-negative bacteria (Nature 460: 967–971). The presented data supported the Gram-negative bacteria originating from an ancient endosymbiosis between the Actinobacteria and Clostridia. His conclusion was based on a presence-absence analysis of protein families that divided all prokaryotes into five groups: Actinobacteria, Double Membrane bacteria (DM), Clostridia, Archaea and Bacilli. Of these five groups, the DM are by far the largest and most diverse group compared to the other groupings. While the fusion hypothesis for the origin of double membrane bacteria is enticing, we show that the signal supporting an ancient symbiosis is lost when the DM group is broken down into smaller subgroups. We conclude that the signal detected in James Lake's analysis in part results from a systematic artifact due to group size and diversity combined with low levels of horizontal gene transfer.
Collapse
Affiliation(s)
- Kristen S. Swithers
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, United States of America
| | - Gregory P. Fournier
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Anna G. Green
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, United States of America
| | - J. Peter Gogarten
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, United States of America
| | - Pascal Lapierre
- University of Connecticut Biotechnology Center, University of Connecticut, Storrs, Connecticut, United States of America
- * E-mail:
| |
Collapse
|
12
|
Huang CH, Chang MT, Huang MC, Lee FL. Application of the SNaPshot minisequencing assay to species identification in the Lactobacillus casei group. Mol Cell Probes 2011; 25:153-7. [PMID: 21440058 DOI: 10.1016/j.mcp.2011.03.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2011] [Revised: 03/14/2011] [Accepted: 03/14/2011] [Indexed: 10/18/2022]
Abstract
This study used group-specific PCR combined with SNaPshot minisequencing for species identification within the Lactobacillus casei group. The L. casei group-specific PCR primer pair was designed using the rpoA gene sequence. A SNaPshot minisequencing assay using dnaK as a target gene was developed, and five SNP primers were designed by analysing the conserved regions of the dnaK sequences. The specificity of the minisequencing assay was evaluated using 63 strains of L. casei group species. The results showed that the group-specific PCR could assign Lactobacillus strains into the L. casei group, and the SNaPshot minisequencing assay was able to unambiguously and simultaneously discriminate strains belonging to the species L. casei, Lactobacillus paracasei, and Lactobacillus rhamnosus. In conclusion, we have successfully developed a rapid, accurate and cost-effective assay for species identification of members of the L. casei group.
Collapse
Affiliation(s)
- Chien-Hsun Huang
- Bioresource Collection and Research Center, Food Industry Research and Development Institute, Hsinchu, Taiwan, ROC
| | | | | | | |
Collapse
|
13
|
Cerdà-Costa N, Guevara T, Karim AY, Ksiazek M, Nguyen KA, Arolas JL, Potempa J, Gomis-Rüth FX. The structure of the catalytic domain of Tannerella forsythia karilysin reveals it is a bacterial xenologue of animal matrix metalloproteinases. Mol Microbiol 2011; 79:119-32. [PMID: 21166898 PMCID: PMC3077575 DOI: 10.1111/j.1365-2958.2010.07434.x] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Metallopeptidases (MPs) are among virulence factors secreted by pathogenic bacteria at the site of infection. One such pathogen is Tannerella forsythia, a member of the microbial consortium that causes peridontitis, arguably the most prevalent infective chronic inflammatory disease known to mankind. The only reported MP secreted by T. forsythia is karilysin, a 52 kDa multidomain protein comprising a central 18 kDa catalytic domain (CD), termed Kly18, flanked by domains unrelated to any known protein. We analysed the 3D structure of Kly18 in the absence and presence of Mg(2+) or Ca(2+) , which are required for function and stability, and found that it evidences most of the structural features characteristic of the CDs of mammalian matrix metalloproteinases (MMPs). Unexpectedly, a peptide was bound to the active-site cleft of Kly18 mimicking a left-behind cleavage product, which revealed that the specificity pocket accommodates bulky hydrophobic side-chains of substrates as in mammalian MMPs. In addition, Kly18 displayed a unique Mg(2+) or Ca(2+) binding site and two flexible segments that could play a role in substrate binding. Phylogenetic and sequence similarity studies revealed that Kly18 is evolutionarily much closer to winged-insect and mammalian MMPs than to potential bacterial counterparts found by genomic sequencing projects. Therefore, we conclude that this first structurally characterized non-mammalian MMP is a xenologue co-opted through horizontal gene transfer during the intimate coexistence between T. forsythia and humans or other animals, in a very rare case of gene shuffling from eukaryotes to prokaryotes. Subsequently, this protein would have evolved in a bacterial environment to give rise to full-length karilysin that is furnished with unique flanking domains that do not conform to the general multidomain architecture of animal MMPs.
Collapse
Affiliation(s)
- Núria Cerdà-Costa
- Proteolysis Lab; Department of Structural Biology; Molecular Biology Institute of Barcelona, CSIC; Barcelona Science Park; Helix Building; c/ Baldiri Reixac, 15-21; E-08028 Barcelona (Catalunya)
| | - Tibisay Guevara
- Proteolysis Lab; Department of Structural Biology; Molecular Biology Institute of Barcelona, CSIC; Barcelona Science Park; Helix Building; c/ Baldiri Reixac, 15-21; E-08028 Barcelona (Catalunya)
| | - Abdulkarim Y. Karim
- Department of Microbiology; Faculty of Biochemistry, Biophysics and Biotechnology; Jagiellonian University; PL-Krakow 30-387 (Poland)
| | - Miroslaw Ksiazek
- Department of Microbiology; Faculty of Biochemistry, Biophysics and Biotechnology; Jagiellonian University; PL-Krakow 30-387 (Poland)
| | - Ky-Anh Nguyen
- Institute of Dental Research, Westmead Centre for Oral Health, Sydney NSW 2145 (Australia)
- Faculty of Dentistry, University of Sydney, Sydney NSW 2006 (Australia)
| | - Joan L. Arolas
- Proteolysis Lab; Department of Structural Biology; Molecular Biology Institute of Barcelona, CSIC; Barcelona Science Park; Helix Building; c/ Baldiri Reixac, 15-21; E-08028 Barcelona (Catalunya)
| | - Jan Potempa
- Department of Microbiology; Faculty of Biochemistry, Biophysics and Biotechnology; Jagiellonian University; PL-Krakow 30-387 (Poland)
- University of Louisville; School of Dentistry; Oral Health and Systemic Disease; Louisville, KY 40202 (USA)
| | - F. Xavier Gomis-Rüth
- Proteolysis Lab; Department of Structural Biology; Molecular Biology Institute of Barcelona, CSIC; Barcelona Science Park; Helix Building; c/ Baldiri Reixac, 15-21; E-08028 Barcelona (Catalunya)
| |
Collapse
|
14
|
The dnaK gene as a molecular marker for the classification and discrimination of the Lactobacillus casei group. Antonie van Leeuwenhoek 2010; 99:319-27. [DOI: 10.1007/s10482-010-9493-6] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/24/2010] [Accepted: 08/02/2010] [Indexed: 10/19/2022]
|
15
|
Delaye L, Deluna A, Lazcano A, Becerra A. The origin of a novel gene through overprinting in Escherichia coli. BMC Evol Biol 2008; 8:31. [PMID: 18226237 PMCID: PMC2268670 DOI: 10.1186/1471-2148-8-31] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2007] [Accepted: 01/28/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Overlapped genes originate by a) loss of a stop codon among contiguous genes coded in different frames; b) shift to an upstream initiation codon of one of the contiguous genes; or c) by overprinting, whereby a novel open reading frame originates through point mutation inside an existing gene. Although overlapped genes are common in viruses, it is not clear whether overprinting has led to new genes in prokaryotes. RESULTS Here we report the origin of a new gene through overprinting in Escherichia coli K12. The htgA gene coding for a positive regulator of the sigma 32 heat shock promoter arose by point mutation in a 123/213 phase within an open reading frame (yaaW) of unknown function, most likely in the lineage leading to E. coli and Shigella sp. Further, we show that yaaW sequences coding for htgA genes have a slower evolutionary rate than those lacking an overlapped htgA gene. CONCLUSION While overprinting has been shown to be rather frequent in the evolution of new genes in viruses, our results suggest that this mechanism has also contributed to the origin of a novel gene in a prokaryote. We propose the term janolog (from Jano, the two-faced Roman god) to describe the homology relationship that holds between two genes when one originated through overprinting of the other. One cannot dismiss the possibility that at least a small fraction of the large number of novel ORPhan genes detected in pan-genome and metagenomic studies arose by overprinting.
Collapse
Affiliation(s)
- Luis Delaye
- Facultad de Ciencias, Universidad Nacional Autónoma de México, Apdo. Postal 70-407, Cd. Universitaria, 04510 México DF, México.
| | | | | | | |
Collapse
|
16
|
Kleisner K. The formation of the theory of homology in biological sciences. Acta Biotheor 2007; 55:317-40. [PMID: 17929173 DOI: 10.1007/s10441-007-9023-8] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2006] [Accepted: 09/07/2007] [Indexed: 11/26/2022]
Abstract
Homology is among the most important comparative concepts in biology. Today, the evolutionary reinterpretation of homology is usually conceived of as the most important event in the development of the concept. This paradigmatic turning point, however important for the historical explanation of life, is not of crucial importance for the development of the concept of homology itself. In the broadest sense, homology can be understood as sameness in reference to the universal guarantor so that in this sense the different concepts of homology show a certain kind of "metahomology". This holds in the old morphological conception, as well as in the evolutionary usage of homology. Depending on what is (or was) taken as a guarantor, different types of homology may be distinguished (as idealistic, historical, developmental etc.). This study represents a historical overview of the development of the homology concept followed by some clues on how to navigate the pluralistic terminology of modern approaches to homology.
Collapse
Affiliation(s)
- Karel Kleisner
- Department of History and Philosophy of Science, Charles University, Vinicná 7, Prague, 128 44, Czech Republic.
| |
Collapse
|
17
|
Abstract
Orthologs and paralogs are two fundamentally different types of homologous genes that evolved, respectively, by vertical descent from a single ancestral gene and by duplication. Orthology and paralogy are key concepts of evolutionary genomics. A clear distinction between orthologs and paralogs is critical for the construction of a robust evolutionary classification of genes and reliable functional annotation of newly sequenced genomes. Genome comparisons show that orthologous relationships with genes from taxonomically distant species can be established for the majority of the genes from each sequenced genome. This review examines in depth the definitions and subtypes of orthologs and paralogs, outlines the principal methodological approaches employed for identification of orthology and paralogy, and considers evolutionary and functional implications of these concepts.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA.
| |
Collapse
|
18
|
Zhaxybayeva O, Lapierre P, Gogarten JP. Ancient gene duplications and the root(s) of the tree of life. PROTOPLASMA 2005; 227:53-64. [PMID: 16389494 DOI: 10.1007/s00709-005-0135-1] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/18/2005] [Accepted: 05/31/2005] [Indexed: 05/06/2023]
Abstract
Tracing organismal histories on the timescale of the tree of life remains one of the challenging tasks in evolutionary biology. The hotly debated questions include the evolutionary relationship between the three domains of life (e.g., which of the three domains are sister domains, are the domains para-, poly-, or monophyletic) and the location of the root within the universal tree of life. For the latter, many different points of view have been considered but so far no consensus has been reached. The only widely accepted rationale to root the universal tree of life is to use anciently duplicated paralogous genes that are present in all three domains of life. To date only few anciently duplicated gene families useful for phylogenetic reconstruction have been identified. Here we present results from a systematic search for ancient gene duplications using twelve representative, completely sequenced, archaeal and bacterial genomes. Phylogenetic analyses of identified cases show that the majority of datasets support a root between Archaea and Bacteria; however, some datasets support alternative hypotheses, and all of them suffer from a lack of strong phylogenetic signal. The results are discussed with respect to the impact of horizontal gene transfer on the ability to reconstruct organismal evolution. The exchange of genetic information between divergent organisms gives rise to mosaic genomes, where different genes in a genome have different histories. Simulations show that even low rates of horizontal gene transfer dramatically complicate the reconstruction of organismal evolution, and that the different most recent common molecular ancestors likely existed at different times and in different lineages.
Collapse
Affiliation(s)
- Olga Zhaxybayeva
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut 06269-31258, USA
| | | | | |
Collapse
|
19
|
Abstract
To what extent is the tree of life the best representation of the evolutionary history of microorganisms? Recent work has shown that, among sets of prokaryotic genomes in which most homologous genes show extremely low sequence divergence, gene content can vary enormously, implying that those genes that are variably present or absent are frequently horizontally transferred. Traditionally, successful horizontal gene transfer was assumed to provide a selective advantage to either the host or the gene itself, but could horizontally transferred genes be neutral or nearly neutral? We suggest that for many prokaryotes, the boundaries between species are fuzzy, and therefore the principles of population genetics must be broadened so that they can be applied to higher taxonomic categories.
Collapse
Affiliation(s)
- J Peter Gogarten
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut 06269-3125, USA.
| | | |
Collapse
|
20
|
Hamel L, Zhaxybayeva O, Gogarten JP. PentaPlot: a software tool for the illustration of genome mosaicism. BMC Bioinformatics 2005; 6:139. [PMID: 15938752 PMCID: PMC1177926 DOI: 10.1186/1471-2105-6-139] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2005] [Accepted: 06/06/2005] [Indexed: 12/02/2022] Open
Abstract
Background Dekapentagonal maps depict the phylogenetic relationships of five genomes in a visually appealing diagram and can be viewed as an alternative to a single evolutionary consensus tree. In particular, the generated maps focus attention on those gene families that significantly deviate from the consensus or plurality phylogeny. PentaPlot is a software tool that computes such dekapentagonal maps given an appropriate probability support matrix. Results The visualization with dekapentagonal maps critically depends on the optimal layout of unrooted tree topologies representing different evolutionary relationships among five organisms along the vertices of the dekapentagon. This is a difficult optimization problem given the large number of possible layouts. At its core our tool utilizes a genetic algorithm with demes and a local search strategy to search for the optimal layout. The hybrid genetic algorithm performs satisfactorily even in those cases where the chosen genomes are so divergent that little phylogenetic information has survived in the individual gene families. Conclusion PentaPlot is being made publicly available as an open source project at .
Collapse
Affiliation(s)
- Lutz Hamel
- Department of Computer Science and Statistics, University of Rhode Island, Kingston, RI 02881, USA
| | - Olga Zhaxybayeva
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, 06269-3125, USA
- Department of Biochemistry and Molecular Biology, Dalhousie University, 5850 College Street, Halifax, NS B3H 1X5, Canada
| | - J Peter Gogarten
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, 06269-3125, USA
| |
Collapse
|
21
|
Rampias TN, Sideris DC, Fragoulis EG. Cc RNase: the Ceratitis capitata ortholog of a novel highly conserved protein family in metazoans. Nucleic Acids Res 2003; 31:3092-100. [PMID: 12799437 PMCID: PMC162248 DOI: 10.1093/nar/gkg414] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Complementary DNA encoding a protein, designated Cc RNase, was isolated from the insect Ceratitis capitata. Deduced amino acid sequence analysis demonstrates that the Cc RNase has strong sequence homology with other uncharacterized proteins predicted from EST sequences belonging to different animal species, therefore defining a new protein family, which is conserved from Caenorhabditis elegans to humans. Phylogenetic analysis data in addition to extensive homolog searches in all available complete genomes suggested that all family members are true orthologs. Proteins belonging to this family are composed of 95-101 amino acids. The C.capitata orthologous protein was expressed in Escherichia coli. Despite the fact that the amino acid sequence of Cc RNase does not share any significant similarities with other known ribonucleases, our data give strong evidence in support of the assignment of enzymatic activity to the recombinant protein. The expressed molecule exhibits ribonucleolytic activity against poly(C) and poly(U) synthetic substrates, as well as rRNA. It is also demonstrated that expression of Cc RNase in E.coli inhibits growth of the host cells.
Collapse
Affiliation(s)
- Theodoros N Rampias
- University of Athens, Faculty of Biology, Department of Biochemistry and Molecular Biology, Panepistimioupolis, 15701 Athens, Greece
| | | | | |
Collapse
|
22
|
Liang P, Riley M. A comparative genomics approach for studying ancestral proteins and evolution. ADVANCES IN APPLIED MICROBIOLOGY 2002; 50:39-72. [PMID: 11677689 DOI: 10.1016/s0065-2164(01)50003-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Affiliation(s)
- P Liang
- Josephine Bay Paul Center for Molecular Evolution and Comparative Biology, Marine Biological Laboratory, Woods Hole, Massachusetts 02543, USA
| | | |
Collapse
|
23
|
|
24
|
Koonin EV, Makarova KS, Aravind L. Horizontal gene transfer in prokaryotes: quantification and classification. Annu Rev Microbiol 2001; 55:709-42. [PMID: 11544372 PMCID: PMC4781227 DOI: 10.1146/annurev.micro.55.1.709] [Citation(s) in RCA: 758] [Impact Index Per Article: 33.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Comparative analysis of bacterial, archaeal, and eukaryotic genomes indicates that a significant fraction of the genes in the prokaryotic genomes have been subject to horizontal transfer. In some cases, the amount and source of horizontal gene transfer can be linked to an organism's lifestyle. For example, bacterial hyperthermophiles seem to have exchanged genes with archaea to a greater extent than other bacteria, whereas transfer of certain classes of eukaryotic genes is most common in parasitic and symbiotic bacteria. Horizontal transfer events can be classified into distinct categories of acquisition of new genes, acquisition of paralogs of existing genes, and xenologous gene displacement whereby a gene is displaced by a horizontally transferred ortholog from another lineage (xenolog). Each of these types of horizontal gene transfer is common among prokaryotes, but their relative contributions differ in different lineages. The fixation and long-term persistence of horizontally transferred genes suggests that they confer a selective advantage on the recipient organism. In most cases, the nature of this advantage remains unclear, but detailed examination of several cases of acquisition of eukaryotic genes by bacteria seems to reveal the evolutionary forces involved. Examples include isoleucyl-tRNA synthetases whose acquisition from eukaryotes by several bacteria is linked to antibiotic resistance, ATP/ADP translocases acquired by intracellular parasitic bacteria, Chlamydia and Rickettsia, apparently from plants, and proteases that may be implicated in chlamydial pathogenesis.
Collapse
Affiliation(s)
- E V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA.
| | | | | |
Collapse
|
25
|
Abstract
There are many problems relating to defining the terminology used to describe various biological relationships and getting agreement on which definitions are best. Here, I examine 15 terminological problems, all of which are current, and all of which relate to the usage of homology and its associated terms. I suggest a set of definitions that are intended to be totally consistent among themselves and also as consistent as possible with most current usage.
Collapse
Affiliation(s)
- W M Fitch
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA 92697, USA
| |
Collapse
|
26
|
Abstract
During the past decade, ancient gene duplications were recognized as one of the main forces in the generation of diverse gene families and the creation of new functional capabilities. New tools developed to search data banks for homologous sequences, and an increased availability of reliable three-dimensional structural information led to the recognition that proteins with diverse functions can belong to the same superfamily. Analyses of the evolution of these superfamilies promises to provide insights into early evolution but are complicated by several important evolutionary processes. Horizontal transfer of genes can lead to a vertical spread of innovations among organisms, therefore finding a certain property in some descendants of an ancestor does not guarantee that it was present in that ancestor. Complete or partial gene conversion between duplicated genes can yield phylogenetic trees with several, apparently independent gene duplications, suggesting an often surprising parallelism in the evolution of independent lineages. Additionally, the breakup of domains within a protein and the fusion of domains into multifunctional proteins makes the delineation of superfamilies a task that remains difficult to automate.
Collapse
Affiliation(s)
- J P Gogarten
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, 06269, USA.
| | | |
Collapse
|
27
|
Dahl C, Rákhely G, Pott-Sperling AS, Fodor B, Takács M, Tóth A, Kraeling M, Gy"orfi K, Kovács A, Tusz J, Kovács KL. Genes involved in hydrogen and sulfur metabolism in phototrophic sulfur bacteria. FEMS Microbiol Lett 1999; 180:317-24. [PMID: 10556728 DOI: 10.1111/j.1574-6968.1999.tb08812.x] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open
Abstract
The dsr genes and the hydSL operon are present as separate entities in phototrophic sulfur oxidizers of the genera Allochromatium, Marichromatium, Thiocapsa and Thiocystis and are organized similarly as in Allochromatium vinosum and Thiocapsa roseopersicina, respectively. The dsrA gene, encoding the alpha subunit of 'reverse' siroheme sulfite reductase, is also present in two species of green sulfur bacteria pointing to an important and universal role of this enzyme and probably other proteins encoded in the dsr locus in the oxidation of stored sulfur by phototrophic bacteria. The hupSL genes are uniformly present in the members of the Chromatiaceae family tested. The two genes between hydS and hydL encode a membrane-bound b-type cytochrome and a soluble iron-sulfur protein, respectively, resembling subunits of heterodisulfide reductase from methanogenic archaea. These genes are similar but not identical to dsrM and dsrK, indicating that the derived proteins have distinct functions, the former in hydrogen metabolism and the latter in oxidative sulfur metabolism.
Collapse
Affiliation(s)
- C Dahl
- Institut für Mikrobiologie und Biotechnologie, Rheinische Friedrich-Wilhelms-Universität Bonn, Meckenheimer Allee 168, D-53115, Bonn, Germany
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Molitor M, Dahl C, Molitor I, Schäfer U, Speich N, Huber R, Deutzmann R, Trüper HG. A dissimilatory sirohaem-sulfite-reductase-type protein from the hyperthermophilic archaeon Pyrobaculum islandicum. MICROBIOLOGY (READING, ENGLAND) 1998; 144 ( Pt 2):529-541. [PMID: 9493389 DOI: 10.1099/00221287-144-2-529] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
A sulfite-reductase-type protein was purified from the hyperthermophilic crenarchaeote Pyrobaculum islandicum grown chemoorganoheterotrophically with thiosulfate as terminal electron acceptor. In common with dissimilatory sulfite reductases the protein has an alpha 2 beta 2 structure and contains high-spin sirohaem, non-haem iron and acid-labile sulfide. The oxidized protein exhibits absorption maxima at 280, 392, 578 and 710 nm with shoulders at 430 and 610 nm. The isoelectric point of pH 8.4 sets the protein apart from all dissimilatory sulfite reductases characterized thus far. The genes for the alpha- and beta-subunits (dsrA and dsrB) are contiguous in the order dsrAdsrB and most probably comprise an operon with the directly following dsrG and dsrC genes. dsrG and dsrC encode products which are homologous to eukaryotic glutathione S-transferases and the proposed gamma-subunit of Desulfovibrio vulgaris sulfite reductase, respectively. dsrA and dsrB encode 44.2 kDa and 41.2 kDa peptides which show significant similarity to the two homologous subunits DsrA and DsrB of dissimilatory sulfite reductases. Phylogenetic analyses indicate a common protogenotic origin of the P. islandicum protein and the dissimilatory sulfite reductases from sulfate-reducing and sulfide-oxidizing prokaryotes. However, the protein from P. islandicum and the sulfite reductases from sulfate-reducers and from sulfur-oxidizers most probably evolved into three independent lineages prior to divergence of archaea and bacteria.
Collapse
Affiliation(s)
- Michael Molitor
- Institut für Mikrobiologie & Biotechnologie, Rheinische Friedrich-Wilhelms-Universität Bonn, 53115 Bonn, Germany
| | - Christiane Dahl
- Institut für Mikrobiologie & Biotechnologie, Rheinische Friedrich-Wilhelms-Universität Bonn, 53115 Bonn, Germany
| | - Ilka Molitor
- Institut für Mikrobiologie & Biotechnologie, Rheinische Friedrich-Wilhelms-Universität Bonn, 53115 Bonn, Germany
| | - Ulrike Schäfer
- Institut für Mikrobiologie & Biotechnologie, Rheinische Friedrich-Wilhelms-Universität Bonn, 53115 Bonn, Germany
| | - Norbert Speich
- Institut für Mikrobiologie & Biotechnologie, Rheinische Friedrich-Wilhelms-Universität Bonn, 53115 Bonn, Germany
| | - Robert Huber
- Lehrstuhl für Mikrobiologie Universitätsstr. 31, 93053 Regensburg and Institut für Biochemie
| | | | - Hans G Trüper
- Institut für Mikrobiologie & Biotechnologie, Rheinische Friedrich-Wilhelms-Universität Bonn, 53115 Bonn, Germany
| |
Collapse
|
29
|
Nelson MA, Kang S, Braun EL, Crawford ME, Dolan PL, Leonard PM, Mitchell J, Armijo AM, Bean L, Blueyes E, Cushing T, Errett A, Fleharty M, Gorman M, Judson K, Miller R, Ortega J, Pavlova I, Perea J, Todisco S, Trujillo R, Valentine J, Wells A, Werner-Washburne M, Natvig DO. Expressed sequences from conidial, mycelial, and sexual stages of Neurospora crassa. Fungal Genet Biol 1997; 21:348-63. [PMID: 9290248 DOI: 10.1006/fgbi.1997.0986] [Citation(s) in RCA: 118] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
In the Neurospora Genome Project at the University of New Mexico, expressed sequence tags (ESTs) corresponding to three stages of the life cycle of the filamentous fungus Neurospora crassa are being analyzed. The results of a pilot project to identify expressed genes and determine their patterns of expression are presented. 1,865 partial complementary DNA (cDNA) sequences for 1,409 clones were determined using single-pass sequencing. Contig analysis allowed the identification of 838 unique ESTs and 156 ESTs present in multiple cDNA clones. For about 34% of the sequences, highly or moderately significant matches to sequences (of known and unknown function) in the NCBI database were detected. Approximately 56% of the ESTs showed no similarity to previously identified genes. Among genes with assigned function, about 43.3% were involved in metabolism, 32.9% in protein synthesis and 8.4% in RNA synthesis. Fewer were involved in defense (6%), cell signalling (3.4%), cell structure (3.4%) and cell division (2.6%).
Collapse
Affiliation(s)
- M A Nelson
- Department of Biology, University of New Mexico, Albuquerque 87131, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Braun EL, Fuge EK, Padilla PA, Werner-Washburne M. A stationary-phase gene in Saccharomyces cerevisiae is a member of a novel, highly conserved gene family. J Bacteriol 1996; 178:6865-72. [PMID: 8955308 PMCID: PMC178587 DOI: 10.1128/jb.178.23.6865-6872.1996] [Citation(s) in RCA: 65] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
The regulation of cellular growth and proliferation in response to environmental cues is critical for development and the maintenance of viability in all organisms. In unicellular organisms, such as the budding yeast Saccharomyces cerevisiae, growth and proliferation are regulated by nutrient availability. We have described changes in the pattern of protein synthesis during the growth of S. cerevisiae cells to stationary phase (E. K. Fuge, E. L. Braun, and M. Werner-Washburne, J. Bacteriol. 176:5802-5813, 1994) and noted a protein, which we designated Snz1p (p35), that shows increased synthesis after entry into stationary phase. We report here the identification of the SNZ1 gene, which encodes this protein. We detected increased SNZ1 mRNA accumulation almost 2 days after glucose exhaustion, significantly later than that of mRNAs encoded by other postexponential genes. SNZ1-related sequences were detected in phylogenetically diverse organisms by sequence comparisons and low-stringency hybridization. Multiple SNZ1-related sequences were detected in some organisms, including S. cerevisiae. Snz1p was found to be among the most evolutionarily conserved proteins currently identified, indicating that we have identified a novel, highly conserved protein involved in growth arrest in S. cerevisiae. The broad phylogenetic distribution, the regulation of the SNZ1 mRNA and protein in S. cerevisiae, and identification of a Snz protein modified during sporulation in the gram-positive bacterium Bacillus subtilis support the hypothesis that Snz proteins are part of an ancient response that occurs during nutrient limitation and growth arrest.
Collapse
Affiliation(s)
- E L Braun
- Department of Biology, University of New Mexico, Albuquerque 87131, USA
| | | | | | | |
Collapse
|
31
|
Abstract
▪ Abstract With the discovery of the eukaryote nucleus, all living organisms were neatly divided into prokaryotes, which lacked a nucleus, and eukaryotes, which possessed it. As data derived directly from the genome became available, it was clear that prokaryotes were comprised of two groups, Eubacteria and Archaebacteria. These were subsequently renamed at the new taxonomic level of Domain as Bacteria and Archaea, with the eukaryotes named as the Eucarya Domain. The interrelationships of the three Domains are still subject to discussion and evaluation, as is their monophyly. Further data, drawn from various protein sequences, suggest conflicting schemes, and resolution may not be straightforward. Additionally, Bacteria and Archaea as well as Eucarya are largely based on organisms already in culture. Investigation of the potentially enormous quantity of uncultured organisms in nature is likely to have as broad-ranging implications as the exploration of new protein sequences.
Collapse
Affiliation(s)
- David M. Williams
- The Natural History Museum, Cromwell Road, London SW7 5BD, United Kingdom
| | - T. Martin Embley
- The Natural History Museum, Cromwell Road, London SW7 5BD, United Kingdom
| |
Collapse
|
32
|
Hwang DM, Dempsey A, Tan KT, Liew CC. A modular domain of NifU, a nitrogen fixation cluster protein, is highly conserved in evolution. J Mol Evol 1996; 43:536-40. [PMID: 8875867 DOI: 10.1007/bf02337525] [Citation(s) in RCA: 47] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
hnifU, a gene exhibiting similarity to nifU genes of nitrogen fixation gene clusters, was identified in the course of expressed sequence tag (EST) generation from a human fetal heart cDNA library. Northern blot of human tissues and polymerase chain reaction (PCR) using human genomic DNA verified that the hnifU gene represented a human gene rather than a microbial contaminant of the cDNA library. Conceptual translation of the hnifU cDNA yielded a protein product bearing 77% and 70% amino acid identity to NifU-like hypothetical proteins from Haemophilus influenzae and Saccharomyces cerevisiae, respectively, and 40-44% identity to the N-terminal regions of NifU proteins from several diazatrophs (i.e., nitrogen-fixing organisms). Pairwise determination of amino acid identities between the NifU-like proteins of nondiazatrophs showed that these NifU-like proteins exhibited higher sequence identity to each other (63-77%) than to the diazatrophic NifU proteins (40-48%). Further, the NifU-like proteins of non-nitrogen-fixing organisms were similar only to the N-terminal region of diazatrophic NifU proteins and therefore identified a novel modular domain in these NifU proteins. These findings support the hypothesis that NifU is indeed a modular protein. The high degree of sequence similarity between NifU-like proteins from species as divergent as humans and H. influenzae suggests that these proteins perform some basic cellular function and may be among the most highly conserved proteins.
Collapse
Affiliation(s)
- D M Hwang
- Department of Clinical Biochemistry, The Centre for Cardiovascular Research, The Toronto Hospital, University of Toronto, Canada
| | | | | | | |
Collapse
|
33
|
Theissen G, Kim JT, Saedler H. Classification and phylogeny of the MADS-box multigene family suggest defined roles of MADS-box gene subfamilies in the morphological evolution of eukaryotes. J Mol Evol 1996; 43:484-516. [PMID: 8875863 DOI: 10.1007/bf02337521] [Citation(s) in RCA: 281] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
The MADS-box encodes a novel type of DNA-binding domain found so far in a diverse group of transcription factors from yeast, animals, and seed plants. Here, our first aim was to evaluate the primary structure of the MADS-box. Compilation of the 107 currently available MADS-domain sequences resulted in a signature which can strictly discriminate between genes possessing or lacking a MADS-domain and allowed a classification of MADS-domain proteins into several distinct subfamilies. A comprehensive phylogenetic analysis of known eukaryotic MADS-box genes, which is the first comprising animal as well as fungal and plant homologs, showed that the vast majority of subfamily members appear on distinct subtrees of phylogenetic trees, suggesting that subfamilies represent monophyletic gene clades and providing the proposed classification scheme with a sound evolutionary basis. A reconstruction of the history of the MADS-box gene subfamilies based on the taxonomic distribution of contemporary subfamily members revealed that each subfamily comprises highly conserved putative orthologs and recent paralogs. Some subfamilies must be very old (1,000 MY or more), while others are more recent. In general, subfamily members tend to share highly similar sequences, expression patterns, and related functions. The defined species distribution, specific function, and strong evolutionary conservation of the members of most subfamilies suggest that the establishment of different subfamilies was followed by rapid fixation and was thus highly advantageous during eukaryotic evolution. These gene subfamilies may have been essential prerequisites for the establishment of several complex eukaryotic body structures, such as muscles in animals and certain reproductive structures in higher plants, and of some signal transduction pathways. Phylogenetic trees indicate that after establishment of different subfamilies, additional gene duplications led to a further increase in the number of MADS-box genes. However, several molecular mechanisms of MADS-box gene diversification were used to a quite different extent during animal and plant evolution. Known plant MADS-domain sequences diverged much faster than those of animals, and gene duplication and sequence diversification were extensively used for the creation of new genes during plant evolution, resulting in a relatively large number of interacting genes. In contrast, the available data on animal genes suggest that increase in gene number was only moderate in the lineage leading to mammals, but in the case of MEF2-like gene products, heterodimerization between different splice variants may have increased the combinatorial possibilities of interactions considerably. These observations demonstrate that in metazoan and plant evolution, increased combinatorial possibilities of MADS-box gene product interactions correlated with the evolution of increasingly complex body plans.
Collapse
Affiliation(s)
- G Theissen
- Max-Planck-Institut für Züchtungsforschung, Abteilung Molekulare Pflanzengenetik, Carl-von-Linné-Weg 10, D-50829 Köln, Germany
| | | | | |
Collapse
|
34
|
Alifano P, Fani R, Liò P, Lazcano A, Bazzicalupo M, Carlomagno MS, Bruni CB. Histidine biosynthetic pathway and genes: structure, regulation, and evolution. Microbiol Rev 1996; 60:44-69. [PMID: 8852895 PMCID: PMC239417 DOI: 10.1128/mr.60.1.44-69.1996] [Citation(s) in RCA: 155] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Affiliation(s)
- P Alifano
- Dipartimento di Biologia e Patologia Cellulare e Molecolare L. Califano, Università degli Studi di Napoli Federico II, Italy
| | | | | | | | | | | | | |
Collapse
|