Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Poptsova MS, Gogarten JP. BranchClust: a phylogenetic algorithm for selecting gene families. BMC Bioinformatics 2007;8:120. [PMID: 17425803 PMCID: PMC1853112 DOI: 10.1186/1471-2105-8-120] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2006] [Accepted: 04/10/2007] [Indexed: 11/10/2022] Open

For:	Poptsova MS, Gogarten JP. BranchClust: a phylogenetic algorithm for selecting gene families. BMC Bioinformatics 2007;8:120. [PMID: 17425803 PMCID: PMC1853112 DOI: 10.1186/1471-2105-8-120] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2006] [Accepted: 04/10/2007] [Indexed: 11/10/2022] Open

Number

Cited by Other Article(s)

Grau-Bové X, Sebé-Pedrós A. Orthology clusters from gene trees with Possvm. Mol Biol Evol 2021;38:5204-5208. [PMID: 34352080 PMCID: PMC8557443 DOI: 10.1093/molbev/msab234] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open

Lallemand T, Leduc M, Landès C, Rizzon C, Lerat E. An Overview of Duplicated Gene Detection Methods: Why the Duplication Mechanism Has to Be Accounted for in Their Choice. Genes (Basel) 2020;11:E1046. [PMID: 32899740 PMCID: PMC7565063 DOI: 10.3390/genes11091046] [Citation(s) in RCA: 51] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Revised: 09/01/2020] [Accepted: 09/02/2020] [Indexed: 12/11/2022] Open

Rangel LT, Marden J, Colston S, Setubal JC, Graf J, Gogarten JP. Identification and characterization of putative Aeromonas spp. T3SS effectors. PLoS One 2019;14:e0214035. [PMID: 31163020 PMCID: PMC6548356 DOI: 10.1371/journal.pone.0214035] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2019] [Accepted: 05/21/2019] [Indexed: 11/23/2022] Open

Inferring Orthology and Paralogy. Methods Mol Biol 2019;1910:149-175. [PMID: 31278664 DOI: 10.1007/978-1-4939-9074-0_5] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Trail F, Wang Z, Stefanko K, Cubba C, Townsend JP. The ancestral levels of transcription and the evolution of sexual phenotypes in filamentous fungi. PLoS Genet 2017;13:e1006867. [PMID: 28704372 PMCID: PMC5509106 DOI: 10.1371/journal.pgen.1006867] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2017] [Accepted: 06/13/2017] [Indexed: 12/29/2022] Open

Marcelletti S, Scortichini M. Xylella fastidiosa CoDiRO strain associated with the olive quick decline syndrome in southern Italy belongs to a clonal complex of the subspecies pauca that evolved in Central America. Microbiology (Reading) 2016;162:2087-2098. [DOI: 10.1099/mic.0.000388] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Marcelletti S, Scortichini M. Genome-wide comparison and taxonomic relatedness of multiple Xylella fastidiosa strains reveal the occurrence of three subspecies and a new Xylella species. Arch Microbiol 2016;198:803-12. [PMID: 27209415 DOI: 10.1007/s00203-016-1245-1] [Citation(s) in RCA: 53] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2016] [Revised: 04/13/2016] [Accepted: 05/16/2016] [Indexed: 11/30/2022]

Tekaia F. Inferring Orthologs: Open Questions and Perspectives. GENOMICS INSIGHTS 2016;9:17-28. [PMID: 26966373 PMCID: PMC4778853 DOI: 10.4137/gei.s37925] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Revised: 12/30/2015] [Accepted: 01/02/2016] [Indexed: 01/25/2023]

Marcelletti S, Scortichini M. Comparative Genomic Analyses of Multiple Pseudomonas Strains Infecting Corylus avellana Trees Reveal the Occurrence of Two Genetic Clusters with Both Common and Distinctive Virulence and Fitness Traits. PLoS One 2015;10:e0131112. [PMID: 26147218 PMCID: PMC4492584 DOI: 10.1371/journal.pone.0131112] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2015] [Accepted: 05/28/2015] [Indexed: 01/26/2023] Open

Lehr NA, Wang Z, Li N, Hewitt DA, López-Giráldez F, Trail F, Townsend JP. Gene expression differences among three Neurospora species reveal genes required for sexual reproduction in Neurospora crassa. PLoS One 2014;9:e110398. [PMID: 25329823 PMCID: PMC4203796 DOI: 10.1371/journal.pone.0110398] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2014] [Accepted: 09/16/2014] [Indexed: 12/23/2022] Open

Abstract

Many fungi form complex three-dimensional fruiting bodies, within which the meiotic machinery for sexual spore production has been considered to be largely conserved over evolutionary time. Indeed, much of what we know about meiosis in plant and animal taxa has been deeply informed by studies of meiosis in Saccharomyces and Neurospora. Nevertheless, the genetic basis of fruiting body development and its regulation in relation to meiosis in fungi is barely known, even within the best studied multicellular fungal model Neurospora crassa. We characterized morphological development and genome-wide transcriptomics in the closely related species Neurospora crassa, Neurospora tetrasperma, and Neurospora discreta, across eight stages of sexual development. Despite diverse life histories within the genus, all three species produce vase-shaped perithecia. Transcriptome sequencing provided gene expression levels of orthologous genes among all three species. Expression of key meiosis genes and sporulation genes corresponded to known phenotypic and developmental differences among these Neurospora species during sexual development. We assembled a list of genes putatively relevant to the recent evolution of fruiting body development by sorting genes whose relative expression across developmental stages increased more in N. crassa relative to the other species. Then, in N. crassa, we characterized the phenotypes of fruiting bodies arising from crosses of homozygous knockout strains of the top genes. Eight N. crassa genes were found to be critical for the successful formation of perithecia. The absence of these genes in these crosses resulted in either no perithecium formation or in arrested development at an early stage. Our results provide insight into the genetic basis of Neurospora sexual reproduction, which is also of great importance with regard to other multicellular ascomycetes, including perithecium-forming pathogens, such as Claviceps purpurea, Ophiostoma ulmi, and Glomerella graminicola.

Collapse

Rusin LY, Lyubetskaya EV, Gorbunov KY, Lyubetsky VA. Reconciliation of gene and species trees. BIOMED RESEARCH INTERNATIONAL 2014;2014:642089. [PMID: 24800245 PMCID: PMC3985182 DOI: 10.1155/2014/642089] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 08/11/2013] [Accepted: 11/27/2013] [Indexed: 11/18/2022]

Reconstructed ancestral Myo-inositol-3-phosphate synthases indicate that ancestors of the Thermococcales and Thermotoga species were more thermophilic than their descendants. PLoS One 2013;8:e84300. [PMID: 24391933 PMCID: PMC3877268 DOI: 10.1371/journal.pone.0084300] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2013] [Accepted: 11/19/2013] [Indexed: 01/06/2023] Open

Metabolic analysis of Chlorobium chlorochromatii CaD3 reveals clues of the symbiosis in 'Chlorochromatium aggregatum'. ISME JOURNAL 2013;8:991-8. [PMID: 24285361 DOI: 10.1038/ismej.2013.207] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2013] [Revised: 09/25/2013] [Accepted: 10/07/2013] [Indexed: 11/08/2022]

Abstract

A symbiotic association occurs in 'Chlorochromatium aggregatum', a phototrophic consortium integrated by two species of phylogenetically distant bacteria composed by the green-sulfur Chlorobium chlorochromatii CaD3 epibiont that surrounds a central β-proteobacterium. The non-motile chlorobia can perform nitrogen and carbon fixation, using sulfide as electron donors for anoxygenic photosynthesis. The consortium can move due to the flagella present in the central β-protobacterium. Although Chl. chlorochromatii CaD3 is never found as free-living bacteria in nature, previous transcriptomic and proteomic studies have revealed that there are differential transcription patterns between the symbiotic and free-living status of Chl. chlorocromatii CaD3 when grown in laboratory conditions. The differences occur mainly in genes encoding the enzymatic reactions involved in nitrogen and amino acid metabolism. We performed a metabolic reconstruction of Chl. chlorochromatii CaD3 and an in silico analysis of its amino acid metabolism using an elementary flux modes approach (EFM). Our study suggests that in symbiosis, Chl. chlorochromatii CaD3 is under limited nitrogen conditions where the GS/GOGAT (glutamine synthetase/glutamate synthetase) pathway is actively assimilating ammonia obtained via N2 fixation. In contrast, when free-living, Chl. chlorochromatii CaD3 is in a condition of nitrogen excess and ammonia is assimilated by the alanine dehydrogenase (AlaDH) pathway. We postulate that 'Chlorochromatium aggregatum' originated from a parasitic interaction where the N2 fixation capacity of the chlorobia would be enhanced by injection of 2-oxoglutarate from the β-proteobacterium via the periplasm. This consortium would have the advantage of motility, which is fundamental to a phototrophic bacterium, and the syntrophy of nitrogen and carbon sources.

Collapse

Scortichini M, Marcelletti S, Ferrante P, Firrao G. A Genomic redefinition of Pseudomonas avellanae species. PLoS One 2013;8:e75794. [PMID: 24086635 PMCID: PMC3783423 DOI: 10.1371/journal.pone.0075794] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2013] [Accepted: 08/20/2013] [Indexed: 11/18/2022] Open

Abstract

The circumscription of bacterial species is a complex task. So far, DNA-DNA hybridization (DDH), 16S rRNA gene sequencing, and multiocus sequence typing analysis (MLSA) are currently the preferred techniques for their genetic determination. However, the average nucleotide identity (ANI) analysis of conserved and shared genes between two bacterial strains based on the pair-wise genome comparisons, with support of the tetranucleotide frequency correlation coefficients (TETRA) value, has recently been proposed as a reliable substitute for DDH. The species demarcation boundary has been set to a value of 95-96% of the ANI identity, with further confirmation through the assessment of the corresponding TETRA value. In this study, we performed a genome-wide MLSA of 14 phytopathogenic pseudomonads genomes, and assessed the ANI and TETRA values of 27 genomes, representing seven out of the nine genomospecies of Pseudomonas spp. sensu Gardan et alii, and their phylogenetic relationships using maximum likelihood and Bayesian approaches. The results demonstrate the existence of a well demarcated genomic cluster that includes strains classified as P. avellanae, P. syringae pv. theae, P. s. pv. actinidiae and one P. s. pv. morsprunorum strain all belonging to the single species P. avellanae. In addition, when compared with P. avellanae, five strains of P. s. pv. tomato, including the model strain DC3000, and one P. s. pv. lachrymans strain, appear as very closely related to P. avellanae, with ANI values of nearly 96% as confirmed by the TETRA analysis. Conversely, one representative strain, previously classified as P. avellanae and isolated in central Italy, is a genuine member of the P. syringae species complex and can be defined as P. s. pv. avellanae. Currently. The core and pan genomes of P. avellanae species consist of 3,995 and 5,410 putative protein-coding genes, respectively.

Collapse

Firrao G, Martini M, Ermacora P, Loi N, Torelli E, Foissac X, Carle P, Kirkpatrick BC, Liefting L, Schneider B, Marzachì C, Palmano S. Genome wide sequence analysis grants unbiased definition of species boundaries in "Candidatus Phytoplasma". Syst Appl Microbiol 2013;36:539-48. [PMID: 24034865 DOI: 10.1016/j.syapm.2013.07.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2013] [Revised: 07/08/2013] [Accepted: 07/18/2013] [Indexed: 10/26/2022]

Abstract

The phytoplasmas are currently named using the Candidatus category, as the inability to grow them in vitro prevented (i) the performance of tests, such as DNA-DNA hybridization, that are regarded as necessary to establish species boundaries, and (ii) the deposition of type strains in culture collections. The recent accession to complete or nearly complete genome sequence information disclosed the opportunity to apply to the uncultivable phytoplasmas the same taxonomic approaches used for other bacteria. In this work, the genomes of 14 strains, belonging to the 16SrI, 16SrIII, 16SrV and 16SrX groups, including the species "Ca. P. asteris", "Ca. P. mali", "Ca. P. pyri", "Ca. P. pruni", and "Ca. P. australiense" were analyzed along with Acholeplasma laidlawi, to determine their taxonomic relatedness. Average nucleotide index (ANIm), tetranucleotide signature frequency correlation index (Tetra), and multilocus sequence analysis of 107 shared genes using both phylogenetic inference of concatenated (DNA and amino acid) sequences and consensus networks, were carried out. The results were in large agreement with the previously established 16S rDNA based classification schemes. Moreover, the taxonomic relationships within the 16SrI, 16SrIII and 16SrX groups, that represent clusters of strains whose relatedness could not be determined by 16SrDNA analysis, could be comparatively evaluated with non-subjective criteria. "Ca. P. mali" and "Ca. P. pyri" were found to meet the genome characteristics for the retention into two different, yet strictly related species; representatives of subgroups 16SrI-A and 16SrI-B were also found to meet the standards used in other bacteria to distinguish separate species; the genomes of the strains belonging to 16SrIII were found more closely related, suggesting that their subdivision into Candidatus species should be approached with caution.

Collapse

Williams D, Gogarten JP, Papke RT. Quantifying homologous replacement of loci between haloarchaeal species. Genome Biol Evol 2013;4:1223-44. [PMID: 23160063 PMCID: PMC3542582 DOI: 10.1093/gbe/evs098] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open

Ding Y, Cai Y, Han Y, Zhao B, Zhu L. Application of principal component analysis to determine the key structural features contributing to iron superoxide dismutase thermostability. Biopolymers 2012;97:864-72. [PMID: 22899361 DOI: 10.1002/bip.22093] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Saccardo F, Martini M, Palmano S, Ermacora P, Scortichini M, Loi N, Firrao G. Genome drafts of four phytoplasma strains of the ribosomal group 16SrIII. MICROBIOLOGY-SGM 2012;158:2805-2814. [PMID: 22936033 DOI: 10.1099/mic.0.061432-0] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]

Abstract

By applying a coverage-based read selection and filtration through a healthy plant dataset, and a post-assembly contig selection based on homology and linkage, genome sequence drafts were obtained for four phytoplasma strains belonging to the 16SrIII group (X disease clade), namely Vaccinium Witches' Broom phytoplasma (647 754 nt in 272 contigs), Italian Clover Phyllody phytoplasma strain MA (597 245 nt in 197 contigs), Poinsettia branch-inducing phytoplasma strain JR1 (631 440 nt in 185 contigs) and Milkweed Yellows phytoplasma (583 806 nt in 158 contigs). Despite assignment to different 16SrIII subgroups, the genomes of the four strains were similar, comprising a highly conserved core (92-98 % similar in their nucleotide sequence among each other over alignments about 500 kb in length) and a minor strain-specific component. As far as their protein complement was concerned, they did not differ significantly in their basic metabolism potential from the genomes of other wide-host-range phytoplasmas sequenced previously, but were distinct from strains of other species, as well as among each other, in genes encoding functions conceivably related to interactions with the host, such as membrane trafficking components, proteases, DNA methylases, effectors and several hypothetical proteins of unknown function, some of which are likely secreted through the Sec-dependent secretion system. The four genomes displayed a group of genes encoding hypothetical proteins with high similarity to a central domain of IcmE/DotG, a core component of the type IVB secretion system of Gram-negative Legionella spp. Conversely, genes encoding functional GroES/GroEL chaperones were not detected in any of the four drafts. The results also indicated the significant role of horizontal gene transfer among different 'Candidatus Phytoplasma' species in shaping phytoplasma genomes and promoting their diversity.

Collapse

Azad RK, Lawrence JG. Detecting laterally transferred genes. Methods Mol Biol 2012;855:281-308. [PMID: 22407713 DOI: 10.1007/978-1-61779-582-4_10] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]

Altenhoff AM, Dessimoz C. Inferring orthology and paralogy. Methods Mol Biol 2012;855:259-79. [PMID: 22407712 DOI: 10.1007/978-1-61779-582-4_9] [Citation(s) in RCA: 78] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]

Tekaia F, Yeramian E. SuperPartitions: detection and classification of orthologs. Gene 2011;492:199-211. [PMID: 22056699 DOI: 10.1016/j.gene.2011.10.027] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2011] [Revised: 10/08/2011] [Accepted: 10/11/2011] [Indexed: 10/16/2022]

Abstract

The proper detection of orthologs is crucial for evolutionary studies of genes and species. Despite large efforts to solve this problem the methodological situation appears unsettled to a large extent and the "quest for orthologs" is still an ongoing task in large-scale genome comparisons. Here, we introduce a simple operational framework for the detection of orthologs and their classification. The operational framework relies on well-established principles, optimizing their implementation for the considered purposes, and chaining components in coherent procedures: 1) We take advantage of the efficiency and simplicity of the Reciprocal Best Hit (RBH) detections, remedying (by design) the drawback concerning the limitations in terms of 1:1 detections. The procedure is based on the partitioning of Reciprocal Best Hits, with the further merging of partitions including members of the same paralogous classes ("SuperPartition of Orthologs" (SPOs)). 2) We then resort to the conservation profiles of the obtained clusters, allowing simple detection of SPOs containing duplicated members. Based on accepted evolutionary principles, such members can be further tagged as in-paralogs (co-orthologs) or out-paralogs. The method is illustrated and validated by extensive genomic analyses. The performances of the overall approach are characterized in global terms for three sets of species (Chlamydiae, Mycobacteria, Aspergilli), showing that at least 75% of the sets of orthologs contain at most one protein from a given species. The sets including more than one protein from a given species are shown to contain in-paralogs in proportions varying from 28% to 58%. The characterizations also show that the large majority of SPOs are associated with ancestral motifs, and accordingly not prone to chaining effects that might be triggered by multi-domain proteins. Further the SPO formulation is compared to other similarity based ortholog detection methods. Beyond core common results, significant differences are observed between various methods, which can be accounted for to a large extent on conceptual grounds, relative to the different merging schemes involved. Such comparisons highlight a major advantage of the SPO approach concerning the proper clustering of associated paralogs, which appear to be often dispatched spuriously into distinct orthologous classes. Finally the perspectives for future applications and elaborations of SPO-based compositional analyses are discussed.

Collapse

Beiko RG. Telling the whole story in a 10,000-genome world. Biol Direct 2011;6:34. [PMID: 21714939 PMCID: PMC3158115 DOI: 10.1186/1745-6150-6-34] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2011] [Accepted: 06/30/2011] [Indexed: 01/07/2023] Open

Abstract

BACKGROUND

Genome sequencing has revolutionized our view of the relationships among genomes, particularly in revealing the confounding effects of lateral genetic transfer (LGT). Phylogenomic techniques have been used to construct purported trees of microbial life. Although such trees are easily interpreted and allow the use of a subset of genomes as "proxies" for the full set, LGT and other phenomena impact the positioning of different groups in genome trees, confounding and potentially invalidating attempts to construct a phylogeny-based taxonomy of microorganisms. Network and graph approaches can reveal complex sets of relationships, but applying these techniques to large data sets is a significant challenge. Notwithstanding the question of what exactly it might represent, generating and interpreting a Tree or Network of All Genomes will only be feasible if current algorithms can be improved upon.

RESULTS

Complex relationships among even the most-similar genomes demonstrate that proxy-based approaches to simplifying large sets of genomes are not alone sufficient to solve the analysis problem. A phylogenomic analysis of 1173 sequenced bacterial and archaeal genomes generated phylogenetic trees for 159,905 distinct homologous gene sets. The relationships inferred from this set can be heavily dependent on the inclusion of other taxa: for example, phyla such as Spirochaetes, Proteobacteria and Firmicutes are recovered as cohesive groups or split depending on the presence of other specific lineages. Furthermore, named groups such as Acidithiobacillus, Coprothermobacter and Brachyspira show a multitude of affiliations that are more consistent with their ecology than with small subunit ribosomal DNA-based taxonomy. Network and graph representations can illustrate the multitude of conflicting affinities, but all methods impose constraints on the input data and create challenges of construction and interpretation.

CONCLUSIONS

These complex relationships highlight the need for an inclusive approach to genomic data, and current methods with minor alterations will likely scale to allow the analysis of data sets with 10,000 or more genomes. The main challenges lie in the visualization and interpretation of genomic relationships, and the redefinition of microbial taxonomy when subsets of genomic data are so evidently in conflict with one another, and with the "canonical" molecular taxonomy.

Collapse

Plett D, Toubia J, Garnett T, Tester M, Kaiser BN, Baumann U. Dichotomy in the NRT gene families of dicots and grass species. PLoS One 2010;5:e15289. [PMID: 21151904 PMCID: PMC2997785 DOI: 10.1371/journal.pone.0015289] [Citation(s) in RCA: 106] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2010] [Accepted: 11/04/2010] [Indexed: 11/19/2022] Open

Poptsova MS, Gogarten JP. Using comparative genome analysis to identify problems in annotated microbial genomes. Microbiology (Reading) 2010;156:1909-1917. [DOI: 10.1099/mic.0.033811-0] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Jun J, Ryvkin P, Hemphill E, Nelson C. Duplication mechanism and disruptions in flanking regions determine the fate of Mammalian gene duplicates. J Comput Biol 2010;16:1253-66. [PMID: 19772436 DOI: 10.1089/cmb.2009.0074] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Jun J, Mandoiu II, Nelson CE. Identification of mammalian orthologs using local synteny. BMC Genomics 2009;10:630. [PMID: 20030836 PMCID: PMC2807883 DOI: 10.1186/1471-2164-10-630] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2009] [Accepted: 12/23/2009] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Accurate determination of orthology is central to comparative genomics. For vertebrates in particular, very large gene families, high rates of gene duplication and loss, multiple mechanisms of gene duplication, and high rates of retrotransposition all combine to make inference of orthology between genes difficult. Many methods have been developed to identify orthologous genes, mostly based upon analysis of the inferred protein sequence of the genes. More recently, methods have been proposed that use genomic context in addition to protein sequence to improve orthology assignment in vertebrates. Such methods have been most successfully implemented in fungal genomes and have long been used in prokaryotic genomes, where gene order is far less variable than in vertebrates. However, to our knowledge, no explicit comparison of synteny and sequence based definitions of orthology has been reported in vertebrates, or, more specifically, in mammals.

RESULTS

We test a simple method for the measurement and utilization of gene order (local synteny) in the identification of mammalian orthologs by investigating the agreement between coding sequence based orthology (Inparanoid) and local synteny based orthology. In the 5 mammalian genomes studied, 93% of the sampled inter-species pairs were found to be concordant between the two orthology methods, illustrating that local synteny is a robust substitute to coding sequence for identifying orthologs. However, 7% of pairs were found to be discordant between local synteny and Inparanoid. These cases of discordance result from evolutionary events including retrotransposition and genome rearrangements.

CONCLUSIONS

By analyzing cases of discordance between local synteny and Inparanoid we show that local synteny can distinguish between true orthologs and recent retrogenes, can resolve ambiguous many-to-many orthology relationships into one-to-one ortholog pairs, and might be used to identify cases of non-orthologous gene displacement by retroduplicated paralogs.

Collapse

Zhaxybayeva O, Doolittle WF, Papke RT, Gogarten JP. Intertwined evolutionary histories of marine Synechococcus and Prochlorococcus marinus. Genome Biol Evol 2009;1:325-39. [PMID: 20333202 PMCID: PMC2817427 DOI: 10.1093/gbe/evp032] [Citation(s) in RCA: 69] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/28/2009] [Indexed: 02/04/2023] Open

Similarity clustering of proteins using substantive knowledge and reconstruction of evolutionary gene histories in herpesvirus. Theor Chem Acc 2009. [DOI: 10.1007/s00214-009-0614-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Poptsova MS, Larionov SA, Ryadchenko EV, Rybalko SD, Zakharov IA, Loskutov A. Hidden chromosome symmetry: in silico transformation reveals symmetry in 2D DNA walk trajectories of 671 chromosomes. PLoS One 2009;4:e6396. [PMID: 19636424 PMCID: PMC2712679 DOI: 10.1371/journal.pone.0006396] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2009] [Accepted: 06/23/2009] [Indexed: 11/18/2022] Open

Detection and quantitative assessment of horizontal gene transfer. Methods Mol Biol 2009;532:195-213. [PMID: 19271186 DOI: 10.1007/978-1-60327-853-9_11] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Commins J, Toft C, Fares MA. Computational biology methods and their application to the comparative genomics of endocellular symbiotic bacteria of insects. Biol Proced Online 2009;11:52-78. [PMID: 19495914 PMCID: PMC3055744 DOI: 10.1007/s12575-009-9004-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2009] [Accepted: 02/17/2009] [Indexed: 12/02/2022] Open

Ramsay H, Rieseberg LH, Ritland K. The correlation of evolutionary rate with pathway position in plant terpenoid biosynthesis. Mol Biol Evol 2009;26:1045-53. [PMID: 19188263 DOI: 10.1093/molbev/msp021] [Citation(s) in RCA: 87] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open

Sato N. Gclust: trans-kingdom classification of proteins using automatic individual threshold setting. Bioinformatics 2009;25:599-605. [DOI: 10.1093/bioinformatics/btp047] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open

Beiko RG, Ragan MA. Untangling hybrid phylogenetic signals: horizontal gene transfer and artifacts of phylogenetic reconstruction. Methods Mol Biol 2009;532:241-256. [PMID: 19271189 DOI: 10.1007/978-1-60327-853-9_14] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]

Poptsova M. Testing phylogenetic methods to identify horizontal gene transfer. Methods Mol Biol 2009;532:227-240. [PMID: 19271188 DOI: 10.1007/978-1-60327-853-9_13] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]

Wu H, Mao F, Olman V, Xu Y. On application of directons to functional classification of genes in prokaryotes. Comput Biol Chem 2008;32:176-84. [PMID: 18440870 DOI: 10.1016/j.compbiolchem.2008.02.007] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2007] [Accepted: 02/15/2008] [Indexed: 11/30/2022]

Poptsova MS, Gogarten JP. BranchClust: a phylogenetic algorithm for selecting gene families. BMC Bioinformatics 2007;8:120. [PMID: 17425803 PMCID: PMC1853112 DOI: 10.1186/1471-2105-8-120] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2006] [Accepted: 04/10/2007] [Indexed: 11/10/2022] Open