Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Glover N, Dessimoz C, Ebersberger I, Forslund SK, Gabaldón T, Huerta-Cepas J, Martin MJ, Muffato M, Patricio M, Pereira C, da Silva AS, Wang Y, Sonnhammer E, Thomas PD. Advances and Applications in the Quest for Orthologs. Mol Biol Evol 2020;36:2157-2164. [PMID: 31241141 PMCID: PMC6759064 DOI: 10.1093/molbev/msz150] [Citation(s) in RCA: 50] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open

For:	Glover N, Dessimoz C, Ebersberger I, Forslund SK, Gabaldón T, Huerta-Cepas J, Martin MJ, Muffato M, Patricio M, Pereira C, da Silva AS, Wang Y, Sonnhammer E, Thomas PD. Advances and Applications in the Quest for Orthologs. Mol Biol Evol 2020;36:2157-2164. [PMID: 31241141 PMCID: PMC6759064 DOI: 10.1093/molbev/msz150] [Citation(s) in RCA: 50] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open

Number

Cited by Other Article(s)

Fu X. How deep can we decipher protein evolution with deep learning models. PATTERNS (NEW YORK, N.Y.) 2024;5:101043. [PMID: 39233697 PMCID: PMC11368669 DOI: 10.1016/j.patter.2024.101043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 09/06/2024]

Rahiminejad S, De Sanctis B, Pevzner P, Mushegian A. Synthetic lethality and the minimal genome size problem. mSphere 2024;9:e0013924. [PMID: 38904396 PMCID: PMC11288024 DOI: 10.1128/msphere.00139-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Accepted: 05/13/2024] [Indexed: 06/22/2024] Open

Abstract

Gene knockout studies suggest that ~300 genes in a bacterial genome and ~1,100 genes in a yeast genome cannot be deleted without loss of viability. These single-gene knockout experiments do not account for negative genetic interactions, when two or more genes can each be deleted without effect, but their joint deletion is lethal. Thus, large-scale single-gene deletion studies underestimate the size of a minimal gene set compatible with cell survival. In yeast Saccharomyces cerevisiae, the viability of all possible deletions of gene pairs (2-tuples), and of some deletions of gene triplets (3-tuples), has been experimentally tested. To estimate the size of a yeast minimal genome from that data, we first established that finding the size of a minimal gene set is equivalent to finding the minimum vertex cover in the lethality (hyper)graph, where the vertices are genes and (hyper)edges connect k-tuples of genes whose joint deletion is lethal. Using the Lovász-Johnson-Chvatal greedy approximation algorithm, we computed the minimum vertex cover of the synthetic-lethal 2-tuples graph to be 1,723 genes. We next simulated the genetic interactions in 3-tuples, extrapolating from the existing triplet sample, and again estimated minimum vertex covers. The size of a minimal gene set in yeast rapidly approaches the size of the entire genome even when considering only synthetic lethalities in k-tuples with small k. In contrast, several studies reported successful experimental reductions of yeast and bacterial genomes by simultaneous deletions of hundreds of genes, without eliciting synthetic lethality. We discuss possible reasons for this apparent contradiction.IMPORTANCEHow can we estimate the smallest number of genes sufficient for a unicellular organism to survive on a rich medium? One approach is to remove genes one at a time and count how many of such deletion strains are unable to grow. However, the single-gene knockout data are insufficient, because joint gene deletions may result in negative genetic interactions, also known as synthetic lethality. We used a technique from graph theory to estimate the size of minimal yeast genome from partial data on synthetic lethality. The number of potential synthetic lethal interactions grows very fast when multiple genes are deleted, revealing a paradoxical contrast with the experimental reductions of yeast genome by ~100 genes, and of bacterial genomes by several hundreds of genes.

Collapse

Campos LRS, Trefflich S, Morais DAA, Imparato DO, Chagas VS, Albanus RD, Dalmolin RJS, Castro MAA. Bridge: A New Algorithm for Rooting Orthologous Genes in Large-Scale Evolutionary Analyses. Mol Biol Evol 2024;41:msae019. [PMID: 38306290 PMCID: PMC10873778 DOI: 10.1093/molbev/msae019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Revised: 01/21/2024] [Accepted: 01/29/2024] [Indexed: 02/04/2024] Open

Altenhoff AM, Warwick Vesztrocy A, Bernard C, Train CM, Nicheperovich A, Prieto Baños S, Julca I, Moi D, Nevers Y, Majidian S, Dessimoz C, Glover NM. OMA orthology in 2024: improved prokaryote coverage, ancestral and extant GO enrichment, a revamped synteny viewer and more in the OMA Ecosystem. Nucleic Acids Res 2024;52:D513-D521. [PMID: 37962356 PMCID: PMC10767875 DOI: 10.1093/nar/gkad1020] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 10/17/2023] [Accepted: 10/23/2023] [Indexed: 11/15/2023] Open

Affiliation(s)

Adrian M Altenhoff SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland ETH Zurich, Computer Science, Universitätstr. 6, 8092 Zurich, Switzerland
Alex Warwick Vesztrocy SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
Charles Bernard SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
Clement-Marie Train Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
Alina Nicheperovich Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
Silvia Prieto Baños SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
Irene Julca SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
David Moi SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
Yannis Nevers SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
Sina Majidian SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
Christophe Dessimoz SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
Natasha M Glover SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland

Collapse

Liu Q, Ye L, Li M, Wang Z, Xiong G, Ye Y, Tu T, Schwarzacher T, Heslop-Harrison JSP. Genome-wide expansion and reorganization during grass evolution: from 30 Mb chromosomes in rice and Brachypodium to 550 Mb in Avena. BMC PLANT BIOLOGY 2023;23:627. [PMID: 38062402 PMCID: PMC10704644 DOI: 10.1186/s12870-023-04644-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Accepted: 11/29/2023] [Indexed: 12/18/2023]

Affiliation(s)

Qing Liu Key Laboratory of Plant Resources Conservation and Sustainable Utilization, Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China. South China National Botanical Garden, Guangzhou, 510650, China. Center for Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Guangzhou, 510650, China.
Lyuhan Ye Key Laboratory of Plant Resources Conservation and Sustainable Utilization, Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China University of Chinese Academy of Sciences, Beijing, 100049, China
Mingzhi Li Bio&Data Biotechnologies Co. Ltd, Guangzhou, 510663, China
Ziwei Wang Henry Fok School of Biology and Agriculture, Shaoguan University, Shaoguan, 512005, China
Gui Xiong Key Laboratory of Plant Resources Conservation and Sustainable Utilization, Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China University of Chinese Academy of Sciences, Beijing, 100049, China
Yushi Ye Key Laboratory of Plant Resources Conservation and Sustainable Utilization, Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China South China National Botanical Garden, Guangzhou, 510650, China
Tieyao Tu Key Laboratory of Plant Resources Conservation and Sustainable Utilization, Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China South China National Botanical Garden, Guangzhou, 510650, China Center for Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Guangzhou, 510650, China
Trude Schwarzacher Key Laboratory of Plant Resources Conservation and Sustainable Utilization, Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China Department of Genetics and Genome Biology, Institute for Environmental Futures, University of Leicester, Leicester, LE1 7RH, UK
John Seymour Pat Heslop-Harrison Key Laboratory of Plant Resources Conservation and Sustainable Utilization, Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China. Department of Genetics and Genome Biology, Institute for Environmental Futures, University of Leicester, Leicester, LE1 7RH, UK.

Collapse

Nestor BJ, Bayer PE, Fernandez CGT, Edwards D, Finnegan PM. Approaches to increase the validity of gene family identification using manual homology search tools. Genetica 2023;151:325-338. [PMID: 37817002 PMCID: PMC10692271 DOI: 10.1007/s10709-023-00196-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Accepted: 10/01/2023] [Indexed: 10/12/2023]

Ceron-Noriega A, Schoonenberg VAC, Butter F, Levin M. AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis. Genome Biol Evol 2023;15:evad187. [PMID: 37831426 PMCID: PMC10612477 DOI: 10.1093/gbe/evad187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 09/26/2023] [Accepted: 10/06/2023] [Indexed: 10/14/2023] Open

Rocha JJ, Jayaram SA, Stevens TJ, Muschalik N, Shah RD, Emran S, Robles C, Freeman M, Munro S. Functional unknomics: Systematic screening of conserved genes of unknown function. PLoS Biol 2023;21:e3002222. [PMID: 37552676 PMCID: PMC10409296 DOI: 10.1371/journal.pbio.3002222] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 06/27/2023] [Indexed: 08/10/2023] Open

Dosch J, Bergmann H, Tran V, Ebersberger I. FAS: assessing the similarity between proteins using multi-layered feature architectures. Bioinformatics 2023;39:btad226. [PMID: 37084276 PMCID: PMC10185405 DOI: 10.1093/bioinformatics/btad226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 02/23/2023] [Accepted: 04/13/2023] [Indexed: 04/23/2023] Open

Mize TJ, Funkhouser SA, Buck JM, Stitzel JA, Ehringer MA, Evans LM. Testing Association of Previously Implicated Gene Sets and Gene-Networks in Nicotine Exposed Mouse Models with Human Smoking Phenotypes. Nicotine Tob Res 2023;25:1030-1038. [PMID: 36444815 PMCID: PMC10077928 DOI: 10.1093/ntr/ntac269] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 08/15/2022] [Accepted: 11/23/2022] [Indexed: 11/30/2022]

Abstract

INTRODUCTION

Smoking behaviors are partly heritable, yet the genetic and environmental mechanisms underlying smoking phenotypes are not fully understood. Developmental nicotine exposure (DNE) is a significant risk factor for smoking and leads to gene expression changes in mouse models; however, it is unknown whether the same genes whose expression is impacted by DNE are also those underlying smoking genetic liability. We examined whether genes whose expression in D1-type striatal medium spiny neurons due to DNE in the mouse are also associated with human smoking behaviors.

METHODS

Specifically, we assessed whether human orthologs of mouse-identified genes, either individually or as a set, were genetically associated with five human smoking traits using MAGMA and S-LDSC while implementing a novel expression-based gene-SNP annotation methodology.

RESULTS

We found no strong evidence that these genes sets were more strongly associated with smoking behaviors than the rest of the genome, but ten of these individual genes were significantly associated with three of the five human smoking traits examined (p < 2.5e-6). Three of these genes have not been reported previously and were discovered only when implementing the expression-based annotation.

CONCLUSIONS

These results suggest the genes whose expression is impacted by DNE in mice are largely distinct from those contributing to smoking genetic liability in humans. However, examining a single mouse neuronal cell type may be too fine a resolution for comparison, suggesting that experimental manipulation of nicotine consumption, reward, or withdrawal in mice may better capture genes related to the complex genetics of human tobacco use.

IMPLICATIONS

Genes whose expression is impacted by DNE in mouse D1-type striatal medium spiny neurons were not found to be, as a whole, more strongly associated with human smoking behaviors than the rest of the genome, though ten individual mouse-identified genes were associated with human smoking traits. This suggests little overlap between the genetic mechanisms impacted by DNE and those influencing heritable liability to smoking phenotypes in humans. Further research is warranted to characterize how developmental nicotine exposure paradigms in mice can be translated to understand nicotine use in humans and their heritable effects on smoking.

Collapse

One genome, multiple phenotypes: decoding the evolution and mechanisms of environmentally induced developmental plasticity in insects. Biochem Soc Trans 2023;51:675-689. [PMID: 36929376 DOI: 10.1042/bst20210995] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Revised: 02/16/2023] [Accepted: 02/21/2023] [Indexed: 03/18/2023]

Opazo JC, Vandewege MW, Hoffmann FG, Zavala K, Meléndez C, Luchsinger C, Cavieres VA, Vargas-Chacoff L, Morera FJ, Burgos PV, Tapia-Rojas C, Mardones GA. How Many Sirtuin Genes Are Out There? Evolution of Sirtuin Genes in Vertebrates With a Description of a New Family Member. Mol Biol Evol 2023;40:6993039. [PMID: 36656997 PMCID: PMC9897032 DOI: 10.1093/molbev/msad014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Revised: 12/21/2022] [Accepted: 01/10/2023] [Indexed: 01/20/2023] Open

Affiliation(s)

Juan C Opazo Corresponding authors: E-mails: ;
Michael W Vandewege College of Veterinary Medicine, North Carolina State University, Raleigh, NC
Federico G Hoffmann Department of Biochemistry, Molecular Biology, Entomology, and Plant Pathology, Mississippi State University, Starkville, MS,Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University, Starkville, MS
Kattina Zavala Instituto de Ciencias Ambientales y Evolutivas, Facultad de Ciencias, Universidad Austral de Chile, Valdivia, Chile
Catalina Meléndez Centro de Biología Celular y Biomedicina (CEBICEM), Facultad de Medicina y Ciencia, Universidad San Sebastián, Santiago, Chile
Charlotte Luchsinger Department of Physiology, School of Medicine, Universidad Austral de Chile, Valdivia, Chile
Viviana A Cavieres Centro de Biología Celular y Biomedicina (CEBICEM), Facultad de Medicina y Ciencia, Universidad San Sebastián, Santiago, Chile
Luis Vargas-Chacoff Integrative Biology Group, Universidad Austral de Chile, Valdivia, Chile,Instituto de Ciencias Marinas y Limnológicas, Universidad Austral de Chile, Valdivia, Chile,Centro Fondap de Investigación de Altas Latitudes (IDEAL), Universidad Austral de Chile, Valdivia, Chile,Millennium Institute Biodiversity of Antarctic and Subantarctic Ecosystems, BASE, Universidad Austral de Chile, Valdivia, Chile
Francisco J Morera Integrative Biology Group, Universidad Austral de Chile, Valdivia, Chile,Applied Biochemistry Laboratory, Facultad de Ciencias Veterinarias, Instituto de Farmacología y Morfofisiología, Universidad Austral de Chile, Valdivia, Chile
Patricia V Burgos Centro de Biología Celular y Biomedicina (CEBICEM), Facultad de Medicina y Ciencia, Universidad San Sebastián, Santiago, Chile,Centro Ciencia & Vida, Fundación Ciencia & Vida, Santiago, Chile,Centro de Envejecimiento y Regeneración (CARE-UC), Facultad de Ciencias Biológicas, Pontificia Universidad Católica, Santiago, Chile
Cheril Tapia-Rojas Centro de Biología Celular y Biomedicina (CEBICEM), Facultad de Medicina y Ciencia, Universidad San Sebastián, Santiago, Chile,Centro Ciencia & Vida, Fundación Ciencia & Vida, Santiago, Chile
Gonzalo A Mardones Corresponding authors: E-mails: ;

Collapse

McCarthy CGP, Mulhair PO, Siu-Ting K, Creevey CJ, O’Connell MJ. Improving Orthologous Signal and Model Fit in Datasets Addressing the Root of the Animal Phylogeny. Mol Biol Evol 2023;40:6989790. [PMID: 36649189 PMCID: PMC9848061 DOI: 10.1093/molbev/msac276] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 12/19/2022] [Accepted: 12/23/2022] [Indexed: 01/18/2023] Open

Zaharias P, Warnow T. Recent progress on methods for estimating and updating large phylogenies. Philos Trans R Soc Lond B Biol Sci 2022;377:20210244. [PMID: 35989607 PMCID: PMC9393559 DOI: 10.1098/rstb.2021.0244] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Accepted: 01/07/2022] [Indexed: 12/20/2022] Open

Doyle JJ. Cell types as species: Exploring a metaphor. FRONTIERS IN PLANT SCIENCE 2022;13:868565. [PMID: 36072310 PMCID: PMC9444152 DOI: 10.3389/fpls.2022.868565] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Accepted: 07/29/2022] [Indexed: 06/05/2023]

Abstract

The concept of "cell type," though fundamental to cell biology, is controversial. Cells have historically been classified into types based on morphology, physiology, or location. More recently, single cell transcriptomic studies have revealed fine-scale differences among cells with similar gross phenotypes. Transcriptomic snapshots of cells at various stages of differentiation, and of cells under different physiological conditions, have shown that in many cases variation is more continuous than discrete, raising questions about the relationship between cell type and cell state. Some researchers have rejected the notion of fixed types altogether. Throughout the history of discussions on cell type, cell biologists have compared the problem of defining cell type with the interminable and often contentious debate over the definition of arguably the most important concept in systematics and evolutionary biology, "species." In the last decades, systematics, like cell biology, has been transformed by the increasing availability of molecular data, and the fine-grained resolution of genetic relationships have generated new ideas about how that variation should be classified. There are numerous parallels between the two fields that make exploration of the "cell types as species" metaphor timely. These parallels begin with philosophy, with discussion of both cell types and species as being either individuals, groups, or something in between (e.g., homeostatic property clusters). In each field there are various different types of lineages that form trees or networks that can (and in some cases do) provide criteria for grouping. Developing and refining models for evolutionary divergence of species and for cell type differentiation are parallel goals of the two fields. The goal of this essay is to highlight such parallels with the hope of inspiring biologists in both fields to look for new solutions to similar problems outside of their own field.

Collapse

Cerón-Romero MA, Fonseca MM, de Oliveira Martins L, Posada D, Katz LA. Phylogenomic Analyses of 2,786 Genes in 158 Lineages Support a Root of the Eukaryotic Tree of Life between Opisthokonts and All Other Lineages. Genome Biol Evol 2022;14:evac119. [PMID: 35880421 PMCID: PMC9366629 DOI: 10.1093/gbe/evac119] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/11/2022] [Indexed: 12/02/2022] Open

Hatami E, Jones KE, Kilian N. New Insights Into the Relationships Within Subtribe Scorzonerinae (Cichorieae, Asteraceae) Using Hybrid Capture Phylogenomics (Hyb-Seq). FRONTIERS IN PLANT SCIENCE 2022;13:851716. [PMID: 35873957 PMCID: PMC9298463 DOI: 10.3389/fpls.2022.851716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Accepted: 05/18/2022] [Indexed: 06/15/2023]

Abstract

Subtribe Scorzonerinae (Cichorieae, Asteraceae) contains 12 main lineages and approximately 300 species. Relationships within the subtribe, either at inter- or intrageneric levels, were largely unresolved in phylogenetic studies to date, due to the lack of phylogenetic signal provided by traditional Sanger sequencing markers. In this study, we employed a phylogenomics approach (Hyb-Seq) that targets 1,061 nuclear-conserved ortholog loci designed for Asteraceae and obtained chloroplast coding regions as a by-product of off-target reads. Our objectives were to evaluate the potential of the Hyb-Seq approach in resolving the phylogenetic relationships across the subtribe at deep and shallow nodes, investigate the relationships of major lineages at inter- and intrageneric levels, and examine the impact of the different datasets and approaches on the robustness of phylogenetic inferences. We analyzed three nuclear datasets: exon only, excluding all potentially paralogous loci; exon only, including loci that were only potentially paralogous in 1-3 samples; exon plus intron regions (supercontigs); and the plastome CDS region. Phylogenetic relationships were reconstructed using both multispecies coalescent and concatenation (Maximum Likelihood and Bayesian analyses) approaches. Overall, our phylogenetic reconstructions recovered the same monophyletic major lineages found in previous studies and were successful in fully resolving the backbone phylogeny of the subtribe, while the internal resolution of the lineages was comparatively poor. The backbone topologies were largely congruent among all inferences, but some incongruent relationships were recovered between nuclear and plastome datasets, which are discussed and assumed to represent cases of cytonuclear discordance. Considering the newly resolved phylogenies, a new infrageneric classification of Scorzonera in its revised circumscription is proposed.

Collapse

Foley S, Vlasova A, Marcet-Houben M, Gabaldón T, Hinman VF. Evolutionary analyses of genes in Echinodermata offer insights towards the origin of metazoan phyla. Genomics 2022;114:110431. [PMID: 35835427 PMCID: PMC9552553 DOI: 10.1016/j.ygeno.2022.110431] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2021] [Revised: 05/10/2022] [Accepted: 07/06/2022] [Indexed: 11/24/2022]

Mining of Cloned Disease Resistance Gene Homologs (CDRHs) in Brassica Species and Arabidopsis thaliana. BIOLOGY 2022;11:biology11060821. [PMID: 35741342 PMCID: PMC9220128 DOI: 10.3390/biology11060821] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/16/2022] [Revised: 05/15/2022] [Accepted: 05/24/2022] [Indexed: 01/23/2023]

Abstract

Simple Summary

Developing cultivars with resistance genes (R genes) is an effective strategy to support high yield and quality in Brassica crops. The availability of clone R gene and genomic sequences in Brassica species and Arabidopsis thaliana provide the opportunity to compare genomic regions and survey R genes across genomic databases. In this paper, we aim to identify genes related to cloned genes through sequence identity, providing a repertoire of species-wide related R genes in Brassica crops. The comprehensive list of candidate R genes can be used as a reference for functional analysis.

Abstract

Various diseases severely affect Brassica crops, leading to significant global yield losses and a reduction in crop quality. In this study, we used the complete protein sequences of 49 cloned resistance genes (R genes) that confer resistance to fungal and bacterial diseases known to impact species in the Brassicaceae family. Homology searches were carried out across Brassica napus, B. rapa, B. oleracea, B. nigra, B. juncea, B. carinata and Arabidopsis thaliana genomes. In total, 660 cloned disease R gene homologs (CDRHs) were identified across the seven species, including 431 resistance gene analogs (RGAs) (248 nucleotide binding site-leucine rich repeats (NLRs), 150 receptor-like protein kinases (RLKs) and 33 receptor-like proteins (RLPs)) and 229 non-RGAs. Based on the position and distribution of specific homologs in each of the species, we observed a total of 87 CDRH clusters composed of 36 NLR, 16 RLK and 3 RLP homogeneous clusters and 32 heterogeneous clusters. The CDRHs detected consistently across the seven species are candidates that can be investigated for broad-spectrum resistance, potentially providing resistance to multiple pathogens. The R genes identified in this study provide a novel resource for the future functional analysis and gene cloning of Brassicaceae R genes towards crop improvement.

Collapse

Nevers Y, Jones TEM, Jyothi D, Yates B, Ferret M, Portell-Silva L, Codo L, Cosentino S, Marcet-Houben M, Vlasova A, Poidevin L, Kress A, Hickman M, Persson E, Piližota I, Guijarro-Clarke C, Iwasaki W, Lecompte O, Sonnhammer E, Roos DS, Gabaldón T, Thybert D, Thomas PD, Hu Y, Emms DM, Bruford E, Capella-Gutierrez S, Martin MJ, Dessimoz C, Altenhoff A. The Quest for Orthologs orthology benchmark service in 2022. Nucleic Acids Res 2022;50:W623-W632. [PMID: 35552456 PMCID: PMC9252809 DOI: 10.1093/nar/gkac330] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Revised: 04/07/2022] [Accepted: 04/30/2022] [Indexed: 11/15/2022] Open

Affiliation(s)

Yannis Nevers To whom correspondence should be addressed. Tel: +41 21 692 5449;
Tamsin E M Jones HUGO Gene Nomenclature Committee (HGNC), European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
Dushyanth Jyothi Protein Function development, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
Bethan Yates HUGO Gene Nomenclature Committee (HGNC), European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
Meritxell Ferret Barcelona Supercomputing Centre (BSC-CNS). Plaça Eusebi Güell, 1-3 08034 Barcelona, Spain
Laura Portell-Silva Barcelona Supercomputing Centre (BSC-CNS). Plaça Eusebi Güell, 1-3 08034 Barcelona, Spain
Laia Codo Barcelona Supercomputing Centre (BSC-CNS). Plaça Eusebi Güell, 1-3 08034 Barcelona, Spain
Salvatore Cosentino Department of Biological Sciences, Graduate School of Science, the University of Tokyo, Tokyo, Japan
Marina Marcet-Houben Barcelona Supercomputing Centre (BSC-CNS). Plaça Eusebi Güell, 1-3 08034 Barcelona, Spain,Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac, 10, 08028 Barcelona, Spain
Anna Vlasova Barcelona Supercomputing Centre (BSC-CNS). Plaça Eusebi Güell, 1-3 08034 Barcelona, Spain,Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac, 10, 08028 Barcelona, Spain
Laetitia Poidevin Department of Computer Science, ICube, UMR 7357, Centre de Recherche en Biomédecine de Strasbourg, University of Strasbourg, CNRS, Strasbourg, France,BiGEst-ICube Platform, ICube, UMR 7357, Centre de Recherche en Biomédecine de Strasbourg, University of Strasbourg, CNRS, Strasbourg, France
Arnaud Kress Department of Computer Science, ICube, UMR 7357, Centre de Recherche en Biomédecine de Strasbourg, University of Strasbourg, CNRS, Strasbourg, France,BiGEst-ICube Platform, ICube, UMR 7357, Centre de Recherche en Biomédecine de Strasbourg, University of Strasbourg, CNRS, Strasbourg, France
Mark Hickman Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
Emma Persson Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Solna, Sweden
Ivana Piližota European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
Cristina Guijarro-Clarke European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
the OpenEBench team the Quest for Orthologs Consortium AltenhoffAdrianBrufordElspeth ACosentinoSalvatoreDessimozChristopheEbersbergerIngoEmmsDavid MGabaldónToniGloverNatashaGuijarro-ClarkeCristinaHickmanMarkHuYanhuiIwasakiWataruJonesTamsin E MJyothiDushyanthKressArnaudLecompteOdileLinardBenjaminMarcet-HoubenMarinaMartinMaria JNeversYannisPerssonEmmaPiližotaIvanaPoidevinLaetitiaRoosDavid SSonhammerErikThomasPaul DThybertDavidVandepoeleKlaasVlasovaAnnaYatesBethanCapella-GutierrezSalvadorCodóLaiaFerretMeritxellGonzalez-UriarteAsierGarrayo-VentasJavierPortell-SilvaLauraRepchevskyDmitrySundeshaVicky
Wataru Iwasaki Department of Biological Sciences, Graduate School of Science, the University of Tokyo, Tokyo, Japan,Department of Integrated Biosciences, Graduate School of Frontier Sciences, the University of Tokyo, Kashiwa, Japan
Odile Lecompte Department of Computer Science, ICube, UMR 7357, Centre de Recherche en Biomédecine de Strasbourg, University of Strasbourg, CNRS, Strasbourg, France
Erik Sonnhammer Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Solna, Sweden
David S Roos Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
Toni Gabaldón Barcelona Supercomputing Centre (BSC-CNS). Plaça Eusebi Güell, 1-3 08034 Barcelona, Spain,Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac, 10, 08028 Barcelona, Spain,Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, Spain,Centro de Investigaciones Biomédicas en Red de Enfermedades Infecciosas, Barcelona, Spain
David Thybert European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
Paul D Thomas Department of Population and Public Health Sciences, University of Southern California, Los Angeles, CA 90032, USA
Yanhui Hu Department of Genetics, Blavatnik Institute, Harvard Medical School, Harvard University, Boston, MA 02115, USA
David M Emms Department of Plant Sciences, University of Oxford, Oxford OX1 3RB, UK
Elspeth Bruford HUGO Gene Nomenclature Committee (HGNC), European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK,Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge, UK
Salvador Capella-Gutierrez Barcelona Supercomputing Centre (BSC-CNS). Plaça Eusebi Güell, 1-3 08034 Barcelona, Spain
Maria J Martin Protein Function development, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
Christophe Dessimoz Department of Computational Biology, University of Lausanne, Lausanne, Switzerland,Swiss Institute for Bioinformatics, University of Lausanne, Lausanne, Switzerland,Department of Computer Science, University College London, London, UK,Centre for Life's Origins and Evolution, Department of Genetics, Evolution and Environment, University College London, London, UK
Adrian Altenhoff Swiss Institute for Bioinformatics, University of Lausanne, Lausanne, Switzerland,Computer Science Department, ETH Zurich, Zurich, Switzerland

Collapse

Crow M, Suresh H, Lee J, Gillis J. Coexpression reveals conserved gene programs that co-vary with cell type across kingdoms. Nucleic Acids Res 2022;50:4302-4314. [PMID: 35451481 PMCID: PMC9071420 DOI: 10.1093/nar/gkac276] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2021] [Revised: 03/30/2022] [Accepted: 04/08/2022] [Indexed: 12/24/2022] Open

Thomas PD, Ebert D, Muruganujan A, Mushayahama T, Albou L, Mi H. PANTHER: Making genome-scale phylogenetics accessible to all. Protein Sci 2022;31:8-22. [PMID: 34717010 PMCID: PMC8740835 DOI: 10.1002/pro.4218] [Citation(s) in RCA: 528] [Impact Index Per Article: 264.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Revised: 10/24/2021] [Accepted: 10/26/2021] [Indexed: 02/03/2023]

Morales-Briones DF, Gehrke B, Huang CH, Liston A, Ma H, Marx HE, Tank DC, Yang Y. Analysis of Paralogs in Target Enrichment Data Pinpoints Multiple Ancient Polyploidy Events in Alchemilla s.l. (Rosaceae). Syst Biol 2021;71:190-207. [PMID: 33978764 PMCID: PMC8677558 DOI: 10.1093/sysbio/syab032] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 04/28/2021] [Accepted: 05/03/2021] [Indexed: 12/16/2022] Open

Abstract

Target enrichment is becoming increasingly popular for phylogenomic studies. Although baits for enrichment are typically designed to target single-copy genes, paralogs are often recovered with increased sequencing depth, sometimes from a significant proportion of loci, especially in groups experiencing whole-genome duplication (WGD) events. Common approaches for processing paralogs in target enrichment data sets include random selection, manual pruning, and mainly, the removal of entire genes that show any evidence of paralogy. These approaches are prone to errors in orthology inference or removing large numbers of genes. By removing entire genes, valuable information that could be used to detect and place WGD events is discarded. Here, we used an automated approach for orthology inference in a target enrichment data set of 68 species of Alchemilla s.l. (Rosaceae), a widely distributed clade of plants primarily from temperate climate regions. Previous molecular phylogenetic studies and chromosome numbers both suggested ancient WGDs in the group. However, both the phylogenetic location and putative parental lineages of these WGD events remain unknown. By taking paralogs into consideration and inferring orthologs from target enrichment data, we identified four nodes in the backbone of Alchemilla s.l. with an elevated proportion of gene duplication. Furthermore, using a gene-tree reconciliation approach, we established the autopolyploid origin of the entire Alchemilla s.l. and the nested allopolyploid origin of four major clades within the group. Here, we showed the utility of automated tree-based orthology inference methods, previously designed for genomic or transcriptomic data sets, to study complex scenarios of polyploidy and reticulate evolution from target enrichment data sets.[Alchemilla; allopolyploidy; autopolyploidy; gene tree discordance; orthology inference; paralogs; Rosaceae; target enrichment; whole genome duplication.].

Collapse

Cantalapiedra CP, Hernández-Plaza A, Letunic I, Bork P, Huerta-Cepas J. eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale. Mol Biol Evol 2021;38:5825-5829. [PMID: 34597405 PMCID: PMC8662613 DOI: 10.1093/molbev/msab293] [Citation(s) in RCA: 1189] [Impact Index Per Article: 396.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open

Cantalapiedra CP, Hernández-Plaza A, Letunic I, Bork P, Huerta-Cepas J. eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale. Mol Biol Evol 2021;38:5825-5829. [PMID: 34597405 DOI: 10.1101/2021.06.03.446934] [Citation(s) in RCA: 48] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/23/2023] Open

Stein WD, Hoshen MB. During evolution from the earliest tetrapoda, newly-recruited genes are increasingly paralogues of existing genes and distribute non-randomly among the chromosomes. BMC Genomics 2021;22:794. [PMID: 34736418 PMCID: PMC8570013 DOI: 10.1186/s12864-021-08066-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2020] [Accepted: 09/28/2021] [Indexed: 11/10/2022] Open

Prabh N, Tautz D. Frequent lineage-specific substitution rate changes support an episodic model for protein evolution. G3-GENES GENOMES GENETICS 2021;11:6372692. [PMID: 34542594 PMCID: PMC8664490 DOI: 10.1093/g3journal/jkab333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Accepted: 09/13/2021] [Indexed: 12/04/2022]

Abstract

Since the inception of the molecular clock model for sequence evolution, the investigation of protein divergence has revolved around the question of a more or less constant change of amino acid sequences, with specific overall rates for each family. Although anomalies in clock-like divergence are well known, the assumption of a constant decay rate for a given protein family is usually taken as the null model for protein evolution. However, systematic tests of this null model at a genome-wide scale have lagged behind, despite the databases’ enormous growth. We focus here on divergence rate comparisons between very closely related lineages since this allows clear orthology assignments by synteny and reliable alignments, which are crucial for determining substitution rate changes. We generated a high-confidence dataset of syntenic orthologs from four ape species, including humans. We find that despite the appearance of an overall clock-like substitution pattern, several hundred protein families show lineage-specific acceleration and deceleration in divergence rates, or combinations of both in different lineages. Hence, our analysis uncovers a rather dynamic history of substitution rate changes, even between these closely related lineages, implying that one should expect that a large fraction of proteins will have had a history of episodic rate changes in deeper phylogenies. Furthermore, each of the lineages has a separate set of particularly fast diverging proteins. The genes with the highest percentage of branch-specific substitutions are ADCYAP1 in the human lineage (9.7%), CALU in chimpanzees (7.1%), SLC39A14 in the internal branch leading to humans and chimpanzees (4.1%), RNF128 in gorillas (9%), and S100Z in gibbons (15.2%). The mutational pattern in ADCYAP1 suggests a biased mutation process, possibly through asymmetric gene conversion effects. We conclude that a null model of constant change can be problematic for predicting the evolutionary trajectories of individual proteins.

Collapse

Grau-Bové X, Sebé-Pedrós A. Orthology clusters from gene trees with Possvm. Mol Biol Evol 2021;38:5204-5208. [PMID: 34352080 PMCID: PMC8557443 DOI: 10.1093/molbev/msab234] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open

Mo N, Zhang X, Shi W, Yu G, Chen X, Yang JR. Bidirectional Genetic Control of Phenotypic Heterogeneity and Its Implication for Cancer Drug Resistance. Mol Biol Evol 2021;38:1874-1887. [PMID: 33355660 PMCID: PMC8097262 DOI: 10.1093/molbev/msaa332] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open

Abstract

Negative genetic regulators of phenotypic heterogeneity, or phenotypic capacitors/stabilizers, elevate population average fitness by limiting deviation from the optimal phenotype and increase the efficacy of natural selection by enhancing the phenotypic differences among genotypes. Stabilizers can presumably be switched off to release phenotypic heterogeneity in the face of extreme or fluctuating environments to ensure population survival. This task could, however, also be achieved by positive genetic regulators of phenotypic heterogeneity, or "phenotypic diversifiers," as shown by recently reported evidence that a bacterial divisome factor enhances antibiotic resistance. We hypothesized that such active creation of phenotypic heterogeneity by diversifiers, which is functionally independent of stabilizers, is more common than previously recognized. Using morphological phenotypic data from 4,718 single-gene knockout strains of Saccharomyces cerevisiae, we systematically identified 324 stabilizers and 160 diversifiers and constructed a bipartite network between these genes and the morphological traits they control. Further analyses showed that, compared with stabilizers, diversifiers tended to be weaker and more promiscuous (regulating more traits) regulators targeting traits unrelated to fitness. Moreover, there is a general division of labor between stabilizers and diversifiers. Finally, by incorporating NCI-60 human cancer cell line anticancer drug screening data, we found that human one-to-one orthologs of yeast diversifiers/stabilizers likely regulate the anticancer drug resistance of human cancer cell lines, suggesting that these orthologs are potential targets for auxiliary treatments. Our study therefore highlights stabilizers and diversifiers as the genetic regulators for the bidirectional control of phenotypic heterogeneity as well as their distinct evolutionary roles and functional independence.

Collapse

Independent duplications of the Golgi phosphoprotein 3 oncogene in birds. Sci Rep 2021;11:12483. [PMID: 34127736 PMCID: PMC8203631 DOI: 10.1038/s41598-021-91909-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Accepted: 06/02/2021] [Indexed: 02/05/2023] Open

Yates B, Gray KA, Jones TEM, Bruford EA. Updates to HCOP: the HGNC comparison of orthology predictions tool. Brief Bioinform 2021;22:6265175. [PMID: 33959747 PMCID: PMC8574622 DOI: 10.1093/bib/bbab155] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2020] [Revised: 03/19/2021] [Accepted: 04/02/2021] [Indexed: 11/15/2022] Open

Derelle R, Philippe H, Colbourne JK. Broccoli: Combining Phylogenetic and Network Analyses for Orthology Assignment. Mol Biol Evol 2021;37:3389-3396. [PMID: 32602888 DOI: 10.1093/molbev/msaa159] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open

Sensitive protein alignments at tree-of-life scale using DIAMOND. Nat Methods 2021;18:366-368. [PMID: 33828273 PMCID: PMC8026399 DOI: 10.1038/s41592-021-01101-x] [Citation(s) in RCA: 1051] [Impact Index Per Article: 350.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Accepted: 02/22/2021] [Indexed: 12/05/2022]

Linard B, Ebersberger I, McGlynn SE, Glover N, Mochizuki T, Patricio M, Lecompte O, Nevers Y, Thomas PD, Gabaldón T, Sonnhammer E, Dessimoz C, Uchiyama I. Ten Years of Collaborative Progress in the Quest for Orthologs. Mol Biol Evol 2021;38:3033-3045. [PMID: 33822172 PMCID: PMC8321534 DOI: 10.1093/molbev/msab098] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 02/07/2021] [Accepted: 04/01/2021] [Indexed: 12/19/2022] Open

Affiliation(s)

Benjamin Linard LIRMM, University of Montpellier, CNRS, Montpellier, France.,SPYGEN, Le Bourget-du-Lac, France
Ingo Ebersberger Institute of Cell Biology and Neuroscience, Goethe University Frankfurt, Frankfurt, Germany.,Senckenberg Biodiversity and Climate Research Centre (S-BIKF), Frankfurt, Germany.,LOEWE Center for Translational Biodiversity Genomics (TBG), Frankfurt, Germany
Shawn E McGlynn Earth-Life Science Institute, Tokyo Institute of Technology, Meguro, Tokyo, Japan.,Blue Marble Space Institute of Science, Seattle, WA, USA
Natasha Glover Swiss Institute of Bioinformatics, Lausanne, Switzerland.,Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland.,Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
Tomohiro Mochizuki Earth-Life Science Institute, Tokyo Institute of Technology, Meguro, Tokyo, Japan
Mateus Patricio European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom
Odile Lecompte Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Fédération de Médecine Translationnelle de Strasbourg, Strasbourg, France
Yannis Nevers Swiss Institute of Bioinformatics, Lausanne, Switzerland.,Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland.,Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
Paul D Thomas Division of Bioinformatics, Department of Preventive Medicine, University of Southern California, Los Angeles, CA, USA
Toni Gabaldón Barcelona Supercomputing Centre (BCS-CNS), Jordi Girona, Barcelona, Spain.,Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology (BIST), Barcelona, Spain.,Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
Erik Sonnhammer Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Solna, Sweden
Christophe Dessimoz Swiss Institute of Bioinformatics, Lausanne, Switzerland.,Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland.,Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.,Department of Computer Science, University College London, London, United Kingdom.,Department of Genetics, Evolution and Environment, University College London, London, United Kingdom
Ikuo Uchiyama Department of Theoretical Biology, National Institute for Basic Biology, National Institutes of Natural Sciences, Okazaki, Aichi, Japan

Collapse

Rossier V, Warwick Vesztrocy A, Robinson-Rechavi M, Dessimoz C. OMAmer: tree-driven and alignment-free protein assignment to subfamilies outperforms closest sequence approaches. Bioinformatics 2021;37:2866-2873. [PMID: 33787851 PMCID: PMC8479680 DOI: 10.1093/bioinformatics/btab219] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2020] [Revised: 02/18/2021] [Accepted: 03/30/2021] [Indexed: 02/02/2023] Open

Natsidis P, Kapli P, Schiffer PH, Telford MJ. Systematic errors in orthology inference and their effects on evolutionary analyses. iScience 2021;24:102110. [PMID: 33659875 PMCID: PMC7892920 DOI: 10.1016/j.isci.2021.102110] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Revised: 01/03/2021] [Accepted: 01/21/2021] [Indexed: 01/13/2023] Open

Altenhoff AM, Train CM, Gilbert KJ, Mediratta I, Mendes de Farias T, Moi D, Nevers Y, Radoykova HS, Rossier V, Warwick Vesztrocy A, Glover NM, Dessimoz C. OMA orthology in 2021: website overhaul, conserved isoforms, ancestral gene order and more. Nucleic Acids Res 2021;49:D373-D379. [PMID: 33174605 PMCID: PMC7779010 DOI: 10.1093/nar/gkaa1007] [Citation(s) in RCA: 101] [Impact Index Per Article: 33.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 10/10/2020] [Accepted: 10/14/2020] [Indexed: 01/11/2023] Open

Affiliation(s)

Adrian M Altenhoff SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland ETH Zurich, Computer Science, Universitätstr. 6, 8092 Zurich, Switzerland
Clément-Marie Train Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
Kimberly J Gilbert SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland
Ishita Mediratta Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland Department of Computer Science and Information Systems, BITS Pilani K.K. Birla Goa Campus, India
Tarcisio Mendes de Farias SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
David Moi SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland
Yannis Nevers SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland
Hale-Seda Radoykova Centre for Life's Origins and Evolution, Department of Genetics, Evolution and Environment, University College London, Gower St, London WC1E 6BT, United Kingdom Department of Computer Science, University College London, Gower St, London WC1E 6BT, United Kingdom
Victor Rossier SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland
Alex Warwick Vesztrocy SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland
Natasha M Glover SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland
Christophe Dessimoz SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland Centre for Life's Origins and Evolution, Department of Genetics, Evolution and Environment, University College London, Gower St, London WC1E 6BT, United Kingdom Department of Computer Science, University College London, Gower St, London WC1E 6BT, United Kingdom

Collapse

Chen Y, Song W, Xie X, Wang Z, Guan P, Peng H, Jiao Y, Ni Z, Sun Q, Guo W. A Collinearity-Incorporating Homology Inference Strategy for Connecting Emerging Assemblies in the Triticeae Tribe as a Pilot Practice in the Plant Pangenomic Era. MOLECULAR PLANT 2020;13:1694-1708. [PMID: 32979565 DOI: 10.1016/j.molp.2020.09.019] [Citation(s) in RCA: 111] [Impact Index Per Article: 27.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2020] [Revised: 09/03/2020] [Accepted: 09/21/2020] [Indexed: 05/18/2023]

Affiliation(s)

Yongming Chen Key Laboratory of Crop Heterosis and Utilization, State Key Laboratory for Agrobiotechnology, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China
Wanjun Song Key Laboratory of Crop Heterosis and Utilization, State Key Laboratory for Agrobiotechnology, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China; Beijing Geek Gene Technology Co Ltd, Beijing 100193, China
Xiaoming Xie Key Laboratory of Crop Heterosis and Utilization, State Key Laboratory for Agrobiotechnology, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China
Zihao Wang Key Laboratory of Crop Heterosis and Utilization, State Key Laboratory for Agrobiotechnology, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China
Panfeng Guan Key Laboratory of Crop Heterosis and Utilization, State Key Laboratory for Agrobiotechnology, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China
Huiru Peng Key Laboratory of Crop Heterosis and Utilization, State Key Laboratory for Agrobiotechnology, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China
Yuannian Jiao State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China; University of Chinese Academy of Sciences, Beijing 100049, China
Zhongfu Ni Key Laboratory of Crop Heterosis and Utilization, State Key Laboratory for Agrobiotechnology, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China
Qixin Sun Key Laboratory of Crop Heterosis and Utilization, State Key Laboratory for Agrobiotechnology, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China
Weilong Guo Key Laboratory of Crop Heterosis and Utilization, State Key Laboratory for Agrobiotechnology, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China.

Collapse

Emms DM, Kelly S. Benchmarking Orthogroup Inference Accuracy: Revisiting Orthobench. Genome Biol Evol 2020;12:2258-2266. [PMID: 33022036 PMCID: PMC7738749 DOI: 10.1093/gbe/evaa211] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/29/2020] [Indexed: 01/24/2023] Open

Chorostecki U, Molina M, Pryszcz LP, Gabaldón T. MetaPhOrs 2.0: integrative, phylogeny-based inference of orthology and paralogy across the tree of life. Nucleic Acids Res 2020;48:W553-W557. [PMID: 32343307 PMCID: PMC7319458 DOI: 10.1093/nar/gkaa282] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Revised: 04/01/2020] [Accepted: 04/25/2020] [Indexed: 12/23/2022] Open

Altenhoff AM, Garrayo-Ventas J, Cosentino S, Emms D, Glover NM, Hernández-Plaza A, Nevers Y, Sundesha V, Szklarczyk D, Fernández JM, Codó L, For Orthologs Consortium TQ, Gelpi JL, Huerta-Cepas J, Iwasaki W, Kelly S, Lecompte O, Muffato M, Martin MJ, Capella-Gutierrez S, Thomas PD, Sonnhammer E, Dessimoz C. The Quest for Orthologs benchmark service and consensus calls in 2020. Nucleic Acids Res 2020;48:W538-W545. [PMID: 32374845 PMCID: PMC7319555 DOI: 10.1093/nar/gkaa308] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2020] [Revised: 04/16/2020] [Accepted: 04/20/2020] [Indexed: 12/18/2022] Open

Affiliation(s)

Adrian M Altenhoff SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland.,ETH Zurich, Department of Computer Science, Zurich, Switzerland
Javier Garrayo-Ventas Life Sciences Department, Barcelona Supercomputing Center (BSC), Barcelona, Spain
Salvatore Cosentino Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo, Japan
David Emms Department of Plant Sciences, University of Oxford, South Parks Road, Oxford, UK
Natasha M Glover SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland.,Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.,Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland
Ana Hernández-Plaza Centro de Biotecnologia y Genomica de Plantas, Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223, Pozuelo de Alarcón, Madrid, Spain
Yannis Nevers SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland.,Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.,Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland.,Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Fédération de Médecine Translationnelle de Strasbourg, Strasbourg, France
Vicky Sundesha Life Sciences Department, Barcelona Supercomputing Center (BSC), Barcelona, Spain
Damian Szklarczyk SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland.,Institute of Molecular Life Sciences, University of Zurich, Winterthurerstrasse 190, Zurich, 8057, Switzerland
José M Fernández Life Sciences Department, Barcelona Supercomputing Center (BSC), Barcelona, Spain
Laia Codó Life Sciences Department, Barcelona Supercomputing Center (BSC), Barcelona, Spain
The Quest For Orthologs Consortium
Josep Ll Gelpi Life Sciences Department, Barcelona Supercomputing Center (BSC), Barcelona, Spain.,Department of Biochemistry and Molecular Biomedicine. University of Barcelona. Barcelona, Spain
Jaime Huerta-Cepas Centro de Biotecnologia y Genomica de Plantas, Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223, Pozuelo de Alarcón, Madrid, Spain
Wataru Iwasaki Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo, Japan
Steven Kelly Department of Plant Sciences, University of Oxford, South Parks Road, Oxford, UK
Odile Lecompte Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Fédération de Médecine Translationnelle de Strasbourg, Strasbourg, France
Matthieu Muffato European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
Maria J Martin European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
Salvador Capella-Gutierrez Life Sciences Department, Barcelona Supercomputing Center (BSC), Barcelona, Spain
Paul D Thomas Division of Bioinformatics, Department of Preventive Medicine, University of Southern California, Los Angeles, USA
Erik Sonnhammer Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Solna, Sweden
Christophe Dessimoz SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland.,Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.,Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland.,Department of Genetics, Evolution & Environment, University College London, London, UK.,Department of Computer Science, University College London, London, UK

Collapse

Zeng X, Sheng J, Zhu F, Wei T, Zhao L, Hu X, Zheng X, Zhou F, Hu Z, Diao Y, Jin S. Genetic, transcriptional, and regulatory landscape of monolignol biosynthesis pathway in Miscanthus × giganteus. BIOTECHNOLOGY FOR BIOFUELS 2020;13:179. [PMID: 33117433 PMCID: PMC7590476 DOI: 10.1186/s13068-020-01819-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/18/2020] [Accepted: 10/16/2020] [Indexed: 06/11/2023]

Abstract

BACKGROUND

Miscanthus × giganteus is widely recognized as a promising lignocellulosic biomass crop due to its advantages of high biomass production, low environmental impacts, and the potential to be cultivated on marginal land. However, the high costs of bioethanol production still limit the current commercialization of lignocellulosic bioethanol. The lignin in the cell wall and its by-products released in the pretreatment step is the main component inhibiting the enzymatic reactions in the saccharification and fermentation processes. Hence, genetic modification of the genes involved in lignin biosynthesis could be a feasible strategy to overcome this barrier by manipulating the lignin content and composition of M. × giganteus. For this purpose, the essential knowledge of these genes and understanding the underlying regulatory mechanisms in M. × giganteus is required.

RESULTS

In this study, MgPAL1, MgPAL5, Mg4CL1, Mg4CL3, MgHCT1, MgHCT2, MgC3'H1, MgCCoAOMT1, MgCCoAOMT3, MgCCR1, MgCCR2, MgF5H, MgCOMT, and MgCAD were identified as the major monolignol biosynthetic genes in M. × giganteus based on genetic and transcriptional evidence. Among them, 12 genes were cloned and sequenced. By combining transcription factor binding site prediction and expression correlation analysis, MYB46, MYB61, MYB63, WRKY24, WRKY35, WRKY12, ERF021, ERF058, and ERF017 were inferred to regulate the expression of these genes directly. On the basis of these results, an integrated model was summarized to depict the monolignol biosynthesis pathway and the underlying regulatory mechanism in M. × giganteus.

CONCLUSIONS

This study provides a list of potential gene targets for genetic improvement of lignocellulosic biomass quality of M. × giganteus, and reveals the genetic, transcriptional, and regulatory landscape of the monolignol biosynthesis pathway in M. × giganteus.

Collapse

Affiliation(s)

Xiaofei Zeng School of Biology and Pharmaceutical Engineering, Wuhan Polytechnic University, Wuhan, 430023 People’s Republic of China School of Medicine, Southern University of Science and Technology, Shenzhen, 518055 People’s Republic of China
Jiajing Sheng School of Life Sciences, Nantong University, Nantong, 226019 People’s Republic of China State Key Laboratory of Hybrid Rice, College of Life Sciences, Hubei Lotus Engineering Center, Wuhan University, Wuhan, 430072 People’s Republic of China
Fenglin Zhu State Key Laboratory of Hybrid Rice, College of Life Sciences, Hubei Lotus Engineering Center, Wuhan University, Wuhan, 430072 People’s Republic of China
Tianzi Wei School of Medicine, Southern University of Science and Technology, Shenzhen, 518055 People’s Republic of China
Lingling Zhao State Key Laboratory of Hybrid Rice, College of Life Sciences, Hubei Lotus Engineering Center, Wuhan University, Wuhan, 430072 People’s Republic of China
Xiaohu Hu State Key Laboratory of Hybrid Rice, College of Life Sciences, Hubei Lotus Engineering Center, Wuhan University, Wuhan, 430072 People’s Republic of China
Xingfei Zheng State Key Laboratory of Hybrid Rice, College of Life Sciences, Hubei Lotus Engineering Center, Wuhan University, Wuhan, 430072 People’s Republic of China
Fasong Zhou State Key Laboratory of Hybrid Rice, College of Life Sciences, Hubei Lotus Engineering Center, Wuhan University, Wuhan, 430072 People’s Republic of China
Zhongli Hu State Key Laboratory of Hybrid Rice, College of Life Sciences, Hubei Lotus Engineering Center, Wuhan University, Wuhan, 430072 People’s Republic of China
Ying Diao School of Biology and Pharmaceutical Engineering, Wuhan Polytechnic University, Wuhan, 430023 People’s Republic of China
Surong Jin School of Chemistry, Chemical Engineering and Life Sciences, Wuhan University of Technology, Wuhan, 430070 People’s Republic of China

Collapse

Manolov A, Konanov D, Fedorov D, Osmolovsky I, Vereshchagin R, Ilina E. Genome Complexity Browser: Visualization and quantification of genome variability. PLoS Comput Biol 2020;16:e1008222. [PMID: 33035207 PMCID: PMC7577506 DOI: 10.1371/journal.pcbi.1008222] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2019] [Revised: 10/21/2020] [Accepted: 08/05/2020] [Indexed: 12/30/2022] Open

Abstract

Comparative genomics studies may be used to acquire new knowledge regarding genome architecture, which defines the rules for combining sets of genes in the genome of living organisms. Hundreds of thousands of prokaryotic genomes have been sequenced and assembled. However, computational tools capable of simultaneously comparing large numbers of genomes are lacking. We developed the Genome Complexity Browser, a tool that allows the visualization of gene contexts, in a graph-based format, and the quantification of variability for different segments of a genome. The graph-based visualization allows the inspection of changes in gene contents and neighborhoods across hundreds of genomes, simultaneously, which may facilitate the identification of conserved and variable segments of operons or the estimation of the overall variability associated with a particular genome locus. We introduced a measure called complexity, to quantify genome variability. Intraspecies and interspecies comparisons revealed that regions with high complexity values tended to be located in areas that are conserved across different strains and species.

The comparison of genomes among different bacteria and archaea species has revealed that many species frequently exchange genes. Occasionally, such horizontal gene transfer events result in the acquisition of pathogenic properties or antibiotic resistance in the recipient organism. Previously, the probabilities of gene insertions were found to vary, with unequal distributions along a chromosome. At some loci, referred to as hotspots, changes occur with much higher frequencies compared with other regions of the chromosome. We developed a computational method and a software tool, called Genome Complexity Browser, that allows the identification of genome variability hotspots and the visualization of changes. We compared the localization of various hotspots and revealed that some demonstrate conserved localizations, even across species, whereas others are transient. Our tool allows users to visually inspect the patterns of gene changes in graph-based format, which presents the visualization in a format that is both compact and informative.

Collapse

Deutekom ES, Snel B, van Dam TJP. Benchmarking orthology methods using phylogenetic patterns defined at the base of Eukaryotes. Brief Bioinform 2020;22:5906198. [PMID: 32935832 PMCID: PMC8138875 DOI: 10.1093/bib/bbaa206] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Revised: 08/10/2020] [Accepted: 08/11/2020] [Indexed: 12/26/2022] Open

Abstract

Insights into the evolution of ancestral complexes and pathways are generally achieved through careful and time-intensive manual analysis often using phylogenetic profiles of the constituent proteins. This manual analysis limits the possibility of including more protein-complex components, repeating the analyses for updated genome sets or expanding the analyses to larger scales. Automated orthology inference should allow such large-scale analyses, but substantial differences between orthologous groups generated by different approaches are observed.

We evaluate orthology methods for their ability to recapitulate a number of observations that have been made with regard to genome evolution in eukaryotes. Specifically, we investigate phylogenetic profile similarity (co-occurrence of complexes), the last eukaryotic common ancestor’s gene content, pervasiveness of gene loss and the overlap with manually determined orthologous groups. Moreover, we compare the inferred orthologies to each other.

We find that most orthology methods reconstruct a large last eukaryotic common ancestor, with substantial gene loss, and can predict interacting proteins reasonably well when applying phylogenetic co-occurrence. At the same time, derived orthologous groups show imperfect overlap with manually curated orthologous groups. There is no strong indication of which orthology method performs better than another on individual or all of these aspects. Counterintuitively, despite the orthology methods behaving similarly regarding large-scale evaluation, the obtained orthologous groups differ vastly from one another.

Availability and implementation The data and code underlying this article are available in github and/or upon reasonable request to the corresponding author: https://github.com/ESDeutekom/ComparingOrthologies.

Collapse

Alliance of Genome Resources Portal: unified model organism research platform. Nucleic Acids Res 2020;48:D650-D658. [PMID: 31552413 PMCID: PMC6943066 DOI: 10.1093/nar/gkz813] [Citation(s) in RCA: 118] [Impact Index Per Article: 29.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Revised: 09/03/2019] [Accepted: 09/19/2019] [Indexed: 01/13/2023] Open

Phylogenetic tree building in the genomic age. Nat Rev Genet 2020;21:428-444. [PMID: 32424311 DOI: 10.1038/s41576-020-0233-0] [Citation(s) in RCA: 165] [Impact Index Per Article: 41.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/20/2020] [Indexed: 01/22/2023]

Exposito-Alonso M, Drost HG, Burbano HA, Weigel D. The Earth BioGenome project: opportunities and challenges for plant genomics and conservation. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020;102:222-229. [PMID: 31788877 DOI: 10.1111/tpj.14631] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2019] [Revised: 11/03/2019] [Accepted: 11/18/2019] [Indexed: 05/28/2023]

Zahn-Zabal M, Dessimoz C, Glover NM. Identifying orthologs with OMA: A primer. F1000Res 2020;9:27. [PMID: 32089838 PMCID: PMC7014581 DOI: 10.12688/f1000research.21508.1] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 12/05/2019] [Indexed: 12/22/2022] Open

Agüero-Chapin G, Galpert D, Molina-Ruiz R, Ancede-Gallardo E, Pérez-Machado G, De la Riva GA, Antunes A. Graph Theory-Based Sequence Descriptors as Remote Homology Predictors. Biomolecules 2019;10:E26. [PMID: 31878100 PMCID: PMC7022958 DOI: 10.3390/biom10010026] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2019] [Revised: 12/16/2019] [Accepted: 12/18/2019] [Indexed: 12/23/2022] Open

The Alliance of Genome Resources: Building a Modern Data Ecosystem for Model Organism Databases. Genetics 2019;213:1189-1196. [PMID: 31796553 PMCID: PMC6893393 DOI: 10.1534/genetics.119.302523] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2019] [Accepted: 10/11/2019] [Indexed: 12/17/2022] Open