1
|
Lapalu N, Simon A, Lu A, Plaumann PL, Amselem J, Pigné S, Auger A, Koch C, Dallery JF, O'Connell RJ. Complete genome of the Medicago anthracnose fungus, Colletotrichum destructivum, reveals a mini-chromosome-like region within a core chromosome. Microb Genom 2024; 10:001283. [PMID: 39166978 PMCID: PMC11338638 DOI: 10.1099/mgen.0.001283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Accepted: 07/22/2024] [Indexed: 08/23/2024] Open
Abstract
Colletotrichum destructivum (Cd) is a phytopathogenic fungus causing significant economic losses on forage legume crops (Medicago and Trifolium species) worldwide. To gain insights into the genetic basis of fungal virulence and host specificity, we sequenced the genome of an isolate from Medicago sativa using long-read (PacBio) technology. The resulting genome assembly has a total length of 51.7 Mb and comprises ten core chromosomes and two accessory chromosomes, all of which were sequenced from telomere to telomere. A total of 15, 631 gene models were predicted, including genes encoding potentially pathogenicity-related proteins such as candidate-secreted effectors (484), secondary metabolism key enzymes (110) and carbohydrate-active enzymes (619). Synteny analysis revealed extensive structural rearrangements in the genome of Cd relative to the closely related Brassicaceae pathogen, Colletotrichum higginsianum. In addition, a 1.2 Mb species-specific region was detected within the largest core chromosome of Cd that has all the characteristics of fungal accessory chromosomes (transposon-rich, gene-poor, distinct codon usage), providing evidence for exchange between these two genomic compartments. This region was also unique in having undergone extensive intra-chromosomal segmental duplications. Our findings provide insights into the evolution of accessory regions and possible mechanisms for generating genetic diversity in this asexual fungal pathogen.
Collapse
Affiliation(s)
- Nicolas Lapalu
- Université Paris-Saclay, INRAE, UR BIOGER, 91120 Palaiseau, France
| | - Adeline Simon
- Université Paris-Saclay, INRAE, UR BIOGER, 91120 Palaiseau, France
| | - Antoine Lu
- Université Paris-Saclay, INRAE, UR BIOGER, 91120 Palaiseau, France
| | - Peter-Louis Plaumann
- Division of Biochemistry, Department of Biology, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91058 Erlangen, Germany
| | - Joëlle Amselem
- Université Paris-Saclay, INRAE, URGI, 78000 Versailles, France
| | - Sandrine Pigné
- Université Paris-Saclay, INRAE, UR BIOGER, 91120 Palaiseau, France
| | - Annie Auger
- Université Paris-Saclay, INRAE, UR BIOGER, 91120 Palaiseau, France
| | - Christian Koch
- Division of Biochemistry, Department of Biology, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91058 Erlangen, Germany
| | | | | |
Collapse
|
2
|
Djari A, Madignier G, Di Valentin O, Gillet T, Frasse P, Djouhri A, Hu G, Julliard S, Liu M, Zhang Y, Regad F, Pirrello J, Maza E, Bouzayen M. Haplotype-resolved genome assembly and implementation of VitExpress, an open interactive transcriptomic platform for grapevine. Proc Natl Acad Sci U S A 2024; 121:e2403750121. [PMID: 38805269 PMCID: PMC11161759 DOI: 10.1073/pnas.2403750121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Accepted: 04/23/2024] [Indexed: 05/30/2024] Open
Abstract
Haplotype-resolved genome assemblies were produced for Chasselas and Ugni Blanc, two heterozygous Vitis vinifera cultivars by combining high-fidelity long-read sequencing and high-throughput chromosome conformation capture (Hi-C). The telomere-to-telomere full coverage of the chromosomes allowed us to assemble separately the two haplo-genomes of both cultivars and revealed structural variations between the two haplotypes of a given cultivar. The deletions/insertions, inversions, translocations, and duplications provide insight into the evolutionary history and parental relationship among grape varieties. Integration of de novo single long-read sequencing of full-length transcript isoforms (Iso-Seq) yielded a highly improved genome annotation. Given its higher contiguity, and the robustness of the IsoSeq-based annotation, the Chasselas assembly meets the standard to become the annotated reference genome for V. vinifera. Building on these resources, we developed VitExpress, an open interactive transcriptomic platform, that provides a genome browser and integrated web tools for expression profiling, and a set of statistical tools (StatTools) for the identification of highly correlated genes. Implementation of the correlation finder tool for MybA1, a major regulator of the anthocyanin pathway, identified candidate genes associated with anthocyanin metabolism, whose expression patterns were experimentally validated as discriminating between black and white grapes. These resources and innovative tools for mining genome-related data are anticipated to foster advances in several areas of grapevine research.
Collapse
Affiliation(s)
- Anis Djari
- Laboratoire de Recherche en Sciences Végétales–Génomique et Biotechnologie des Fruits-UMR5546, Université de Toulouse, CNRS, Université Paul Sabatier, Institut Polytechnique de Toulouse, Auzeville Tolosan31326, France
| | - Guillaume Madignier
- Laboratoire de Recherche en Sciences Végétales–Génomique et Biotechnologie des Fruits-UMR5546, Université de Toulouse, CNRS, Université Paul Sabatier, Institut Polytechnique de Toulouse, Auzeville Tolosan31326, France
- Fondation Jean Poupelain, Cognac, Javrezac16100, France
| | - Olivia Di Valentin
- Laboratoire de Recherche en Sciences Végétales–Génomique et Biotechnologie des Fruits-UMR5546, Université de Toulouse, CNRS, Université Paul Sabatier, Institut Polytechnique de Toulouse, Auzeville Tolosan31326, France
| | - Thibault Gillet
- Laboratoire de Recherche en Sciences Végétales–Génomique et Biotechnologie des Fruits-UMR5546, Université de Toulouse, CNRS, Université Paul Sabatier, Institut Polytechnique de Toulouse, Auzeville Tolosan31326, France
| | - Pierre Frasse
- Laboratoire de Recherche en Sciences Végétales–Génomique et Biotechnologie des Fruits-UMR5546, Université de Toulouse, CNRS, Université Paul Sabatier, Institut Polytechnique de Toulouse, Auzeville Tolosan31326, France
| | - Amel Djouhri
- Laboratoire de Recherche en Sciences Végétales–Génomique et Biotechnologie des Fruits-UMR5546, Université de Toulouse, CNRS, Université Paul Sabatier, Institut Polytechnique de Toulouse, Auzeville Tolosan31326, France
| | - Guojian Hu
- Laboratoire de Recherche en Sciences Végétales–Génomique et Biotechnologie des Fruits-UMR5546, Université de Toulouse, CNRS, Université Paul Sabatier, Institut Polytechnique de Toulouse, Auzeville Tolosan31326, France
- Fondation Jean Poupelain, Cognac, Javrezac16100, France
| | - Sebastien Julliard
- Conservatoire du vignoble charentais, Institut de Formation de Richemont, Cherves-Richemont16370, France
| | - Mingchun Liu
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu610065, China
| | - Yang Zhang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu610065, China
| | - Farid Regad
- Laboratoire de Recherche en Sciences Végétales–Génomique et Biotechnologie des Fruits-UMR5546, Université de Toulouse, CNRS, Université Paul Sabatier, Institut Polytechnique de Toulouse, Auzeville Tolosan31326, France
| | - Julien Pirrello
- Laboratoire de Recherche en Sciences Végétales–Génomique et Biotechnologie des Fruits-UMR5546, Université de Toulouse, CNRS, Université Paul Sabatier, Institut Polytechnique de Toulouse, Auzeville Tolosan31326, France
| | - Elie Maza
- Laboratoire de Recherche en Sciences Végétales–Génomique et Biotechnologie des Fruits-UMR5546, Université de Toulouse, CNRS, Université Paul Sabatier, Institut Polytechnique de Toulouse, Auzeville Tolosan31326, France
| | - Mondher Bouzayen
- Laboratoire de Recherche en Sciences Végétales–Génomique et Biotechnologie des Fruits-UMR5546, Université de Toulouse, CNRS, Université Paul Sabatier, Institut Polytechnique de Toulouse, Auzeville Tolosan31326, France
| |
Collapse
|
3
|
Mota APZ, Koutsovoulos GD, Perfus-Barbeoch L, Despot-Slade E, Labadie K, Aury JM, Robbe-Sermesant K, Bailly-Bechet M, Belser C, Péré A, Rancurel C, Kozlowski DK, Hassanaly-Goulamhoussen R, Da Rocha M, Noel B, Meštrović N, Wincker P, Danchin EGJ. Unzipped genome assemblies of polyploid root-knot nematodes reveal unusual and clade-specific telomeric repeats. Nat Commun 2024; 15:773. [PMID: 38316773 PMCID: PMC10844300 DOI: 10.1038/s41467-024-44914-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Accepted: 01/09/2024] [Indexed: 02/07/2024] Open
Abstract
Using long-read sequencing, we assembled and unzipped the polyploid genomes of Meloidogyne incognita, M. javanica and M. arenaria, three of the most devastating plant-parasitic nematodes. We found the canonical nematode telomeric repeat to be missing in these and other Meloidogyne genomes. In addition, we find no evidence for the enzyme telomerase or for orthologs of C. elegans telomere-associated proteins, suggesting alternative lengthening of telomeres. Instead, analyzing our assembled genomes, we identify species-specific composite repeats enriched mostly at one extremity of contigs. These repeats are G-rich, oriented, and transcribed, similarly to canonical telomeric repeats. We confirm them as telomeric using fluorescent in situ hybridization. These repeats are mostly found at one single end of chromosomes in these species. The discovery of unusual and specific complex telomeric repeats opens a plethora of perspectives and highlights the evolutionary diversity of telomeres despite their central roles in senescence, aging, and chromosome integrity.
Collapse
Affiliation(s)
- Ana Paula Zotta Mota
- Institut Sophia Agrobiotech, INRAE, Université Côte d'Azur, CNRS, 400 routes des Chappes, 06903, Sophia-Antipolis, France.
| | - Georgios D Koutsovoulos
- Institut Sophia Agrobiotech, INRAE, Université Côte d'Azur, CNRS, 400 routes des Chappes, 06903, Sophia-Antipolis, France
| | - Laetitia Perfus-Barbeoch
- Institut Sophia Agrobiotech, INRAE, Université Côte d'Azur, CNRS, 400 routes des Chappes, 06903, Sophia-Antipolis, France
| | - Evelin Despot-Slade
- Division of Molecular Biology, Ruđer Bošković Institute, Bijenička cesta 54, 10000, Zagreb, Croatia
| | - Karine Labadie
- Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
| | - Jean-Marc Aury
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
| | - Karine Robbe-Sermesant
- Institut Sophia Agrobiotech, INRAE, Université Côte d'Azur, CNRS, 400 routes des Chappes, 06903, Sophia-Antipolis, France
| | - Marc Bailly-Bechet
- Institut Sophia Agrobiotech, INRAE, Université Côte d'Azur, CNRS, 400 routes des Chappes, 06903, Sophia-Antipolis, France
| | - Caroline Belser
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
| | - Arthur Péré
- Institut Sophia Agrobiotech, INRAE, Université Côte d'Azur, CNRS, 400 routes des Chappes, 06903, Sophia-Antipolis, France
| | - Corinne Rancurel
- Institut Sophia Agrobiotech, INRAE, Université Côte d'Azur, CNRS, 400 routes des Chappes, 06903, Sophia-Antipolis, France
| | - Djampa K Kozlowski
- Institut Sophia Agrobiotech, INRAE, Université Côte d'Azur, CNRS, 400 routes des Chappes, 06903, Sophia-Antipolis, France
- Université Côte d'Azur, Center of Modeling, Simulation, and Interactions, 28 Avenue Valrose, 06000, Nice, France
| | - Rahim Hassanaly-Goulamhoussen
- Institut Sophia Agrobiotech, INRAE, Université Côte d'Azur, CNRS, 400 routes des Chappes, 06903, Sophia-Antipolis, France
| | - Martine Da Rocha
- Institut Sophia Agrobiotech, INRAE, Université Côte d'Azur, CNRS, 400 routes des Chappes, 06903, Sophia-Antipolis, France
| | - Benjamin Noel
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
| | - Nevenka Meštrović
- Division of Molecular Biology, Ruđer Bošković Institute, Bijenička cesta 54, 10000, Zagreb, Croatia
| | - Patrick Wincker
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
| | - Etienne G J Danchin
- Institut Sophia Agrobiotech, INRAE, Université Côte d'Azur, CNRS, 400 routes des Chappes, 06903, Sophia-Antipolis, France.
| |
Collapse
|
4
|
Samantray D, Tanwar AS, Murali TS, Brand A, Satyamoorthy K, Paul B. A Comprehensive Bioinformatics Resource Guide for Genome-Based Antimicrobial Resistance Studies. OMICS : A JOURNAL OF INTEGRATIVE BIOLOGY 2023; 27:445-460. [PMID: 37861712 DOI: 10.1089/omi.2023.0140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/21/2023]
Abstract
The use of high-throughput sequencing technologies and bioinformatic tools has greatly transformed microbial genome research. With the help of sophisticated computational tools, it has become easier to perform whole genome assembly, identify and compare different species based on their genomes, and predict the presence of genes responsible for proteins, antimicrobial resistance, and toxins. These bioinformatics resources are likely to continuously improve in quality, become more user-friendly to analyze the multiple genomic data, efficient in generating information and translating it into meaningful knowledge, and enhance our understanding of the genetic mechanism of AMR. In this manuscript, we provide an essential guide for selecting the popular resources for microbial research, such as genome assembly and annotation, antibiotic resistance gene profiling, identification of virulence factors, and drug interaction studies. In addition, we discuss the best practices in computer-oriented microbial genome research, emerging trends in microbial genomic data analysis, integration of multi-omics data, the appropriate use of machine-learning algorithms, and open-source bioinformatics resources for genome data analytics.
Collapse
Affiliation(s)
- Debyani Samantray
- Department of Bioinformatics, Manipal School of Life Sciences, Manipal Academy of Higher Education, Manipal, India
| | - Ankit Singh Tanwar
- United Nations University-Maastricht Economic and Social Research Institute on Innovation and Technology (UNU-MERIT), Maastricht, The Netherlands
- Faculty of Health, Medicine and Life Sciences (FHML), Maastricht University, Maastricht, The Netherlands
| | - Thokur Sreepathy Murali
- Department of Biotechnology, Manipal School of Life Sciences, Manipal Academy of Higher Education, Manipal, India
| | - Angela Brand
- United Nations University-Maastricht Economic and Social Research Institute on Innovation and Technology (UNU-MERIT), Maastricht, The Netherlands
- Faculty of Health, Medicine and Life Sciences (FHML), Maastricht University, Maastricht, The Netherlands
- Department of Health Information, Prasanna School of Public Health (PSPH), Manipal Academy of Higher Education, Manipal, India
| | - Kapaettu Satyamoorthy
- SDM College of Medical Sciences and Hospital, Shri Dharmasthala Manjunatheshwara (SDM) University, Dharwad, India
| | - Bobby Paul
- Department of Bioinformatics, Manipal School of Life Sciences, Manipal Academy of Higher Education, Manipal, India
| |
Collapse
|
5
|
Demirjian C, Razavi N, Yu G, Mayjonade B, Zhang L, Lonjon F, Chardon F, Carrere S, Gouzy J, Genin S, Macho AP, Roux F, Berthomé R, Vailleau F. An atypical NLR gene confers bacterial wilt susceptibility in Arabidopsis. PLANT COMMUNICATIONS 2023; 4:100607. [PMID: 37098653 PMCID: PMC10504594 DOI: 10.1016/j.xplc.2023.100607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 01/19/2023] [Accepted: 04/20/2023] [Indexed: 06/12/2023]
Abstract
Quantitative disease resistance (QDR) remains the most prevalent form of plant resistance in crop fields and wild habitats. Genome-wide association studies (GWAS) have proved to be successful in deciphering the quantitative genetic basis of complex traits such as QDR. To unravel the genetics of QDR to the devastating worldwide bacterial pathogen Ralstonia solanacearum, we performed a GWAS by challenging a highly polymorphic local mapping population of Arabidopsis thaliana with four R. solanacearum type III effector (T3E) mutants, identified as key pathogenicity determinants after a first screen on an A. thaliana core collection of 25 accessions. Although most quantitative trait loci (QTLs) were highly specific to the identity of the T3E mutant (ripAC, ripAG, ripAQ, and ripU), we finely mapped a common QTL located on a cluster of nucleotide-binding domain and leucine-rich repeat (NLR) genes that exhibited structural variation. We functionally validated one of these NLRs as a susceptibility factor in response to R. solanacearum, named it Bacterial Wilt Susceptibility 1 (BWS1), and cloned two alleles that conferred contrasting levels of QDR. Further characterization indicated that expression of BWS1 leads to suppression of immunity triggered by different R. solanacearum effectors. In addition, we showed a direct interaction between BWS1 and RipAC T3E, and BWS1 and SUPPRESSOR OF G2 ALLELE OF skp1 (SGT1b), the latter interaction being suppressed by RipAC. Together, our results highlight a putative role for BWS1 as a quantitative susceptibility factor directly targeted by the T3E RipAC, mediating negative regulation of the SGT1-dependent immune response.
Collapse
Affiliation(s)
- Choghag Demirjian
- LIPME, Université de Toulouse, INRAE, CNRS, Castanet-Tolosan, France
| | - Narjes Razavi
- LIPME, Université de Toulouse, INRAE, CNRS, Castanet-Tolosan, France
| | - Gang Yu
- Shanghai Center for Plant Stress Biology, CAS Center for Excellence in Molecular Plant Sciences, Chinese Academy of Sciences, Shanghai, China
| | | | - Lu Zhang
- Shanghai Center for Plant Stress Biology, CAS Center for Excellence in Molecular Plant Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Fabien Lonjon
- LIPME, Université de Toulouse, INRAE, CNRS, Castanet-Tolosan, France
| | - Fabien Chardon
- Université Paris-Saclay, INRAE, AgroParisTech, Institut Jean-Pierre Bourgin (IJPB), 78000 Versailles, France
| | - Sébastien Carrere
- LIPME, Université de Toulouse, INRAE, CNRS, Castanet-Tolosan, France
| | - Jérome Gouzy
- LIPME, Université de Toulouse, INRAE, CNRS, Castanet-Tolosan, France
| | - Stéphane Genin
- LIPME, Université de Toulouse, INRAE, CNRS, Castanet-Tolosan, France
| | - Alberto P Macho
- Shanghai Center for Plant Stress Biology, CAS Center for Excellence in Molecular Plant Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Fabrice Roux
- LIPME, Université de Toulouse, INRAE, CNRS, Castanet-Tolosan, France
| | - Richard Berthomé
- LIPME, Université de Toulouse, INRAE, CNRS, Castanet-Tolosan, France
| | - Fabienne Vailleau
- LIPME, Université de Toulouse, INRAE, CNRS, Castanet-Tolosan, France.
| |
Collapse
|
6
|
Bovio E, Rancurel C, Seassau A, Magliano M, Gislard M, Loisier A, Kuchly C, Ponchet M, Danchin EGJ, Van Ghelder C. Genome sequence and annotation of Periconia digitata a hopeful biocontrol agent of phytopathogenic oomycetes. Sci Data 2023; 10:583. [PMID: 37673954 PMCID: PMC10483032 DOI: 10.1038/s41597-023-02440-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 08/02/2023] [Indexed: 09/08/2023] Open
Abstract
The Periconia fungal genus belongs to the phylum Ascomycota, order Pleosporales, family Periconiaceae. Periconia are found in many habitats, but little is known about their ecology. Several species from this genus produce bioactive molecules. Periconia digitata extracts were shown to be deadly active against the pine wilt nematode. Furthermore, P. digitata was shown to inhibit the plant pathogenic oomycete Phytophthora parasitica. Because P. digitata has great potential as a biocontrol agent and high quality genomic resources are still lacking in the Periconiaceae family, we generated long-read genomic data for P. digitata. Using PacBio Hifi sequencing technology, we obtained a highly-contiguous genome assembled in 13 chromosomes and totaling ca. 39 Mb. In addition, we produced a reference transcriptome, based on 12 different culture conditions, and proteomic data to support the genome annotation. Besides representing a new reference genome within the Periconiaceae, this work will contribute to our better understanding of the Eukaryotic tree of life and opens new possibilities in terms of biotechnological applications.
Collapse
Affiliation(s)
- Elena Bovio
- Institut Sophia Agrobiotech, INRAE 1355, CNRS and Université Côte d'Azur, 400, Route des Chappes, BP 167, 06903, Sophia Antipolis Cedex, France.
| | - Corinne Rancurel
- Institut Sophia Agrobiotech, INRAE 1355, CNRS and Université Côte d'Azur, 400, Route des Chappes, BP 167, 06903, Sophia Antipolis Cedex, France.
| | - Aurélie Seassau
- Institut Sophia Agrobiotech, INRAE 1355, CNRS and Université Côte d'Azur, 400, Route des Chappes, BP 167, 06903, Sophia Antipolis Cedex, France
| | - Marc Magliano
- Institut Sophia Agrobiotech, INRAE 1355, CNRS and Université Côte d'Azur, 400, Route des Chappes, BP 167, 06903, Sophia Antipolis Cedex, France
| | - Marie Gislard
- GeT-PlaGe (genomic platform), Campus INRAE, 24 chemin de borde rouge, Auzeville CS 52627, 31326, CASTANET-TOLOSAN Cedex, France
| | - Anaïs Loisier
- GeT-PlaGe (genomic platform), Campus INRAE, 24 chemin de borde rouge, Auzeville CS 52627, 31326, CASTANET-TOLOSAN Cedex, France
| | - Claire Kuchly
- GeT-PlaGe (genomic platform), Campus INRAE, 24 chemin de borde rouge, Auzeville CS 52627, 31326, CASTANET-TOLOSAN Cedex, France
| | - Michel Ponchet
- Institut Sophia Agrobiotech, INRAE 1355, CNRS and Université Côte d'Azur, 400, Route des Chappes, BP 167, 06903, Sophia Antipolis Cedex, France
| | - Etienne G J Danchin
- Institut Sophia Agrobiotech, INRAE 1355, CNRS and Université Côte d'Azur, 400, Route des Chappes, BP 167, 06903, Sophia Antipolis Cedex, France
| | - Cyril Van Ghelder
- Institut Sophia Agrobiotech, INRAE 1355, CNRS and Université Côte d'Azur, 400, Route des Chappes, BP 167, 06903, Sophia Antipolis Cedex, France
| |
Collapse
|
7
|
Carrère S, Mayjonade B, Lalanne D, Gaillard S, Verdier J, Chen NW. First whole genome assembly and annotation of a European common bean cultivar using PacBio HiFi and Iso-Seq data. Data Brief 2023; 48:109182. [PMID: 37383758 PMCID: PMC10293967 DOI: 10.1016/j.dib.2023.109182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 04/19/2023] [Indexed: 06/30/2023] Open
Abstract
Common bean (Phaseolus vulgaris L.) is the most important grain legume for direct human consumption worldwide. Flageolet bean originates from France and presents typical organoleptic properties, including the remarkable feature of having small pale green colored seeds. Here, we report the whole-genome data, assembly and annotation of the flageolet bean accession 'Flavert'. High molecular weight DNA and RNA were extracted and subjected to long-read sequencing using PacBio Sequel II platform. The genome consisted of 566,238,753 bp assembled in 13 molecules, including 11 chromosomes plus the mitochondrial and chloroplastic genomes. Annotation predicted 29,549 protein coding genes and 6,958 non-coding RNA. This high-quality genome (99.2% BUSCO completeness) represents a valuable data set for further genomic and genetic studies on common bean and more generally on legumes. To our knowledge, this is the first whole-genome sequence of a common bean accession originating from Europe.
Collapse
Affiliation(s)
- Sébastien Carrère
- Université de Toulouse, INRAE, CNRS, Laboratoire des Interactions Plantes Micro-organismes Environnement (LIPME), 31326 Castanet-Tolosan, France
| | - Baptiste Mayjonade
- Université de Toulouse, INRAE, CNRS, Laboratoire des Interactions Plantes Micro-organismes Environnement (LIPME), 31326 Castanet-Tolosan, France
| | - David Lalanne
- Univ Angers, Institut Agro, INRAE, IRHS, SFR QUASAV, F-49000 Angers, France
| | - Sylvain Gaillard
- Univ Angers, Institut Agro, INRAE, IRHS, SFR QUASAV, F-49000 Angers, France
| | - Jérôme Verdier
- Univ Angers, Institut Agro, INRAE, IRHS, SFR QUASAV, F-49000 Angers, France
| | - Nicolas W.G. Chen
- Univ Angers, Institut Agro, INRAE, IRHS, SFR QUASAV, F-49000 Angers, France
| |
Collapse
|
8
|
Piet Q, Droc G, Marande W, Sarah G, Bocs S, Klopp C, Bourge M, Siljak-Yakovlev S, Bouchez O, Lopez-Roques C, Lepers-Andrzejewski S, Bourgois L, Zucca J, Dron M, Besse P, Grisoni M, Jourda C, Charron C. A chromosome-level, haplotype-phased Vanilla planifolia genome highlights the challenge of partial endoreplication for accurate whole-genome assembly. PLANT COMMUNICATIONS 2022; 3:100330. [PMID: 35617961 PMCID: PMC9482989 DOI: 10.1016/j.xplc.2022.100330] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/30/2021] [Revised: 04/10/2022] [Accepted: 04/27/2022] [Indexed: 06/02/2023]
Abstract
Vanilla planifolia, the species cultivated to produce one of the world's most popular flavors, is highly prone to partial genome endoreplication, which leads to highly unbalanced DNA content in cells. We report here the first molecular evidence of partial endoreplication at the chromosome scale by the assembly and annotation of an accurate haplotype-phased genome of V. planifolia. Cytogenetic data demonstrated that the diploid genome size is 4.09 Gb, with 16 chromosome pairs, although aneuploid cells are frequently observed. Using PacBio HiFi and optical mapping, we assembled and phased a diploid genome of 3.4 Gb with a scaffold N50 of 1.2 Mb and 59 128 predicted protein-coding genes. The atypical k-mer frequencies and the uneven sequencing depth observed agreed with our expectation of unbalanced genome representation. Sixty-seven percent of the genes were scattered over only 30% of the genome, putatively linking gene-rich regions and the endoreplication phenomenon. By contrast, low-coverage regions (non-endoreplicated) were rich in repeated elements but also contained 33% of the annotated genes. Furthermore, this assembly showed distinct haplotype-specific sequencing depth variation patterns, suggesting complex molecular regulation of endoreplication along the chromosomes. This high-quality, anchored assembly represents 83% of the estimated V. planifolia genome. It provides a significant step toward the elucidation of this complex genome. To support post-genomics efforts, we developed the Vanilla Genome Hub, a user-friendly integrated web portal that enables centralized access to high-throughput genomic and other omics data and interoperable use of bioinformatics tools.
Collapse
Affiliation(s)
- Quentin Piet
- CIRAD, UMR PVBMT, 97410 Saint-Pierre, La Réunion, France
| | - Gaetan Droc
- CIRAD, UMR AGAP Institut, 34398 Montpellier, France; UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, 34398 Montpellier, France; French Institute of Bioinformatics (IFB) - South Green Bioinformatics Platform, Bioversity, CIRAD, INRAE, IRD, 34398 Montpellier, France.
| | | | - Gautier Sarah
- French Institute of Bioinformatics (IFB) - South Green Bioinformatics Platform, Bioversity, CIRAD, INRAE, IRD, 34398 Montpellier, France; AGAP, Univ. Montpellier, CIRAD, INRAE, Montpellier SupAgro, Montpellier, France
| | - Stéphanie Bocs
- CIRAD, UMR AGAP Institut, 34398 Montpellier, France; UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, 34398 Montpellier, France; French Institute of Bioinformatics (IFB) - South Green Bioinformatics Platform, Bioversity, CIRAD, INRAE, IRD, 34398 Montpellier, France
| | - Christophe Klopp
- Plateforme Bioinformatique, Genotoul, BioinfoMics, UR875 Biométrie et Intelligence Artificielle, INRAE, Castanet-Tolosan, France
| | - Mickael Bourge
- Cytometry Facility, Imagerie-Gif, Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Sonja Siljak-Yakovlev
- Université Paris-Saclay, CNRS, AgroParisTech, Ecologie Systématique Evolution (ESE), 91190 Gif-sur-Yvette, France
| | | | | | | | | | - Joseph Zucca
- Département Biotechnologie, V. Mane Fils, 06620 Le Bar Sur Loup, France
| | - Michel Dron
- Université Paris-Saclay, CNRS, INRAE, Univ. Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91405 Orsay, France
| | - Pascale Besse
- Université de la Réunion, UMR PVBMT, Saint-Pierre, La Réunion, France
| | | | - Cyril Jourda
- CIRAD, UMR PVBMT, 97410 Saint-Pierre, La Réunion, France.
| | - Carine Charron
- CIRAD, UMR PVBMT, 97410 Saint-Pierre, La Réunion, France
| |
Collapse
|
9
|
Wang H, Jia C, Li H, Yin R, Chen J, Li Y, Yue M. Paving the way for precise diagnostics of antimicrobial resistant bacteria. Front Mol Biosci 2022; 9:976705. [PMID: 36032670 PMCID: PMC9413203 DOI: 10.3389/fmolb.2022.976705] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Accepted: 07/19/2022] [Indexed: 12/26/2022] Open
Abstract
The antimicrobial resistance (AMR) crisis from bacterial pathogens is frequently emerging and rapidly disseminated during the sustained antimicrobial exposure in human-dominated communities, posing a compelling threat as one of the biggest challenges in humans. The frequent incidences of some common but untreatable infections unfold the public health catastrophe that antimicrobial-resistant pathogens have outpaced the available countermeasures, now explicitly amplified during the COVID-19 pandemic. Nowadays, biotechnology and machine learning advancements help create more fundamental knowledge of distinct spatiotemporal dynamics in AMR bacterial adaptation and evolutionary processes. Integrated with reliable diagnostic tools and powerful analytic approaches, a collaborative and systematic surveillance platform with high accuracy and predictability should be established and implemented, which is not just for an effective controlling strategy on AMR but also for protecting the longevity of valuable antimicrobials currently and in the future.
Collapse
Affiliation(s)
- Hao Wang
- Institute of Preventive Veterinary Sciences & Department of Veterinary Medicine, Zhejiang University College of Animal Sciences, Hangzhou, China
| | - Chenhao Jia
- Institute of Preventive Veterinary Sciences & Department of Veterinary Medicine, Zhejiang University College of Animal Sciences, Hangzhou, China
- Hainan Institute of Zhejiang University, Sanya, China
| | - Hongzhao Li
- Institute of Preventive Veterinary Sciences & Department of Veterinary Medicine, Zhejiang University College of Animal Sciences, Hangzhou, China
- Hainan Institute of Zhejiang University, Sanya, China
| | - Rui Yin
- Institute of Preventive Veterinary Sciences & Department of Veterinary Medicine, Zhejiang University College of Animal Sciences, Hangzhou, China
| | - Jiang Chen
- Department of Microbiology, Zhejiang Provincial Center for Disease Control and Prevention, Hangzhou, China
- *Correspondence: Jiang Chen, ; Yan Li, ; Min Yue,
| | - Yan Li
- Institute of Preventive Veterinary Sciences & Department of Veterinary Medicine, Zhejiang University College of Animal Sciences, Hangzhou, China
- Hainan Institute of Zhejiang University, Sanya, China
- Zhejiang Provincial Key Laboratory of Preventive Veterinary Medicine, Hangzhou, China
- *Correspondence: Jiang Chen, ; Yan Li, ; Min Yue,
| | - Min Yue
- Institute of Preventive Veterinary Sciences & Department of Veterinary Medicine, Zhejiang University College of Animal Sciences, Hangzhou, China
- Hainan Institute of Zhejiang University, Sanya, China
- Zhejiang Provincial Key Laboratory of Preventive Veterinary Medicine, Hangzhou, China
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, National Medical Center for Infectious Diseases, The First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou, China
- *Correspondence: Jiang Chen, ; Yan Li, ; Min Yue,
| |
Collapse
|
10
|
Liao B, Shen X, Xiang L, Guo S, Chen S, Meng Y, Liang Y, Ding D, Bai J, Zhang D, Czechowski T, Li Y, Yao H, Ma T, Howard C, Sun C, Liu H, Liu J, Pei J, Gao J, Wang J, Qiu X, Huang Z, Li H, Yuan L, Wei J, Graham I, Xu J, Zhang B, Chen S. Allele-aware chromosome-level genome assembly of Artemisia annua reveals the correlation between ADS expansion and artemisinin yield. MOLECULAR PLANT 2022; 15:1310-1328. [PMID: 35655434 DOI: 10.1016/j.molp.2022.05.013] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Revised: 03/25/2022] [Accepted: 05/27/2022] [Indexed: 06/15/2023]
Abstract
Artemisia annua is the major natural source of artemisinin, an anti-malarial medicine commonly used worldwide. Here, we present chromosome-level haploid maps for two A. annua strains with different artemisinin contents to explore the relationships between genomic organization and artemisinin production. High-fidelity sequencing, optical mapping, and chromatin conformation capture sequencing were used to assemble the heterogeneous and repetitive genome and resolve the haplotypes of A. annua. Approximately 50,000 genes were annotated for each haplotype genome, and a triplication event that occurred approximately 58.12 million years ago was examined for the first time in this species. A total of 3,903,467-5,193,414 variants (SNPs, indels, and structural variants) were identified in the 1.5-Gb genome during pairwise comparison between haplotypes, consistent with the high heterozygosity of this species. Genomic analyses revealed a correlation between artemisinin concents and the copy number of amorpha-4,11-diene synthase genes. This correlation was further confirmed by resequencing of 36 A. annua samples with varied artemisinin contents. Circular consensus sequencing of transcripts facilitated the detection of paralog expression. Collectively, our study provides chromosome-level allele-aware genome assemblies for two A. annua strains and new insights into the biosynthesis of artemisinin and its regulation, which will contribute to conquering malaria worldwide.
Collapse
Affiliation(s)
- Baosheng Liao
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China; Key Laboratory of Quality Evaluation of Chinese Medicine of the Guangdong Provincial Medical Products Administration, the Second Clinical College, Guangzhou University of Chinese Medicine, Guangzhou 510006, China
| | - Xiaofeng Shen
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China; Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences, Beijing 100193, China
| | - Li Xiang
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China; Artemisinin Research Center, China Academy of Chinese Medical Sciences, Beijing 100700, China
| | - Shuai Guo
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China; Pharmacy College, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China
| | - Shiyu Chen
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China; Pharmacy College, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China
| | - Ying Meng
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China
| | - Yu Liang
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China
| | - Dandan Ding
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China
| | - Junqi Bai
- Key Laboratory of Quality Evaluation of Chinese Medicine of the Guangdong Provincial Medical Products Administration, the Second Clinical College, Guangzhou University of Chinese Medicine, Guangzhou 510006, China
| | - Dong Zhang
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China; Centre for Novel Agricultural Products, Department of Biology, University of York, York, UK
| | - Tomasz Czechowski
- Centre for Novel Agricultural Products, Department of Biology, University of York, York, UK
| | - Yi Li
- Centre for Novel Agricultural Products, Department of Biology, University of York, York, UK
| | - Hui Yao
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences, Beijing 100193, China
| | - Tingyu Ma
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China
| | - Caroline Howard
- Wellcome Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1RQ, UK
| | - Chao Sun
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences, Beijing 100193, China
| | - Haitao Liu
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences, Beijing 100193, China
| | - Jiushi Liu
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences, Beijing 100193, China
| | - Jin Pei
- Pharmacy College, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China
| | - Jihai Gao
- Pharmacy College, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China
| | - Jigang Wang
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China; Artemisinin Research Center, China Academy of Chinese Medical Sciences, Beijing 100700, China
| | - Xiaohui Qiu
- Key Laboratory of Quality Evaluation of Chinese Medicine of the Guangdong Provincial Medical Products Administration, the Second Clinical College, Guangzhou University of Chinese Medicine, Guangzhou 510006, China
| | - Zhihai Huang
- Key Laboratory of Quality Evaluation of Chinese Medicine of the Guangdong Provincial Medical Products Administration, the Second Clinical College, Guangzhou University of Chinese Medicine, Guangzhou 510006, China
| | - Hongyi Li
- Key Laboratory of Quality Evaluation of Chinese Medicine of the Guangdong Provincial Medical Products Administration, the Second Clinical College, Guangzhou University of Chinese Medicine, Guangzhou 510006, China
| | - Ling Yuan
- Kentucky Tobacco Research and Development Center, University of Kentucky, Lexington, KY 40546, USA; Key Laboratory of South China Agricultural Plant Molecular Analysis and Genetic Improvement & Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510520, China
| | - Jianhe Wei
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences, Beijing 100193, China
| | - Ian Graham
- Centre for Novel Agricultural Products, Department of Biology, University of York, York, UK
| | - Jiang Xu
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China.
| | - Boli Zhang
- School of Chinese Materia Medica, Tianjin University of Traditional Chinese Medicine, Tianjin 300193, China.
| | - Shilin Chen
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China; Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China.
| |
Collapse
|
11
|
Simon M, Durand S, Ricou A, Vrielynck N, Mayjonade B, Gouzy J, Boyer R, Roux F, Camilleri C, Budar F. APOK3, a pollen killer antidote in Arabidopsis thaliana. Genetics 2022; 221:6603116. [PMID: 35666201 DOI: 10.1093/genetics/iyac089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Accepted: 05/27/2022] [Indexed: 11/14/2022] Open
Abstract
The principles of heredity state that the two alleles carried by a heterozygote are equally transmitted to the progeny. However, genomic regions that escape this rule have been reported in many organisms. It is notably the case of genetic loci referred to as gamete killers, where one allele enhances its transmission by causing the death of the gametes that do not carry it. Gamete killers are of great interest, particularly to understand mechanisms of evolution and speciation. Although being common in plants, only a few, all in rice, have so far been deciphered to the causal genes. Here, we studied a pollen killer found in hybrids between two accessions of Arabidopsis thaliana. Exploring natural variation, we observed this pollen killer in many crosses within the species. Genetic analyses revealed that three genetically linked elements are necessary for pollen killer activity. Using mutants, we showed that this pollen killer works according to a poison-antidote model, where the poison kills pollen grains not producing the antidote. We identified the gene encoding the antidote, a chimeric protein addressed to mitochondria. De novo genomic sequencing in twelve natural variants with different behaviors regarding the pollen killer revealed a hyper variable locus, with important structural variations particularly in killer genotypes, where the antidote gene recently underwent duplications. Our results strongly suggest that the gene has newly evolved within A. thaliana. Finally, we identified in the protein sequence polymorphisms related to its antidote activity.
Collapse
Affiliation(s)
- Matthieu Simon
- Université Paris-Saclay,INRAE, AgroParisTech, Institut Jean-Pierre Bourgin (IJPB), 78000, Versailles, France
| | - Stéphanie Durand
- Université Paris-Saclay,INRAE, AgroParisTech, Institut Jean-Pierre Bourgin (IJPB), 78000, Versailles, France
| | - Anthony Ricou
- Université Paris-Saclay,INRAE, AgroParisTech, Institut Jean-Pierre Bourgin (IJPB), 78000, Versailles, France
| | - Nathalie Vrielynck
- Université Paris-Saclay,INRAE, AgroParisTech, Institut Jean-Pierre Bourgin (IJPB), 78000, Versailles, France
| | | | - Jérôme Gouzy
- LIPME,Université de Toulouse,INRAE,CNRS, 31326 Castanet-Tolosan, France
| | - Roxane Boyer
- INRAE, GeT-PlaGe, Genotoul, 31326 Castanet-Tolosan, France(doi : 10.15454/1.5572370921303193E12)
| | - Fabrice Roux
- LIPME,Université de Toulouse,INRAE,CNRS, 31326 Castanet-Tolosan, France
| | - Christine Camilleri
- Université Paris-Saclay,INRAE, AgroParisTech, Institut Jean-Pierre Bourgin (IJPB), 78000, Versailles, France
| | - Françoise Budar
- Université Paris-Saclay,INRAE, AgroParisTech, Institut Jean-Pierre Bourgin (IJPB), 78000, Versailles, France
| |
Collapse
|
12
|
Bornemann TLV, Adam PS, Probst AJ. Reconstruction of Archaeal Genomes from Short-Read Metagenomes. Methods Mol Biol 2022; 2522:487-527. [PMID: 36125772 DOI: 10.1007/978-1-0716-2445-6_33] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
As the majority of biological diversity remains unexplored and uncultured, investigating it requires culture-independent approaches. Archaea in particular suffer from a multitude of issues that make their culturing problematic, from them being frequently members of the rare biosphere, to low growth rates, to them thriving under very specific and often extreme environmental and community conditions that are difficult to replicate. OMICs techniques are state of the art approaches that allow direct high-throughput investigations of environmental samples at all levels from nucleic acids to proteins, lipids, and secondary metabolites. Metagenomics, as the foundation for other OMICs techniques, facilitates the identification and functional characterization of the microbial community members and can be combined with other methods to provide insights into the microbial activities, both on the RNA and protein levels. In this chapter, we provide a step-by-step workflow for the recovery of archaeal genomes from metagenomes, starting from raw short-read sequences. This workflow can be applied to recover bacterial genomes as well.
Collapse
Affiliation(s)
- Till L V Bornemann
- Environmental Microbiology and Biotechnology, Faculty of Chemistry, University of Duisburg-Essen, Essen, Germany.
| | - Panagiotis S Adam
- Environmental Microbiology and Biotechnology, Faculty of Chemistry, University of Duisburg-Essen, Essen, Germany
| | - Alexander J Probst
- Environmental Microbiology and Biotechnology, Faculty of Chemistry, University of Duisburg-Essen, Essen, Germany.
- Centre of Water and Environmental Research (ZWU), University of Duisburg-Essen, Essen, Germany.
| |
Collapse
|
13
|
Li J, Singh U, Bhandary P, Campbell J, Arendsee Z, Seetharam AS, Wurtele ES. Foster thy young: enhanced prediction of orphan genes in assembled genomes. Nucleic Acids Res 2021; 50:e37. [PMID: 34928390 PMCID: PMC9023268 DOI: 10.1093/nar/gkab1238] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Revised: 10/22/2021] [Accepted: 12/02/2021] [Indexed: 02/06/2023] Open
Abstract
Proteins encoded by newly-emerged genes ('orphan genes') share no sequence similarity with proteins in any other species. They provide organisms with a reservoir of genetic elements to quickly respond to changing selection pressures. Here, we systematically assess the ability of five gene prediction pipelines to accurately predict genes in genomes according to phylostratal origin. BRAKER and MAKER are existing, popular ab initio tools that infer gene structures by machine learning. Direct Inference is an evidence-based pipeline we developed to predict gene structures from alignments of RNA-Seq data. The BIND pipeline integrates ab initio predictions of BRAKER and Direct inference; MIND combines Direct Inference and MAKER predictions. We use highly-curated Arabidopsis and yeast annotations as gold-standard benchmarks, and cross-validate in rice. Each pipeline under-predicts orphan genes (as few as 11 percent, under one prediction scenario). Increasing RNA-Seq diversity greatly improves prediction efficacy. The combined methods (BIND and MIND) yield best predictions overall, BIND identifying 68% of annotated orphan genes, 99% of ancient genes, and give the highest sensitivity score regardless dataset in Arabidopsis. We provide a light weight, flexible, reproducible, and well-documented solution to improve gene prediction.
Collapse
Affiliation(s)
- Jing Li
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA 50014, USA.,Center for Metabolic Biology, Iowa State University, Ames, IA 50014, USA.,Genetics and Genomics Graduate Program, Iowa State University, Ames, IA 50014, USA
| | - Urminder Singh
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA 50014, USA.,Center for Metabolic Biology, Iowa State University, Ames, IA 50014, USA.,Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA 50014, USA
| | - Priyanka Bhandary
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA 50014, USA.,Center for Metabolic Biology, Iowa State University, Ames, IA 50014, USA.,Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA 50014, USA
| | - Jacqueline Campbell
- Corn Insects and Crop Genetics Research Unit, US Department of Agriculture Agriculture Research Service, Ames, IA 50014, USA
| | - Zebulun Arendsee
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA 50014, USA.,Center for Metabolic Biology, Iowa State University, Ames, IA 50014, USA.,Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA 50014, USA
| | - Arun S Seetharam
- Genome Informatics Facility, Iowa State University, Ames, IA 50014, USA
| | - Eve Syrkin Wurtele
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA 50014, USA.,Center for Metabolic Biology, Iowa State University, Ames, IA 50014, USA.,Genetics and Genomics Graduate Program, Iowa State University, Ames, IA 50014, USA.,Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA 50014, USA
| |
Collapse
|
14
|
Nourbakhsh F, Lotfalizadeh M, Badpeyma M, Shakeri A, Soheili V. From plants to antimicrobials: Natural products against bacterial membranes. Phytother Res 2021; 36:33-52. [PMID: 34532918 DOI: 10.1002/ptr.7275] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2020] [Revised: 07/16/2021] [Accepted: 08/25/2021] [Indexed: 12/24/2022]
Abstract
Bacterial membrane barrier provides a cytoplasmic environment for organelles of bacteria. The membrane is composed of lipid compounds containing phosphatide protein and a minimal amount of sugars, and is responsible for intercellular transfers of chemicals. Several antimicrobials have been found that affect bacterial cytoplasmic membranes. These compounds generally disrupt the organization of the membrane or perforate it. By destroying the membrane, the drugs can permeate and replace the effective macromolecules necessary for cell life. Furthermore, they can disrupt electrical gradients of the cells through impairment of the membrane integrity. In recent years, considering the spread of microbial resistance and the side effects of antibiotics, natural antimicrobial compounds have been studied by researchers extensively. These molecules are the best alternative for controlling bacterial infections and reducing drug resistance due to the lack of severe side effects, low cost of production, and biocompatibility. Better understanding of the natural compounds' mechanisms against bacteria provides improved strategies for antimicrobial therapies. In this review, natural products with antibacterial activities focusing on membrane damaging mechanisms were described. However, further high-quality research studies are needed to confirm the clinical efficacy of these natural products.
Collapse
Affiliation(s)
- Fahimeh Nourbakhsh
- Medical Toxicology Research Centre, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Marzieh Lotfalizadeh
- Department of Obstetrics and Gynecology, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Mohaddeseh Badpeyma
- Student Research Committee, Department of Clinical Nutrition, Nutrition Research Center, School of Nutrition and Food Sciences, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Abolfazl Shakeri
- Department of Pharmacognosy, School of Pharmacy, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Vahid Soheili
- Department of Pharmaceutical Control, School of Pharmacy, Mashhad University of Medical Sciences, Mashhad, Iran
| |
Collapse
|
15
|
Wang L, Liao B, Gong L, Xiao S, Huang Z. Haploid Genome Analysis Reveals a Tandem Cluster of Four HSP20 Genes Involved in the High-Temperature Adaptation of Coriolopsis trogii. Microbiol Spectr 2021; 9:e0028721. [PMID: 34406871 PMCID: PMC8552761 DOI: 10.1128/spectrum.00287-21] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Accepted: 07/16/2021] [Indexed: 01/16/2023] Open
Abstract
Coriolopsis trogii is a typical thermotolerant basidiomycete fungus, but its thermotolerance mechanisms are currently unknown. In this study, two monokaryons of C. trogii strain Ct001 were assembled: Ct001_29 had a genome assembly size of 38.85 Mb and encoded 13,113 genes, while Ct001_31 was 40.19 Mb in length and encoded 13,309 genes. Comparative intra- and interstrain genomic analysis revealed the rich genetic diversity of C. trogii, which included more than 315,194 single-nucleotide polymorphisms (SNPs), 30,387 insertion/deletions (indels), and 1,460 structural variations. Gene family analysis showed that the expanded families of C. trogii were functionally enriched in lignocellulose degradation activities. Furthermore, a total of 14 allelic pairs of heat shock protein 20 (HSP20) genes were identified in the C. trogii genome. The expression profile obtained from RNA sequencing (RNA-Seq) showed that four tandem-duplicated allelic pairs, HSP20.5 to HSP20.8, had more than 5-fold higher expression at 35°C than at 25°C. In particular, HSP20.5 and HSP20.8 were the most highly expressed HSP20 genes. Allelic expression bias was found for HSP20.5 and HSP20.8; the expression of Ct29HSP20.8 was at least 1.34-fold higher than that of Ct31HSP20.8, and that of Ct31HSP20.5 was at least 1.5-fold higher than that of Ct29HSP20.5. The unique structural and expression profiles of the HSP20 genes revealed by these haplotype-resolved genomes provide insight into the molecular mechanisms of high-temperature adaptation in C. trogii. IMPORTANCE Heat stress is one of the most frequently encountered environmental stresses for most mushroom-forming fungi. Currently available fungal genomes are mostly haploid because high heterozygosity hinders diploid genome assembly. Here, two haplotype genomes of C. trogii, a thermotolerant basidiomycete, were assembled separately. A conserved tandem cluster of four HSP20 genes showing allele-specific expression was found to be closely related to high-temperature adaptation in C. trogii. The obtained haploid genomes and their comparison offer a more thorough understanding of the genetic background of C. trogii. In addition, the responses of HSP20 genes at 35°C, which may contribute to the growth and survival of C. trogii at high temperatures, could inform the selection and breeding of elite strains in the future.
Collapse
Affiliation(s)
- Lining Wang
- Institute of Bioengineering, Guangdong Academy of Sciences, Guangzhou, People’s Republic of China
| | - Baosheng Liao
- Key Laboratory of Quality Evaluation of Chinese Medicine of the Guangdong Provincial Medical Products Administration, the Second Clinical College, Guangzhou University of Chinese Medicine, Guangzhou, People’s Republic of China
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, People’s Republic of China
| | - Lu Gong
- Key Laboratory of Quality Evaluation of Chinese Medicine of the Guangdong Provincial Medical Products Administration, the Second Clinical College, Guangzhou University of Chinese Medicine, Guangzhou, People’s Republic of China
| | - Shuiming Xiao
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, People’s Republic of China
| | - Zhihai Huang
- Key Laboratory of Quality Evaluation of Chinese Medicine of the Guangdong Provincial Medical Products Administration, the Second Clinical College, Guangzhou University of Chinese Medicine, Guangzhou, People’s Republic of China
| |
Collapse
|
16
|
Brůna T, Hoff KJ, Lomsadze A, Stanke M, Borodovsky M. BRAKER2: automatic eukaryotic genome annotation with GeneMark-EP+ and AUGUSTUS supported by a protein database. NAR Genom Bioinform 2021; 3:lqaa108. [PMID: 33575650 PMCID: PMC7787252 DOI: 10.1093/nargab/lqaa108] [Citation(s) in RCA: 613] [Impact Index Per Article: 204.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2020] [Revised: 11/26/2020] [Accepted: 12/20/2020] [Indexed: 12/13/2022] Open
Abstract
The task of eukaryotic genome annotation remains challenging. Only a few genomes could serve as standards of annotation achieved through a tremendous investment of human curation efforts. Still, the correctness of all alternative isoforms, even in the best-annotated genomes, could be a good subject for further investigation. The new BRAKER2 pipeline generates and integrates external protein support into the iterative process of training and gene prediction by GeneMark-EP+ and AUGUSTUS. BRAKER2 continues the line started by BRAKER1 where self-training GeneMark-ET and AUGUSTUS made gene predictions supported by transcriptomic data. Among the challenges addressed by the new pipeline was a generation of reliable hints to protein-coding exon boundaries from likely homologous but evolutionarily distant proteins. In comparison with other pipelines for eukaryotic genome annotation, BRAKER2 is fully automatic. It is favorably compared under equal conditions with other pipelines, e.g. MAKER2, in terms of accuracy and performance. Development of BRAKER2 should facilitate solving the task of harmonization of annotation of protein-coding genes in genomes of different eukaryotic species. However, we fully understand that several more innovations are needed in transcriptomic and proteomic technologies as well as in algorithmic development to reach the goal of highly accurate annotation of eukaryotic genomes.
Collapse
Affiliation(s)
- Tomáš Brůna
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Katharina J Hoff
- Institute of Mathematics and Computer Science, University of Greifswald, 17489 Greifswald, Germany
- Center for Functional Genomics of Microbes, University of Greifswald, 17489 Greifswald, Germany
| | - Alexandre Lomsadze
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Mario Stanke
- Institute of Mathematics and Computer Science, University of Greifswald, 17489 Greifswald, Germany
- Center for Functional Genomics of Microbes, University of Greifswald, 17489 Greifswald, Germany
| | - Mark Borodovsky
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA
- School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA
| |
Collapse
|
17
|
Genome assembly and annotation of Meloidogyne enterolobii, an emerging parthenogenetic root-knot nematode. Sci Data 2020; 7:324. [PMID: 33020495 PMCID: PMC7536185 DOI: 10.1038/s41597-020-00666-0] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2020] [Accepted: 08/27/2020] [Indexed: 11/21/2022] Open
Abstract
Root-knot nematodes (genus Meloidogyne) are plant parasites causing huge economic loss in the agricultural industry and affecting severely numerous developing countries. Control methods against these plant pests are sparse, the preferred one being the deployment of plant cultivars bearing resistance genes against Meloidogyne species. However, M. enterolobii is not controlled by the resistance genes deployed in the crop plants cultivated in Europe. The recent identification of this species in Europe is thus a major concern. Here, we sequenced the genome of M. enterolobii using short and long-read technologies. The genome assembly spans 240 Mbp with contig N50 size of 143 kbp, enabling high-quality annotations of 59,773 coding genes, 4,068 non-coding genes, and 10,944 transposable elements (spanning 8.7% of the genome). We validated the genome size by flow cytometry and the structure, quality and completeness by bioinformatics metrics. This ensemble of resources will fuel future projects aiming at pinpointing the genome singularities, the origin, diversity, and adaptive potential of this emerging plant pest. Measurement(s) | genome • sequence_assembly • sequence feature annotation | Technology Type(s) | DNA sequencing assay • sequence assembly process • sequence annotation | Sample Characteristic - Organism | Meloidogyne enterolobii |
Machine-accessible metadata file describing the reported data: 10.6084/m9.figshare.12410363
Collapse
|
18
|
Badouin H, Velt A, Gindraud F, Flutre T, Dumas V, Vautrin S, Marande W, Corbi J, Sallet E, Ganofsky J, Santoni S, Guyot D, Ricciardelli E, Jepsen K, Käfer J, Berges H, Duchêne E, Picard F, Hugueney P, Tavares R, Bacilieri R, Rustenholz C, Marais GAB. The wild grape genome sequence provides insights into the transition from dioecy to hermaphroditism during grape domestication. Genome Biol 2020; 21:223. [PMID: 32892750 PMCID: PMC7487632 DOI: 10.1186/s13059-020-02131-y] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Accepted: 08/05/2020] [Indexed: 12/22/2022] Open
Abstract
Background A key step in domestication of the grapevine was the transition from separate sexes (dioecy) in wild Vitis vinifera ssp. sylvestris (V. sylvestris) to hermaphroditism in cultivated Vitis vinifera ssp. sativa (V. vinifera). It is known that V. sylvestris has an XY system and V. vinifera a modified Y haplotype (Yh) and that the sex locus is small, but it has not previously been precisely characterized. Results We generate a high-quality de novo reference genome for V. sylvestris, onto which we map whole-genome re-sequencing data of a cross to locate the sex locus. Assembly of the full X, Y, and Yh haplotypes of V. sylvestris and V. vinifera sex locus and examining their gene content and expression profiles during flower development in wild and cultivated accessions show that truncation and deletion of tapetum and pollen development genes on the X haplotype likely causes male sterility, while the upregulation of a Y allele of a cytokinin regulator (APRT3) may cause female sterility. The downregulation of this cytokinin regulator in the Yh haplotype may be sufficient to trigger reversal to hermaphroditism. Molecular dating of X and Y haplotypes is consistent with the sex locus being as old as the Vitis genus, but the mechanism by which recombination was suppressed remains undetermined. Conclusions We describe the genomic and evolutionary characterization of the sex locus of cultivated and wild grapevine, providing a coherent model of sex determination in the latter and for transition from dioecy to hermaphroditism during domestication.
Collapse
Affiliation(s)
- Hélène Badouin
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, F-69622, Villeurbanne, France.
| | - Amandine Velt
- Université de Strasbourg, INRAE, SVQV UMR-A 1131, F-68000, Colmar, France
| | - François Gindraud
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, F-69622, Villeurbanne, France
| | - Timothée Flutre
- GQE-Le Moulon, INRAE, Univ. Paris-Sud, CNRS, AgroParisTech, Univ. Paris-Saclay, 91190, Gif-sur-Yvette, France
| | - Vincent Dumas
- Université de Strasbourg, INRAE, SVQV UMR-A 1131, F-68000, Colmar, France
| | - Sonia Vautrin
- INRAE, Centre National de Ressources Génomiques Végétales, F-31326, Castanet-Tolosan, France
| | - William Marande
- INRAE, Centre National de Ressources Génomiques Végétales, F-31326, Castanet-Tolosan, France
| | - Jonathan Corbi
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, F-69622, Villeurbanne, France
| | - Erika Sallet
- LIPM, Université de Toulouse, INRAE, CNRS, Castanet-Tolosan, France
| | - Jérémy Ganofsky
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, F-69622, Villeurbanne, France
| | - Sylvain Santoni
- INRAE, UMR AGAP, Univ. Montpellier, CIRAD, INRAE, Institut Agro, Montpellier, France
| | | | | | - Kristen Jepsen
- IGM Genomics Center, University of California, San Diego, La Jolla, CA, USA
| | - Jos Käfer
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, F-69622, Villeurbanne, France
| | - Hélène Berges
- INRAE, Centre National de Ressources Génomiques Végétales, F-31326, Castanet-Tolosan, France
| | - Eric Duchêne
- Université de Strasbourg, INRAE, SVQV UMR-A 1131, F-68000, Colmar, France
| | - Franck Picard
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, F-69622, Villeurbanne, France
| | - Philippe Hugueney
- Université de Strasbourg, INRAE, SVQV UMR-A 1131, F-68000, Colmar, France
| | - Raquel Tavares
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, F-69622, Villeurbanne, France
| | - Roberto Bacilieri
- INRAE, UMR AGAP, Univ. Montpellier, CIRAD, INRAE, Institut Agro, Montpellier, France.
| | - Camille Rustenholz
- Université de Strasbourg, INRAE, SVQV UMR-A 1131, F-68000, Colmar, France.
| | - Gabriel A B Marais
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, F-69622, Villeurbanne, France.
| |
Collapse
|
19
|
Foucher J, Ruh M, Préveaux A, Carrère S, Pelletier S, Briand M, Serre RF, Jacques MA, Chen NWG. Common bean resistance to Xanthomonas is associated with upregulation of the salicylic acid pathway and downregulation of photosynthesis. BMC Genomics 2020; 21:566. [PMID: 32811445 PMCID: PMC7437933 DOI: 10.1186/s12864-020-06972-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Accepted: 08/05/2020] [Indexed: 02/08/2023] Open
Abstract
Background Common bacterial blight (CBB) caused by Xanthomonas phaseoli pv. phaseoli and Xanthomonas citri pv. fuscans is one of the major threats to common bean crops (Phaseolus vulgaris L.). Resistance to CBB is particularly complex as 26 quantitative resistance loci to CBB have been described so far. To date, transcriptomic studies after CBB infection have been very scarce and the molecular mechanisms underlying susceptibility or resistance are largely unknown. Results We sequenced and annotated the genomes of two common bean genotypes being either resistant (BAT93) or susceptible (JaloEEP558) to CBB. Reciprocal BLASTp analysis led to a list of 20,787 homologs between these genotypes and the common bean reference genome (G19833), which provides a solid dataset for further comparative analyses. RNA-Seq after inoculation with X. phaseoli pv. phaseoli showed that the susceptible genotype initiated a more intense and diverse biological response than the resistant genotype. Resistance was linked to upregulation of the salicylic acid pathway and downregulation of photosynthesis and sugar metabolism, while susceptibility was linked to downregulation of resistance genes and upregulation of the ethylene pathway and of genes involved in cell wall modification. Conclusions This study helps better understanding the mechanisms occurring during the early colonization phase of common bean by Xanthomonas and unveils new actors potentially important for resistance and susceptibility to CBB. We discuss the potential link between the pathways induced during bean colonization and genes induced by transcription activator-like effectors (TALEs), as illustrated in other Xanthomonas pathovars.
Collapse
Affiliation(s)
- Justine Foucher
- IRHS, INRAE, AGROCAMPUS OUEST, Université d'Angers, SFR4207 QUASAV, 42, rue Georges Morel, F-49071, Beaucouzé, France
| | - Mylène Ruh
- IRHS, INRAE, AGROCAMPUS OUEST, Université d'Angers, SFR4207 QUASAV, 42, rue Georges Morel, F-49071, Beaucouzé, France
| | - Anne Préveaux
- IRHS, INRAE, AGROCAMPUS OUEST, Université d'Angers, SFR4207 QUASAV, 42, rue Georges Morel, F-49071, Beaucouzé, France
| | - Sébastien Carrère
- CNRS, UMR 2594, Laboratoire des Interactions Plantes-Microorganismes (LIPM), F-31326, Castanet-Tolosan, France
| | - Sandra Pelletier
- IRHS, INRAE, AGROCAMPUS OUEST, Université d'Angers, SFR4207 QUASAV, 42, rue Georges Morel, F-49071, Beaucouzé, France
| | - Martial Briand
- IRHS, INRAE, AGROCAMPUS OUEST, Université d'Angers, SFR4207 QUASAV, 42, rue Georges Morel, F-49071, Beaucouzé, France
| | | | - Marie-Agnès Jacques
- IRHS, INRAE, AGROCAMPUS OUEST, Université d'Angers, SFR4207 QUASAV, 42, rue Georges Morel, F-49071, Beaucouzé, France
| | - Nicolas W G Chen
- IRHS, INRAE, AGROCAMPUS OUEST, Université d'Angers, SFR4207 QUASAV, 42, rue Georges Morel, F-49071, Beaucouzé, France.
| |
Collapse
|
20
|
Pérez-Cobas AE, Gomez-Valero L, Buchrieser C. Metagenomic approaches in microbial ecology: an update on whole-genome and marker gene sequencing analyses. Microb Genom 2020; 6:mgen000409. [PMID: 32706331 PMCID: PMC7641418 DOI: 10.1099/mgen.0.000409] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Accepted: 06/30/2020] [Indexed: 12/23/2022] Open
Abstract
Metagenomics and marker gene approaches, coupled with high-throughput sequencing technologies, have revolutionized the field of microbial ecology. Metagenomics is a culture-independent method that allows the identification and characterization of organisms from all kinds of samples. Whole-genome shotgun sequencing analyses the total DNA of a chosen sample to determine the presence of micro-organisms from all domains of life and their genomic content. Importantly, the whole-genome shotgun sequencing approach reveals the genomic diversity present, but can also give insights into the functional potential of the micro-organisms identified. The marker gene approach is based on the sequencing of a specific gene region. It allows one to describe the microbial composition based on the taxonomic groups present in the sample. It is frequently used to analyse the biodiversity of microbial ecosystems. Despite its importance, the analysis of metagenomic sequencing and marker gene data is quite a challenge. Here we review the primary workflows and software used for both approaches and discuss the current challenges in the field.
Collapse
Affiliation(s)
- Ana Elena Pérez-Cobas
- Institut Pasteur, Biologie des Bactéries Intracellulaires, Paris, France and CNRS UMR 3525, 675724, Paris, France
| | - Laura Gomez-Valero
- Institut Pasteur, Biologie des Bactéries Intracellulaires, Paris, France and CNRS UMR 3525, 675724, Paris, France
| | - Carmen Buchrieser
- Institut Pasteur, Biologie des Bactéries Intracellulaires, Paris, France and CNRS UMR 3525, 675724, Paris, France
| |
Collapse
|
21
|
Structural and Functional Annotation of Transposable Elements Revealed a Potential Regulation of Genes Involved in Rubber Biosynthesis by TE-Derived siRNA Interference in Hevea brasiliensis. Int J Mol Sci 2020; 21:ijms21124220. [PMID: 32545790 PMCID: PMC7353026 DOI: 10.3390/ijms21124220] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2020] [Revised: 06/09/2020] [Accepted: 06/11/2020] [Indexed: 12/14/2022] Open
Abstract
The natural rubber biosynthetic pathway is well described in Hevea, although the final stages of rubber elongation are still poorly understood. Small Rubber Particle Proteins and Rubber Elongation Factors (SRPPs and REFs) are proteins with major function in rubber particle formation and stabilization. Their corresponding genes are clustered on a scaffold1222 of the reference genomic sequence of the Hevea brasiliensis genome. Apart from gene expression by transcriptomic analyses, to date, no deep analyses have been carried out for the genomic environment of SRPPs and REFs loci. By integrative analyses on transposable element annotation, small RNAs production and gene expression, we analysed their role in the control of the transcription of rubber biosynthetic genes. The first in-depth annotation of TEs (Transposable Elements) and their capacity to produce TE-derived siRNAs (small interfering RNAs) is presented, only possible in the Hevea brasiliensis clone PB 260 for which all data are available. We observed that 11% of genes are located near TEs and their presence may interfere in their transcription at both genetic and epigenetic level. We hypothesized that the genomic environment of rubber biosynthesis genes has been shaped by TE and TE-derived siRNAs with possible transcriptional interference on their gene expression. We discussed possible functionalization of TEs as enhancers and as donors of alternative transcription start sites in promoter sequences, possibly through the modelling of genetic and epigenetic landscapes.
Collapse
|
22
|
Sargent L, Liu Y, Leung W, Mortimer NT, Lopatto D, Goecks J, Elgin SCR. G-OnRamp: Generating genome browsers to facilitate undergraduate-driven collaborative genome annotation. PLoS Comput Biol 2020; 16:e1007863. [PMID: 32497138 PMCID: PMC7272004 DOI: 10.1371/journal.pcbi.1007863] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
Abstract
Scientists are sequencing new genomes at an increasing rate with the goal of associating genome contents with phenotypic traits. After a new genome is sequenced and assembled, structural gene annotation is often the first step in analysis. Despite advances in computational gene prediction algorithms, most eukaryotic genomes still benefit from manual gene annotation. This requires access to good genome browsers to enable annotators to visualize and evaluate multiple lines of evidence (e.g., sequence similarity, RNA sequencing [RNA-Seq] results, gene predictions, repeats) and necessitates many volunteers to participate in the work. To address the technical barriers to creating genome browsers, the Genomics Education Partnership (GEP; https://gep.wustl.edu/) has partnered with the Galaxy Project (https://galaxyproject.org) to develop G-OnRamp (http://g-onramp.org), a web-based platform for creating UCSC Genome Browser Assembly Hubs and JBrowse genome browsers. G-OnRamp also converts a JBrowse instance into an Apollo instance for collaborative genome annotations in research and educational settings. The genome browsers produced can be transferred to the CyVerse Data Store for long-term access. G-OnRamp enables researchers to easily visualize their experimental results, educators to create Course-based Undergraduate Research Experiences (CUREs) centered on genome annotation, and students to participate in genomics research. In the process, students learn about genes/genomes and about how to utilize large datasets. Development of G-OnRamp was guided by extensive user feedback. Sixty-five researchers/educators from >40 institutions participated through in-person workshops, which produced >20 genome browsers now available for research and education. Genome browsers generated for four parasitoid wasp species have been used in a CURE engaging students at 15 colleges and universities. Our assessment results in the classroom demonstrate that the genome browsers produced by G-OnRamp are effective tools for engaging undergraduates in research and in enabling their contributions to the scientific literature in genomics. Expansion of such genomics research/education partnerships will be beneficial to researchers, faculty, and students alike.
Collapse
Affiliation(s)
- Luke Sargent
- Department of Biomedical Engineering, Oregon Health & Science University, Portland, Oregon, United States of America
| | - Yating Liu
- Department of Biology, Washington University in St. Louis, St. Louis, Missouri, United States of America
| | - Wilson Leung
- Department of Biology, Washington University in St. Louis, St. Louis, Missouri, United States of America
| | - Nathan T. Mortimer
- School of Biological Sciences, Illinois State University, Normal, Illinois, United States of America
| | - David Lopatto
- Department of Psychology, Grinnell College, Grinnell, Iowa, United States of America
| | - Jeremy Goecks
- Department of Biomedical Engineering, Oregon Health & Science University, Portland, Oregon, United States of America
| | - Sarah C. R. Elgin
- Department of Biology, Washington University in St. Louis, St. Louis, Missouri, United States of America
| |
Collapse
|
23
|
Brůna T, Lomsadze A, Borodovsky M. GeneMark-EP+: eukaryotic gene prediction with self-training in the space of genes and proteins. NAR Genom Bioinform 2020; 2:lqaa026. [PMID: 32440658 PMCID: PMC7222226 DOI: 10.1093/nargab/lqaa026] [Citation(s) in RCA: 198] [Impact Index Per Article: 49.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2019] [Revised: 03/10/2020] [Accepted: 05/12/2020] [Indexed: 01/29/2023] Open
Abstract
We have made several steps toward creating a fast and accurate algorithm for gene prediction in eukaryotic genomes. First, we introduced an automated method for efficient ab initio gene finding, GeneMark-ES, with parameters trained in iterative unsupervised mode. Next, in GeneMark-ET we proposed a method of integration of unsupervised training with information on intron positions revealed by mapping short RNA reads. Now we describe GeneMark-EP, a tool that utilizes another source of external information, a protein database, readily available prior to the start of a sequencing project. A new specialized pipeline, ProtHint, initiates massive protein mapping to genome and extracts hints to splice sites and translation start and stop sites of potential genes. GeneMark-EP uses the hints to improve estimation of model parameters as well as to adjust coordinates of predicted genes if they disagree with the most reliable hints (the -EP+ mode). Tests of GeneMark-EP and -EP+ demonstrated improvements in gene prediction accuracy in comparison with GeneMark-ES, while the GeneMark-EP+ showed higher accuracy than GeneMark-ET. We have observed that the most pronounced improvements in gene prediction accuracy happened in large eukaryotic genomes.
Collapse
Affiliation(s)
- Tomáš Brůna
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Alexandre Lomsadze
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Mark Borodovsky
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA 30332, USA.,Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA.,School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA
| |
Collapse
|
24
|
Hufnagel B, Marques A, Soriano A, Marquès L, Divol F, Doumas P, Sallet E, Mancinotti D, Carrere S, Marande W, Arribat S, Keller J, Huneau C, Blein T, Aimé D, Laguerre M, Taylor J, Schubert V, Nelson M, Geu-Flores F, Crespi M, Gallardo K, Delaux PM, Salse J, Bergès H, Guyot R, Gouzy J, Péret B. High-quality genome sequence of white lupin provides insight into soil exploration and seed quality. Nat Commun 2020; 11:492. [PMID: 31980615 PMCID: PMC6981116 DOI: 10.1038/s41467-019-14197-9] [Citation(s) in RCA: 68] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2019] [Accepted: 12/19/2019] [Indexed: 12/31/2022] Open
Abstract
White lupin (Lupinus albus L.) is an annual crop cultivated for its protein-rich seeds. It is adapted to poor soils due to the production of cluster roots, which are made of dozens of determinate lateral roots that drastically improve soil exploration and nutrient acquisition (mostly phosphate). Using long-read sequencing technologies, we provide a high-quality genome sequence of a cultivated accession of white lupin (2n = 50, 451 Mb), as well as de novo assemblies of a landrace and a wild relative. We describe a modern accession displaying increased soil exploration capacity through early establishment of lateral and cluster roots. We also show how seed quality may have been impacted by domestication in term of protein profiles and alkaloid content. The availability of a high-quality genome assembly together with companion genomic and transcriptomic resources will enable the development of modern breeding strategies to increase and stabilize white lupin yield. White lupin is an annual crop cultivated for protein rich seeds and can produce cluster roots for efficient phosphate acquisition. Here, the authors generate high quality genome assemblies of a cultivated accession, a landrace, and a wild relative and provides insight into soil exploration and seed quality.
Collapse
Affiliation(s)
- Bárbara Hufnagel
- BPMP, Univ Montpellier, CNRS, INRAE, SupAgro, Montpellier, France
| | - André Marques
- BPMP, Univ Montpellier, CNRS, INRAE, SupAgro, Montpellier, France.,MPIPZ, Cologne, Germany
| | | | - Laurence Marquès
- BPMP, Univ Montpellier, CNRS, INRAE, SupAgro, Montpellier, France
| | - Fanchon Divol
- BPMP, Univ Montpellier, CNRS, INRAE, SupAgro, Montpellier, France
| | - Patrick Doumas
- BPMP, Univ Montpellier, CNRS, INRAE, SupAgro, Montpellier, France
| | - Erika Sallet
- LIPM, Université de Toulouse, INRAE, CNRS, Castanet-Tolosan, France
| | | | | | | | | | - Jean Keller
- Laboratoire de Recherche en Sciences Végétales (LRSV), Université de Toulouse, CNRS, Castanet Tolosan, France
| | | | - Thomas Blein
- Institute of Plant Sciences Paris-Saclay, Gif-sur-Yvette, France
| | | | - Malika Laguerre
- BPMP, Univ Montpellier, CNRS, INRAE, SupAgro, Montpellier, France
| | | | | | | | | | - Martin Crespi
- Institute of Plant Sciences Paris-Saclay, Gif-sur-Yvette, France
| | | | - Pierre-Marc Delaux
- Laboratoire de Recherche en Sciences Végétales (LRSV), Université de Toulouse, CNRS, Castanet Tolosan, France
| | | | | | - Romain Guyot
- IRD, Montpellier, France INRAE / 13 Department of Electronics and Automatization, Universidad Autónoma de Manizales, Manizales, Colombia
| | - Jérôme Gouzy
- LIPM, Université de Toulouse, INRAE, CNRS, Castanet-Tolosan, France
| | - Benjamin Péret
- BPMP, Univ Montpellier, CNRS, INRAE, SupAgro, Montpellier, France.
| |
Collapse
|