1
|
Jiang Z, Peng Z, Wei Z, Sun J, Luo Y, Bie L, Zhang G, Wang Y. A deep learning-based method enables the automatic and accurate assembly of chromosome-level genomes. Nucleic Acids Res 2024:gkae789. [PMID: 39287126 DOI: 10.1093/nar/gkae789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Revised: 08/25/2024] [Accepted: 08/30/2024] [Indexed: 09/19/2024] Open
Abstract
The application of high-throughput chromosome conformation capture (Hi-C) technology enables the construction of chromosome-level assemblies. However, the correction of errors and the anchoring of sequences to chromosomes in the assembly remain significant challenges. In this study, we developed a deep learning-based method, AutoHiC, to address the challenges in chromosome-level genome assembly by enhancing contiguity and accuracy. Conventional Hi-C-aided scaffolding often requires manual refinement, but AutoHiC instead utilizes Hi-C data for automated workflows and iterative error correction. When trained on data from 300+ species, AutoHiC demonstrated a robust average error detection accuracy exceeding 90%. The benchmarking results confirmed its significant impact on genome contiguity and error correction. The innovative approach and comprehensive results of AutoHiC constitute a breakthrough in automated error detection, promising more accurate genome assemblies for advancing genomics research.
Collapse
Affiliation(s)
- Zijie Jiang
- Integrative Science Center of Germplasm Creation in Western China (CHONGQING) Science City, Biological Science Research Center, Southwest University, Chongqing, China
| | - Zhixiang Peng
- Integrative Science Center of Germplasm Creation in Western China (CHONGQING) Science City, Biological Science Research Center, Southwest University, Chongqing, China
| | - Zhaoyuan Wei
- Integrative Science Center of Germplasm Creation in Western China (CHONGQING) Science City, Biological Science Research Center, Southwest University, Chongqing, China
| | - Jiahe Sun
- Integrative Science Center of Germplasm Creation in Western China (CHONGQING) Science City, Biological Science Research Center, Southwest University, Chongqing, China
| | - Yongjiang Luo
- Integrative Science Center of Germplasm Creation in Western China (CHONGQING) Science City, Biological Science Research Center, Southwest University, Chongqing, China
| | - Lingzi Bie
- Integrative Science Center of Germplasm Creation in Western China (CHONGQING) Science City, Biological Science Research Center, Southwest University, Chongqing, China
| | - Guoqing Zhang
- Integrative Science Center of Germplasm Creation in Western China (CHONGQING) Science City, Biological Science Research Center, Southwest University, Chongqing, China
| | - Yi Wang
- Integrative Science Center of Germplasm Creation in Western China (CHONGQING) Science City, Biological Science Research Center, Southwest University, Chongqing, China
| |
Collapse
|
2
|
Benowitz KM, Allan CW, Jaworski CC, Sanderson MJ, Diaz F, Chen X, Matzkin LM. Fundamental Patterns of Structural Evolution Revealed by Chromosome-Length Genomes of Cactophilic Drosophila. Genome Biol Evol 2024; 16:evae191. [PMID: 39228294 PMCID: PMC11411373 DOI: 10.1093/gbe/evae191] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 08/22/2024] [Accepted: 08/26/2024] [Indexed: 09/05/2024] Open
Abstract
A thorough understanding of adaptation and speciation requires model organisms with both a history of ecological and phenotypic study as well as a complete set of genomic resources. In particular, high-quality genome assemblies of ecological model organisms are needed to assess the evolution of genome structure and its role in adaptation and speciation. Here, we generate new genomes of cactophilic Drosophila, a crucial model clade for understanding speciation and ecological adaptation in xeric environments. We generated chromosome-level genome assemblies and complete annotations for seven populations across Drosophila mojavensis, Drosophila arizonae, and Drosophila navojoa. We use these data first to establish the most robust phylogeny for this clade to date, and to assess patterns of molecular evolution across the phylogeny, showing concordance with a priori hypotheses regarding adaptive genes in this system. We then show that structural evolution occurs at constant rate across the phylogeny, varies by chromosome, and is correlated with molecular evolution. These results advance the understanding of the D. mojavensis clade by demonstrating core evolutionary genetic patterns and integrating those patterns to generate new gene-level hypotheses regarding adaptation. Our data are presented in a new public database (cactusflybase.arizona.edu), providing one of the most in-depth resources for the analysis of inter- and intraspecific evolutionary genomic data. Furthermore, we anticipate that the patterns of structural evolution identified here will serve as a baseline for future comparative studies to identify the factors that influence the evolution of genome structure across taxa.
Collapse
Affiliation(s)
- Kyle M Benowitz
- Department of Entomology, University of Arizona, Tucson, AZ, USA
| | - Carson W Allan
- Department of Entomology, University of Arizona, Tucson, AZ, USA
| | | | - Michael J Sanderson
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, USA
| | - Fernando Diaz
- Department of Entomology, University of Arizona, Tucson, AZ, USA
| | - Xingsen Chen
- Department of Entomology, University of Arizona, Tucson, AZ, USA
| | - Luciano M Matzkin
- Department of Entomology, University of Arizona, Tucson, AZ, USA
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, USA
- BIO5 Institute, University of Arizona, Tucson, AZ, USA
| |
Collapse
|
3
|
Peña TA, Villarreal P, Agier N, De Chiara M, Barría T, Urbina K, Villarroel CA, Santos ARO, Rosa CA, Nespolo RF, Liti G, Fischer G, Cubillos FA. An integrative taxonomy approach reveals Saccharomyces chiloensis sp. nov. as a newly discovered species from Coastal Patagonia. PLoS Genet 2024; 20:e1011396. [PMID: 39241096 PMCID: PMC11410238 DOI: 10.1371/journal.pgen.1011396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2024] [Revised: 09/18/2024] [Accepted: 08/14/2024] [Indexed: 09/08/2024] Open
Abstract
Species delineation in microorganisms is challenging due to the limited markers available for accurate species assignment. Here, we applied an integrative taxonomy approach, combining extensive sampling, whole-genome sequence-based classification, phenotypic profiling, and assessment of interspecific reproductive isolation. Our work reveals the presence of a distinct Saccharomyces lineage in Nothofagus forests of coastal Patagonia. This lineage, designated Saccharomyces chiloensis sp. nov., exhibits 7% genetic divergence from its sister species S. uvarum, as revealed by whole-genome sequencing and population analyses. The South America-C (SA-C) coastal Patagonia population forms a unique clade closely related to a previously described divergent S. uvarum population from Oceania (AUS, found in Australia and New Zealand). Our species reclassification is supported by a low Ortho Average Nucleotide Identity (OANI) of 93% in SA-C and AUS relative to S. uvarum, which falls below the suggested species delineation threshold of 95%, indicating an independent evolutionary lineage. Hybrid spore viability assessment provided compelling evidence that SA-C and AUS are reproductively isolated from S. uvarum. In addition, we found unique structural variants between S. chiloensis sp. nov. lineages, including large-scale chromosomal translocations and inversions, together with a distinct phenotypic profile, emphasizing their intraspecies genetic distinctiveness. We suggest that S. chiloensis sp. nov diverged from S. uvarum in allopatry due to glaciation, followed by post-glacial dispersal, resulting in distinct lineages on opposite sides of the Pacific Ocean. The discovery of S. chiloensis sp. nov. illustrates the uniqueness of Patagonia's coastal biodiversity and underscores the importance of adopting an integrative taxonomic approach in species delineation to unveil cryptic microbial species. The holotype of S. chiloensis sp. nov. is CBS 18620T.
Collapse
Affiliation(s)
- Tomas A Peña
- Facultad de Química y Biología, Departamento de Biología, Universidad de Santiago de Chile, Santiago, Chile
- Millennium Institute for Integrative Biology (iBio), Santiago, Chile
| | - Pablo Villarreal
- Facultad de Química y Biología, Departamento de Biología, Universidad de Santiago de Chile, Santiago, Chile
- Millennium Institute for Integrative Biology (iBio), Santiago, Chile
| | - Nicolas Agier
- Laboratory of Computational and Quantitative Biology, CNRS, Institut de Biologie Paris-Seine, Sorbonne Université, Paris, France
| | | | - Tomas Barría
- Facultad de Química y Biología, Departamento de Biología, Universidad de Santiago de Chile, Santiago, Chile
| | - Kamila Urbina
- Facultad de Química y Biología, Departamento de Biología, Universidad de Santiago de Chile, Santiago, Chile
- Millenium Nucleus of Patagonian Limit of Life (LiLi), Santiago, Chile
| | - Carlos A Villarroel
- Centro de Biotecnología de los Recursos Naturales (CENBio), Facultad de Ciencias Agrarias y Forestales, Universidad Católica del Maule, Talca, Chile
| | - Ana R O Santos
- Departamento de Microbiologia, ICB, C.P. 486, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Carlos A Rosa
- Departamento de Microbiologia, ICB, C.P. 486, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Roberto F Nespolo
- Millennium Institute for Integrative Biology (iBio), Santiago, Chile
- Millenium Nucleus of Patagonian Limit of Life (LiLi), Santiago, Chile
- Instituto de Ciencias Ambientales y Evolutivas, Universidad Austral de Chile, Valdivia, Chile
- Center of Applied Ecology and Sustainability (CAPES), Facultad de Ciencias Biológicas, Universidad Católica de Chile, Santiago, Chile
| | - Gianni Liti
- Université Côte d'Azur, CNRS, INSERM, IRCAN, Nice, France
| | - Gilles Fischer
- Laboratory of Computational and Quantitative Biology, CNRS, Institut de Biologie Paris-Seine, Sorbonne Université, Paris, France
| | - Francisco A Cubillos
- Facultad de Química y Biología, Departamento de Biología, Universidad de Santiago de Chile, Santiago, Chile
- Millennium Institute for Integrative Biology (iBio), Santiago, Chile
- Millenium Nucleus of Patagonian Limit of Life (LiLi), Santiago, Chile
| |
Collapse
|
4
|
Lai S, Wang H, Bork P, Chen WH, Zhao XM. Long-read sequencing reveals extensive gut phageome structural variations driven by genetic exchange with bacterial hosts. SCIENCE ADVANCES 2024; 10:eadn3316. [PMID: 39141729 PMCID: PMC11323893 DOI: 10.1126/sciadv.adn3316] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Accepted: 07/10/2024] [Indexed: 08/16/2024]
Abstract
Genetic variations are instrumental for unraveling phage evolution and deciphering their functional implications. Here, we explore the underlying fine-scale genetic variations in the gut phageome, especially structural variations (SVs). By using virome-enriched long-read metagenomic sequencing across 91 individuals, we identified a total of 14,438 nonredundant phage SVs and revealed their prevalence within the human gut phageome. These SVs are mainly enriched in genes involved in recombination, DNA methylation, and antibiotic resistance. Notably, a substantial fraction of phage SV sequences share close homology with bacterial fragments, with most SVs enriched for horizontal gene transfer (HGT) mechanism. Further investigations showed that these SV sequences were genetic exchanged between specific phage-bacteria pairs, particularly between phages and their respective bacterial hosts. Temperate phages exhibit a higher frequency of genetic exchange with bacterial chromosomes and then virulent phages. Collectively, our findings provide insights into the genetic landscape of the human gut phageome.
Collapse
Affiliation(s)
- Senying Lai
- Department of Neurology, Zhongshan Hospital and Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
- State Key Laboratory of Medical Neurobiology, Institutes of Brain Science, Fudan University, Shanghai, China
- MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China
| | - Huarui Wang
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular Imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Peer Bork
- European Molecular Biology Laboratory, Structural and Computational Biology Unit, Heidelberg, Germany
- Max Delbrück Centre for Molecular Medicine, Berlin, Germany
- Department of Bioinformatics, Biocenter, University of Würzburg, Würzburg, Germany
| | - Wei-Hua Chen
- State Key Laboratory of Medical Neurobiology, Institutes of Brain Science, Fudan University, Shanghai, China
- College of Life Science, Henan Normal University, Xinxiang, Henan, China
| | - Xing-Ming Zhao
- Department of Neurology, Zhongshan Hospital and Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
- State Key Laboratory of Medical Neurobiology, Institutes of Brain Science, Fudan University, Shanghai, China
- MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China
| |
Collapse
|
5
|
Curry KD, Yu FB, Vance SE, Segarra S, Bhaya D, Chikhi R, Rocha EPC, Treangen TJ. Reference-free structural variant detection in microbiomes via long-read co-assembly graphs. Bioinformatics 2024; 40:i58-i67. [PMID: 38940156 PMCID: PMC11211843 DOI: 10.1093/bioinformatics/btae224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/29/2024] Open
Abstract
MOTIVATION The study of bacterial genome dynamics is vital for understanding the mechanisms underlying microbial adaptation, growth, and their impact on host phenotype. Structural variants (SVs), genomic alterations of 50 base pairs or more, play a pivotal role in driving evolutionary processes and maintaining genomic heterogeneity within bacterial populations. While SV detection in isolate genomes is relatively straightforward, metagenomes present broader challenges due to the absence of clear reference genomes and the presence of mixed strains. In response, our proposed method rhea, forgoes reference genomes and metagenome-assembled genomes (MAGs) by encompassing all metagenomic samples in a series (time or other metric) into a single co-assembly graph. The log fold change in graph coverage between successive samples is then calculated to call SVs that are thriving or declining. RESULTS We show rhea to outperform existing methods for SV and horizontal gene transfer (HGT) detection in two simulated mock metagenomes, particularly as the simulated reads diverge from reference genomes and an increase in strain diversity is incorporated. We additionally demonstrate use cases for rhea on series metagenomic data of environmental and fermented food microbiomes to detect specific sequence alterations between successive time and temperature samples, suggesting host advantage. Our approach leverages previous work in assembly graph structural and coverage patterns to provide versatility in studying SVs across diverse and poorly characterized microbial communities for more comprehensive insights into microbial gene flux. AVAILABILITY AND IMPLEMENTATION rhea is open source and available at: https://github.com/treangenlab/rhea.
Collapse
Affiliation(s)
- Kristen D Curry
- Department of Computer Science, Rice University, 6100 Main St., Houston, TX 77005, United States
- Department of Genomes and Genetics, Microbial Evolutionary Genomics, Institut Pasteur, Université Paris Cité, CNRS, UMR3525, Paris 75015, France
| | | | - Summer E Vance
- Department of Environmental Science, Policy, and Management, University of California, Berkeley, CA 94720, United States
| | - Santiago Segarra
- Department of Electrical and Computer Engineering, Rice University, Houston, TX 77005, United States
| | - Devaki Bhaya
- Carnegie Institution for Science, Department of Plant Biology, Stanford, CA 94305, United States
| | - Rayan Chikhi
- Department of Computational Biology, Institut Pasteur, Université Paris Cité, Paris 75015, France
| | - Eduardo P C Rocha
- Department of Genomes and Genetics, Microbial Evolutionary Genomics, Institut Pasteur, Université Paris Cité, CNRS, UMR3525, Paris 75015, France
| | - Todd J Treangen
- Department of Computer Science, Rice University, 6100 Main St., Houston, TX 77005, United States
| |
Collapse
|
6
|
Huang J, Zhang Y, Li Y, Xing M, Lei C, Wang S, Nie Y, Wang Y, Zhao M, Han Z, Sun X, Zhou H, Wang Y, Zheng X, Xiao X, Fan W, Liu Z, Guo W, Zhang L, Cheng Y, Qian Q, He H, Yang Q, Qiao W. Haplotype-resolved gapless genome and chromosome segment substitution lines facilitate gene identification in wild rice. Nat Commun 2024; 15:4573. [PMID: 38811581 PMCID: PMC11137157 DOI: 10.1038/s41467-024-48845-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 05/15/2024] [Indexed: 05/31/2024] Open
Abstract
The abundant genetic variation harbored by wild rice (Oryza rufipogon) has provided a reservoir of useful genes for rice breeding. However, the genome of wild rice has not yet been comprehensively assessed. Here, we report the haplotype-resolved gapless genome assembly and annotation of wild rice Y476. In addition, we develop two sets of chromosome segment substitution lines (CSSLs) using Y476 as the donor parent and cultivated rice as the recurrent parents. By analyzing the gapless reference genome and CSSL population, we identify 254 QTLs associated with agronomic traits, biotic and abiotic stresses. We clone a receptor-like kinase gene associated with rice blast resistance and confirm its wild rice allele improves rice blast resistance. Collectively, our study provides a haplotype-resolved gapless reference genome and demonstrates a highly efficient platform for gene identification from wild rice.
Collapse
Affiliation(s)
- Jingfen Huang
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yilin Zhang
- School of Advanced Agriculture Sciences and School of Life Sciences, State Key Laboratory of Protein and Plant Gene Research, Peking University, Beijing, China
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences at Weifang, Weifang, Shandong, China
| | - Yapeng Li
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China
- Hainan Academy of Agricultural Sciences, Haikou, Hainan, China
| | - Meng Xing
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China
| | - Cailin Lei
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China
| | - Shizhuang Wang
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China
| | - Yamin Nie
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China
| | - Yanyan Wang
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China
| | - Mingchao Zhao
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China
- Hainan Academy of Agricultural Sciences, Haikou, Hainan, China
| | - Zhenyun Han
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Xianjun Sun
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Han Zhou
- School of Advanced Agriculture Sciences and School of Life Sciences, State Key Laboratory of Protein and Plant Gene Research, Peking University, Beijing, China
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences at Weifang, Weifang, Shandong, China
| | - Yan Wang
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences at Weifang, Weifang, Shandong, China
| | - Xiaoming Zheng
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China
| | - Xiaorong Xiao
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China
- Hainan Academy of Agricultural Sciences, Haikou, Hainan, China
| | - Weiya Fan
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Ziran Liu
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Wenlong Guo
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lifang Zhang
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yunlian Cheng
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Qian Qian
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China
| | - Hang He
- School of Advanced Agriculture Sciences and School of Life Sciences, State Key Laboratory of Protein and Plant Gene Research, Peking University, Beijing, China.
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences at Weifang, Weifang, Shandong, China.
| | - Qingwen Yang
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China.
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China.
| | - Weihua Qiao
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China.
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China.
| |
Collapse
|
7
|
Tsouris A, Brach G, Friedrich A, Hou J, Schacherer J. Diallel panel reveals a significant impact of low-frequency genetic variants on gene expression variation in yeast. Mol Syst Biol 2024; 20:362-373. [PMID: 38355920 PMCID: PMC10987670 DOI: 10.1038/s44320-024-00021-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 01/29/2024] [Accepted: 01/30/2024] [Indexed: 02/16/2024] Open
Abstract
Unraveling the genetic sources of gene expression variation is essential to better understand the origins of phenotypic diversity in natural populations. Genome-wide association studies identified thousands of variants involved in gene expression variation, however, variants detected only explain part of the heritability. In fact, variants such as low-frequency and structural variants (SVs) are poorly captured in association studies. To assess the impact of these variants on gene expression variation, we explored a half-diallel panel composed of 323 hybrids originated from pairwise crosses of 26 natural Saccharomyces cerevisiae isolates. Using short- and long-read sequencing strategies, we established an exhaustive catalog of single nucleotide polymorphisms (SNPs) and SVs for this panel. Combining this dataset with the transcriptomes of all hybrids, we comprehensively mapped SNPs and SVs associated with gene expression variation. While SVs impact gene expression variation, SNPs exhibit a higher effect size with an overrepresentation of low-frequency variants compared to common ones. These results reinforce the importance of dissecting the heritability of complex traits with a comprehensive catalog of genetic variants at the population level.
Collapse
Affiliation(s)
- Andreas Tsouris
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Gauthier Brach
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Anne Friedrich
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Jing Hou
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France.
| | - Joseph Schacherer
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France.
- Institut Universitaire de France (IUF), Paris, France.
| |
Collapse
|
8
|
Zhang Z, Gomes Viana JP, Zhang B, Walden KKO, Müller Paul H, Moose SP, Morris GP, Daum C, Barry KW, Shakoor N, Hudson ME. Major impacts of widespread structural variation on sorghum. Genome Res 2024; 34:286-299. [PMID: 38479835 PMCID: PMC10984582 DOI: 10.1101/gr.278396.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Accepted: 01/22/2024] [Indexed: 03/22/2024]
Abstract
Genetic diversity is critical to crop breeding and improvement, and dissection of the genomic variation underlying agronomic traits can both assist breeding and give insight into basic biological mechanisms. Although recent genome analyses in plants reveal many structural variants (SVs), most current studies of crop genetic variation are dominated by single-nucleotide polymorphisms (SNPs). The extent of the impact of SVs on global trait variation, as well as their utility in genome-wide selection, is not yet understood. In this study, we built an SV data set based on whole-genome resequencing of diverse sorghum lines (n = 363), validated the correlation of photoperiod sensitivity and variety type, and identified SV hotspots underlying the divergent evolution of cellulosic and sweet sorghum. In addition, we showed the complementary contribution of SVs for heritability of traits related to sorghum adaptation. Importantly, inclusion of SV polymorphisms in association studies revealed genotype-phenotype associations not observed with SNPs alone. Three-way genome-wide association studies (GWAS) based on whole-genome SNP, SV, and integrated SNP + SV data sets showed substantial associations between SVs and sorghum traits. The addition of SVs to GWAS substantially increased heritability estimates for some traits, indicating their important contribution to functional allelic variation at the genome level. Our discovery of the widespread impacts of SVs on heritable gene expression variation could render a plausible mechanism for their disproportionate impact on phenotypic variation. This study expands our knowledge of SVs and emphasizes the extensive impacts of SVs on sorghum.
Collapse
Affiliation(s)
- Zhihai Zhang
- DOE Center for Advanced Bioenergy and Bioproducts Innovation (CABBI), University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Joao Paulo Gomes Viana
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Bosen Zhang
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Kimberly K O Walden
- High Performance Computing in Biology, Carver Biotechnology Center, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Hans Müller Paul
- DOE Center for Advanced Bioenergy and Bioproducts Innovation (CABBI), University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Stephen P Moose
- DOE Center for Advanced Bioenergy and Bioproducts Innovation (CABBI), University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Geoffrey P Morris
- Department of Soil and Crop Science, Colorado State University, Fort Collins, Colorado 80523, USA
| | - Chris Daum
- United States Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Kerrie W Barry
- United States Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Nadia Shakoor
- Donald Danforth Plant Science Center, St. Louis, Missouri 63132, USA
| | - Matthew E Hudson
- DOE Center for Advanced Bioenergy and Bioproducts Innovation (CABBI), University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA;
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| |
Collapse
|
9
|
Curry KD, Yu FB, Vance SE, Segarra S, Bhaya D, Chikhi R, Rocha EP, Treangen TJ. Reference-free Structural Variant Detection in Microbiomes via Long-read Coassembly Graphs. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.25.577285. [PMID: 38352454 PMCID: PMC10862772 DOI: 10.1101/2024.01.25.577285] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
Bacterial genome dynamics are vital for understanding the mechanisms underlying microbial adaptation, growth, and their broader impact on host phenotype. Structural variants (SVs), genomic alterations of 10 base pairs or more, play a pivotal role in driving evolutionary processes and maintaining genomic heterogeneity within bacterial populations. While SV detection in isolate genomes is relatively straightforward, metagenomes present broader challenges due to absence of clear reference genomes and presence of mixed strains. In response, our proposed method rhea, forgoes reference genomes and metagenome-assembled genomes (MAGs) by encompassing a single metagenome coassembly graph constructed from all samples in a series. The log fold change in graph coverage between subsequent samples is then calculated to call SVs that are thriving or declining throughout the series. We show rhea to outperform existing methods for SV and horizontal gene transfer (HGT) detection in two simulated mock metagenomes, which is particularly noticeable as the simulated reads diverge from reference genomes and an increase in strain diversity is incorporated. We additionally demonstrate use cases for rhea on series metagenomic data of environmental and fermented food microbiomes to detect specific sequence alterations between subsequent time and temperature samples, suggesting host advantage. Our innovative approach leverages raw read patterns rather than references or MAGs to include all sequencing reads in analysis, and thus provide versatility in studying SVs across diverse and poorly characterized microbial communities for more comprehensive insights into microbial genome dynamics.
Collapse
Affiliation(s)
- Kristen D. Curry
- Rice University, Department of Computer Science, Houston, TX 77005, United States
- Institut Pasteur, Université Paris Cité, CNRS, UMR3525, Microbial Evolutionary Genomics, 75015 Paris, France
| | | | - Summer E. Vance
- University of California, Berkeley, Department of Environmental Science, Policy, and Management, Berkeley, CA 94720, United States
| | - Santiago Segarra
- Rice University, Department of Electrical and Computer Engineering, Houston, TX 77005, United States
| | - Devaki Bhaya
- Carnegie Institution for Science, Department of Plant Biology, Stanford, CA 94305, United States
| | - Rayan Chikhi
- Institut Pasteur, Université Paris Cité, Sequence Bioinformatics unit, 75015 Paris, France
| | - Eduardo P.C. Rocha
- Institut Pasteur, Université Paris Cité, CNRS, UMR3525, Microbial Evolutionary Genomics, 75015 Paris, France
| | - Todd J. Treangen
- Rice University, Department of Computer Science, Houston, TX 77005, United States
| |
Collapse
|
10
|
Villarreal P, O'Donnell S, Agier N, Muñoz-Guzman F, Benavides-Parra J, Urbina K, Peña TA, Solomon M, Nespolo RF, Fischer G, Varela C, Cubillos FA. Domestication signatures in the non-conventional yeast Lachancea cidri. mSystems 2024; 9:e0105823. [PMID: 38085042 PMCID: PMC10805023 DOI: 10.1128/msystems.01058-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Accepted: 11/06/2023] [Indexed: 01/24/2024] Open
Abstract
Evaluating domestication signatures beyond model organisms is essential for a thorough understanding of the genotype-phenotype relationship in wild and human-related environments. Structural variations (SVs) can significantly impact phenotypes playing an important role in the physiological adaptation of species to different niches, including during domestication. A detailed characterization of the fitness consequences of these genomic rearrangements, however, is still limited in non-model systems, largely due to the paucity of direct comparisons between domesticated and wild isolates. Here, we used a combination of sequencing strategies to explore major genomic rearrangements in a Lachancea cidri yeast strain isolated from cider (CBS2950) and compared them to those in eight wild isolates from primary forests. Genomic analysis revealed dozens of SVs, including a large reciprocal translocation (~16 kb and 500 kb) present in the cider strain, but absent from all wild strains. Interestingly, the number of SVs was higher relative to single-nucleotide polymorphisms in the cider strain, suggesting a significant role in the strain's phenotypic variation. The set of SVs identified directly impacts dozens of genes and likely underpins the greater fermentation performance in the L. cidri CBS2950. In addition, the large reciprocal translocation affects a proline permease (PUT4) regulatory region, resulting in higher PUT4 transcript levels, which agrees with higher ethanol tolerance, improved cell growth when using proline, and higher amino acid consumption during fermentation. These results suggest that SVs are responsible for the rapid physiological adaptation of yeast to a human-related environment and demonstrate the key contribution of SVs in adaptive fermentative traits in non-model species.IMPORTANCEThe exploration of domestication signatures associated with human-related environments has predominantly focused on studies conducted on model organisms, such as Saccharomyces cerevisiae, overlooking the potential for comparisons across other non-Saccharomyces species. In our research, employing a combination of long- and short-read data, we found domestication signatures in Lachancea cidri, a non-model species recently isolated from fermentative environments in cider in France. The significance of our study lies in the identification of large array of major genomic rearrangements in a cider strain compared to wild isolates, which underly several fermentative traits. These domestication signatures result from structural variants, which are likely responsible for the phenotypic differences between strains, providing a rapid path of adaptation to human-related environments.
Collapse
Affiliation(s)
- Pablo Villarreal
- Departamento de Biología, Facultad de Química y Biología, Universidad de Santiago de Chile, Santiago, Chile
- Millennium Institute for Integrative Biology (iBio), Santiago, Chile
| | - Samuel O'Donnell
- Laboratory of Computational and Quantitative Biology, CNRS, Institut de Biologie Paris-Seine, Sorbonne Université, Paris, France
| | - Nicolas Agier
- Laboratory of Computational and Quantitative Biology, CNRS, Institut de Biologie Paris-Seine, Sorbonne Université, Paris, France
| | - Felipe Muñoz-Guzman
- Departamento de Biología, Facultad de Química y Biología, Universidad de Santiago de Chile, Santiago, Chile
- Millennium Institute for Integrative Biology (iBio), Santiago, Chile
| | - Jose Benavides-Parra
- Departamento de Biología, Facultad de Química y Biología, Universidad de Santiago de Chile, Santiago, Chile
| | - Kami Urbina
- Departamento de Biología, Facultad de Química y Biología, Universidad de Santiago de Chile, Santiago, Chile
- Millennium Institute for Integrative Biology (iBio), Santiago, Chile
- Millenium Nucleus of Patagonian Limit of Life (LiLi), Santiago, Chile
| | - Tomas A. Peña
- Departamento de Biología, Facultad de Química y Biología, Universidad de Santiago de Chile, Santiago, Chile
- Millennium Institute for Integrative Biology (iBio), Santiago, Chile
| | - Mark Solomon
- The Australian Wine Research Institute, Glen Osmond, Adelaide, SA, Australia
| | - Roberto F. Nespolo
- Millennium Institute for Integrative Biology (iBio), Santiago, Chile
- Millenium Nucleus of Patagonian Limit of Life (LiLi), Santiago, Chile
- Instituto de Ciencias Ambientales y Evolutivas, Universidad Austral de Chile, Valdivia, Chile
- Center of Applied Ecology and Sustainability (CAPES), Facultad de Ciencias Biológicas, Universidad Católica de Chile, Santiago, Chile
| | - Gilles Fischer
- Laboratory of Computational and Quantitative Biology, CNRS, Institut de Biologie Paris-Seine, Sorbonne Université, Paris, France
| | - Cristian Varela
- The Australian Wine Research Institute, Glen Osmond, Adelaide, SA, Australia
- School of Agriculture, Food and Wine, University of Adelaide, Glen Osmond, Adelaide, SA, Australia
| | - Francisco A. Cubillos
- Departamento de Biología, Facultad de Química y Biología, Universidad de Santiago de Chile, Santiago, Chile
- Millennium Institute for Integrative Biology (iBio), Santiago, Chile
- Millenium Nucleus of Patagonian Limit of Life (LiLi), Santiago, Chile
| |
Collapse
|
11
|
Zhang DQ, Liu XY, Qiu LF, Liu ZR, Yang YP, Huang L, Wang SY, Zhang JQ. Two chromosome-level genome assemblies of Rhodiola shed new light on genome evolution in rapid radiation and evolution of the biosynthetic pathway of salidroside. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024; 117:464-482. [PMID: 37872890 DOI: 10.1111/tpj.16501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Revised: 09/29/2023] [Accepted: 10/04/2023] [Indexed: 10/25/2023]
Abstract
Rhodiola L. is a genus that has undergone rapid radiation in the mid-Miocene and may represent a typic case of adaptive radiation. Many species of Rhodiola have also been widely used as an important adaptogen in traditional medicines for centuries. However, a lack of high-quality chromosome-level genomes hinders in-depth study of its evolution and biosynthetic pathway of secondary metabolites. Here, we assembled two chromosome-level genomes for two Rhodiola species with different chromosome number and sexual system. The assembled genome size of R. chrysanthemifolia (2n = 14; hermaphrodite) and R. kirilowii (2n = 22; dioecious) were of 402.67 and 653.62 Mb, respectively, with approximately 57.60% and 69.22% of transposable elements (TEs). The size difference between the two genomes was mostly due to proliferation of long terminal repeat-retrotransposons (LTR-RTs) in the R. kirilowii genome. Comparative genomic analysis revealed possible gene families responsible for high-altitude adaptation of Rhodiola, including a homolog of plant cysteine oxidase 2 gene of Arabidopsis thaliana (AtPCO2), which is part of the core molecular reaction to hypoxia and contributes to the stability of Group VII ethylene response factors (ERF-VII). We found extensive chromosome fusion/fission events and structural variations between the two genomes, which might have facilitated the initial rapid radiation of Rhodiola. We also identified candidate genes in the biosynthetic pathway of salidroside. Overall, our results provide important insights into genome evolution in plant rapid radiations, and possible roles of chromosome fusion/fission and structure variation played in rapid speciation.
Collapse
Affiliation(s)
- Dan-Qing Zhang
- National Engineering Laboratory for Resource Development of Endangered Crude Drugs in Northwest China, College of Life Sciences, Shaanxi Normal University, Xi'an, 710119, China
- Key Laboratory of Medicinal Plant Resource and Natural Pharmaceutical Chemistry of Ministry of Education, Shaanxi Normal University, Xi'an, 710119, China
| | - Xiao-Ying Liu
- National Engineering Laboratory for Resource Development of Endangered Crude Drugs in Northwest China, College of Life Sciences, Shaanxi Normal University, Xi'an, 710119, China
- Key Laboratory of Medicinal Plant Resource and Natural Pharmaceutical Chemistry of Ministry of Education, Shaanxi Normal University, Xi'an, 710119, China
| | - Lin-Feng Qiu
- National Engineering Laboratory for Resource Development of Endangered Crude Drugs in Northwest China, College of Life Sciences, Shaanxi Normal University, Xi'an, 710119, China
- Key Laboratory of Medicinal Plant Resource and Natural Pharmaceutical Chemistry of Ministry of Education, Shaanxi Normal University, Xi'an, 710119, China
| | - Zhao-Rui Liu
- National Engineering Laboratory for Resource Development of Endangered Crude Drugs in Northwest China, College of Life Sciences, Shaanxi Normal University, Xi'an, 710119, China
- Key Laboratory of Medicinal Plant Resource and Natural Pharmaceutical Chemistry of Ministry of Education, Shaanxi Normal University, Xi'an, 710119, China
| | - Ya-Peng Yang
- National Engineering Laboratory for Resource Development of Endangered Crude Drugs in Northwest China, College of Life Sciences, Shaanxi Normal University, Xi'an, 710119, China
- Key Laboratory of Medicinal Plant Resource and Natural Pharmaceutical Chemistry of Ministry of Education, Shaanxi Normal University, Xi'an, 710119, China
| | - Long Huang
- National Engineering Laboratory for Resource Development of Endangered Crude Drugs in Northwest China, College of Life Sciences, Shaanxi Normal University, Xi'an, 710119, China
- Key Laboratory of Medicinal Plant Resource and Natural Pharmaceutical Chemistry of Ministry of Education, Shaanxi Normal University, Xi'an, 710119, China
| | - Shi-Yu Wang
- National Engineering Laboratory for Resource Development of Endangered Crude Drugs in Northwest China, College of Life Sciences, Shaanxi Normal University, Xi'an, 710119, China
- Key Laboratory of Medicinal Plant Resource and Natural Pharmaceutical Chemistry of Ministry of Education, Shaanxi Normal University, Xi'an, 710119, China
| | - Jian-Qiang Zhang
- National Engineering Laboratory for Resource Development of Endangered Crude Drugs in Northwest China, College of Life Sciences, Shaanxi Normal University, Xi'an, 710119, China
- Key Laboratory of Medicinal Plant Resource and Natural Pharmaceutical Chemistry of Ministry of Education, Shaanxi Normal University, Xi'an, 710119, China
| |
Collapse
|
12
|
Zhou C, Zheng X, Peng K, Feng K, Yue B, Wu Y. Chromosome-level genome assembly of the kiang (Equus kiang) illuminates genomic basis for its high-altitude adaptation. Integr Zool 2023. [PMID: 38151756 DOI: 10.1111/1749-4877.12795] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2023]
Abstract
The kiang (Equus kiang) can only be observed in the Qinghai-Tibet Plateau (QTP). The kiang displayed excellent athletic performance in the high-altitude environment, which attracted wide interest in the investigation of the potential adaptive mechanisms to the extreme environment. Here, we assembled a chromosome-level genome of the kiang based on Hi-C sequencing technology. A total of 324.14 Gb clean data were generated, and the chromosome-level genome with 26 chromosomes (25 + X) and scaffold N50 of 101.77 Mb was obtained for the kiang. The genomic synteny analysis revealed large-scale chromosomal rearrangement during the evolution process of Equus species. Phylogenetic and divergence analyses revealed that the kiang was the sister branch to the ass and diverged from a common ancestor at approximately 13.5 Mya. The expanded gene families were mainly related to the hypoxia response, metabolism, and immunity. The kiang suffered a significant loss of olfaction-related genes, which might indicate decreased olfactory sensibility. Positively selected genes (PSGs) detected in the kiang were mainly associated with hypoxia response. Especially, there were two species-specific missense amino acid mutations in the PSG STAT3 annotated in the hypoxia-inducible factor 1 signal pathway, which may play an important role in the high-altitude adaptation of the kiang. Moreover, structure variations in the kiang genome were also identified, which possibly contributed to the high-altitude adaptation of the kiang. Comparative analysis revealed a lot of species-specific insertions and deletions in the kiang genome, such as PIK3CB and AKT with 3258 and 189 bp insertions in the intron region, respectively, possibly affecting the expression and regulation of hypoxia-related downstream pathways. This study provided valuable genomic resources, and our findings help a better understanding of the underlying adaptive strategies to the high-altitude environment in the kiang.
Collapse
Affiliation(s)
- Chuang Zhou
- Key Laboratory of Bioresources and Ecoenvironment (Ministry of Education), College of Life Sciences, Sichuan University, Chengdu, China
| | - Xiaofeng Zheng
- Key Laboratory of Bioresources and Ecoenvironment (Ministry of Education), College of Life Sciences, Sichuan University, Chengdu, China
| | - Kexin Peng
- Key Laboratory of Bioresources and Ecoenvironment (Ministry of Education), College of Life Sciences, Sichuan University, Chengdu, China
| | - Kaize Feng
- Key Laboratory of Bioresources and Ecoenvironment (Ministry of Education), College of Life Sciences, Sichuan University, Chengdu, China
| | - Bisong Yue
- Key Laboratory of Bioresources and Ecoenvironment (Ministry of Education), College of Life Sciences, Sichuan University, Chengdu, China
| | - Yongjie Wu
- Key Laboratory of Bioresources and Ecoenvironment (Ministry of Education), College of Life Sciences, Sichuan University, Chengdu, China
| |
Collapse
|
13
|
Charron P, Gao R, Chmara J, Hoover E, Nadin-Davis S, Chauvin D, Hazelwood J, Makondo K, Duceppe MO, Kang M. Influence of genomic variations on glanders serodiagnostic antigens using integrative genomic and transcriptomic approaches. Front Vet Sci 2023; 10:1217135. [PMID: 38125681 PMCID: PMC10730941 DOI: 10.3389/fvets.2023.1217135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Accepted: 11/20/2023] [Indexed: 12/23/2023] Open
Abstract
Glanders is a highly contagious and life-threatening zoonotic disease caused by Burkholderia mallei (B. mallei). Without an effective vaccine or treatment, early diagnosis has been regarded as the most effective method to prevent glanders transmission. Currently, the diagnosis of glanders is heavily reliant on serological tests. However, given that markedly different host immune responses can be elicited by genetically different strains of the same bacterial species, infection by B. mallei, whose genome is unstable and plastic, may result in various immune responses. This variability can make the serodiagnosis of glanders challenging. Therefore, there is a need for a comprehensive understanding and assessment of how B. mallei genomic variations impact the appropriateness of specific target antigens for glanders serodiagnosis. In this study, we investigated how genomic variations in the B. mallei genome affect gene content (gene presence/absence) and expression, with a special focus on antigens used or potentially used in serodiagnosis. In all the genome sequences of B. mallei isolates available in NCBI's RefSeq database (accessed in July 2023) and in-house sequenced samples, extensive small and large variations were observed when compared to the type strain ATCC 23344. Further pan-genome analysis of those assemblies revealed variations of gene content among all available genomes of B. mallei. Specifically, differences in gene content ranging from 31 to 715 genes with an average of 334 gene presence-absence variations were found in strains with complete or chromosome-level genome assemblies, using the ATCC 23344 strain as a reference. The affected genes included some encoded proteins used as serodiagnostic antigens, which were lost due mainly to structural variations. Additionally, a transcriptomic analysis was performed using the type strain ATCC 23344 and strain Zagreb which has been widely utilized to produce glanders antigens. In total, 388 significant differentially expressed genes were identified between these two strains, including genes related to bacterial pathogenesis and virulence, some of which were associated with genomic variations, particularly structural variations. To our knowledge, this is the first comprehensive study to uncover the impacts of genetic variations of B. mallei on its gene content and expression. These differences would have significant impacts on host innate and adaptive immunity, including antibody production, during infection. This study provides novel insights into B. mallei genetic variants, knowledge which will help to improve glanders serodiagnosis.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | - Mingsong Kang
- Ottawa Laboratory-Fallowfield, Canadian Food Inspection Agency, Ottawa, ON, Canada
| |
Collapse
|
14
|
Chen S, Wang P, Kong W, Chai K, Zhang S, Yu J, Wang Y, Jiang M, Lei W, Chen X, Wang W, Gao Y, Qu S, Wang F, Wang Y, Zhang Q, Gu M, Fang K, Ma C, Sun W, Ye N, Wu H, Zhang X. Gene mining and genomics-assisted breeding empowered by the pangenome of tea plant Camellia sinensis. NATURE PLANTS 2023; 9:1986-1999. [PMID: 38012346 DOI: 10.1038/s41477-023-01565-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Accepted: 10/20/2023] [Indexed: 11/29/2023]
Abstract
Tea is one of the world's oldest crops and is cultivated to produce beverages with various flavours. Despite advances in sequencing technologies, the genetic mechanisms underlying key agronomic traits of tea remain unclear. In this study, we present a high-quality pangenome of 22 elite cultivars, representing broad genetic diversity in the species. Our analysis reveals that a recent long terminal repeat burst contributed nearly 20% of gene copies, introducing functional genetic variants that affect phenotypes such as leaf colour. Our graphical pangenome improves the efficiency of genome-wide association studies and allows the identification of key genes controlling bud flush timing. We also identified strong correlations between allelic variants and flavour-related chemistries. These findings deepen our understanding of the genetic basis of tea quality and provide valuable genomic resources to facilitate its genomics-assisted breeding.
Collapse
Affiliation(s)
- Shuai Chen
- National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Pengjie Wang
- National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Weilong Kong
- National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Kun Chai
- Center for Genomics and Biotechnology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Key Laboratory of Genetics, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Shengcheng Zhang
- National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Jiaxin Yu
- National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Yibin Wang
- National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Mengwei Jiang
- Center for Genomics and Biotechnology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Key Laboratory of Genetics, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Wenlong Lei
- Center for Genomics and Biotechnology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Key Laboratory of Genetics, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Xiao Chen
- National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Wenling Wang
- Center for Genomics and Biotechnology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Key Laboratory of Genetics, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Yingying Gao
- National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Shenyang Qu
- National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Fang Wang
- National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Yinghao Wang
- Center for Genomics and Biotechnology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Key Laboratory of Genetics, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Qing Zhang
- National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Mengya Gu
- College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Kaixing Fang
- Tea Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Provincial Key Laboratory of Tea Plant Resources Innovation and Utilization, Guangzhou, China
| | - Chunlei Ma
- Key Laboratory of Biology, Genetics and Breeding of Special Economic Animals and Plants, Ministry of Agriculture and Rural Affairs, Tea Research Institute of the Chinese Academy of Agricultural Sciences, Hangzhou, China
| | - Weijiang Sun
- College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Naixing Ye
- College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou, China.
| | - Hualing Wu
- Tea Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Provincial Key Laboratory of Tea Plant Resources Innovation and Utilization, Guangzhou, China.
| | - Xingtan Zhang
- National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China.
| |
Collapse
|
15
|
Liu J, Ye SY, Xu XD, Liu Q, Ma F, Yu X, Luo YH, Chen LL, Zeng X. Multiomics analysis reveals the genetic and metabolic characteristics associated with the low prevalence of dental caries. J Oral Microbiol 2023; 15:2277271. [PMID: 37928602 PMCID: PMC10623897 DOI: 10.1080/20002297.2023.2277271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Accepted: 10/25/2023] [Indexed: 11/07/2023] Open
Abstract
Background Despite poor oral hygiene, the Baiku Yao (BKY) ethnic group in China presents a low prevalence of dental caries, which may be related to genetic susceptibility. Due to strict intra-ethnic marriage rule, this ethnic has an advantage in studying the interaction between genetic factors and other regulatory factors related to dental caries. Methods Peripheral blood from a caries-free adult male was used for whole genome sequencing, and the BKY assembled genome was compared to the Han Chinese genome. Oral saliva samples were collected from 51 subjects for metabolomic and metagenomic analysis. Multiomics data were integrated for combined analysis using bioinformatics approaches. Results Comparative genomic analysis revealed the presence of structural variations in several genes associated with dental caries. Metabolomic and metagenomic sequencing demonstrated the caries-free group had significantly higher concentration of antimicrobials and higher abundance of core oral health-related microbiota. The functional analysis indicated that cationic antimicrobial peptide resistance and the lipopolysaccharide biosynthesis pathway were enriched in the caries-free group. Conclusions Our study provided new insights into the specific regulatory mechanisms that contribute to the low prevalence of dental caries in the specific population and may provide new evidence for the genetic diagnosis and control of dental caries.
Collapse
Affiliation(s)
- Jinshen Liu
- College of Stomatology, Hospital of Stomatology, Guangxi Medical University, Nanning, China
| | - Si-Ying Ye
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, College of Life Science and Technology, Guangxi University, Nanning, China
| | - Xin-Dong Xu
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, College of Life Science and Technology, Guangxi University, Nanning, China
| | - Qiulin Liu
- College of Stomatology, Hospital of Stomatology, Guangxi Medical University, Nanning, China
| | - Fei Ma
- College of Stomatology, Hospital of Stomatology, Guangxi Medical University, Nanning, China
| | - Xueting Yu
- College of Stomatology, Hospital of Stomatology, Guangxi Medical University, Nanning, China
| | - Yu-Hong Luo
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, College of Life Science and Technology, Guangxi University, Nanning, China
| | - Ling-Ling Chen
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, College of Life Science and Technology, Guangxi University, Nanning, China
| | - Xiaojuan Zeng
- College of Stomatology, Hospital of Stomatology, Guangxi Medical University, Nanning, China
| |
Collapse
|
16
|
Li J, Stenberg S, Yue JX, Mikhalev E, Thompson D, Warringer J, Liti G. Genome instability footprint under rapamycin and hydroxyurea treatments. PLoS Genet 2023; 19:e1011012. [PMID: 37931001 PMCID: PMC10653606 DOI: 10.1371/journal.pgen.1011012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 11/16/2023] [Accepted: 10/10/2023] [Indexed: 11/08/2023] Open
Abstract
The mutational processes dictating the accumulation of mutations in genomes are shaped by genetic background, environment and their interactions. Accurate quantification of mutation rates and spectra under drugs has important implications in disease treatment. Here, we used whole-genome sequencing and time-resolved growth phenotyping of yeast mutation accumulation lines to give a detailed view of the mutagenic effects of rapamycin and hydroxyurea on the genome and cell growth. Mutation rates depended on the genetic backgrounds but were only marginally affected by rapamycin. As a remarkable exception, rapamycin treatment was associated with frequent chromosome XII amplifications, which compensated for rapamycin induced rDNA repeat contraction on this chromosome and served to maintain rDNA content homeostasis and fitness. In hydroxyurea, a wide range of mutation rates were elevated regardless of the genetic backgrounds, with a particularly high occurrence of aneuploidy that associated with dramatic fitness loss. Hydroxyurea also induced a high T-to-G and low C-to-A transversion rate that reversed the common G/C-to-A/T bias in yeast and gave rise to a broad range of structural variants, including mtDNA deletions. The hydroxyurea mutation footprint was consistent with the activation of error-prone DNA polymerase activities and non-homologues end joining repair pathways. Taken together, our study provides an in-depth view of mutation rates and signatures in rapamycin and hydroxyurea and their impact on cell fitness, which brings insights for assessing their chronic effects on genome integrity.
Collapse
Affiliation(s)
- Jing Li
- State Key Laboratory of Oncology in South China, Guangdong Provincial Clinical Research Center for Cancer, Sun Yat-sen University Cancer Center, Guangzhou, P. R. China
- Université Côte d’Azur, CNRS, INSERM, IRCAN, Nice, France
| | - Simon Stenberg
- Department of Chemistry and Molecular Biology, University of Gothenburg, Gothenburg, Sweden
| | - Jia-Xing Yue
- State Key Laboratory of Oncology in South China, Guangdong Provincial Clinical Research Center for Cancer, Sun Yat-sen University Cancer Center, Guangzhou, P. R. China
- Université Côte d’Azur, CNRS, INSERM, IRCAN, Nice, France
| | | | - Dawn Thompson
- Ginkgo Bioworks, Boston, Massachusetts, United States of America
| | - Jonas Warringer
- Department of Chemistry and Molecular Biology, University of Gothenburg, Gothenburg, Sweden
| | - Gianni Liti
- Université Côte d’Azur, CNRS, INSERM, IRCAN, Nice, France
| |
Collapse
|
17
|
Li Y, Cao J, Wang J. MetaSVs: A pipeline combining long and short reads for analysis and visualization of structural variants in metagenomes. IMETA 2023; 2:e139. [PMID: 38868213 PMCID: PMC10989790 DOI: 10.1002/imt2.139] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Accepted: 09/25/2023] [Indexed: 06/14/2024]
Abstract
Structural variants (SVs, including large-scale insertions, deletions, inversions, and translocations) significantly impact the functions of genes in the microbial genome, and SVs in the microbiome are associated with diverse biological processes and human diseases. With the advancements in sequencing and bioinformatics technologies, increasingly, sequencing data and analysis tools are already being extensively utilized for microbiome SV analyses, leading to a higher demand for more dedicated SV analysis workflows. Moreover, due to the unique detection biases of various sequencing technologies, including short-read sequencing (such as Illumina platforms) and long-read sequencing (e.g., Oxford Nanopore and PacBio), SV discovery based on multiple platforms is necessary to comprehensively identify the wide variety of SVs. Here, we establish an integrated pipeline MetaSVs combining Nanopore long reads and Illumina short reads to analyze SVs in the microbial genomes from gut microbiome and further identify differential SVs that can be reflective of metabolic differences. Our pipeline provides researchers easy access to SVs and relevant metabolites in the microbial genomes without the requirement of specific technical expertise, which is particularly useful to researchers interested in metagenomic SVs but lacking sophisticated bioinformatic knowledge.
Collapse
Affiliation(s)
- Yuejuan Li
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of MicrobiologyChinese Academy of SciencesBeijingChina
- University of Chinese Academy of SciencesBeijingChina
| | - Jiabao Cao
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of MicrobiologyChinese Academy of SciencesBeijingChina
| | - Jun Wang
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of MicrobiologyChinese Academy of SciencesBeijingChina
- University of Chinese Academy of SciencesBeijingChina
| |
Collapse
|
18
|
Xue JY, Fan HY, Zeng Z, Zhou YH, Hu SY, Li SX, Cheng YJ, Meng XR, Chen F, Shao ZQ, Van de Peer Y. Comprehensive regulatory networks for tomato organ development based on the genome and RNAome of MicroTom tomato. HORTICULTURE RESEARCH 2023; 10:uhad147. [PMID: 37691964 PMCID: PMC10483172 DOI: 10.1093/hr/uhad147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Accepted: 07/15/2023] [Indexed: 09/12/2023]
Abstract
MicroTom has a short growth cycle and high transformation efficiency, and is a prospective model plant for studying organ development, metabolism, and plant-microbe interactions. Here, with a newly assembled reference genome for this tomato cultivar and abundant RNA-seq data derived from tissues of different organs/developmental stages/treatments, we constructed multiple gene co-expression networks, which will provide valuable clues for the identification of important genes involved in diverse regulatory pathways during plant growth, e.g. arbuscular mycorrhizal symbiosis and fruit development. Additionally, non-coding RNAs, including miRNAs, lncRNAs, and circRNAs were also identified, together with their potential targets. Interacting networks between different types of non-coding RNAs (miRNA-lncRNA), and non-coding RNAs and genes (miRNA-mRNA and lncRNA-mRNA) were constructed as well. Our results and data will provide valuable information for the study of organ differentiation and development of this important fruit. Lastly, we established a database (http://eplant.njau.edu.cn/microTomBase/) with genomic and transcriptomic data, as well as details of gene co-expression and interacting networks on MicroTom, and this database should be of great value to those who want to adopt MicroTom as a model plant for research.
Collapse
Affiliation(s)
- Jia-Yu Xue
- College of Horticulture, Academy for Advanced Interdisciplinary Studies, Nanjing Agricultural University, Nanjing 210095, China
| | - Hai-Yun Fan
- College of Horticulture, Academy for Advanced Interdisciplinary Studies, Nanjing Agricultural University, Nanjing 210095, China
| | - Zhen Zeng
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing 210023, China
| | - Yu-Han Zhou
- College of Horticulture, Academy for Advanced Interdisciplinary Studies, Nanjing Agricultural University, Nanjing 210095, China
| | - Shuai-Ya Hu
- College of Horticulture, Academy for Advanced Interdisciplinary Studies, Nanjing Agricultural University, Nanjing 210095, China
| | - Sai-Xi Li
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing 210023, China
| | - Ying-Juan Cheng
- College of Horticulture, Academy for Advanced Interdisciplinary Studies, Nanjing Agricultural University, Nanjing 210095, China
| | - Xiang-Ru Meng
- College of Horticulture, Academy for Advanced Interdisciplinary Studies, Nanjing Agricultural University, Nanjing 210095, China
| | - Fei Chen
- College of Tropical Crops, Sanya Nanfan Research Institute, Hainan University, Haikou 570228, China
- Hainan Yazhou Bay Seed Laboratory, Sanya 572025, China
| | - Zhu-Qing Shao
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing 210023, China
| | - Yves Van de Peer
- College of Horticulture, Academy for Advanced Interdisciplinary Studies, Nanjing Agricultural University, Nanjing 210095, China
- Department of Plant Biotechnology and Bioinformatics, VIB-UGent Center for Plant Systems Biology, Ghent University, B-9052 Ghent, Belgium
- Department of Biochemistry, Genetics and Microbiology, University of Pretoria, Pretoria 0028, South Africa
| |
Collapse
|
19
|
Chaux F, Agier N, Garrido C, Fischer G, Eberhard S, Xu Z. Telomerase-independent survival leads to a mosaic of complex subtelomere rearrangements in Chlamydomonas reinhardtii. Genome Res 2023; 33:1582-1598. [PMID: 37580131 PMCID: PMC10620057 DOI: 10.1101/gr.278043.123] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Accepted: 08/09/2023] [Indexed: 08/16/2023]
Abstract
Telomeres and subtelomeres, the genomic regions located at chromosome extremities, are essential for genome stability in eukaryotes. In the absence of the canonical maintenance mechanism provided by telomerase, telomere shortening induces genome instability. The landscape of the ensuing genome rearrangements is not accessible by short-read sequencing. Here, we leverage Oxford Nanopore Technologies long-read sequencing to survey the extensive repertoire of genome rearrangements in telomerase mutants of the model green microalga Chlamydomonas reinhardtii In telomerase-mutant strains grown for hundreds of generations, most chromosome extremities were capped by short telomere sequences that were either recruited de novo from other loci or maintained in a telomerase-independent manner. Other extremities did not end with telomeres but only with repeated subtelomeric sequences. The subtelomeric elements, including rDNA, were massively rearranged and involved in breakage-fusion-bridge cycles, translocations, recombinations, and chromosome circularization. These events were established progressively over time and displayed heterogeneity at the subpopulation level. New telomere-capped extremities composed of sequences originating from more internal genomic regions were associated with high DNA methylation, suggesting that de novo heterochromatin formation contributes to the restoration of chromosome end stability in C. reinhardtii The diversity of alternative strategies present in the same organism to maintain chromosome integrity and the variety of rearrangements found in telomerase mutants are remarkable, and illustrate genome plasticity at short timescales.
Collapse
Affiliation(s)
- Frédéric Chaux
- Sorbonne Université, CNRS, UMR7238, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, 75005 Paris, France
| | - Nicolas Agier
- Sorbonne Université, CNRS, UMR7238, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, 75005 Paris, France
| | - Clotilde Garrido
- Sorbonne Université, CNRS, UMR7238, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, 75005 Paris, France
| | - Gilles Fischer
- Sorbonne Université, CNRS, UMR7238, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, 75005 Paris, France
| | - Stephan Eberhard
- Sorbonne Université, CNRS, UMR7141, Institut de Biologie Physico-Chimique, Laboratory of Chloroplast Biology and Light-Sensing in Microalgae, 75005 Paris, France
| | - Zhou Xu
- Sorbonne Université, CNRS, UMR7238, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, 75005 Paris, France;
| |
Collapse
|
20
|
Tsouris A, Brach G, Friedrich A, Hou J, Schacherer J. Diallel panel reveals a significant impact of low-frequency genetic variants on gene expression variation in yeast. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.21.550015. [PMID: 37503053 PMCID: PMC10370210 DOI: 10.1101/2023.07.21.550015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Unraveling the genetic sources of gene expression variation is essential to better understand the origins of phenotypic diversity in natural populations. Genome-wide association studies identified thousands of variants involved in gene expression variation, however, variants detected only explain part of the heritability. In fact, variants such as low-frequency and structural variants (SVs) are poorly captured in association studies. To assess the impact of these variants on gene expression variation, we explored a half-diallel panel composed of 323 hybrids originated from pairwise crosses of 26 natural Saccharomyces cerevisiae isolates. Using short- and long-read sequencing strategies, we established an exhaustive catalog of single nucleotide polymorphisms (SNPs) and SVs for this panel. Combining this dataset with the transcriptomes of all hybrids, we comprehensively mapped SNPs and SVs associated with gene expression variation. While SVs impact gene expression variation, SNPs exhibit a higher effect size with an overrepresentation of low-frequency variants compared to common ones. These results reinforce the importance of dissecting the heritability of complex traits with a comprehensive catalog of genetic variants at the population level.
Collapse
Affiliation(s)
- Andreas Tsouris
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Gauthier Brach
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Anne Friedrich
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Jing Hou
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
| | - Joseph Schacherer
- Université de Strasbourg, CNRS, GMGM UMR 7156, Strasbourg, France
- Institut Universitaire de France (IUF), Paris, France
| |
Collapse
|
21
|
Lee D, Fox B, Palomino D, Panda O, Tenjo F, Koury E, Evans K, Stevens L, Rodrigues P, Kolodziej A, Schroeder F, Andersen E. Natural genetic variation in the pheromone production of C. elegans. Proc Natl Acad Sci U S A 2023; 120:e2221150120. [PMID: 37339205 PMCID: PMC10293855 DOI: 10.1073/pnas.2221150120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 05/10/2023] [Indexed: 06/22/2023] Open
Abstract
From bacterial quorum sensing to human language, communication is essential for social interactions. Nematodes produce and sense pheromones to communicate among individuals and respond to environmental changes. These signals are encoded by different types and mixtures of ascarosides, whose modular structures further enhance the diversity of this nematode pheromone language. Interspecific and intraspecific differences in this ascaroside pheromone language have been described previously, but the genetic basis and molecular mechanisms underlying the variation remain largely unknown. Here, we analyzed natural variation in the production of 44 ascarosides across 95 wild Caenorhabditis elegans strains using high-performance liquid chromatography coupled to high-resolution mass spectrometry. We discovered wild strains defective in the production of specific subsets of ascarosides (e.g., the aggregation pheromone icas#9) or short- and medium-chain ascarosides, as well as inversely correlated patterns between the production of two major classes of ascarosides. We investigated genetic variants that are significantly associated with the natural differences in the composition of the pheromone bouquet, including rare genetic variants in key enzymes participating in ascaroside biosynthesis, such as the peroxisomal 3-ketoacyl-CoA thiolase, daf-22, and the carboxylesterase cest-3. Genome-wide association mappings revealed genomic loci harboring common variants that affect ascaroside profiles. Our study yields a valuable dataset for investigating the genetic mechanisms underlying the evolution of chemical communication.
Collapse
Affiliation(s)
- Daehan Lee
- Department of Molecular Biosciences, Northwestern University, Evanston, IL60208
- Department of Biology, Kyung Hee University, Seoul02447, Republic of Korea
- Department of Biological Sciences, Sungkyunkwan University, Suwon16419, Republic of Korea
| | - Bennett W. Fox
- Boyce Thompson Institute, Cornell University, Ithaca, NY14850
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY14850
| | - Diana Fajardo Palomino
- Boyce Thompson Institute, Cornell University, Ithaca, NY14850
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY14850
| | - Oishika Panda
- Boyce Thompson Institute, Cornell University, Ithaca, NY14850
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY14850
| | - Francisco J. Tenjo
- Boyce Thompson Institute, Cornell University, Ithaca, NY14850
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY14850
| | - Emily J. Koury
- Department of Molecular Biosciences, Northwestern University, Evanston, IL60208
| | - Kathryn S. Evans
- Department of Molecular Biosciences, Northwestern University, Evanston, IL60208
| | - Lewis Stevens
- Department of Molecular Biosciences, Northwestern University, Evanston, IL60208
- Tree of Life, Wellcome Sanger Institute, CambridgeCB10 1SA, United Kingdom
| | - Pedro R. Rodrigues
- Boyce Thompson Institute, Cornell University, Ithaca, NY14850
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY14850
| | - Aiden R. Kolodziej
- Boyce Thompson Institute, Cornell University, Ithaca, NY14850
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY14850
| | - Frank C. Schroeder
- Boyce Thompson Institute, Cornell University, Ithaca, NY14850
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY14850
| | - Erik C. Andersen
- Department of Molecular Biosciences, Northwestern University, Evanston, IL60208
| |
Collapse
|
22
|
Weller CA, Andreev I, Chambers MJ, Park M, Bloom JS, Sadhu MJ. Highly complete long-read genomes reveal pangenomic variation underlying yeast phenotypic diversity. Genome Res 2023; 33:729-740. [PMID: 37127330 PMCID: PMC10317115 DOI: 10.1101/gr.277515.122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 04/26/2023] [Indexed: 05/03/2023]
Abstract
Understanding the genetic causes of trait variation is a primary goal of genetic research. One way that individuals can vary genetically is through variable pangenomic genes: genes that are only present in some individuals in a population. The presence or absence of entire genes could have large effects on trait variation. However, variable pangenomic genes can be missed in standard genotyping workflows, owing to reliance on aligning short-read sequencing to reference genomes. A popular method for studying the genetic basis of trait variation is linkage mapping, which identifies quantitative trait loci (QTLs), regions of the genome that harbor causative genetic variants. Large-scale linkage mapping in the budding yeast Saccharomyces cerevisiae has found thousands of QTLs affecting myriad yeast phenotypes. To enable the resolution of QTLs caused by variable pangenomic genes, we used long-read sequencing to generate highly complete de novo genome assemblies of 16 diverse yeast isolates. With these assemblies, we resolved QTLs for growth on maltose, sucrose, raffinose, and oxidative stress to specific genes that are absent from the reference genome but present in the broader yeast population at appreciable frequency. Copies of genes also duplicate onto chromosomes where they are absent in the reference genome, and we found that these copies generate additional QTLs whose resolution requires pangenome characterization. Our findings show the need for highly complete genome assemblies to identify the genetic basis of trait variation.
Collapse
Affiliation(s)
- Cory A Weller
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Ilya Andreev
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Michael J Chambers
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Morgan Park
- NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Joshua S Bloom
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, California 90095, USA
- Department of Biological Chemistry, University of California, Los Angeles, Los Angeles, California 90095, USA
- Howard Hughes Medical Institute, University of California, Los Angeles, Los Angeles, California 90095, USA
- Institute for Quantitative and Computational Biology, University of California, Los Angeles, Los Angeles, California 90095, USA
- Department of Computational Medicine, University of California, Los Angeles, Los Angeles, California 90095, USA
| | - Meru J Sadhu
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA;
| |
Collapse
|
23
|
Wang J, Hu H, Liang X, Tahir ul Qamar M, Zhang Y, Zhao J, Ren H, Yan X, Ding B, Guo J. High-quality genome assembly and comparative genomic profiling of yellowhorn ( Xanthoceras sorbifolia) revealed environmental adaptation footprints and seed oil contents variations. FRONTIERS IN PLANT SCIENCE 2023; 14:1147946. [PMID: 37025151 PMCID: PMC10070836 DOI: 10.3389/fpls.2023.1147946] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Accepted: 03/06/2023] [Indexed: 05/31/2023]
Abstract
Yellowhorn (Xanthoceras sorbifolia) is a species of deciduous tree that is native to Northern and Central China, including Loess Plateau. The yellowhorn tree is a hardy plant, tolerating a wide range of growing conditions, and is often grown for ornamental purposes in parks, gardens, and other landscaped areas. The seeds of yellowhorn are edible and contain rich oil and fatty acid contents, making it an ideal plant for oil production. However, the mechanism of its ability to adapt to extreme environments and the genetic basis of oil synthesis remains to be elucidated. In this study, we reported a high-quality and near gap-less yellowhorn genome assembly, containing the highest genome continuity with a contig N50 of 32.5 Mb. Comparative genomics analysis showed that 1,237 and 231 gene families under expansion and the yellowhorn-specific gene family NB-ARC were enriched in photosynthesis and root cap development, which may contribute to the environmental adaption and abiotic stress resistance of yellowhorn. A 3-ketoacyl-CoA thiolase (KAT) gene (Xso_LG02_00600) was identified under positive selection, which may be associated with variations of seed oil content among different yellowhorn cultivars. This study provided insights into environmental adaptation and seed oil content variations of yellowhorn to accelerate its genetic improvement.
Collapse
Affiliation(s)
- Juan Wang
- College of Forestry, Shanxi Agricultural University, Taigu, Shanxi, China
- Shanxi Key Laboratory of Functional Oil Tree Cultivation and Research, Shanxi Agricultural University, Taigu, Shanxi, China
| | - Haifei Hu
- Rice Research Institute, Guangdong Key Laboratory of New Technology in Rice Breeding, Guangzhou, China
- Guangdong Rice Engineering Laboratory, Guangdong Academy of Agricultural Sciences, Guangzhou, China
| | - Xizhen Liang
- College of Forestry, Shanxi Agricultural University, Taigu, Shanxi, China
- Shanxi Key Laboratory of Functional Oil Tree Cultivation and Research, Shanxi Agricultural University, Taigu, Shanxi, China
| | - Muhammad Tahir ul Qamar
- Integrative Omics and Molecular Modeling Laboratory, Department of Bioinformatics and Biotechnology, Government College University Faisalabad (GCUF), Faisalabad, Pakistan
| | - Yunxiang Zhang
- College of Forestry, Shanxi Agricultural University, Taigu, Shanxi, China
- Shanxi Key Laboratory of Functional Oil Tree Cultivation and Research, Shanxi Agricultural University, Taigu, Shanxi, China
| | - Jianguo Zhao
- Engineering Research Center of Coalbased Ecological Carbon Sequestration Technology of the Ministry of Education, Datong University, Taigu, Shanxi, China
| | - Hongqian Ren
- College of Forestry, Shanxi Agricultural University, Taigu, Shanxi, China
- Shanxi Key Laboratory of Functional Oil Tree Cultivation and Research, Shanxi Agricultural University, Taigu, Shanxi, China
| | - Xingrong Yan
- College of Forestry, Shanxi Agricultural University, Taigu, Shanxi, China
- Shanxi Key Laboratory of Functional Oil Tree Cultivation and Research, Shanxi Agricultural University, Taigu, Shanxi, China
| | - Baopeng Ding
- College of Forestry, Shanxi Agricultural University, Taigu, Shanxi, China
- Engineering Research Center of Coalbased Ecological Carbon Sequestration Technology of the Ministry of Education, Datong University, Taigu, Shanxi, China
| | - Jinping Guo
- College of Forestry, Shanxi Agricultural University, Taigu, Shanxi, China
- Shanxi Key Laboratory of Functional Oil Tree Cultivation and Research, Shanxi Agricultural University, Taigu, Shanxi, China
| |
Collapse
|
24
|
Craig RJ, Gallaher SD, Shu S, Salomé PA, Jenkins JW, Blaby-Haas CE, Purvine SO, O’Donnell S, Barry K, Grimwood J, Strenkert D, Kropat J, Daum C, Yoshinaga Y, Goodstein DM, Vallon O, Schmutz J, Merchant SS. The Chlamydomonas Genome Project, version 6: Reference assemblies for mating-type plus and minus strains reveal extensive structural mutation in the laboratory. THE PLANT CELL 2023; 35:644-672. [PMID: 36562730 PMCID: PMC9940879 DOI: 10.1093/plcell/koac347] [Citation(s) in RCA: 27] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Revised: 10/12/2022] [Accepted: 12/16/2022] [Indexed: 05/20/2023]
Abstract
Five versions of the Chlamydomonas reinhardtii reference genome have been produced over the last two decades. Here we present version 6, bringing significant advances in assembly quality and structural annotations. PacBio-based chromosome-level assemblies for two laboratory strains, CC-503 and CC-4532, provide resources for the plus and minus mating-type alleles. We corrected major misassemblies in previous versions and validated our assemblies via linkage analyses. Contiguity increased over ten-fold and >80% of filled gaps are within genes. We used Iso-Seq and deep RNA-seq datasets to improve structural annotations, and updated gene symbols and textual annotation of functionally characterized genes via extensive manual curation. We discovered that the cell wall-less classical reference strain CC-503 exhibits genomic instability potentially caused by deletion of the helicase RECQ3, with major structural mutations identified that affect >100 genes. We therefore present the CC-4532 assembly as the primary reference, although this strain also carries unique structural mutations and is experiencing rapid proliferation of a Gypsy retrotransposon. We expect all laboratory strains to harbor gene-disrupting mutations, which should be considered when interpreting and comparing experimental results. Collectively, the resources presented here herald a new era of Chlamydomonas genomics and will provide the foundation for continued research in this important reference organism.
Collapse
Affiliation(s)
- Rory J Craig
- California Institute for Quantitative Biosciences, University of California, Berkeley, California 94720, USA
- Institute of Ecology and Evolution, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3FL, UK
| | - Sean D Gallaher
- California Institute for Quantitative Biosciences, University of California, Berkeley, California 94720, USA
| | - Shengqiang Shu
- United States Department of Energy, Joint Genome Institute, Berkeley, California 94720, USA
| | - Patrice A Salomé
- Department of Chemistry and Biochemistry, University of California, Los Angeles, California 90095, USA
- Institute for Genomics and Proteomics, University of California, Los Angeles, California 90095, USA
| | - Jerry W Jenkins
- HudsonAlpha Genome Sequencing Center, HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
| | - Crysten E Blaby-Haas
- The Molecular Foundry, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Samuel O Purvine
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Samuel O’Donnell
- Laboratory of Computational and Quantitative Biology, UMR 7238, CNRS, Institut de Biologie Paris-Seine, Sorbonne Université, Paris 75005, France
| | - Kerrie Barry
- United States Department of Energy, Joint Genome Institute, Berkeley, California 94720, USA
| | - Jane Grimwood
- HudsonAlpha Genome Sequencing Center, HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
| | - Daniela Strenkert
- California Institute for Quantitative Biosciences, University of California, Berkeley, California 94720, USA
| | - Janette Kropat
- Department of Chemistry and Biochemistry, University of California, Los Angeles, California 90095, USA
| | - Chris Daum
- United States Department of Energy, Joint Genome Institute, Berkeley, California 94720, USA
| | - Yuko Yoshinaga
- United States Department of Energy, Joint Genome Institute, Berkeley, California 94720, USA
| | - David M Goodstein
- United States Department of Energy, Joint Genome Institute, Berkeley, California 94720, USA
| | - Olivier Vallon
- Unité Mixte de Recherche 7141, CNRS, Institut de Biologie Physico-Chimique, Sorbonne Université, Paris 75005, France
| | - Jeremy Schmutz
- United States Department of Energy, Joint Genome Institute, Berkeley, California 94720, USA
- HudsonAlpha Genome Sequencing Center, HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
| | - Sabeeha S Merchant
- California Institute for Quantitative Biosciences, University of California, Berkeley, California 94720, USA
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
- Department of Plant and Microbial Biology, University of California, Berkeley, California 94720, USA
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| |
Collapse
|
25
|
Shi J, Tian Z, Lai J, Huang X. Plant pan-genomics and its applications. MOLECULAR PLANT 2023; 16:168-186. [PMID: 36523157 DOI: 10.1016/j.molp.2022.12.009] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 12/07/2022] [Accepted: 12/12/2022] [Indexed: 06/17/2023]
Abstract
Plant genomes are so highly diverse that a substantial proportion of genomic sequences are not shared among individuals. The variable DNA sequences, along with the conserved core sequences, compose the more sophisticated pan-genome that represents the collection of all non-redundant DNA in a species. With rapid progress in genome sequencing technologies, pan-genome research in plants is now accelerating. Here we review recent advances in plant pan-genomics, including major driving forces of structural variations that constitute the variable sequences, methodological innovations for representing the pan-genome, and major successes in constructing plant pan-genomes. We also summarize recent efforts toward decoding the remaining dark matter in telomere-to-telomere or gapless plant genomes. These new genome resources, which have remarkable advantages over numerous previously assembled less-than-perfect genomes, are expected to become new references for genetic studies and plant breeding.
Collapse
Affiliation(s)
- Junpeng Shi
- State Key Laboratory of Biocontrol, School of Agriculture, Sun Yat-sen University, Shenzhen 518107, China.
| | - Zhixi Tian
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China
| | - Jinsheng Lai
- State Key Laboratory of Plant Physiology and Biochemistry and National Maize Improvement Center, Department of Plant Genetics and Breeding, China Agricultural University, Beijing 100193, China
| | - Xuehui Huang
- Shanghai Key Laboratory of Plant Molecular Sciences, College of Life Sciences, Shanghai Normal University, Shanghai 200234, China.
| |
Collapse
|
26
|
López-Cortegano E, Craig RJ, Chebib J, Balogun EJ, Keightley PD. Rates and spectra of de novo structural mutations in Chlamydomonas reinhardtii. Genome Res 2023; 33:45-60. [PMID: 36617667 PMCID: PMC9977147 DOI: 10.1101/gr.276957.122] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Accepted: 12/06/2022] [Indexed: 12/14/2022]
Abstract
Genetic variation originates from several types of spontaneous mutation, including single-nucleotide substitutions, short insertions and deletions (indels), and larger structural changes. Structural mutations (SMs) drive genome evolution and are thought to play major roles in evolutionary adaptation, speciation, and genetic disease, including cancers. Sequencing of mutation accumulation (MA) lines has provided estimates of rates and spectra of single-nucleotide and indel mutations in many species, yet the rate of new SMs is largely unknown. Here, we use long-read sequencing to determine the full mutation spectrum in MA lines derived from two strains (CC-1952 and CC-2931) of the green alga Chlamydomonas reinhardtii The SM rate is highly variable between strains and between MA lines, and SMs represent a substantial proportion of all mutations in both strains (CC-1952 6%; CC-2931 12%). The SM spectra differ considerably between the two strains, with almost all inversions and translocations occurring in CC-2931 MA lines. This variation is associated with heterogeneity in the number and type of active transposable elements (TEs), which comprise major proportions of SMs in both strains (CC-1952 22%; CC-2931 38%). In CC-2931, a Crypton and a previously undescribed type of DNA element have caused 71% of chromosomal rearrangements, whereas in CC-1952, a Dualen LINE is associated with 87% of duplications. Other SMs, notably large duplications in CC-2931, are likely products of various double-strand break repair pathways. Our results show that diverse types of SMs occur at substantial rates, and support prominent roles for SMs and TEs in evolution.
Collapse
Affiliation(s)
- Eugenio López-Cortegano
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| | - Rory J Craig
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
- California Institute for Quantitative Biosciences, UC Berkeley, Berkeley, California 94720, USA
| | - Jobran Chebib
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| | - Eniolaye J Balogun
- Department of Ecology and Evolutionary Biology, University of Toronto, Ontario ON M5S 3B2, Canada
- Department of Biology, University of Toronto Mississauga, Mississauga ON L5L 1C6, Canada
| | - Peter D Keightley
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| |
Collapse
|
27
|
Lesack K, Mariene GM, Andersen EC, Wasmuth JD. Different structural variant prediction tools yield considerably different results in Caenorhabditis elegans. PLoS One 2022; 17:e0278424. [PMID: 36584177 PMCID: PMC9803319 DOI: 10.1371/journal.pone.0278424] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Accepted: 11/15/2022] [Indexed: 01/01/2023] Open
Abstract
The accurate characterization of structural variation is crucial for our understanding of how large chromosomal alterations affect phenotypic differences and contribute to genome evolution. Whole-genome sequencing is a popular approach for identifying structural variants, but the accuracy of popular tools remains unclear due to the limitations of existing benchmarks. Moreover, the performance of these tools for predicting variants in non-human genomes is less certain, as most tools were developed and benchmarked using data from the human genome. To evaluate the use of long-read data for the validation of short-read structural variant calls, the agreement between predictions from a short-read ensemble learning method and long-read tools were compared using real and simulated data from Caenorhabditis elegans. The results obtained from simulated data indicate that the best performing tool is contingent on the type and size of the variant, as well as the sequencing depth of coverage. These results also highlight the need for reference datasets generated from real data that can be used as 'ground truth' in benchmarks.
Collapse
Affiliation(s)
- Kyle Lesack
- Faculty of Veterinary Medicine, University of Calgary, Alberta, Canada
- Host-Parasite Interactions Research Training Network, University of Calgary, Alberta, Canada
| | - Grace M. Mariene
- Faculty of Veterinary Medicine, University of Calgary, Alberta, Canada
- Host-Parasite Interactions Research Training Network, University of Calgary, Alberta, Canada
| | - Erik C. Andersen
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, United States of America
| | - James D. Wasmuth
- Faculty of Veterinary Medicine, University of Calgary, Alberta, Canada
- Host-Parasite Interactions Research Training Network, University of Calgary, Alberta, Canada
- * E-mail:
| |
Collapse
|
28
|
Chen L, Zhao N, Cao J, Liu X, Xu J, Ma Y, Yu Y, Zhang X, Zhang W, Guan X, Yu X, Liu Z, Fan Y, Wang Y, Liang F, Wang D, Zhao L, Song M, Wang J. Short- and long-read metagenomics expand individualized structural variations in gut microbiomes. Nat Commun 2022; 13:3175. [PMID: 35676264 PMCID: PMC9177567 DOI: 10.1038/s41467-022-30857-9] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Accepted: 05/18/2022] [Indexed: 01/04/2023] Open
Abstract
In-depth profiling of genetic variations in the gut microbiome is highly desired for understanding its functionality and impacts on host health and disease. Here, by harnessing the long read advantage provided by Oxford Nanopore Technology (ONT), we characterize fine-scale genetic variations of structural variations (SVs) in hundreds of gut microbiomes from healthy humans. ONT long reads dramatically improve the quality of metagenomic assemblies, enable reliable detection of a large, expanded set of structural variation types (notably including large insertions and inversions). We find SVs are highly distinct between individuals and stable within an individual, representing gut microbiome fingerprints that shape strain-level differentiations in function within species, complicating the associations to metabolites and host phenotypes such as blood glucose. In summary, our study strongly emphasizes that incorporating ONT reads into metagenomic analyses expands the detection scope of genetic variations, enables profiling strain-level variations in gut microbiome, and their intricate correlations with metabolome.
Collapse
Affiliation(s)
- Liang Chen
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Na Zhao
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Jiabao Cao
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Xiaolin Liu
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jiayue Xu
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Yue Ma
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Ying Yu
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Xuan Zhang
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Wenhui Zhang
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Xiangyu Guan
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Xiaotong Yu
- Guang'anmen Hospital, China Academy of Chinese Medical Sciences, Beijing, China
| | | | | | - Yang Wang
- GrandOmics Biosciences, Beijing, China
| | - Fan Liang
- GrandOmics Biosciences, Beijing, China
| | | | - Linhua Zhao
- Guang'anmen Hospital, China Academy of Chinese Medical Sciences, Beijing, China
| | - Moshi Song
- University of Chinese Academy of Sciences, Beijing, China.
- State Key Laboratory of Membrane Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
- Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing, China.
- Beijing Institute for Stem Cell and Regenerative Medicine, 100101, Beijing, China.
| | - Jun Wang
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China.
- University of Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
29
|
Li M, Xu X, Liu S, Fan G, Zhou Q, Chen S. The chromosome-level genome assembly of the Japanese yellowtail jack Seriola aureovittata provides insights into genome evolution and efficient oxygen transport. Mol Ecol Resour 2022; 22:2701-2712. [PMID: 35593537 DOI: 10.1111/1755-0998.13648] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2021] [Revised: 04/16/2022] [Accepted: 05/11/2022] [Indexed: 11/27/2022]
Abstract
Fishes of the genus Seriola are widely farmed and highly valued in global aquaculture production. To further understand their economically important traits and help improve aquaculture product quality and sustainability, we performed a chromosome-level genome construction for Seriola aureovittata. Combining two technologies, PacBio and BGISEQ-500, we assembled 649.86 Mb S. aureovittata genome sequences with a contig N50 of 22.21 Mb, and 98% of BUSCO genes were detected in total. The initial assembly was then further scaffolded into 24 pseudochromosomes using Hi-C data, indicating the high quality of the genome. Genome evolution analysis showed that many genes related to fatty acid metabolism and oxygen binding, or transport were expanded, which provided insights into the metabolic characteristics of fatty acids and efficient oxygen transport. Based on the genome data, we confirmed the evolutionary relationship of S. aureovittata, S. dorsalis and S. lalandi and identified chr12 as the putative sex chromosome of S. aureovittata. Our chromosome-level genome assembly provides a genetic foundation for the phylogenetic and taxonomic investigation of different Seriola species. Moreover, the genome will provide an important genomic resource for further biological and aquaculture studies of S. aureovittata.
Collapse
Affiliation(s)
- Ming Li
- Yellow Sea Fisheries Research Institute, CAFS, Laboratory for Marine Fisheries Science and Food Production Processes, Pilot National Laboratory for Marine Science and Technology (Qingdao), Qingdao, China.,Shandong Provincial Key Laboratory of Marine Fishery Biotechnology and Genetic Breeding, Qingdao, China
| | - Xiwen Xu
- Yellow Sea Fisheries Research Institute, CAFS, Laboratory for Marine Fisheries Science and Food Production Processes, Pilot National Laboratory for Marine Science and Technology (Qingdao), Qingdao, China.,Shandong Provincial Key Laboratory of Marine Fishery Biotechnology and Genetic Breeding, Qingdao, China
| | | | | | - Qian Zhou
- Yellow Sea Fisheries Research Institute, CAFS, Laboratory for Marine Fisheries Science and Food Production Processes, Pilot National Laboratory for Marine Science and Technology (Qingdao), Qingdao, China.,Shandong Provincial Key Laboratory of Marine Fishery Biotechnology and Genetic Breeding, Qingdao, China
| | - Songlin Chen
- Yellow Sea Fisheries Research Institute, CAFS, Laboratory for Marine Fisheries Science and Food Production Processes, Pilot National Laboratory for Marine Science and Technology (Qingdao), Qingdao, China.,Shandong Provincial Key Laboratory of Marine Fishery Biotechnology and Genetic Breeding, Qingdao, China
| |
Collapse
|
30
|
Goel M, Schneeberger K. plotsr: visualizing structural similarities and rearrangements between multiple genomes. Bioinformatics 2022; 38:2922-2926. [PMID: 35561173 PMCID: PMC9113368 DOI: 10.1093/bioinformatics/btac196] [Citation(s) in RCA: 61] [Impact Index Per Article: 30.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Revised: 03/15/2022] [Accepted: 04/11/2022] [Indexed: 02/03/2023] Open
Abstract
SUMMARY Third-generation genome sequencing technologies have led to a sharp increase in the number of high-quality genome assemblies. This allows the comparison of multiple assembled genomes of individual species and demands new tools for visualizing their structural properties. Here, we present plotsr, an efficient tool to visualize structural similarities and rearrangements between genomes. It can be used to compare genomes on chromosome level or to zoom in on any selected region. In addition, plotsr can augment the visualization with regional identifiers (e.g. genes or genomic markers) or histogram tracks for continuous features (e.g. GC content or polymorphism density). AVAILABILITY AND IMPLEMENTATION plotsr is implemented as a python package and uses the standard matplotlib library for plotting. It is freely available under the MIT license at GitHub (https://github.com/schneebergerlab/plotsr) and bioconda (https://anaconda.org/bioconda/plotsr). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Manish Goel
- Faculty of Biology, LMU Munich, Planegg-Martinsried 82152, Germany
- Department of Genetics, Faculty of Biology, LMU Munich, Germany
| | - Korbinian Schneeberger
- Faculty of Biology, LMU Munich, Planegg-Martinsried 82152, Germany
- Department of Genetics, Faculty of Biology, LMU Munich, Germany
| |
Collapse
|
31
|
A Saccharomyces eubayanus haploid resource for research studies. Sci Rep 2022; 12:5976. [PMID: 35396494 PMCID: PMC8993842 DOI: 10.1038/s41598-022-10048-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Accepted: 04/01/2022] [Indexed: 12/16/2022] Open
Abstract
Since its identification, Saccharomyces eubayanus has been recognized as the missing parent of the lager hybrid, S. pastorianus. This wild yeast has never been isolated from fermentation environments, thus representing an interesting candidate for evolutionary, ecological and genetic studies. However, it is imperative to develop additional molecular genetics tools to ease manipulation and thus facilitate future studies. With this in mind, we generated a collection of stable haploid strains representative of three main lineages described in S. eubayanus (PB-1, PB-2 and PB-3), by deleting the HO gene using CRISPR-Cas9 and tetrad micromanipulation. Phenotypic characterization under different conditions demonstrated that the haploid derivates were extremely similar to their parental strains. Genomic analysis in three strains highlighted a likely low frequency of off-targets, and sequencing of a single tetrad evidenced no structural variants in any of the haploid spores. Finally, we demonstrate the utilization of the haploid set by challenging the strains under mass-mating conditions. In this way, we found that S. eubayanus under liquid conditions has a preference to remain in a haploid state, unlike S. cerevisiae that mates rapidly. This haploid resource is a novel set of strains for future yeast molecular genetics studies.
Collapse
|
32
|
Nie S, Wang B, Ding H, Lin H, Zhang L, Li Q, Wang Y, Zhang B, Liang A, Zheng Q, Wang H, Lv H, Zhu K, Jia M, Wang X, Du J, Zhao R, Jiang Z, Xia C, Qiao Z, Li X, Liu B, Zhu H, An R, Li Y, Jiang Q, Chen B, Zhang H, Wang D, Tang C, Yuan Y, Dai J, Zhan J, He W, Wang X, Shi J, Wang B, Gong M, He X, Li P, Huang L, Li H, Pan C, Huang H, Yuan G, Lan H, Nie Y, Li X, Zhao X, Zhang X, Pan G, Wu Q, Xu F, Zhang Z. Genome assembly of the Chinese maize elite inbred line RP125 and its EMS mutant collection provide new resources for maize genetics research and crop improvement. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2021; 108:40-54. [PMID: 34252236 DOI: 10.1111/tpj.15421] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Accepted: 07/06/2021] [Indexed: 06/13/2023]
Abstract
Maize is an important crop worldwide, as well as a valuable model with vast genetic diversity. Accurate genome and annotation information for a wide range of inbred lines would provide valuable resources for crop improvement and pan-genome characterization. In this study, we generated a high-quality de novo genome assembly (contig N50 of 15.43 Mb) of the Chinese elite inbred line RP125 using Nanopore long-read sequencing and Hi-C scaffolding, which yield highly contiguous, chromosome-length scaffolds. Global comparison of the RP125 genome with those of B73, W22, and Mo17 revealed a large number of structural variations. To create new germplasm for maize research and crop improvement, we carried out an EMS mutagenesis screen on RP125. In total, we obtained 5818 independent M2 families, with 946 mutants showing heritable phenotypes. Taking advantage of the high-quality RP125 genome, we successfully cloned 10 mutants from the EMS library, including the novel kernel mutant qk1 (quekou: "missing a small part" in Chinese), which exhibited partial loss of endosperm and a starch accumulation defect. QK1 encodes a predicted metal tolerance protein, which is specifically required for Fe transport. Increased accumulation of Fe and reactive oxygen species as well as ferroptosis-like cell death were detected in qk1 endosperm. Our study provides the community with a high-quality genome sequence and a large collection of mutant germplasm.
Collapse
Affiliation(s)
- Shujun Nie
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, 271018, China
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Bo Wang
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 11724, USA
| | - Haiping Ding
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, 271018, China
| | - Haijian Lin
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Li Zhang
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, 271018, China
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Qigui Li
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, 271018, China
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Yujiao Wang
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Bin Zhang
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Anping Liang
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Qi Zheng
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
- The Key Laboratory of Plant Development and Environmental Adaption Biology, Ministry of Education, School of Life Sciences, Shandong University, Qingdao, 266237, China
| | - Hui Wang
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Huayang Lv
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Kun Zhu
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, 271018, China
| | - Minghui Jia
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, 271018, China
| | - Xiaotong Wang
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, 271018, China
| | - Jiyuan Du
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, 271018, China
| | - Runtai Zhao
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, 271018, China
| | - Zhenzhen Jiang
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, 271018, China
| | - Caina Xia
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, 271018, China
| | - Zhenghao Qiao
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, 271018, China
| | - Xiaohu Li
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, 271018, China
| | - Boyan Liu
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, 271018, China
| | - Hongbo Zhu
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, 271018, China
| | - Rong An
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Yucui Li
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Qian Jiang
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Benfang Chen
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Hongkai Zhang
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Dening Wang
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Changxiao Tang
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Yang Yuan
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Jie Dai
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Jing Zhan
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Weiqiang He
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Xuebo Wang
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Jian Shi
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Bin Wang
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Min Gong
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Xiujing He
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Peng Li
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Li Huang
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Hui Li
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Chao Pan
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Hong Huang
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Guangsheng Yuan
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Hai Lan
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Yongxin Nie
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, 271018, China
| | - Xinzheng Li
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, 271018, China
| | - Xiangyu Zhao
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, 271018, China
| | - Xiansheng Zhang
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, 271018, China
| | - Guangtang Pan
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| | - Qingyu Wu
- Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Fang Xu
- The Key Laboratory of Plant Development and Environmental Adaption Biology, Ministry of Education, School of Life Sciences, Shandong University, Qingdao, 266237, China
| | - Zhiming Zhang
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, 271018, China
- Maize Research Institute, Sichuan Agricultural University, ChengDu, 611130, China
| |
Collapse
|
33
|
Genomic structural variants constrain and facilitate adaptation in natural populations of Theobroma cacao, the chocolate tree. Proc Natl Acad Sci U S A 2021; 118:2102914118. [PMID: 34408075 DOI: 10.1073/pnas.2102914118] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Genomic structural variants (SVs) can play important roles in adaptation and speciation. Yet the overall fitness effects of SVs are poorly understood, partly because accurate population-level identification of SVs requires multiple high-quality genome assemblies. Here, we use 31 chromosome-scale, haplotype-resolved genome assemblies of Theobroma cacao-an outcrossing, long-lived tree species that is the source of chocolate-to investigate the fitness consequences of SVs in natural populations. Among the 31 accessions, we find over 160,000 SVs, which together cover eight times more of the genome than single-nucleotide polymorphisms and short indels (125 versus 15 Mb). Our results indicate that a vast majority of these SVs are deleterious: they segregate at low frequencies and are depleted from functional regions of the genome. We show that SVs influence gene expression, which likely impairs gene function and contributes to the detrimental effects of SVs. We also provide empirical support for a theoretical prediction that SVs, particularly inversions, increase genetic load through the accumulation of deleterious nucleotide variants as a result of suppressed recombination. Despite the overall detrimental effects, we identify individual SVs bearing signatures of local adaptation, several of which are associated with genes differentially expressed between populations. Genes involved in pathogen resistance are strongly enriched among these candidates, highlighting the contribution of SVs to this important local adaptation trait. Beyond revealing empirical evidence for the evolutionary importance of SVs, these 31 de novo assemblies provide a valuable resource for genetic and breeding studies in T cacao.
Collapse
|
34
|
Mardones W, Villarroel CA, Abarca V, Urbina K, Peña TA, Molinet J, Nespolo RF, Cubillos FA. Rapid selection response to ethanol in Saccharomyces eubayanus emulates the domestication process under brewing conditions. Microb Biotechnol 2021; 15:967-984. [PMID: 33755311 PMCID: PMC8913853 DOI: 10.1111/1751-7915.13803] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Revised: 03/01/2021] [Accepted: 03/07/2021] [Indexed: 01/02/2023] Open
Abstract
Although the typical genomic and phenotypic changes that characterize the evolution of organisms under the human domestication syndrome represent textbook examples of rapid evolution, the molecular processes that underpin such changes are still poorly understood. Domesticated yeasts for brewing, where short generation times and large phenotypic and genomic plasticity were attained in a few generations under selection, are prime examples. To experimentally emulate the lager yeast domestication process, we created a genetically complex (panmictic) artificial population of multiple Saccharomyces eubayanus genotypes, one of the parents of lager yeast. Then, we imposed a constant selection regime under a high ethanol concentration in 10 replicated populations during 260 generations (6 months) and compared them with propagated controls exposed solely to glucose. Propagated populations exhibited a selection differential of 60% in growth rate in ethanol, mostly explained by the proliferation of a single lineage (CL248.1) that competitively displaced all other clones. Interestingly, the outcome does not require the entire time‐course of adaptation, as four lineages monopolized the culture at generation 120. Sequencing demonstrated that de novo genetic variants were produced in all propagated lines, including SNPs, aneuploidies, INDELs and translocations. In addition, the different propagated populations showed correlated responses resembling the domestication syndrome: genomic rearrangements, faster fermentation rates, lower production of phenolic off‐flavours and lower volatile compound complexity. Expression profiling in beer wort revealed altered expression levels of genes related to methionine metabolism, flocculation, stress tolerance and diauxic shift, likely contributing to higher ethanol and fermentation stress tolerance in the evolved populations. Our study shows that experimental evolution can rebuild the brewing domestication process in ‘fast motion’ in wild yeast, and also provides a powerful tool for studying the genetics of the adaptation process in complex populations.
Collapse
Affiliation(s)
- Wladimir Mardones
- Facultad de Química y Biología, Departamento de Biología, Universidad de Santiago de Chile, Santiago, 9170022, Chile.,Millennium Institute for Integrative Biology (iBio), ANID - Millennium Science Initiative Program, Santiago, 7500574, Chile
| | - Carlos A Villarroel
- Facultad de Química y Biología, Departamento de Biología, Universidad de Santiago de Chile, Santiago, 9170022, Chile.,Millennium Institute for Integrative Biology (iBio), ANID - Millennium Science Initiative Program, Santiago, 7500574, Chile
| | - Valentina Abarca
- Facultad de Química y Biología, Departamento de Biología, Universidad de Santiago de Chile, Santiago, 9170022, Chile.,Millennium Institute for Integrative Biology (iBio), ANID - Millennium Science Initiative Program, Santiago, 7500574, Chile
| | - Kamila Urbina
- Facultad de Química y Biología, Departamento de Biología, Universidad de Santiago de Chile, Santiago, 9170022, Chile.,Millennium Institute for Integrative Biology (iBio), ANID - Millennium Science Initiative Program, Santiago, 7500574, Chile
| | - Tomás A Peña
- Facultad de Química y Biología, Departamento de Biología, Universidad de Santiago de Chile, Santiago, 9170022, Chile.,Millennium Institute for Integrative Biology (iBio), ANID - Millennium Science Initiative Program, Santiago, 7500574, Chile
| | - Jennifer Molinet
- Facultad de Química y Biología, Departamento de Biología, Universidad de Santiago de Chile, Santiago, 9170022, Chile.,Millennium Institute for Integrative Biology (iBio), ANID - Millennium Science Initiative Program, Santiago, 7500574, Chile
| | - Roberto F Nespolo
- Millennium Institute for Integrative Biology (iBio), ANID - Millennium Science Initiative Program, Santiago, 7500574, Chile.,Institute of Environmental and Evolutionary Science, Universidad Austral de Chile, Valdivia, 5110566, Chile.,Center of Applied Ecology and Sustainability (CAPES), Pontificia Universidad Católica de Chile, Santiago, Chile
| | - Francisco A Cubillos
- Facultad de Química y Biología, Departamento de Biología, Universidad de Santiago de Chile, Santiago, 9170022, Chile.,Millennium Institute for Integrative Biology (iBio), ANID - Millennium Science Initiative Program, Santiago, 7500574, Chile
| |
Collapse
|
35
|
Bendixsen DP, Gettle N, Gilchrist C, Zhang Z, Stelkens R. Genomic Evidence of an Ancient East Asian Divergence Event in Wild Saccharomyces cerevisiae. Genome Biol Evol 2021; 13:6081032. [PMID: 33432360 PMCID: PMC7874999 DOI: 10.1093/gbe/evab001] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2020] [Revised: 12/21/2020] [Accepted: 01/05/2021] [Indexed: 12/12/2022] Open
Abstract
Comparative genome analyses have suggested East Asia to be the cradle of the domesticated microbe Brewer's yeast (Saccharomyces cerevisiae), used in the food and biotechnology industry worldwide. Here, we provide seven new, high-quality long-read genomes of nondomesticated yeast strains isolated from primeval forests and other natural environments in China and Taiwan. In a comprehensive analysis of our new genome assemblies, along with other long-read Saccharomycetes genomes available, we show that the newly sequenced East Asian strains are among the closest living relatives of the ancestors of the global diversity of Brewer's yeast, confirming predictions made from short-read genomic data. Three of these strains (termed the East Asian Clade IX Complex here) share a recent ancestry and evolutionary history suggesting an early divergence from other S. cerevisiae strains before the larger radiation of the species, and prior to its domestication. Our genomic analyses reveal that the wild East Asian strains contain elevated levels of structural variations. The new genomic resources provided here contribute to our understanding of the natural diversity of S. cerevisiae, expand the intraspecific genetic variation found in this heavily domesticated microbe, and provide a foundation for understanding its origin and global colonization history.
Collapse
|