1
|
Chuong JN, Nun NB, Suresh I, Matthews JC, De T, Avecilla G, Abdul-Rahman F, Brandt N, Ram Y, Gresham D. Template switching during DNA replication is a prevalent source of adaptive gene amplification. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.03.589936. [PMID: 39464144 PMCID: PMC11507740 DOI: 10.1101/2024.05.03.589936] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 10/29/2024]
Abstract
Copy number variants (CNVs)-gains and losses of genomic sequences-are an important source of genetic variation underlying rapid adaptation and genome evolution. However, despite their central role in evolution little is known about the factors that contribute to the structure, size, formation rate, and fitness effects of adaptive CNVs. Local genomic sequences are likely to be an important determinant of these properties. Whereas it is known that point mutation rates vary with genomic location and local DNA sequence features, the role of genome architecture in the formation, selection, and the resulting evolutionary dynamics of CNVs is poorly understood. Previously, we have found that the GAP1 gene in Saccharomyces cerevisiae undergoes frequent and repeated amplification and selection under long-term experimental evolution in glutamine-limiting conditions. The GAP1 gene has a unique genomic architecture consisting of two flanking long terminal repeats (LTRs) and a proximate origin of DNA replication (autonomously replicating sequence, ARS), which are likely to promote rapid GAP1 CNV formation. To test the role of these genomic elements on CNV-mediated adaptive evolution, we performed experimental evolution in glutamine-limited chemostats using engineered strains lacking either the adjacent LTRs, ARS, or all elements. Using a CNV reporter system and neural network simulation-based inference (nnSBI) we quantified the formation rate and fitness effect of CNVs for each strain. We find that although GAP1 CNVs repeatedly form and sweep to high frequency in strains with modified genome architecture, removal of local DNA elements significantly impacts the rate and fitness effect of CNVs and the rate of adaptation. We performed genome sequence analysis to define the molecular mechanisms of CNV formation for 177 CNV lineages. We find that across all four strain backgrounds, between 26% and 80% of all GAP1 CNVs are mediated by Origin Dependent Inverted Repeat Amplification (ODIRA) which results from template switching between the leading and lagging strand during DNA synthesis. In the absence of the local ARS, a distal ARS can mediate CNV formation via ODIRA. In the absence of local LTRs, homologous recombination mechanisms still mediate gene amplification following de novo insertion of retrotransposon elements at the locus. Our study demonstrates the remarkable plasticity of the genome and reveals that template switching during DNA replication is a frequent source of adaptive CNVs.
Collapse
Affiliation(s)
- Julie N Chuong
- Department of Biology, Center for Genomics and Systems Biology, New York University
| | - Nadav Ben Nun
- School of Zoology, Faculty of Life Sciences, Tel Aviv University
- Edmond J. Safra Center for Bioinformatics, Tel Aviv University
| | - Ina Suresh
- Department of Biology, Center for Genomics and Systems Biology, New York University
| | - Julia Cano Matthews
- Department of Biology, Center for Genomics and Systems Biology, New York University
| | - Titir De
- Department of Biology, Center for Genomics and Systems Biology, New York University
| | | | - Farah Abdul-Rahman
- Department of Ecology and Evolutionary Biology, Yale University
- Microbial Sciences Institute, Yale University
| | - Nathan Brandt
- Department of Biological Sciences, North Carolina State University
| | - Yoav Ram
- School of Zoology, Faculty of Life Sciences, Tel Aviv University
- Edmond J. Safra Center for Bioinformatics, Tel Aviv University
| | - David Gresham
- Department of Biology, Center for Genomics and Systems Biology, New York University
- Correspondence:
| |
Collapse
|
2
|
Kohanovski I, Pontz M, Vande Zande P, Selmecki A, Dahan O, Pilpel Y, Yona AH, Ram Y. Aneuploidy Can Be an Evolutionary Diversion on the Path to Adaptation. Mol Biol Evol 2024; 41:msae052. [PMID: 38427813 PMCID: PMC10951435 DOI: 10.1093/molbev/msae052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Revised: 01/29/2024] [Accepted: 02/27/2024] [Indexed: 03/03/2024] Open
Abstract
Aneuploidy is common in eukaryotes, often leading to decreased fitness. However, evidence from fungi and human tumur cells suggests that specific aneuploidies can be beneficial under stressful conditions and facilitate adaptation. In a previous evolutionary experiment with yeast, populations evolving under heat stress became aneuploid, only to later revert to euploidy after beneficial mutations accumulated. It was therefore suggested that aneuploidy is a "stepping stone" on the path to adaptation. Here, we test this hypothesis. We use Bayesian inference to fit an evolutionary model with both aneuploidy and mutation to the experimental results. We then predict the genotype frequency dynamics during the experiment, demonstrating that most of the evolved euploid population likely did not descend from aneuploid cells, but rather from the euploid wild-type population. Our model shows how the beneficial mutation supply-the product of population size and beneficial mutation rate-determines the evolutionary dynamics: with low supply, much of the evolved population descends from aneuploid cells; but with high supply, beneficial mutations are generated fast enough to outcompete aneuploidy due to its inherent fitness cost. Our results suggest that despite its potential fitness benefits under stress, aneuploidy can be an evolutionary "diversion" rather than a "stepping stone": it can delay, rather than facilitate, the adaptation of the population, and cells that become aneuploid may leave less descendants compared to cells that remain diploid.
Collapse
Affiliation(s)
- Ilia Kohanovski
- School of Zoology, Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
- School of Computer Science, Reichman University, Herzliya, Israel
| | - Martin Pontz
- School of Zoology, Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Pétra Vande Zande
- Department of Microbiology and Immunology, University of Minnesota Medical School, Minneapolis, MN, USA
| | - Anna Selmecki
- Department of Microbiology and Immunology, University of Minnesota Medical School, Minneapolis, MN, USA
| | - Orna Dahan
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
| | - Yitzhak Pilpel
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
| | - Avihu H Yona
- Institute of Biochemistry, Food Science and Nutrition, Robert H. Smith Faculty of Agriculture, Food and Environment, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Yoav Ram
- School of Zoology, Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
3
|
Lambert S, Voznica J, Morlon H. Deep Learning from Phylogenies for Diversification Analyses. Syst Biol 2023; 72:1262-1279. [PMID: 37556735 DOI: 10.1093/sysbio/syad044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 06/20/2023] [Accepted: 08/08/2023] [Indexed: 08/11/2023] Open
Abstract
Birth-death (BD) models are widely used in combination with species phylogenies to study past diversification dynamics. Current inference approaches typically rely on likelihood-based methods. These methods are not generalizable, as a new likelihood formula must be established each time a new model is proposed; for some models, such a formula is not even tractable. Deep learning can bring solutions in such situations, as deep neural networks can be trained to learn the relation between simulations and parameter values as a regression problem. In this paper, we adapt a recently developed deep learning method from pathogen phylodynamics to the case of diversification inference, and we extend its applicability to the case of the inference of state-dependent diversification models from phylogenies associated with trait data. We demonstrate the accuracy and time efficiency of the approach for the time-constant homogeneous BD model and the Binary-State Speciation and Extinction model. Finally, we illustrate the use of the proposed inference machinery by reanalyzing a phylogeny of primates and their associated ecological role as seed dispersers. Deep learning inference provides at least the same accuracy as likelihood-based inference while being faster by several orders of magnitude, offering a promising new inference approach for the deployment of future models in the field.
Collapse
Affiliation(s)
- Sophia Lambert
- Institut de Biologie de l'École Normale Supérieure, École Normale Supérieure, CNRS, INSERM, Université Paris Sciences et Lettres, 46 Rue d'Ulm, 75005 Paris, France
- Institute of Ecology and Evolution, Department of Biology, 5289 University of Oregon, Eugene, OR 97403, USA
| | - Jakub Voznica
- Institut Pasteur, Université Paris Cité, Unité Bioinformatique Evolutive, 25-28 Rue du Dr Roux, 75015 Paris, France
- Unité de Biologie Computationnelle, USR 3756 CNRS, 25-28 Rue du Dr Roux, 75015 Paris, France
| | - Hélène Morlon
- Institut de Biologie de l'École Normale Supérieure, École Normale Supérieure, CNRS, INSERM, Université Paris Sciences et Lettres, 46 Rue d'Ulm, 75005 Paris, France
| |
Collapse
|
4
|
Selmecki A. Recurrent copy number variations in the human fungal pathogen Candida parapsilosis. mBio 2023; 14:e0071323. [PMID: 37787545 PMCID: PMC10653803 DOI: 10.1128/mbio.00713-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/04/2023] Open
Abstract
Candida parapsilosis is an opportunistic fungal pathogen with increasing incidence in hospital settings worldwide; however, we lack a comprehensive understanding of the mechanisms promoting its virulence and drug resistance. Bergin et al. systematically quantify the frequency and effect of copy number variation (CNV) across 170 diverse clinical and environmental isolates of C. parapsilosis (Bergin SA, Zhao F, Ryan AP, Müller CA, Nieduszynski CA, Zhai B, Rolling T, Hohl TM, Morio F, Scully J, Wolfe KH, Butler G, 2022, mBio, https://doi.org/10.1128/mbio.01777-22). Using a combination of both short- and long-read whole genome sequencing techniques, they determine the structure and copy number of two CNVs that arose recurrently throughout the evolution of these isolates. Each CNV predominantly amplifies one coding sequence (ARR3 or RTA3); however, the amplitude and recombination breakpoints are variable across the isolates. Amplification of RTA3 correlates with drug resistance and deletion causes drug susceptibility. This study highlights the need for further research into the mechanisms and dynamics of CNV formation and the impact of these CNVs on virulence and drug resistance across diverse fungal pathogens.
Collapse
Affiliation(s)
- Anna Selmecki
- Department of Microbiology and Immunology, University of Minnesota Medical School, Minneapolis, Minnesota, USA
| |
Collapse
|
5
|
Chen V, Johnson MS, Hérissant L, Humphrey PT, Yuan DC, Li Y, Agarwala A, Hoelscher SB, Petrov DA, Desai MM, Sherlock G. Evolution of haploid and diploid populations reveals common, strong, and variable pleiotropic effects in non-home environments. eLife 2023; 12:e92899. [PMID: 37861305 PMCID: PMC10629826 DOI: 10.7554/elife.92899] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 09/27/2023] [Indexed: 10/21/2023] Open
Abstract
Adaptation is driven by the selection for beneficial mutations that provide a fitness advantage in the specific environment in which a population is evolving. However, environments are rarely constant or predictable. When an organism well adapted to one environment finds itself in another, pleiotropic effects of mutations that made it well adapted to its former environment will affect its success. To better understand such pleiotropic effects, we evolved both haploid and diploid barcoded budding yeast populations in multiple environments, isolated adaptive clones, and then determined the fitness effects of adaptive mutations in 'non-home' environments in which they were not selected. We find that pleiotropy is common, with most adaptive evolved lineages showing fitness effects in non-home environments. Consistent with other studies, we find that these pleiotropic effects are unpredictable: they are beneficial in some environments and deleterious in others. However, we do find that lineages with adaptive mutations in the same genes tend to show similar pleiotropic effects. We also find that ploidy influences the observed adaptive mutational spectra in a condition-specific fashion. In some conditions, haploids and diploids are selected with adaptive mutations in identical genes, while in others they accumulate mutations in almost completely disjoint sets of genes.
Collapse
Affiliation(s)
- Vivian Chen
- Department of Biology, Stanford UniversityStanfordUnited States
| | - Milo S Johnson
- Department of Organismic and Evolutionary Biology, Harvard UniversityCambridgeUnited States
- Quantitative Biology Initiative, Harvard UniversityCambridgeUnited States
- NSF-Simons Center for Mathematical and Statistical Analysis of Biology, Harvard UniversityBostonUnited States
| | - Lucas Hérissant
- Department of Genetics, Stanford UniversityStanfordUnited States
| | - Parris T Humphrey
- Department of Organismic and Evolutionary Biology, Harvard UniversityCambridgeUnited States
| | - David C Yuan
- Department of Biology, Stanford UniversityStanfordUnited States
| | - Yuping Li
- Department of Biology, Stanford UniversityStanfordUnited States
| | - Atish Agarwala
- Department of Physics, Stanford UniversityStanfordUnited States
| | | | - Dmitri A Petrov
- Department of Biology, Stanford UniversityStanfordUnited States
| | - Michael M Desai
- Department of Organismic and Evolutionary Biology, Harvard UniversityCambridgeUnited States
- Quantitative Biology Initiative, Harvard UniversityCambridgeUnited States
- NSF-Simons Center for Mathematical and Statistical Analysis of Biology, Harvard UniversityBostonUnited States
- Department of Physics, Harvard UniversityCambridgeUnited States
| | - Gavin Sherlock
- Department of Genetics, Stanford UniversityStanfordUnited States
| |
Collapse
|
6
|
Vande Zande P, Zhou X, Selmecki A. The Dynamic Fungal Genome: Polyploidy, Aneuploidy and Copy Number Variation in Response to Stress. Annu Rev Microbiol 2023; 77:341-361. [PMID: 37307856 PMCID: PMC10599402 DOI: 10.1146/annurev-micro-041320-112443] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Fungal species have dynamic genomes and often exhibit genomic plasticity in response to stress. This genome plasticity often comes with phenotypic consequences that affect fitness and resistance to stress. Fungal pathogens exhibit genome plasticity in both clinical and agricultural settings and often during adaptation to antifungal drugs, posing significant challenges to human health. Therefore, it is important to understand the rates, mechanisms, and impact of large genomic changes. This review addresses the prevalence of polyploidy, aneuploidy, and copy number variation across diverse fungal species, with special attention to prominent fungal pathogens and model species. We also explore the relationship between environmental stress and rates of genomic changes and highlight the mechanisms underlying genotypic and phenotypic changes. A comprehensive understanding of these dynamic fungal genomes is needed to identify novel solutions for the increase in antifungal drug resistance.
Collapse
Affiliation(s)
- Pétra Vande Zande
- Department of Microbiology and Immunology, University of Minnesota, Minneapolis, Minnesota, USA;
| | - Xin Zhou
- Department of Microbiology and Immunology, University of Minnesota, Minneapolis, Minnesota, USA;
| | - Anna Selmecki
- Department of Microbiology and Immunology, University of Minnesota, Minneapolis, Minnesota, USA;
| |
Collapse
|
7
|
Boelts J, Harth P, Gao R, Udvary D, Yáñez F, Baum D, Hege HC, Oberlaender M, Macke JH. Simulation-based inference for efficient identification of generative models in computational connectomics. PLoS Comput Biol 2023; 19:e1011406. [PMID: 37738260 PMCID: PMC10550169 DOI: 10.1371/journal.pcbi.1011406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 10/04/2023] [Accepted: 08/01/2023] [Indexed: 09/24/2023] Open
Abstract
Recent advances in connectomics research enable the acquisition of increasing amounts of data about the connectivity patterns of neurons. How can we use this wealth of data to efficiently derive and test hypotheses about the principles underlying these patterns? A common approach is to simulate neuronal networks using a hypothesized wiring rule in a generative model and to compare the resulting synthetic data with empirical data. However, most wiring rules have at least some free parameters, and identifying parameters that reproduce empirical data can be challenging as it often requires manual parameter tuning. Here, we propose to use simulation-based Bayesian inference (SBI) to address this challenge. Rather than optimizing a fixed wiring rule to fit the empirical data, SBI considers many parametrizations of a rule and performs Bayesian inference to identify the parameters that are compatible with the data. It uses simulated data from multiple candidate wiring rule parameters and relies on machine learning methods to estimate a probability distribution (the 'posterior distribution over parameters conditioned on the data') that characterizes all data-compatible parameters. We demonstrate how to apply SBI in computational connectomics by inferring the parameters of wiring rules in an in silico model of the rat barrel cortex, given in vivo connectivity measurements. SBI identifies a wide range of wiring rule parameters that reproduce the measurements. We show how access to the posterior distribution over all data-compatible parameters allows us to analyze their relationship, revealing biologically plausible parameter interactions and enabling experimentally testable predictions. We further show how SBI can be applied to wiring rules at different spatial scales to quantitatively rule out invalid wiring hypotheses. Our approach is applicable to a wide range of generative models used in connectomics, providing a quantitative and efficient way to constrain model parameters with empirical connectivity data.
Collapse
Affiliation(s)
- Jan Boelts
- Machine Learning in Science, University of Tübingen, Tübingen, Germany
- Tübingen AI Center, University of Tübingen, Tübingen, Germany
| | - Philipp Harth
- Department of Visual and Data-centric Computing, Zuse Institute Berlin, Berlin, Germany
| | - Richard Gao
- Machine Learning in Science, University of Tübingen, Tübingen, Germany
- Tübingen AI Center, University of Tübingen, Tübingen, Germany
| | - Daniel Udvary
- In Silico Brain Sciences, Max Planck Institute for Neurobiology of Behavior – caesar, Bonn, Germany
| | - Felipe Yáñez
- In Silico Brain Sciences, Max Planck Institute for Neurobiology of Behavior – caesar, Bonn, Germany
| | - Daniel Baum
- Department of Visual and Data-centric Computing, Zuse Institute Berlin, Berlin, Germany
| | - Hans-Christian Hege
- Department of Visual and Data-centric Computing, Zuse Institute Berlin, Berlin, Germany
| | - Marcel Oberlaender
- In Silico Brain Sciences, Max Planck Institute for Neurobiology of Behavior – caesar, Bonn, Germany
- Department of Integrative Neurophysiology, Center for Neurogenomics and Cognitive Research, Free University Amsterdam, Amsterdam, Netherlands
| | - Jakob H. Macke
- Machine Learning in Science, University of Tübingen, Tübingen, Germany
- Tübingen AI Center, University of Tübingen, Tübingen, Germany
- Empirical Inference, Max Planck Institute for Intelligent Systems, Tübingen, Germany
| |
Collapse
|
8
|
Avecilla G, Spealman P, Matthews J, Caudal E, Schacherer J, Gresham D. Copy number variation alters local and global mutational tolerance. Genome Res 2023; 33:1340-1353. [PMID: 37652668 PMCID: PMC10547251 DOI: 10.1101/gr.277625.122] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Accepted: 07/07/2023] [Indexed: 09/02/2023]
Abstract
Copy number variants (CNVs), duplications and deletions of genomic sequences, contribute to evolutionary adaptation but can also confer deleterious effects and cause disease. Whereas the effects of amplifying individual genes or whole chromosomes (i.e., aneuploidy) have been studied extensively, much less is known about the genetic and functional effects of CNVs of differing sizes and structures. Here, we investigated Saccharomyces cerevisiae (yeast) strains that acquired adaptive CNVs of variable structures and copy numbers following experimental evolution in glutamine-limited chemostats. Although beneficial in the selective environment, CNVs result in decreased fitness compared with the euploid ancestor in rich media. We used transposon mutagenesis to investigate mutational tolerance and genome-wide genetic interactions in CNV strains. We find that CNVs increase mutational target size, confer increased mutational tolerance in amplified essential genes, and result in novel genetic interactions with unlinked genes. We validated a novel genetic interaction between different CNVs and BMH1 that was common to multiple strains. We also analyzed global gene expression and found that transcriptional dosage compensation does not affect most genes amplified by CNVs, although gene-specific transcriptional dosage compensation does occur for ∼12% of amplified genes. Furthermore, we find that CNV strains do not show previously described transcriptional signatures of aneuploidy. Our study reveals the extent to which local and global mutational tolerance is modified by CNVs with implications for genome evolution and CNV-associated diseases, such as cancer.
Collapse
Affiliation(s)
- Grace Avecilla
- Department of Biology, New York University, New York, New York 10003, USA
- Center for Genomics and Systems Biology, New York University, New York, New York 10003, USA
| | - Pieter Spealman
- Department of Biology, New York University, New York, New York 10003, USA
- Center for Genomics and Systems Biology, New York University, New York, New York 10003, USA
| | - Julia Matthews
- Department of Biology, New York University, New York, New York 10003, USA
- Center for Genomics and Systems Biology, New York University, New York, New York 10003, USA
| | - Elodie Caudal
- Université de Strasbourg, CNRS, GMGM UMR, 7156 Strasbourg, France
| | - Joseph Schacherer
- Université de Strasbourg, CNRS, GMGM UMR, 7156 Strasbourg, France
- Institut Universitaire de France (IUF), 75231 Paris Cedex 05, France
| | - David Gresham
- Department of Biology, New York University, New York, New York 10003, USA;
- Center for Genomics and Systems Biology, New York University, New York, New York 10003, USA
| |
Collapse
|
9
|
Theodosiou L, Farr AD, Rainey PB. Barcoding Populations of Pseudomonas fluorescens SBW25. J Mol Evol 2023; 91:254-262. [PMID: 37186220 PMCID: PMC10275814 DOI: 10.1007/s00239-023-10103-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 03/13/2023] [Indexed: 05/17/2023]
Abstract
In recent years, evolutionary biologists have developed an increasing interest in the use of barcoding strategies to study eco-evolutionary dynamics of lineages within evolving populations and communities. Although barcoded populations can deliver unprecedented insight into evolutionary change, barcoding microbes presents specific technical challenges. Here, strategies are described for barcoding populations of the model bacterium Pseudomonas fluorescens SBW25, including the design and cloning of barcoded regions, preparation of libraries for amplicon sequencing, and quantification of resulting barcoded lineages. In so doing, we hope to aid the design and implementation of barcoding methodologies in a broad range of model and non-model organisms.
Collapse
Affiliation(s)
- Loukas Theodosiou
- Department of Microbial Population Biology, Max Planck Institute for Evolutionary Biology, Plön, Germany.
- Department of Comparative Development and Genetics, Max Planck Institute for Plant Breeding, Cologne, Germany.
| | - Andrew D Farr
- Department of Microbial Population Biology, Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Paul B Rainey
- Department of Microbial Population Biology, Max Planck Institute for Evolutionary Biology, Plön, Germany
- Laboratory of Biophysics and Evolution, CBI, ESPCI Paris, Université PSL, CNRS, Paris, France
| |
Collapse
|
10
|
Spealman P, De T, Chuong JN, Gresham D. Best Practices in Microbial Experimental Evolution: Using Reporters and Long-Read Sequencing to Identify Copy Number Variation in Experimental Evolution. J Mol Evol 2023; 91:356-368. [PMID: 37012421 PMCID: PMC10275804 DOI: 10.1007/s00239-023-10102-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Accepted: 02/21/2023] [Indexed: 04/05/2023]
Abstract
Copy number variants (CNVs), comprising gene amplifications and deletions, are a pervasive class of heritable variation. CNVs play a key role in rapid adaptation in both natural, and experimental, evolution. However, despite the advent of new DNA sequencing technologies, detection and quantification of CNVs in heterogeneous populations has remained challenging. Here, we summarize recent advances in the use of CNV reporters that provide a facile means of quantifying de novo CNVs at a specific locus in the genome, and nanopore sequencing, for resolving the often complex structures of CNVs. We provide guidance for the engineering and analysis of CNV reporters and practical guidelines for single-cell analysis of CNVs using flow cytometry. We summarize recent advances in nanopore sequencing, discuss the utility of this technology, and provide guidance for the bioinformatic analysis of these data to define the molecular structure of CNVs. The combination of reporter systems for tracking and isolating CNV lineages and long-read DNA sequencing for characterizing CNV structures enables unprecedented resolution of the mechanisms by which CNVs are generated and their evolutionary dynamics.
Collapse
Affiliation(s)
- Pieter Spealman
- Department of Biology, New York University, New York, NY, 10003, USA
- Center for Genomics and Systems Biology, New York University, New York, NY, 10003, USA
| | - Titir De
- Department of Biology, New York University, New York, NY, 10003, USA
- Center for Genomics and Systems Biology, New York University, New York, NY, 10003, USA
| | - Julie N Chuong
- Department of Biology, New York University, New York, NY, 10003, USA
- Center for Genomics and Systems Biology, New York University, New York, NY, 10003, USA
| | - David Gresham
- Department of Biology, New York University, New York, NY, 10003, USA.
- Center for Genomics and Systems Biology, New York University, New York, NY, 10003, USA.
| |
Collapse
|
11
|
Caspi I, Meir M, Ben Nun N, Abu Rass R, Yakhini U, Stern A, Ram Y. Mutation rate, selection, and epistasis inferred from RNA virus haplotypes via neural posterior estimation. Virus Evol 2023; 9:vead033. [PMID: 37305706 PMCID: PMC10256221 DOI: 10.1093/ve/vead033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 04/30/2023] [Accepted: 05/16/2023] [Indexed: 06/13/2023] Open
Abstract
RNA viruses are particularly notorious for their high levels of genetic diversity, which is generated through the forces of mutation and natural selection. However, disentangling these two forces is a considerable challenge, and this may lead to widely divergent estimates of viral mutation rates, as well as difficulties in inferring the fitness effects of mutations. Here, we develop, test, and apply an approach aimed at inferring the mutation rate and key parameters that govern natural selection, from haplotype sequences covering full-length genomes of an evolving virus population. Our approach employs neural posterior estimation, a computational technique that applies simulation-based inference with neural networks to jointly infer multiple model parameters. We first tested our approach on synthetic data simulated using different mutation rates and selection parameters while accounting for sequencing errors. Reassuringly, the inferred parameter estimates were accurate and unbiased. We then applied our approach to haplotype sequencing data from a serial passaging experiment with the MS2 bacteriophage, a virus that parasites Escherichia coli. We estimated that the mutation rate of this phage is around 0.2 mutations per genome per replication cycle (95% highest density interval: 0.051-0.56). We validated this finding with two different approaches based on single-locus models that gave similar estimates but with much broader posterior distributions. Furthermore, we found evidence for reciprocal sign epistasis between four strongly beneficial mutations that all reside in an RNA stem loop that controls the expression of the viral lysis protein, responsible for lysing host cells and viral egress. We surmise that there is a fine balance between over- and underexpression of lysis that leads to this pattern of epistasis. To recap, we have developed an approach for joint inference of the mutation rate and selection parameters from full haplotype data with sequencing errors and used it to reveal features governing MS2 evolution.
Collapse
Affiliation(s)
- Itamar Caspi
- Shmunis School of Biomedicine and Cancer Research, Faculty of Life Sciences, Tel Aviv University, P.O. Box 39040, Tel Aviv 6997801, Israel
| | - Moran Meir
- Shmunis School of Biomedicine and Cancer Research, Faculty of Life Sciences, Tel Aviv University, P.O. Box 39040, Tel Aviv 6997801, Israel
| | - Nadav Ben Nun
- Edmond J. Safra Center for Bioinformatics, Tel Aviv University, P.O. Box 39040, Tel Aviv 6997801, Israel
- School of Zoology, Faculty of Life Sciences, Tel Aviv University, P.O. Box 39040, Tel Aviv 6997801, Israel
| | | | - Uri Yakhini
- Shmunis School of Biomedicine and Cancer Research, Faculty of Life Sciences, Tel Aviv University, P.O. Box 39040, Tel Aviv 6997801, Israel
- Edmond J. Safra Center for Bioinformatics, Tel Aviv University, P.O. Box 39040, Tel Aviv 6997801, Israel
| | | | - Yoav Ram
- *Corresponding author: E-mail: ;
| |
Collapse
|
12
|
Brettner L, Ho WC, Schmidlin K, Apodaca S, Eder R, Geiler-Samerotte K. Challenges and potential solutions for studying the genetic and phenotypic architecture of adaptation in microbes. Curr Opin Genet Dev 2022; 75:101951. [PMID: 35797741 DOI: 10.1016/j.gde.2022.101951] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Revised: 06/01/2022] [Accepted: 06/14/2022] [Indexed: 11/29/2022]
Abstract
All organisms are defined by the makeup of their DNA. Over billions of years, the structure and information contained in that DNA, often referred to as genetic architecture, have been honed by a multitude of evolutionary processes. Mutations that cause genetic elements to change in a way that results in beneficial phenotypic change are more likely to survive and propagate through the population in a process known as adaptation. Recent work reveals that the genetic targets of adaptation are varied and can change with genetic background. Further, seemingly similar adaptive mutations, even within the same gene, can have diverse and unpredictable effects on phenotype. These challenges represent major obstacles in predicting adaptation and evolution. In this review, we cover these concepts in detail and identify three emerging synergistic solutions: higher-throughput evolution experiments combined with updated genotype-phenotype mapping strategies and physiological models. Our review largely focuses on recent literature in yeast, and the field seems to be on the cusp of a new era with regard to studying the predictability of evolution.
Collapse
|