1
|
A pan-genome of 69 Arabidopsis thaliana accessions reveals a conserved genome structure throughout the global species range. Nat Genet 2024; 56:982-991. [PMID: 38605175 PMCID: PMC11096106 DOI: 10.1038/s41588-024-01715-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Accepted: 03/11/2024] [Indexed: 04/13/2024]
Abstract
Although originally primarily a system for functional biology, Arabidopsis thaliana has, owing to its broad geographical distribution and adaptation to diverse environments, developed into a powerful model in population genomics. Here we present chromosome-level genome assemblies of 69 accessions from a global species range. We found that genomic colinearity is very conserved, even among geographically and genetically distant accessions. Along chromosome arms, megabase-scale rearrangements are rare and typically present only in a single accession. This indicates that the karyotype is quasi-fixed and that rearrangements in chromosome arms are counter-selected. Centromeric regions display higher structural dynamics, and divergences in core centromeres account for most of the genome size variations. Pan-genome analyses uncovered 32,986 distinct gene families, 60% being present in all accessions and 40% appearing to be dispensable, including 18% private to a single accession, indicating unexplored genic diversity. These 69 new Arabidopsis thaliana genome assemblies will empower future genetic research.
Collapse
|
2
|
Phytophthora capsici genome assembly for two isolates using long-read Oxford Nanopore Technology sequencing. Microbiol Resour Announc 2023; 12:e0019623. [PMID: 37948305 PMCID: PMC10720514 DOI: 10.1128/mra.00196-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Accepted: 10/10/2023] [Indexed: 11/12/2023] Open
Abstract
The oomycete Phytophthora capsici is a common pathogen of the Solanaceae and Cucurbitaceae families. An improved assembly for the reference isolate LT1534 was constructed using Oxford Nanopore Technologies and Illumina data. Additionally, an unpolished assembly was produced for the European isolate Pc285 collected on chili pepper using Oxford Nanopore reads.
Collapse
|
3
|
Transcriptomic profiling reveals host-specific evolutionary pathways promoting enhanced fitness in the plant pathogen Ralstonia pseudosolanacearum. Microb Genom 2023; 9:001142. [PMID: 38063495 PMCID: PMC10763508 DOI: 10.1099/mgen.0.001142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 11/09/2023] [Indexed: 12/18/2023] Open
Abstract
The impact of host diversity on the genotypic and phenotypic evolution of broad-spectrum pathogens is an open issue. Here, we used populations of the plant pathogen Ralstonia pseudosolanacearum that were experimentally evolved on five types of host plants, either belonging to different botanical families or differing in their susceptibility or resistance to the pathogen. We investigated whether changes in transcriptomic profiles, associated with or independent of genetic changes, could occur during the process of host adaptation, and whether transcriptomic reprogramming was dependent on host type. Genomic and transcriptomic variations were established for 31 evolved clones that showed better fitness in their experimental host than the ancestral clone. Few genomic polymorphisms were detected in these clones, but significant transcriptomic variations were observed, with a large number of differentially expressed genes (DEGs). In a very clear way, a group of genes belonging to the network of regulation of the bacterial virulence such as efpR, efpH or hrpB, among others, were deregulated in several independent evolutionary lineages and appeared to play a key role in the transcriptomic rewiring observed in evolved clones. A double hierarchical clustering based on the 400 top DEGs for each clone revealed 2 major patterns of gene deregulation that depend on host genotype, but not on host susceptibility or resistance to the pathogen. This work therefore highlights the existence of two major evolutionary paths that result in a significant reorganization of gene expression during adaptive evolution and underscore clusters of co-regulated genes associated with bacterial adaptation on different host lines.
Collapse
|
4
|
Suspicions of two bridgehead invasions of Xylella fastidiosa subsp. multiplex in France. Commun Biol 2023; 6:103. [PMID: 36707697 PMCID: PMC9883466 DOI: 10.1038/s42003-023-04499-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Accepted: 01/18/2023] [Indexed: 01/29/2023] Open
Abstract
Of American origin, a wide diversity of Xylella fastidiosa strains belonging to different subspecies have been reported in Europe since 2013 and its discovery in Italian olive groves. Strains from the subspecies multiplex (ST6 and ST7) were first identified in France in 2015 in urban and natural areas. To trace back the most probable scenario of introduction in France, the molecular evolution rate of this subspecies was estimated at 3.2165 × 10-7 substitutions per site per year, based on heterochronous genome sequences collected worldwide. This rate allowed the dating of the divergence between French and American strains in 1987 for ST6 and in 1971 for ST7. The development of a new VNTR-13 scheme allowed tracing the spread of the bacterium in France, hypothesizing an American origin. Our results suggest that both sequence types were initially introduced and spread in Provence-Alpes-Côte d'Azur (PACA); then they were introduced in Corsica in two waves from the PACA bridgehead populations.
Collapse
|
5
|
A chromosome-level, haplotype-phased Vanilla planifolia genome highlights the challenge of partial endoreplication for accurate whole-genome assembly. PLANT COMMUNICATIONS 2022; 3:100330. [PMID: 35617961 PMCID: PMC9482989 DOI: 10.1016/j.xplc.2022.100330] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/30/2021] [Revised: 04/10/2022] [Accepted: 04/27/2022] [Indexed: 06/02/2023]
Abstract
Vanilla planifolia, the species cultivated to produce one of the world's most popular flavors, is highly prone to partial genome endoreplication, which leads to highly unbalanced DNA content in cells. We report here the first molecular evidence of partial endoreplication at the chromosome scale by the assembly and annotation of an accurate haplotype-phased genome of V. planifolia. Cytogenetic data demonstrated that the diploid genome size is 4.09 Gb, with 16 chromosome pairs, although aneuploid cells are frequently observed. Using PacBio HiFi and optical mapping, we assembled and phased a diploid genome of 3.4 Gb with a scaffold N50 of 1.2 Mb and 59 128 predicted protein-coding genes. The atypical k-mer frequencies and the uneven sequencing depth observed agreed with our expectation of unbalanced genome representation. Sixty-seven percent of the genes were scattered over only 30% of the genome, putatively linking gene-rich regions and the endoreplication phenomenon. By contrast, low-coverage regions (non-endoreplicated) were rich in repeated elements but also contained 33% of the annotated genes. Furthermore, this assembly showed distinct haplotype-specific sequencing depth variation patterns, suggesting complex molecular regulation of endoreplication along the chromosomes. This high-quality, anchored assembly represents 83% of the estimated V. planifolia genome. It provides a significant step toward the elucidation of this complex genome. To support post-genomics efforts, we developed the Vanilla Genome Hub, a user-friendly integrated web portal that enables centralized access to high-throughput genomic and other omics data and interoperable use of bioinformatics tools.
Collapse
|
6
|
A duplicated copy of id2b is an unusual sex-determining candidate gene on the Y chromosome of arapaima (Arapaima gigas). Sci Rep 2021; 11:21544. [PMID: 34732792 PMCID: PMC8566520 DOI: 10.1038/s41598-021-01066-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Accepted: 10/21/2021] [Indexed: 12/19/2022] Open
Abstract
Arapaima gigas is one of the largest freshwater fish species of high ecological and economic importance. Overfishing and habitat destruction are severe threats to the remaining wild populations. By incorporating a chromosomal Hi-C contact map, we improved the arapaima genome assembly to chromosome-level, revealing an unexpected high degree of chromosome rearrangements during evolution of the bonytongues (Osteoglossiformes). Combining this new assembly with pool-sequencing of male and female genomes, we identified id2bbY, a duplicated copy of the inhibitor of DNA binding 2b (id2b) gene on the Y chromosome as candidate male sex-determining gene. A PCR-test for id2bbY was developed, demonstrating that this gene is a reliable male-specific marker for genotyping. Expression analyses showed that this gene is expressed in juvenile male gonads. Its paralog, id2ba, exhibits a male-biased expression in immature gonads. Transcriptome analyses and protein structure predictions confirm id2bbY as a prime candidate for the master sex-determiner. Acting through the TGFβ signaling pathway, id2bbY from arapaima would provide the first evidence for a link of this family of transcriptional regulators to sex determination. Our study broadens our current understanding about the evolution of sex determination genetic networks and provide a tool for improving arapaima aquaculture for commercial and conservation purposes.
Collapse
|
7
|
Detection and Typing of a Fowl Adenovirus Type 1 Agent of Pancreatitis in Guinea Fowl. Avian Dis 2021; 65:429-437. [PMID: 34699140 DOI: 10.1637/0005-2086-65.3.429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Accepted: 04/29/2020] [Indexed: 11/05/2022]
Abstract
Adenoviral pancreatitis has been amply described for decades in guinea fowl. Although its pathologic picture has been characterized fairly well, its etiology still remains only partially clarified. Based on several outbreaks diagnosed on commercial guinea flocks raised in France since 2017, we performed direct whole-genome sequencing from pancreatic lesional tissue by using the Oxford Nanopore Technologies (ONT) sequencing method. We generated 4781 viral reads and assembled a whole genome of 43,509 bp, clustering within fowl adenovirus type 1 (FAdV-1). A phylogenetic analysis based on a partial sequence of the hexon and short fiber genes on viruses collected in France showed 98.7% and 99.8% nucleotide identity, respectively. Altogether, these results confirm that an FAdV-1 closely related to chicken and other avian strains is the agent of pancreatitis in guinea fowl. This study illustrates the potential of ONT sequencing method to achieve rapid whole-genome sequencing directly from pathologic material.
Collapse
|
8
|
Convergent Rewiring of the Virulence Regulatory Network Promotes Adaptation of Ralstonia solanacearum on Resistant Tomato. Mol Biol Evol 2021; 38:1792-1808. [PMID: 33306125 PMCID: PMC8097285 DOI: 10.1093/molbev/msaa320] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
The evolutionary and adaptive potential of a pathogen is a key determinant for successful host colonization and proliferation but remains poorly known for most of the pathogens. Here, we used experimental evolution combined with phenotyping, genomics, and transcriptomics to estimate the adaptive potential of the bacterial plant pathogen Ralstonia solanacearum to overcome the quantitative resistance of the tomato cultivar Hawaii 7996. After serial passaging over 300 generations, we observed pathogen adaptation to within-plant environment of the resistant cultivar but no plant resistance breakdown. Genomic sequence analysis of the adapted clones revealed few genetic alterations, but we provide evidence that all but one were gain of function mutations. Transcriptomic analyses revealed that even if different adaptive events occurred in independently evolved clones, there is convergence toward a global rewiring of the virulence regulatory network as evidenced by largely overlapping gene expression profiles. A subset of four transcription regulators, including HrpB, the activator of the type 3 secretion system regulon and EfpR, a global regulator of virulence and metabolic functions, emerged as key nodes of this regulatory network that are frequently targeted to redirect the pathogen’s physiology and improve its fitness in adverse conditions. Significant transcriptomic variations were also detected in evolved clones showing no genomic polymorphism, suggesting that epigenetic modifications regulate expression of some of the virulence network components and play a major role in adaptation as well.
Collapse
|
9
|
A 180 Myr-old female-specific genome region in sturgeon reveals the oldest known vertebrate sex determining system with undifferentiated sex chromosomes. Philos Trans R Soc Lond B Biol Sci 2021; 376:20200089. [PMID: 34247507 PMCID: PMC8273502 DOI: 10.1098/rstb.2020.0089] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Several hypotheses explain the prevalence of undifferentiated sex chromosomes in poikilothermic vertebrates. Turnovers change the master sex determination gene, the sex chromosome or the sex determination system (e.g. XY to WZ). Jumping master genes stay main triggers but translocate to other chromosomes. Occasional recombination (e.g. in sex-reversed females) prevents sex chromosome degeneration. Recent research has uncovered conserved heteromorphic or even homomorphic sex chromosomes in several clades of non-avian and non-mammalian vertebrates. Sex determination in sturgeons (Acipenseridae) has been a long-standing basic biological question, linked to economical demands by the caviar-producing aquaculture. Here, we report the discovery of a sex-specific sequence from sterlet (Acipenser ruthenus). Using chromosome-scale assemblies and pool-sequencing, we first identified an approximately 16 kb female-specific region. We developed a PCR-genotyping test, yielding female-specific products in six species, spanning the entire phylogeny with the most divergent extant lineages (A. sturio, A. oxyrinchus versus A. ruthenus, Huso huso), stemming from an ancient tetraploidization. Similar results were obtained in two octoploid species (A. gueldenstaedtii, A. baerii). Conservation of a female-specific sequence for a long period, representing 180 Myr of sturgeon evolution, and across at least one polyploidization event, raises many interesting biological questions. We discuss a conserved undifferentiated sex chromosome system with a ZZ/ZW-mode of sex determination and potential alternatives. This article is part of the theme issue ‘Challenging the paradigm in sex chromosome evolution: empirical and theoretical insights with a focus on vertebrates (Part I)’.
Collapse
|
10
|
Population genomics of apricots unravels domestication history and adaptive events. Nat Commun 2021; 12:3956. [PMID: 34172741 PMCID: PMC8233370 DOI: 10.1038/s41467-021-24283-6] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Accepted: 06/08/2021] [Indexed: 01/27/2023] Open
Abstract
Among crop fruit trees, the apricot (Prunus armeniaca) provides an excellent model to study divergence and adaptation processes. Here, we obtain nearly 600 Armeniaca apricot genomes and four high-quality assemblies anchored on genetic maps. Chinese and European apricots form two differentiated gene pools with high genetic diversity, resulting from independent domestication events from distinct wild Central Asian populations, and with subsequent gene flow. A relatively low proportion of the genome is affected by selection. Different genomic regions show footprints of selection in European and Chinese cultivated apricots, despite convergent phenotypic traits, with predicted functions in both groups involved in the perennial life cycle, fruit quality and disease resistance. Selection footprints appear more abundant in European apricots, with a hotspot on chromosome 4, while admixture is more pervasive in Chinese cultivated apricots. Our study provides clues to the biology of selected traits and targets for fruit tree research and breeding.
Collapse
|
11
|
DNAModAnnot: a R toolbox for DNA modification filtering and annotation. Bioinformatics 2021; 37:2738-2740. [PMID: 33471071 PMCID: PMC8428616 DOI: 10.1093/bioinformatics/btab032] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Revised: 12/17/2020] [Accepted: 01/13/2021] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Long-read sequencing technologies can be employed to detect and map DNA modifications at the nucleotide resolution on a genome-wide scale. However, published software packages neglect the integration of genomic annotation and comprehensive filtering when analyzing patterns of modified bases detected using Pacific Biosciences (PacBio) or Oxford Nanopore Technologies (ONT) data. Here, we present DNAModAnnot, a R package designed for the global analysis of DNA modification patterns using adapted filtering and visualization tools. RESULTS We tested our package using PacBio sequencing data to analyze patterns of the 6-methyladenine (6 mA) in the ciliate Paramecium tetraurelia, in which high 6 mA amounts were previously reported. We found Paramecium tetraurelia 6 mA genome-wide distribution to be similar to other ciliates. We also performed 5-methylcytosine (5mC) analysis in human lymphoblastoid cells using ONT data and confirmed previously known patterns of 5mC. DNAModAnnot provides a toolbox for the genome-wide analysis of different DNA modifications using PacBio and ONT long-read sequencing data. AVAILABILITY DNAModAnnot is distributed as a R package available via GitHub (https://github.com/AlexisHardy/DNAModAnnot). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
12
|
Independent Origin of XY and ZW Sex Determination Mechanisms in Mosquitofish Sister Species. Genetics 2020; 214:193-209. [PMID: 31704715 PMCID: PMC6944411 DOI: 10.1534/genetics.119.302698] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2019] [Accepted: 11/05/2019] [Indexed: 12/12/2022] Open
Abstract
Fish are known for the outstanding variety of their sex determination mechanisms and sex chromosome systems. The western (Gambusia affinis) and eastern mosquitofish (G. holbrooki) are sister species for which different sex determination mechanisms have been described: ZZ/ZW for G. affinis and XX/XY for G. holbrooki Here, we carried out restriction-site associated DNA (RAD-) and pool sequencing (Pool-seq) to characterize the sex chromosomes of both species. We found that the ZW chromosomes of G. affinis females and the XY chromosomes of G. holbrooki males correspond to different linkage groups, and thus evolved independently from separate autosomes. In interspecific hybrids, the Y chromosome is dominant over the W chromosome, and X is dominant over Z. In G. holbrooki, we identified a candidate region for the Y-linked melanic pigmentation locus, a rare male phenotype that constitutes a potentially sexually antagonistic trait and is associated with other such characteristics, e.g., large body size and aggressive behavior. We developed a SNP-based marker in the Y-linked allele of GIPC PDZ domain containing family member 1 (gipc1), which was linked to melanism in all tested G. holbrooki populations. This locus represents an example for a color locus that is located in close proximity to a putative sex determiner, and most likely substantially contributed to the evolution of the Y.
Collapse
|
13
|
|
14
|
Whole-genome landscape of Medicago truncatula symbiotic genes. NATURE PLANTS 2018; 4:1017-1025. [PMID: 30397259 DOI: 10.1038/s41477-018-0286-7] [Citation(s) in RCA: 131] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/07/2018] [Accepted: 09/21/2018] [Indexed: 05/07/2023]
Abstract
Advances in deciphering the functional architecture of eukaryotic genomes have been facilitated by recent breakthroughs in sequencing technologies, enabling a more comprehensive representation of genes and repeat elements in genome sequence assemblies, as well as more sensitive and tissue-specific analyses of gene expression. Here we show that PacBio sequencing has led to a substantially improved genome assembly of Medicago truncatula A17, a legume model species notable for endosymbiosis studies1, and has enabled the identification of genome rearrangements between genotypes at a near-base-pair resolution. Annotation of the new M. truncatula genome sequence has allowed for a thorough analysis of transposable elements and their dynamics, as well as the identification of new players involved in symbiotic nodule development, in particular 1,037 upregulated long non-coding RNAs (lncRNAs). We have also discovered that a substantial proportion (~35% and 38%, respectively) of the genes upregulated in nodules or expressed in the nodule differentiation zone colocalize in genomic clusters (270 and 211, respectively), here termed symbiotic islands. These islands contain numerous expressed lncRNA genes and display differentially both DNA methylation and histone marks. Epigenetic regulations and lncRNAs are therefore attractive candidate elements for the orchestration of symbiotic gene expression in the M. truncatula genome.
Collapse
|
15
|
Accurate Identification and Quantification of DNA Species by Next-Generation Sequencing in Adeno-Associated Viral Vectors Produced in Insect Cells. Hum Gene Ther Methods 2017; 28:148-162. [PMID: 28463571 DOI: 10.1089/hgtb.2016.185] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Recombinant adeno-associated viral (rAAV) vectors have proven excellent tools for the treatment of many genetic diseases and other complex diseases. However, the illegitimate encapsidation of DNA contaminants within viral particles constitutes a major safety concern for rAAV-based therapies. Moreover, the development of rAAV vectors for early-phase clinical trials has revealed the limited accuracy of the analytical tools used to characterize these new and complex drugs. Although most published data concerning residual DNA in rAAV preparations have been generated by quantitative PCR, we have developed a novel single-strand virus sequencing (SSV-Seq) method for quantification of DNA contaminants in AAV vectors produced in mammalian cells by next-generation sequencing (NGS). Here, we describe the adaptation of SSV-Seq for the accurate identification and quantification of DNA species in rAAV stocks produced in insect cells. We found that baculoviral DNA was the most abundant contaminant, representing less than 2.1% of NGS reads regardless of serotype (2, 8, or rh10). Sf9 producer cell DNA was detected at low frequency (≤0.03%) in rAAV lots. Advanced computational analyses revealed that (1) baculoviral sequences close to the inverted terminal repeats preferentially underwent illegitimate encapsidation, and (2) single-nucleotide variants were absent from the rAAV genome. The high-throughput sequencing protocol described here enables effective DNA quality control of rAAV vectors produced in insect cells, and is adapted to conform with regulatory agency safety requirements.
Collapse
|