1
|
Gene duplication as a major force driving the genome expansion in some giant viruses. J Virol 2023; 97:e0130923. [PMID: 38092658 PMCID: PMC10734413 DOI: 10.1128/jvi.01309-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Accepted: 10/26/2023] [Indexed: 12/22/2023] Open
Abstract
IMPORTANCE Giant viruses are noteworthy not only due to their enormous particles but also because of their gigantic genomes. In this context, a fundamental question has persisted: how did these genomes evolve? Here we present the discovery of cedratvirus pambiensis, featuring the largest genome ever described for a cedratvirus. Our data suggest that the larger size of the genome can be attributed to an unprecedented number of duplicated genes. Further investigation of this phenomenon in other viruses has illuminated gene duplication as a key evolutionary mechanism driving genome expansion in diverse giant viruses. Although gene duplication has been described as a recurrent event in cellular organisms, our data highlights its potential as a pivotal event in the evolution of gigantic viral genomes.
Collapse
|
2
|
High Nucleotide Substitution Rates Associated with Retrotransposon Proliferation Drive Dynamic Secretome Evolution in Smut Pathogens. Microbiol Spectr 2022; 10:e0034922. [PMID: 35972267 PMCID: PMC9603552 DOI: 10.1128/spectrum.00349-22] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 07/22/2022] [Indexed: 11/20/2022] Open
Abstract
Transposable elements (TEs) play a pivotal role in shaping diversity in eukaryotic genomes. The covered smut pathogen on barley, Ustilago hordei, encountered a recent genome expansion. Using long reads, we assembled genomes of 6 U. hordei strains and 3 sister species, to study this genome expansion. We found that larger genome sizes can mainly be attributed to a higher genome fraction of long terminal repeat retrotransposons (LTR-RTs). In the studied smut genomes, LTR-RTs fractions are the largest in U. hordei and are positively correlated with the mating-type locus sizes, which is up to ~560 kb in U. hordei. Furthermore, LTR-RTs were found to be associated with higher nucleotide substitution levels, as these occur in specific genome regions of smut species with a recent LTR-RT proliferation. Moreover, genes in genome regions with higher nucleotide substitution levels generally reside closer to LTR-RTs than other genome regions. Genome regions with many nucleotide substitutions encountered an especially high fraction of CG substitutions, which is not observed for LTR-RT sequences. The high nucleotide substitution levels particularly accelerate the evolution of secretome genes, as their more accessory nature results in substitutions that often lead to amino acid alterations. IMPORTANCE Genomic alteration can be generated through various means, in which transposable elements (TEs) can play a pivotal role. Their mobility causes mutagenesis in itself and can disrupt the function of the sequences they insert into. They also impact genome evolution as their repetitive nature facilitates nonhomologous recombination. Furthermore, TEs have been linked to specific epigenetic genome organizations. We report a recent TE proliferation in the genome of the barley covered smut fungus, Ustilago hordei. This proliferation is associated with a distinct nucleotide substitution regime that has a higher rate and a higher fraction of CG substitutions. This different regime shapes the evolution of genes in subjected genome regions. We hypothesize that TEs may influence the error-rate of DNA polymerase in a hitherto unknown fashion.
Collapse
|
3
|
Full-Length Genome of an Ogataea polymorpha Strain CBS4732 ura3Δ Reveals Large Duplicated Segments in Subtelomeric Regions. Front Microbiol 2022; 13:855666. [PMID: 35464988 PMCID: PMC9019687 DOI: 10.3389/fmicb.2022.855666] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2022] [Accepted: 02/25/2022] [Indexed: 11/18/2022] Open
Abstract
Background Currently, methylotrophic yeasts (e.g., Pichia pastoris, Ogataea polymorpha, and Candida boindii) are subjects of intense genomics studies in basic research and industrial applications. In the genus Ogataea, most research is focused on three basic O. polymorpha strains-CBS4732, NCYC495, and DL-1. However, the relationship between CBS4732, NCYC495, and DL-1 remains unclear, as the genomic differences between them have not be exactly determined without their high-quality complete genomes. As a nutritionally deficient mutant derived from CBS4732, the O. polymorpha strain CBS4732 ura3Δ (named HU-11) is being used for high-yield production of several important proteins or peptides. HU-11 has the same reference genome as CBS4732 (noted as HU-11/CBS4732), because the only genomic difference between them is a 5-bp insertion. Results In the present study, we have assembled the full-length genome of O. polymorpha HU-11/CBS4732 using high-depth PacBio and Illumina data. Long terminal repeat retrotransposons (LTR-rts), rDNA, 5′ and 3′ telomeric, subtelomeric, low complexity and other repeat regions were exactly determined to improve the genome quality. In brief, the main findings include complete rDNAs, complete LTR-rts, three large duplicated segments in subtelomeric regions and three structural variations between the HU-11/CBS4732 and NCYC495 genomes. These findings are very important for the assembly of full-length genomes of yeast and the correction of assembly errors in the published genomes of Ogataea spp. HU-11/CBS4732 is so phylogenetically close to NCYC495 that the syntenic regions cover nearly 100% of their genomes. Moreover, HU-11/CBS4732 and NCYC495 share a nucleotide identity of 99.5% through their whole genomes. CBS4732 and NCYC495 can be regarded as the same strain in basic research and industrial applications. Conclusion The present study preliminarily revealed the relationship between CBS4732, NCYC495, and DL-1. Our findings provide new opportunities for in-depth understanding of genome evolution in methylotrophic yeasts and lay the foundations for the industrial applications of O. polymorpha CBS4732, NCYC495, DL-1, and their derivative strains. The full-length genome of O. polymorpha HU-11/CBS4732 should be included into the NCBI RefSeq database for future studies of Ogataea spp.
Collapse
|
4
|
The Chinese pine genome and methylome unveil key features of conifer evolution. Cell 2021; 185:204-217.e14. [PMID: 34965378 DOI: 10.1016/j.cell.2021.12.006] [Citation(s) in RCA: 94] [Impact Index Per Article: 31.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Revised: 10/23/2021] [Accepted: 12/03/2021] [Indexed: 12/30/2022]
Abstract
Conifers dominate the world's forest ecosystems and are the most widely planted tree species. Their giant and complex genomes present great challenges for assembling a complete reference genome for evolutionary and genomic studies. We present a 25.4-Gb chromosome-level assembly of Chinese pine (Pinus tabuliformis) and revealed that its genome size is mostly attributable to huge intergenic regions and long introns with high transposable element (TE) content. Large genes with long introns exhibited higher expressions levels. Despite a lack of recent whole-genome duplication, 91.2% of genes were duplicated through dispersed duplication, and expanded gene families are mainly related to stress responses, which may underpin conifers' adaptation, particularly in cold and/or arid conditions. The reproductive regulation network is distinct compared with angiosperms. Slow removal of TEs with high-level methylation may have contributed to genomic expansion. This study provides insights into conifer evolution and resources for advancing research on conifer adaptation and development.
Collapse
|
5
|
DNA Transposon Expansion is Associated with Genome Size Increase in Mudminnows. Genome Biol Evol 2021; 13:6380143. [PMID: 34599322 PMCID: PMC8557787 DOI: 10.1093/gbe/evab228] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/27/2021] [Indexed: 12/20/2022] Open
Abstract
Genome sizes of eukaryotic organisms vary substantially, with whole-genome duplications (WGD) and transposable element expansion acting as main drivers for rapid genome size increase. The two North American mudminnows, Umbra limi and Umbra pygmaea, feature genomes about twice the size of their sister lineage Esocidae (e.g., pikes and pickerels). However, it is unknown whether all Umbra species share this genome expansion and which causal mechanisms drive this expansion. Using flow cytometry, we find that the genome of the European mudminnow is expanded similarly to both North American species, ranging between 4.5 and 5.4 pg per diploid nucleus. Observed blocks of interstitially located telomeric repeats in U. limi suggest frequent Robertsonian rearrangements in its history. Comparative analyses of transcriptome and genome assemblies show that the genome expansion in Umbra is driven by the expansion of DNA transposon and unclassified repeat sequences without WGD. Furthermore, we find a substantial ongoing expansion of repeat sequences in the Alaska blackfish Dallia pectoralis, the closest relative to the family Umbridae, which might mark the beginning of a similar genome expansion. Our study suggests that the genome expansion in mudminnows, driven mainly by transposon expansion, but not WGD, occurred before the separation into the American and European lineage.
Collapse
|
6
|
Comparative Genome Analyses Highlight Transposon-Mediated Genome Expansion and the Evolutionary Architecture of 3D Genomic Folding in Cotton. Mol Biol Evol 2021. [PMID: 33973633 DOI: 10.21203/rs.3.rs-93594/v1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/08/2023] Open
Abstract
Transposable element (TE) amplification has been recognized as a driving force mediating genome size expansion and evolution, but the consequences for shaping 3D genomic architecture remains largely unknown in plants. Here, we report reference-grade genome assemblies for three species of cotton ranging 3-fold in genome size, namely Gossypium rotundifolium (K2), G. arboreum (A2), and G. raimondii (D5), using Oxford Nanopore Technologies. Comparative genome analyses document the details of lineage-specific TE amplification contributing to the large genome size differences (K2, 2.44 Gb; A2, 1.62 Gb; D5, 750.19 Mb) and indicate relatively conserved gene content and synteny relationships among genomes. We found that approximately 17% of syntenic genes exhibit chromatin status change between active ("A") and inactive ("B") compartments, and TE amplification was associated with the increase of the proportion of A compartment in gene regions (∼7,000 genes) in K2 and A2 relative to D5. Only 42% of topologically associating domain (TAD) boundaries were conserved among the three genomes. Our data implicate recent amplification of TEs following the formation of lineage-specific TAD boundaries. This study sheds light on the role of transposon-mediated genome expansion in the evolution of higher-order chromatin structure in plants.
Collapse
|
7
|
The genome assembly and annotation of the Apollo butterfly Parnassius apollo, a flagship species for conservation biology. Genome Biol Evol 2021; 13:6296838. [PMID: 34115121 PMCID: PMC8536933 DOI: 10.1093/gbe/evab122] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/25/2021] [Indexed: 11/13/2022] Open
Abstract
Conservation genomics has made dramatic improvements over the past decade, leveraging the power of genomes to infer diverse parameters central to conservation management questions. However, much of this effort has focused upon vertebrate species, despite insects providing similar flagship status with the added benefit of smaller genomes, shorter generation times and extensive historical collections in museums. Here we present the genome of the Apollo butterfly (Parnassius apollo, Papilionidae), an iconic endangered butterfly, which like many species in this genus, needs conservation genomic attention yet lacks a genome. Using 68.7 Gb of long-read data (N50 = 15.2 kb) we assembled a 1.4 Gb genome for the Apollo butterfly, making this the largest sequenced Lepidopteran genome to date. The assembly was highly contiguous (N50 = 7.1 Mb) and complete (97% of Lepidopteran BUSCOs were single-copy and complete) and consisted of 1,707 contigs. Using RNAseq data and Arthropoda proteins, we annotated 28.3K genes. Alignment with the closest-related chromosome-level assembly, Papilio bianor, reveals a highly conserved chromosomal organization, albeit genome size is highly expanded in the Apollo butterfly, due primarily to a dramatic increase in repetitive element content. Using this alignment for superscaffolding places the P. apollo genome in to 31 chromosomal scaffolds, and together with our functional annotation, provides an essential resource for advancing conservation genomics in a flagship species for insect conservation.
Collapse
|
8
|
Comparative genome analyses highlight transposon-mediated genome expansion and the evolutionary architecture of 3D genomic folding in cotton. Mol Biol Evol 2021; 38:3621-3636. [PMID: 33973633 PMCID: PMC8382922 DOI: 10.1093/molbev/msab128] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2020] [Revised: 04/19/2021] [Accepted: 04/28/2021] [Indexed: 12/12/2022] Open
Abstract
Transposable element (TE) amplification has been recognized as a driving force mediating genome size expansion and evolution, but the consequences for shaping 3D genomic architecture remains largely unknown in plants. Here, we report reference-grade genome assemblies for three species of cotton ranging 3-fold in genome size, namely Gossypium rotundifolium (K2), G. arboreum (A2), and G. raimondii (D5), using Oxford Nanopore Technologies. Comparative genome analyses document the details of lineage-specific TE amplification contributing to the large genome size differences (K2, 2.44 Gb; A2, 1.62 Gb; D5, 750.19 Mb) and indicate relatively conserved gene content and synteny relationships among genomes. We found that approximately 17% of syntenic genes exhibit chromatin status change between active (“A”) and inactive (“B”) compartments, and TE amplification was associated with the increase of the proportion of A compartment in gene regions (∼7,000 genes) in K2 and A2 relative to D5. Only 42% of topologically associating domain (TAD) boundaries were conserved among the three genomes. Our data implicate recent amplification of TEs following the formation of lineage-specific TAD boundaries. This study sheds light on the role of transposon-mediated genome expansion in the evolution of higher-order chromatin structure in plants.
Collapse
|
9
|
Plasmids Related to the Symbiotic Nitrogen Fixation Are Not Only Cooperated Functionally but Also May Have Evolved over a Time Span in Family Rhizobiaceae. Genome Biol Evol 2020; 12:2002-2014. [PMID: 32687170 PMCID: PMC7719263 DOI: 10.1093/gbe/evaa152] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/15/2020] [Indexed: 12/17/2022] Open
Abstract
Rhizobia are soil bacteria capable of forming symbiotic nitrogen-fixing nodules associated with leguminous plants. In fast-growing legume-nodulating rhizobia, such as the species in the family Rhizobiaceae, the symbiotic plasmid is the main genetic basis for nitrogen-fixing symbiosis, and is susceptible to horizontal gene transfer. To further understand the symbioses evolution in Rhizobiaceae, we analyzed the pan-genome of this family based on 92 genomes of type/reference strains and reconstructed its phylogeny using a phylogenomics approach. Intriguingly, although the genetic expansion that occurred in chromosomal regions was the main reason for the high proportion of low-frequency flexible gene families in the pan-genome, gene gain events associated with accessory plasmids introduced more genes into the genomes of nitrogen-fixing species. For symbiotic plasmids, although horizontal gene transfer frequently occurred, transfer may be impeded by, such as, the host’s physical isolation and soil conditions, even among phylogenetically close species. During coevolution with leguminous hosts, the plasmid system, including accessory and symbiotic plasmids, may have evolved over a time span, and provided rhizobial species with the ability to adapt to various environmental conditions and helped them achieve nitrogen fixation. These findings provide new insights into the phylogeny of Rhizobiaceae and advance our understanding of the evolution of symbiotic nitrogen fixation.
Collapse
|
10
|
DNA methylome analysis provides evidence that the expansion of the tea genome is linked to TE bursts. PLANT BIOTECHNOLOGY JOURNAL 2019; 17:826-835. [PMID: 30256509 PMCID: PMC6419580 DOI: 10.1111/pbi.13018] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/29/2018] [Revised: 09/14/2018] [Accepted: 09/23/2018] [Indexed: 05/12/2023]
Abstract
DNA methylation is essential for gene regulation, imprinting and silencing of transposable elements (TEs). Although bursts of transposable elements are common in many plant lineages, how plant DNA methylation is related to transposon bursts remains unclear. Here we explore the landscape of DNA methylation of tea, a species thought to have experienced a recent transposon burst event. This species possesses more transposable elements than any other sequenced asterids (potato, tomato, coffee, pepper and tobacco). The overall average DNA methylation levels were found to differ among the tea, potato and tomato genomes, and methylation at CHG sequence sites was found to be significantly higher in tea than that in potato or tomato. Moreover, the abundant TEs resulting from burst events not only resulted in tea developing a very large genome size, but also affected many genes involved in importantly biological processes, including caffeine, theanine and flavonoid metabolic pathway genes. In addition, recently transposed TEs were more heavily methylated than ancient ones, implying that DNA methylation is proportionate to the degree of TE silencing, especially on recent active ones. Taken together, our results show that DNA methylation regulates transposon silencing and may play a role in genome size expansion.
Collapse
|
11
|
Exploring the Limits and Causes of Plastid Genome Expansion in Volvocine Green Algae. Genome Biol Evol 2018; 10:2248-2254. [PMID: 30102347 PMCID: PMC6128376 DOI: 10.1093/gbe/evy175] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/07/2018] [Indexed: 12/25/2022] Open
Abstract
Plastid genomes are not normally celebrated for being large. But researchers are steadily uncovering algal lineages with big and, in rare cases, enormous plastid DNAs (ptDNAs), such as volvocine green algae. Plastome sequencing of five different volvocine species has revealed some of the largest, most repeat-dense plastomes on record, including that of Volvox carteri (∼525 kb). Volvocine algae have also been used as models for testing leading hypotheses on organelle genome evolution (e.g., the mutational hazard hypothesis), and it has been suggested that ptDNA inflation within this group might be a consequence of low mutation rates and/or the transition from a unicellular to multicellular existence. Here, we further our understanding of plastome size variation in the volvocine line by examining the ptDNA sequences of the colonial species Yamagishiella unicocca and Eudorina sp. NIES-3984 and the multicellular Volvox africanus, which are phylogenetically situated between species with known ptDNA sizes. Although V. africanus is closely related and similar in multicellular organization to V. carteri, its ptDNA was much less inflated than that of V. carteri. Synonymous- and noncoding-site nucleotide substitution rate analyses of these two Volvox ptDNAs suggest that there are drastically different plastid mutation rates operating in the coding versus intergenic regions, supporting the idea that error-prone DNA repair in repeat-rich intergenic spacers is contributing to genome expansion. Our results reinforce the idea that the volvocine line harbors extremes in plastome size but ultimately shed doubt on some of the previously proposed hypotheses for ptDNA inflation within the lineage.
Collapse
|
12
|
A Novel Betabaculovirus Isolated from the Monocot Pest Mocis latipes (Lepidoptera: Noctuidae) and the Evolution of Multiple-Copy Genes. Viruses 2018; 10:v10030134. [PMID: 29547534 PMCID: PMC5869527 DOI: 10.3390/v10030134] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2017] [Revised: 03/12/2018] [Accepted: 03/14/2018] [Indexed: 01/29/2023] Open
Abstract
In this report, we described the genome of a novel baculovirus isolated from the monocot insect pest Mocis latipes, the striped grass looper. The genome has 134,272 bp in length with a G + C content of 38.3%. Based on the concatenated sequence of the 38 baculovirus core genes, we found that the virus is a betabaculovirus closely related to the noctuid-infecting betabaculoviruses including Pseudaletia unipuncta granulovirus (PsunGV), Trichoplusia ni granulovirus (TnGV), Helicoverpa armigera granulovirus (HearGV), and Xestia c-nigrum granulovirus (XecnGV). The virus may constitute a new Betabaculovirus species tentatively named Mocis latipes granulovirus (MolaGV). After gene content analysis, five open reading frames (ORFs) were found to be unique to MolaGV and several auxiliary genes were found including iap-3, iap-5, bro-a, bro-b, and three enhancins. The virus genome lacked both chitinase and cathepsin. We then looked at the evolutionary history of the enhancin gene and found that betabaculovirus acquired this gene from an alphabaculovirus followed by several duplication events. Gene duplication also happened to an endonuclease-like gene. Genomic and gene content analyses revealed both a strict collinearity and gene expansion into the genome of the MolaGV-related species. We also characterized the granulin gene using a recombinant Autographa californica multiple nucleopolyhedrovirus (AcMNPV) and found that occlusion bodies were produced into the nucleus of infected cells and presented a polyhedral shape and no occluded virions within. Overall, betabaculovirus genome sequencing is of importance to the field as few genomes are publicly accessible. Mocislatipes is a secondary pest of maize, rice, and wheat crops in Brazil. Certainly, both the discovery and description of novel baculoviruses may lead to development of greener and safer pesticides in order to counteract and effectively control crop damage-causing insect populations
Collapse
|
13
|
The number of genes encoding repeat domain-containing proteins positively correlates with genome size in amoebal giant viruses. Virus Evol 2018; 4:vex039. [PMID: 29308275 PMCID: PMC5753266 DOI: 10.1093/ve/vex039] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Curiously, in viruses, the virion volume appears to be predominantly driven by genome length rather than the number of proteins it encodes or geometric constraints. With their large genome and giant particle size, amoebal viruses (AVs) are ideally suited to study the relationship between genome and virion size and explore the role of genome plasticity in their evolutionary success. Different genomic regions of AVs exhibit distinct genealogies. Although the vertically transferred core genes and their functions are universally conserved across the nucleocytoplasmic large DNA virus (NCLDV) families and are essential for their replication, the horizontally acquired genes are variable across families and are lineage-specific. When compared with other giant virus families, we observed a near–linear increase in the number of genes encoding repeat domain-containing proteins (RDCPs) with the increase in the genome size of AVs. From what is known about the functions of RDCPs in bacteria and eukaryotes and their prevalence in the AV genomes, we envisage important roles for RDCPs in the life cycle of AVs, their genome expansion, and plasticity. This observation also supports the evolution of AVs from a smaller viral ancestor by the acquisition of diverse gene families from the environment including RDCPs that might have helped in host adaption.
Collapse
|
14
|
Rapid Increase in Genome Size as a Consequence of Transposable Element Hyperactivity in Wood-White (Leptidea) Butterflies. Genome Biol Evol 2017; 9:2491-2505. [PMID: 28981642 PMCID: PMC5737376 DOI: 10.1093/gbe/evx163] [Citation(s) in RCA: 69] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/22/2016] [Indexed: 12/14/2022] Open
Abstract
Characterizing and quantifying genome size variation among organisms and understanding if genome size evolves as a consequence of adaptive or stochastic processes have been long-standing goals in evolutionary biology. Here, we investigate genome size variation and association with transposable elements (TEs) across lepidopteran lineages using a novel genome assembly of the common wood-white (Leptidea sinapis) and population re-sequencing data from both L. sinapis and the closely related L. reali and L. juvernica together with 12 previously available lepidopteran genome assemblies. A phylogenetic analysis confirms established relationships among species, but identifies previously unknown intraspecific structure within Leptidea lineages. The genome assembly of L. sinapis is one of the largest of any lepidopteran taxon so far (643 Mb) and genome size is correlated with abundance of TEs, both in Lepidoptera in general and within Leptidea where L. juvernica from Kazakhstan has considerably larger genome size than any other Leptidea population. Specific TE subclasses have been active in different Lepidoptera lineages with a pronounced expansion of predominantly LINEs, DNA elements, and unclassified TEs in the Leptidea lineage after the split from other Pieridae. The rate of genome expansion in Leptidea in general has been in the range of four Mb/Million year (My), with an increase in a particular L. juvernica population to 72 Mb/My. The considerable differences in accumulation rates of specific TE classes in different lineages indicate that TE activity plays a major role in genome size evolution in butterflies and moths.
Collapse
|
15
|
The plastid genomes of nonphotosynthetic algae are not so small after all. Commun Integr Biol 2017; 10:e1283080. [PMID: 28377793 PMCID: PMC5363391 DOI: 10.1080/19420889.2017.1283080] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2016] [Revised: 01/10/2017] [Accepted: 01/11/2017] [Indexed: 12/27/2022] Open
Abstract
The thing about plastid genomes in nonphotosynthetic plants and algae is that they are
usually very small and highly compact. This is not surprising: a heterotrophic existence
means that genes for photosynthesis can be easily discarded. But the loss of
photosynthesis cannot explain why the plastomes of heterotrophs are so often depauperate
in noncoding DNA. If plastid genomes from photosynthetic taxa can span the gamut of
compactness, why can't those of nonphotosynthetic species? Well, recently we showed
that they can. The free-living, heterotrophic green alga Polytoma uvella
has a plastid genome boasting more than 165 kilobases of noncoding DNA, making it the most
bloated plastome yet found in a heterotroph. In this addendum to the primary study, we
elaborate on why the P. uvella plastome is so inflated, discussing the
potential impact of a free-living vs. parasitic lifestyle on plastid genome expansion in
nonphotosynthetic lineages.
Collapse
|
16
|
Small homologous blocks in phytophthora genomes do not point to an ancient whole-genome duplication. Genome Biol Evol 2016; 6:1079-85. [PMID: 24760277 PMCID: PMC4040989 DOI: 10.1093/gbe/evu081] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Genomes of the plant-pathogenic genus Phytophthora are characterized by small duplicated blocks consisting of two consecutive genes (2HOM blocks) and by an elevated abundance of similarly aged gene duplicates. Both properties, in particular the presence of 2HOM blocks, have been attributed to a whole-genome duplication (WGD) at the last common ancestor of Phytophthora. However, large intraspecies synteny—compelling evidence for a WGD—has not been detected. Here, we revisited the WGD hypothesis by deducing the age of 2HOM blocks. Two independent timing methods reveal that the majority of 2HOM blocks arose after divergence of the Phytophthora lineages. In addition, a large proportion of the 2HOM block copies colocalize on the same scaffold. Therefore, the presence of 2HOM blocks does not support a WGD at the last common ancestor of Phytophthora. Thus, genome evolution of Phytophthora is likely driven by alternative mechanisms, such as bursts of transposon activity.
Collapse
|
17
|
Whole-genome sequencing of cultivated and wild peppers provides insights into Capsicum domestication and specialization. Proc Natl Acad Sci U S A 2014; 111:5135-5140. [PMID: 24591624 DOI: 10.4172/2168-9881.s1.013] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/24/2023] Open
Abstract
As an economic crop, pepper satisfies people's spicy taste and has medicinal uses worldwide. To gain a better understanding of Capsicum evolution, domestication, and specialization, we present here the genome sequence of the cultivated pepper Zunla-1 (C. annuum L.) and its wild progenitor Chiltepin (C. annuum var. glabriusculum). We estimate that the pepper genome expanded ∼0.3 Mya (with respect to the genome of other Solanaceae) by a rapid amplification of retrotransposons elements, resulting in a genome comprised of ∼81% repetitive sequences. Approximately 79% of 3.48-Gb scaffolds containing 34,476 protein-coding genes were anchored to chromosomes by a high-density genetic map. Comparison of cultivated and wild pepper genomes with 20 resequencing accessions revealed molecular footprints of artificial selection, providing us with a list of candidate domestication genes. We also found that dosage compensation effect of tandem duplication genes probably contributed to the pungent diversification in pepper. The Capsicum reference genome provides crucial information for the study of not only the evolution of the pepper genome but also, the Solanaceae family, and it will facilitate the establishment of more effective pepper breeding programs.
Collapse
|
18
|
Abstract
Mitochondria are intracellular organelles where oxidative phosphorylation is carried out to complete ATP synthesis. Mitochondria have their own genome; in metazoans, this is a small, circular molecule encoding 13 electron transport proteins, 22 tRNAs, and 2 rRNAs. In invertebrates, mitochondrial gene rearrangement is common, and it is correlated with increased substitution rates. In vertebrates, mitochondrial gene rearrangement is rare, and its relationship to substitution rate remains unexplored. Mitochondrial genes can also show spatial variation in substitution rates around the genome due to the mechanism of mtDNA replication, which produces a mutation gradient. To date, however, the strength of the mutation gradient and whether movement along the gradient in rearranged (or otherwise modified) genomes impacts genic substitution rates remain unexplored in the majority of vertebrates. Salamanders include both normal mitochondrial genomes and independently derived rearrangements and expansions, providing a rare opportunity to test the effects of large-scale changes to genome architecture on vertebrate mitochondrial gene sequence evolution. We show that: 1) rearranged/expanded genomes have higher substitution rates; 2) most genes in rearranged/expanded genomes maintain their position along the mutation gradient, substitution rates of the genes that do move are unaffected by their new position, and the gradient in salamanders is weak; and 3) genomic rearrangements/expansions occur independent of levels of selective constraint on genes. Together, our results demonstrate that large-scale changes to genome architecture impact mitochondrial gene evolution in predictable ways; however, despite these impacts, the same functional constraints act on mitochondrial protein-coding genes in both modified and normal genomes.
Collapse
|
19
|
Whole-genome sequencing of cultivated and wild peppers provides insights into Capsicum domestication and specialization. Proc Natl Acad Sci U S A 2014; 111:5135-40. [PMID: 24591624 DOI: 10.1073/pnas.1400975111] [Citation(s) in RCA: 419] [Impact Index Per Article: 41.9] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
As an economic crop, pepper satisfies people's spicy taste and has medicinal uses worldwide. To gain a better understanding of Capsicum evolution, domestication, and specialization, we present here the genome sequence of the cultivated pepper Zunla-1 (C. annuum L.) and its wild progenitor Chiltepin (C. annuum var. glabriusculum). We estimate that the pepper genome expanded ∼0.3 Mya (with respect to the genome of other Solanaceae) by a rapid amplification of retrotransposons elements, resulting in a genome comprised of ∼81% repetitive sequences. Approximately 79% of 3.48-Gb scaffolds containing 34,476 protein-coding genes were anchored to chromosomes by a high-density genetic map. Comparison of cultivated and wild pepper genomes with 20 resequencing accessions revealed molecular footprints of artificial selection, providing us with a list of candidate domestication genes. We also found that dosage compensation effect of tandem duplication genes probably contributed to the pungent diversification in pepper. The Capsicum reference genome provides crucial information for the study of not only the evolution of the pepper genome but also, the Solanaceae family, and it will facilitate the establishment of more effective pepper breeding programs.
Collapse
|
20
|
Abstract
Fungi display a large diversity in genome size and complexity, variation that is often considered to be adaptive. But because nonadaptive processes can also have important consequences on the features of genomes, we investigated the relationship of genetic drift and genome size in the phylum Ascomycota using multiple indicators of genetic drift. We detected a complex relationship between genetic drift and genome size in fungi: genetic drift is associated with genome expansion on broad evolutionary timescales, as hypothesized for other eukaryotes; but within subphyla over smaller timescales, the opposite trend is observed. Moreover, fungi and bacteria display similar patterns of genome degradation that are associated with initial effects of genetic drift. We conclude that changes in genome size within Ascomycota have occurred using two different routes: large-scale genome expansions are catalyzed by increasing drift as predicted by the mutation-hazard model of genome evolution and small-scale modifications in genome size are independent of drift.
Collapse
|