1
|
Eberhard JR, Wright TF. Rearrangement and evolution of mitochondrial genomes in parrots. Mol Phylogenet Evol 2015; 94:34-46. [PMID: 26291569 DOI: 10.1016/j.ympev.2015.08.011] [Citation(s) in RCA: 63] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2014] [Revised: 07/15/2015] [Accepted: 08/11/2015] [Indexed: 12/27/2022]
Abstract
Mitochondrial genome rearrangements that result in control region duplication have been described for a variety of birds, but the mechanisms leading to their appearance and maintenance remain unclear, and their effect on sequence evolution has not been explored. A recent survey of mitochondrial genomes in the Psittaciformes (parrots) found that control region duplications have arisen independently at least six times across the order. We analyzed complete mitochondrial genome sequences from 20 parrot species, including representatives of each lineage with control region duplications, to document the gene order changes and to examine effects of genome rearrangements on patterns of sequence evolution. The gene order previously reported for Amazona parrots was found for four of the six independently derived genome rearrangements, and a previously undescribed gene order was found in Prioniturus luconensis, representing a fifth clade with rearranged genomes; the gene order resulting from the remaining rearrangement event could not be confirmed. In all rearranged genomes, two copies of the control region are present and are very similar at the sequence level, while duplicates of the other genes involved in the rearrangement show signs of degeneration or have been lost altogether. We compared rates of sequence evolution in genomes with and without control region duplications and did not find a consistent acceleration or deceleration associated with the duplications. This could be due to the fact that most of the genome rearrangement events in parrots are ancient, and additionally, to an effect of body size on evolutionary rate that we found for mitochondrial but not nuclear sequences. Base composition analyses found that relative to other birds, parrots have unusually strong compositional asymmetry (AT- and GC-skew) in their coding sequences, especially at fourfold degenerate sites. Furthermore, we found higher AT skew in species with control region duplications. One potential cause for this compositional asymmetry is that parrots have unusually slow mtDNA replication. If this is the case, then any replicative advantage provided by having a second control region could result in selection for maintenance of both control regions once duplicated.
Collapse
|
Research Support, N.I.H., Extramural |
10 |
63 |
2
|
Costantini M, Musto H. The Isochores as a Fundamental Level of Genome Structure and Organization: A General Overview. J Mol Evol 2017; 84:93-103. [PMID: 28243687 DOI: 10.1007/s00239-017-9785-9] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2016] [Accepted: 02/15/2017] [Indexed: 11/30/2022]
Abstract
The recent availability of a number of fully sequenced genomes (including marine organisms) allowed to map very precisely the isochores, based on DNA sequences, confirming the results obtained before genome sequencing by the ultracentrifugation in CsCl. In fact, the analytical profile of human DNA showed that the vertebrate genome is a mosaic of isochores, typically megabase-size DNA segments that belong to a small number of families characterized by different GC levels. In this review, we will concentrate on some general genome features regarding the compositional organization from different organisms and their evolution, ranging from vertebrates to invertebrates until unicellular organisms. Since isochores are tightly linked to biological properties such as gene density, replication timing, and recombination, the new level of detail provided by the isochore map helped the understanding of genome structure, function, and evolution. All the findings reported here confirm the idea that the isochores can be considered as a "fundamental level of genome structure and organization." We stress that we do not discuss in this review the origin of isochores, which is still a matter of controversy, but we focus on well established structural and physiological aspects.
Collapse
|
Review |
8 |
25 |
3
|
Bolívar P, Guéguen L, Duret L, Ellegren H, Mugal CF. GC-biased gene conversion conceals the prediction of the nearly neutral theory in avian genomes. Genome Biol 2019; 20:5. [PMID: 30616647 PMCID: PMC6322265 DOI: 10.1186/s13059-018-1613-z] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2018] [Accepted: 12/17/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The nearly neutral theory of molecular evolution predicts that the efficacy of natural selection increases with the effective population size. This prediction has been verified by independent observations in diverse taxa, which show that life-history traits are strongly correlated with measures of the efficacy of selection, such as the dN/dS ratio. Surprisingly, avian taxa are an exception to this theory because correlations between life-history traits and dN/dS are apparently absent. Here we explore the role of GC-biased gene conversion on estimates of substitution rates as a potential driver of these unexpected observations. RESULTS We analyze the relationship between dN/dS estimated from alignments of 47 avian genomes and several proxies for effective population size. To distinguish the impact of GC-biased gene conversion from selection, we use an approach that accounts for non-stationary base composition and estimate dN/dS separately for changes affected or unaffected by GC-biased gene conversion. This analysis shows that the impact of GC-biased gene conversion on substitution rates can explain the lack of correlations between life-history traits and dN/dS. Strong correlations between life-history traits and dN/dS are recovered after accounting for GC-biased gene conversion. The correlations are robust to variation in base composition and genomic location. CONCLUSIONS Our study shows that gene sequence evolution across a wide range of avian lineages meets the prediction of the nearly neutral theory, the efficacy of selection increases with effective population size. Moreover, our study illustrates that accounting for GC-biased gene conversion is important to correctly estimate the strength of selection.
Collapse
|
research-article |
6 |
20 |
4
|
Abstract
Flaviviruses present substantial differences in their host range and transmissibility. We studied the evolution of base composition, dinucleotide biases, codon usage and amino acid frequencies in the genus Flavivirus within a phylogenetic framework by principal components analysis. There is a mutual interplay between the evolutionary history of flaviviruses and their respective vectors and/or hosts. Hosts associated to distinct phylogenetic groups may be driving flaviviruses at different pace and through various sequence landscapes, as can be seen for viruses associated with Aedes or Culex spp., although phylogenetic inertia cannot be ruled out. In some cases, viruses face even opposite forces. For instance, in tick-borne flaviviruses, while vertebrate hosts exert pressure to deplete their CpG, tick vectors drive them to exhibit GC-rich codons. Within a vertebrate environment, natural selection appears to be acting on the viral genome to overcome the immune system. On the other side, within an arthropod environment, mutational biases seem to be the dominant forces.
Collapse
|
Research Support, Non-U.S. Gov't |
8 |
16 |
5
|
Forsdyke DR. Implications of HIV RNA structure for recombination, speciation, and the neutralism-selectionism controversy. Microbes Infect 2013; 16:96-103. [PMID: 24211872 DOI: 10.1016/j.micinf.2013.10.017] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2013] [Revised: 10/24/2013] [Accepted: 10/24/2013] [Indexed: 11/29/2022]
Abstract
The conflict between the needs to encode both a protein (impaired by non-synonymous mutation), and nucleic acid structure (impaired by synonymous or non-synonymous mutation), can sometimes be resolved in favour of the nucleic acid because its structure is critical for a selectively advantageous genome-wide activity--recombination. However, above a sequence difference threshold, recombination is impaired. It may then be advantageous for new species to arise. Building on the work of Grantham and others critical of the neutralist viewpoint, heuristic support for this hypothesis emerged from studies of the base composition and structure of retroviral genomes. The extreme enrichment in the purine A of the RNA of human immunodeficiency virus (HIV-1), parallels the mild purine-loading of the RNAs of most organisms, for which there is an adaptive explanation--immune evasion. However, human T cell leukaemia virus (HTLV-1), with the potential to invade the same host cell, shows extreme enrichment in the pyrimidine C. Assuming the low GC% HIV and the high GC% HTLV-1 to share a common ancestor, it was postulated that differences in GC% had arisen to prevent homologous recombination between these emerging lentiviral species. Sympatrically isolated by this intracellular reproductive barrier, prototypic HIV-1 seized the AU-rich (low GC%) high ground (thus committing to purine A rather than purine G). Prototypic HTLV-1 forwent this advantage and evolved an independent evolutionary strategy--similar to that of the GC%-rich Epstein-Barr virus--profound latency maintained by transcription of one purine-rich mRNA. The evidence supporting these interpretations is reviewed.
Collapse
|
Review |
12 |
12 |
6
|
Neutralism versus selectionism: Chargaff's second parity rule, revisited. Genetica 2021; 149:81-88. [PMID: 33880685 PMCID: PMC8057000 DOI: 10.1007/s10709-021-00119-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2021] [Accepted: 04/09/2021] [Indexed: 11/03/2022]
Abstract
Of Chargaff's four "rules" on DNA base frequencies, the functional interpretation of his second parity rule (PR2) is the most contentious. Thermophile base compositions (GC%) were taken by Galtier and Lobry (1997) as favoring Sueoka's neutral PR2 hypothesis over Forsdyke's selective PR2 hypothesis, namely that mutations improving local within-species recombination efficiency had generated a genome-wide potential for the strands of duplex DNA to separate and initiate recombination through the "kissing" of the tips of stem-loops. However, following Chargaff's GC rule, base composition mainly reflects a species-specific, genome-wide, evolutionary pressure. GC% could not have consistently followed the dictates of temperature, since it plays fundamental roles in both sustaining species integrity and, through primarily neutral genome-wide mutation, fostering speciation. Evidence for a local within-species recombination-initiating role of base order was obtained with a novel technology that masked the contribution of base composition to nucleic acid folding energy. Forsdyke's results were consistent with his PR2 hypothesis, appeared to resolve some root problems in biology and provided a theoretical underpinning for alignment-free taxonomic analyses using relative oligonucleotide frequencies (k-mer analysis). Moreover, consistent with Chargaff's cluster rule, discovery of the thermoadaptive role of the "purine-loading" of open reading frames made less tenable the Galtier-Lobry anti-selectionist arguments.
Collapse
|
Review |
4 |
7 |
7
|
Wang J, Li X, Do Kim K, Scanlon MJ, Jackson SA, Springer NM, Yu J. Genome-wide nucleotide patterns and potential mechanisms of genome divergence following domestication in maize and soybean. Genome Biol 2019; 20:74. [PMID: 31018867 PMCID: PMC6482504 DOI: 10.1186/s13059-019-1683-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2018] [Accepted: 03/28/2019] [Indexed: 01/21/2023] Open
Abstract
BACKGROUND Plant domestication provides a unique model to study genome evolution. Many studies have been conducted to examine genes, genetic diversity, genome structure, and epigenome changes associated with domestication. Interestingly, domesticated accessions have significantly higher [A] and [T] values across genome-wide polymorphic sites than accessions sampled from the corresponding progenitor species. However, the relative contributions of different genomic regions to this genome divergence pattern and underlying mechanisms have not been well characterized. RESULTS Here, we investigate the genome-wide base-composition patterns by analyzing millions of SNPs segregating among 100 accessions from a teosinte-maize comparison set and among 302 accessions from a wild-domesticated soybean comparison set. We show that non-genic part of the genome has a greater contribution than genic SNPs to the [AT]-increase observed between wild and domesticated accessions in maize and soybean. The separation between wild and domesticated accessions in [AT] values is significantly enlarged in non-genic and pericentromeric regions. Motif frequency and sequence context analyses show the motifs (PyCG) related to solar-UV signature are enriched in these regions, particularly when they are methylated. Additional analysis using population-private SNPs also implicates the role of these motifs in relatively recent mutations. With base-composition across polymorphic sites as a genome phenotype, genome scans identify a set of putative candidate genes involved in UV damage repair pathways. CONCLUSIONS The [AT]-increase is more pronounced in genomic regions that are non-genic, pericentromeric, transposable elements; methylated; and with low recombination. Our findings establish important links among UV radiation, mutation, DNA repair, methylation, and genome evolution.
Collapse
|
research-article |
6 |
6 |
8
|
Quandt EM, Traverse CC, Ochman H. Local genic base composition impacts protein production and cellular fitness. PeerJ 2018; 6:e4286. [PMID: 29362699 PMCID: PMC5774297 DOI: 10.7717/peerj.4286] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Accepted: 01/01/2018] [Indexed: 01/25/2023] Open
Abstract
The maintenance of a G + C content that is higher than the mutational input to a genome provides support for the view that selection serves to increase G + C contents in bacteria. Recent experimental evidence from Escherichia coli demonstrated that selection for increasing G + C content operates at the level of translation, but the precise mechanism by which this occurs is unknown. To determine the substrate of selection, we asked whether selection on G + C content acts across all sites within a gene or is confined to particular genic regions or nucleotide positions. We systematically altered the G + C contents of the GFP gene and assayed its effects on the fitness of strains harboring each variant. Fitness differences were attributable to the base compositional variation in the terminal portion of the gene, suggesting a connection to the folding of a specific protein feature. Variants containing sequence features that are thought to result in rapid translation, such as low G + C content and high levels of codon adaptation, displayed highly reduced growth rates. Taken together, our results show that purifying selection acting against A and T mutations most likely results from their tendency to increase the rate of translation, which can perturb the dynamics of protein folding.
Collapse
|
Journal Article |
7 |
4 |
9
|
Caputi L, Osca D, Ceruso M, Venuti I, Sepe RM, Anastasio A, D'Aniello S, Crocetta F, Pepe T, Sordino P. The complete mitochondrial genome of the white seabream Diplodus sargus (Perciformes: Sparidae) from the Tyrrhenian sea. MITOCHONDRIAL DNA PART B-RESOURCES 2021; 6:2581-2583. [PMID: 34409152 PMCID: PMC8366610 DOI: 10.1080/23802359.2021.1915209] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The white seabream Diplodus sargus (Linnaeus, 1758) is a species of interest for commercial fisheries throughout its range of distribution and it is also reared using aquaculture techniques. Herein, we present the first complete sequence and annotation of the mitochondrial genome of this species. The D. sargus mitogenome is 16,515 base pairs in length and contains 13 protein-coding genes, 2 rRNA, 22 tRNA, and 2 non-coding regions (D-loop and L-origin). The overall nucleotide composition is: 27.3% A, 28.9% C, 26.8% T, and 17.0% G. Maximum likelihood analyses placed D. sargus as a sister species of Diplodus puntazzo. This study provides valuable information for further studying identification methods and evolutionary relationships of Sparidae species.
Collapse
|
|
4 |
4 |
10
|
Ruggiero RP, Boissinot S. Variation in base composition underlies functional and evolutionary divergence in non-LTR retrotransposons. Mob DNA 2020; 11:14. [PMID: 32280379 PMCID: PMC7140322 DOI: 10.1186/s13100-020-00209-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2019] [Accepted: 03/24/2020] [Indexed: 12/03/2022] Open
Abstract
Background Non-LTR retrotransposons often exhibit base composition that is markedly different from the nucleotide content of their host’s gene. For instance, the mammalian L1 element is AT-rich with a strong A bias on the positive strand, which results in a reduced transcription. It is plausible that the A-richness of mammalian L1 is a self-regulatory mechanism reflecting a trade-off between transposition efficiency and the deleterious effect of L1 on its host. We examined if the A-richness of L1 is a general feature of non-LTR retrotransposons or if different clades of elements have evolved different nucleotide content. We also investigated if elements belonging to the same clade evolved towards different base composition in different genomes or if elements from different clades evolved towards similar base composition in the same genome. Results We found that non-LTR retrotransposons differ in base composition among clades within the same host but also that elements belonging to the same clade differ in base composition among hosts. We showed that nucleotide content remains constant within the same host over extended period of evolutionary time, despite mutational patterns that should drive nucleotide content away from the observed base composition. Conclusions Our results suggest that base composition is evolving under selection and may be reflective of the long-term co-evolution between non-LTR retrotransposons and their host. Finally, the coexistence of elements with drastically different base composition suggests that these elements may be using different strategies to persist and multiply in the genome of their host.
Collapse
|
Journal Article |
5 |
4 |
11
|
Zhao Y, Dong L, Jiang C, Wang X, Xie J, Rashid MAR, Liu Y, Li M, Bu Z, Wang H, Ma X, Sun S, Wang X, Bo C, Zhou T, Kong L. Distinct nucleotide patterns among three subgenomes of bread wheat and their potential origins during domestication after allopolyploidization. BMC Biol 2020; 18:188. [PMID: 33267868 PMCID: PMC7713161 DOI: 10.1186/s12915-020-00917-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Accepted: 11/05/2020] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND The speciation and fast global domestication of bread wheat have made a great impact on three subgenomes of bread wheat. DNA base composition is an essential genome feature, which follows the individual-strand base equality rule and [AT]-increase pattern at the genome, chromosome, and polymorphic site levels among thousands of species. Systematic analyses on base compositions of bread wheat and its wild progenitors could facilitate further understanding of the evolutionary pattern of genome/subgenome-wide base composition of allopolyploid species and its potential causes. RESULTS Genome/subgenome-wide base-composition patterns were investigated by using the data of polymorphic site in 93 accessions from worldwide populations of bread wheat, its diploid and tetraploid progenitors, and their corresponding reference genome sequences. Individual-strand base equality rule and [AT]-increase pattern remain in recently formed hexaploid species bread wheat at the genome, subgenome, chromosome, and polymorphic site levels. However, D subgenome showed the fastest [AT]-increase across polymorphic site from Aegilops tauschii to bread wheat than that on A and B subgenomes from wild emmer to bread wheat. The fastest [AT]-increase could be detected almost all chromosome windows on D subgenome, suggesting different mechanisms between D and other two subgenomes. Interestingly, the [AT]-increase is mainly contributed by intergenic regions at non-selective sweeps, especially the fastest [AT]-increase of D subgenome. Further transition frequency and sequence context analysis indicated that three subgenomes shared same mutation type, but D subgenome owns the highest mutation rate on high-frequency mutation type. The highest mutation rate on D subgenome was further confirmed by using a bread-wheat-private SNP set. The exploration of loci/genes related to the [AT] value of D subgenome suggests the fastest [AT]-increase of D subgenome could be involved in DNA repair systems distributed on three subgenomes of bread wheat. CONCLUSIONS The highest mutation rate is detected on D subgenome of bread wheat during domestication after allopolyploidization, leading to the fastest [AT]-increase pattern of D subgenome. The phenomenon may come from the joint action of multiple repair systems inherited from its wild progenitors.
Collapse
|
research-article |
5 |
3 |
12
|
Hao CL, Arken K, Kadir M, Zhang WR, Rong MJ, Wei NW, Liu YJ, Yue C. The complete mitochondrial genomes of Paradiplozoon yarkandense and Paradiplozoon homoion confirm that Diplozoidae evolve at an elevated rate. Parasit Vectors 2022; 15:149. [PMID: 35477556 PMCID: PMC9044634 DOI: 10.1186/s13071-022-05275-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 04/04/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Diplozoidae are monogenean (Monogenea: Polyopisthocotylea) fish parasites characterised by a unique life history: two larvae permanently fuse into an X-shaped "Siamese" organism. Taxonomy and phylogeny of Diplozoidae and Polyopisthocotylea remain unresolved due to the unavailability of molecular markers with sufficiently high resolution. Mitogenomes may be a suitable candidate, but there are currently only 12 available for the Polyopisthocotylea (three for Diplozoidae). The only available study of diplozoid mitogenomes found unique base composition patterns and elevated evolution rates in comparison with other Monogenean mitogenomes. METHODS To further explore their evolution and generate molecular data for evolutionary studies, we sequenced the complete mitogenomes of two Diplozoidae species, Paradiplozoon homoion and Paradiplozoon yarkandense, and conducted a number of comparative mitogenomic analyses with other polyopisthocotyleans. RESULTS We found further evidence that mitogenomes of Diplozoidae evolve at a unique, elevated rate, which was reflected in their exceptionally long branches, large sizes, unique base composition, skews, and very low gene sequence similarity levels between the two newly sequenced species. They also exhibited remarkably large overlaps between some genes. Phylogenetic analysis of Polyopisthocotylea resolved all major taxa as monophyletic, and Mazocraeidea was split into two major clades: (Diplozoidae) + (all four remaining families: Diclidophoridae, Chauhaneidae, Mazocraeidae and Microcotylidae). It also provided further confirmation that the genus Paradiplozoon is paraphyletic and requires a taxonomic revision, so the two species may have to be renamed Indodiplozoon homoion and Diplozoon yarkandense comb. nov. CONCLUSIONS Although our findings indicate that mitogenomes may be a promising tool for resolving the phylogeny of Polyopisthocotylea, elevated evolutionary rates of Diplozoidae may cause phylogenetic artefacts, so future studies should pay caution to this problem. Furthermore, as the reason for their elevated evolution remains unknown, Diplozoidae are a remarkably interesting lineage for other types of evolutionary mitogenomic studies.
Collapse
|
|
3 |
3 |
13
|
Bohlin J, Rose B, Brynildsrud O, Birgitte Freiesleben De Blasio. A simple stochastic model describing genomic evolution over time of GC content in microbial symbionts. J Theor Biol 2020; 503:110389. [PMID: 32634385 DOI: 10.1016/j.jtbi.2020.110389] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2019] [Revised: 04/21/2020] [Accepted: 06/24/2020] [Indexed: 11/29/2022]
Abstract
An organism's genomic base composition is usually summarized by its AT or GC content due to Chargaff's parity laws. Variation in prokaryotic GC content can be substantial between taxa but is generally small within microbial genomes. This variation has been found to correlate with both phylogeny and environmental factors. Since novel single-nucleotide polymorphisms (SNPs) within genomes are at least partially linked to the environment through natural selection, SNP GC content can be considered a compound measure of an organism's environmental influences, lifestyle, phylogeny as well as other more or less random processes. While there are several models describing genomic GC content few, if any, consider AT/GC mutation rates subjected to random perturbations. We present a mathematical model that describes how GC content in microbial genomes evolves over time as a function of the AT → GC and GC → AT mutation rates with Gaussian white noise disturbances. The model, which is suited specifically to non-recombining vertically transmitted prokaryotic symbionts, suggests that small differences in the AT/GC mutation rates can lead to profound differences in outcome due to the ensuing stochastic process. In other words, the model indicates that time to extinction could be a consequence of the mutation rate trajectory on which the symbiont embarked early on in its evolutionary history.
Collapse
|
Journal Article |
5 |
3 |
14
|
Liu L, Chen L, Wu Y, Wang X, Liao F, Wang D, Huang X, Deng S, Liu L. Base composition of the Danio chrysotaenitus complete mitochondrial genome. Mitochondrial DNA A DNA Mapp Seq Anal 2015; 27:3370-1. [PMID: 25738218 DOI: 10.3109/19401736.2015.1018225] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
In this study, 16 sets of primers were used to amplify contiguous, overlapping segments of the complete mitochondrial DNA (mtDNA) of Danio chrysotaenitus in order to characterize and compare their mitochondrial genomes. The total length of the mitochondrial genome is 16,608 bp and deposited in the GenBank with accession numbers KP407138. The organization of the mitochondrial genomes was similar to those reported from other Mountain carp fishes mitochondrial genomes containing 37 genes (13 protein-coding genes, 2 ribosomal RNA, and 22 transfer RNAs) and a major non-coding control region (D-loop region). Most genes were encoded on the H-strand, except for the ND6 and 8 tRNA genes, encoding on the L-strand. The nucleotide skewness for the coding strands of Danio chrysotaenitus (AT-skew = 0.10, GC-skew = -0.25) is biased toward T and G. The complete mitogenome may provide important date set for the study of genetic mechanism of Danio chrysotaenitus.
Collapse
|
|
10 |
2 |
15
|
Wang X, Chen D, Zhou X, Zhao F, Yao J, An M. Complete mitochondrial genome of javeline goby (Synechogobius hasta). Mitochondrial DNA A DNA Mapp Seq Anal 2016; 27:4586-4587. [PMID: 27607305 DOI: 10.3109/19401736.2014.987254] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
In this study, we cloned and sequenced the complete mitochondrial DNA (mtDNA) of Synechogobius hasta to characterize and compare their mitochondrial genomes. The total length of the mitochondrial genome is 16,655 bp with an accession number KM891736. The organization of the mitochondrial genomes was similar to those reported from other fish mitochondrial genomes containing 37 genes (13 protein-coding genes, 2 ribosomal RNA and 22 transfer RNAs) and a major non-coding control region. Except for ND6 and 8 tRNAs, all other genes are encoded on the heavy strand. The base composition of the 13 mitochondrial protein-coding genes in the third position was relatively low (9.7%). The complete mitogenome may provide important date set for the study of genetic mechanism of S. hasta.
Collapse
|
|
9 |
2 |
16
|
Wang F, He E, Li Y, Cai X, Ma W. Inheritance of the complete mitochondrial genomes rainbow trout (Oncorhynchus mykiss) triploid. Mitochondrial DNA A DNA Mapp Seq Anal 2015; 27:4093-4094. [PMID: 25629478 DOI: 10.3109/19401736.2014.1003859] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
In this study, we cloned and sequenced the complete mitochondrial DNA (mtDNA) of Oncorhynchus mykiss triploid to characterize and compare their mitochondrial genomes. The total length of the mitochondrial genome is 16,656 bp with an accession number KP085590. The organization of the mitochondrial genomes was similar to those reported from other fish mitochondrial genomes containing 37 genes (13 protein-coding genes, 2 ribosomal RNA, and 22 transfer RNAs) and a major non-coding control region. Except for ND6 and 8 tRNAs, all other genes are encoded on the heavy strand. The nucleotide skewness for the coding strands of O. mykiss triploid (AT-skew = -0.41, GC-skew = 0.34) is biased toward T and G. The complete mitogenome may provide important date set for the study of genetic mechanism of O. mykiss.
Collapse
|
|
10 |
1 |
17
|
Variation and constraints in species-specific promoter sequences. J Theor Biol 2014; 363:357-66. [PMID: 25149367 DOI: 10.1016/j.jtbi.2014.08.006] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2014] [Revised: 07/30/2014] [Accepted: 08/04/2014] [Indexed: 11/24/2022]
Abstract
A vast literature is nowadays devoted to the search of correlations between transcription related functions and the composition of sequences upstream the Transcription Start Site. Little is known about the possible functional effects of nucleotide distributions on the conformational landscape of DNA in such regions. We have used suitable statistical indicators for identifying sequences that may play an important role in regulating transcription processes. In particular, we have analyzed base composition, periodicity and information content in sets of aligned promoters clustered according to functional information in order to obtain an insight on the main structural differences between promoters regulating genes with different functions. Our results show that when we select promoters according to some biological information, in a single species, at least in vertebrates, we observe structurally different classes of sequences. The highly variable and differentiated gene expression patterns may explain the great extent of structural differentiation observed in complex organisms. In fact, despite our analysis is focused on Homo sapiens, we provide also a comparison with other species, selected at different positions in the phylogenetic tree.
Collapse
|
Journal Article |
11 |
1 |
18
|
Hao C, Liu Y, Wei N, Arken K, Shi C, Yue C. The complete mitochondrial genomes of the Leuciscus baicalensis and Rutilus rutilus: a detailed genomic comparison among closely related species of the Leuciscinae subfamily. Gene 2023:147535. [PMID: 37328078 DOI: 10.1016/j.gene.2023.147535] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Revised: 05/04/2023] [Accepted: 05/31/2023] [Indexed: 06/18/2023]
Abstract
Cyprinidae is the largest family in the order of freshwater fish Cypriniformes. Increased subfamily members of Cyprinidae have been suggested to be re-classified for decades. In this study, we sequenced the mitochondrial genomes (mitogenomes) of Leuciscus baicalensis and Rutilus rutilus collected from northwest China and compared with other closely related species to determine their associated family or subfamily. We used Illumina NovaSeq to sequence the entire mitochondrial genomes of Leuciscus baicalensis and Rutilus rutilus and characterized the mitogenomes by the gene structure, gene order, and the secondary structures of the 22 tRNA genes. We compared mitogenome features of Leuciscinae with other subfamilies in Cyprinidae. We used the analytic Bayesian Information and Maximum Likelihood methods to determine phylogenetic trees of 13 PCGs. The mitogenomes of Leuciscus baicalensis and Rutilus rutilus were 16,607 bp and 16,606 bp, respectively. Organization and location of these genes were consistent with already studied Leuciscinae fishes. Synonymous codon usage was conservative in Leuciscinae as compared with other subfamilies in Cyprinidae. Phylogenetic analysis indicated that Leuciscinae was a monophyletic group, and genus Leuciscus was a paraphyletic group. Our approach, for the first time, of studying comparative mitochondrial genomics and phylogenetics together provided a supportive platform to the analysis of population genetics and phylogeny for Leuciscinae. Our results indicated a promising potential of comparative mitochondrial genomics in the manifestation of phylogenetic relationships between fishes, leading us to a suggestion that mitogenomes should be routinely considered in clarifying phylogenetics of family and subfamily members of fish.
Collapse
|
|
2 |
|
19
|
Peng H, Liu Q, Xiao T. Inheritance of the complete mitochondrial genomes Cyprinus capio furong(♀) × Cyprinus carpio var.singguonensis(♂). Mitochondrial DNA A DNA Mapp Seq Anal 2015; 27:3105-6. [PMID: 25648921 DOI: 10.3109/19401736.2015.1007286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
In this study, 15 sets of primers were used to amplify contiguous, overlapping segments of the complete mitochondrial DNA (mtDNA) of C. capio furong(♀) × C. carpio var.singguonensis(♂) in order to characterize and compare their mitochondrial genomes. The total length of the mitochondrial genome was 16,581 bp and deposited in the GenBank with the accession number KP210473. The organization of the mitochondrial genomes contained 37 genes (13 protein-coding genes, 2 ribosomal RNA and 22 transfer RNAs) and a major non-coding control region which was similar to those reported mitochondrial genomes. Most genes were encoded on the H-strand, except for the ND6 and 8 tRNA genes, encoding on the L-strand. The nucleotide skewness for the coding strands of C. capio furong(♀) × C. carpio var.singguonensis(♂) (AT-skew = 0.12, GC-skew = -0.27) were biased toward T and G. The complete mitogenome may provide important date for the study of genetic mechanism of C. capio furong(♀) × C. carpio var.singguonensis(♂).
Collapse
|
|
10 |
|
20
|
Bohlin J, Pettersson JHO. Compression rates of microbial genomes are associated with genome size and base composition. Genomics Inform 2024; 22:16. [PMID: 39390533 PMCID: PMC11468749 DOI: 10.1186/s44342-024-00018-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Accepted: 09/10/2024] [Indexed: 10/12/2024] Open
Abstract
BACKGROUND To what degree a string of symbols can be compressed reveals important details about its complexity. For instance, strings that are not compressible are random and carry a low information potential while the opposite is true for highly compressible strings. We explore to what extent microbial genomes are amenable to compression as they vary considerably both with respect to size and base composition. For instance, microbial genome sizes vary from less than 100,000 base pairs in symbionts to more than 10 million in soil-dwellers. Genomic base composition, often summarized as genomic AT or GC content due to the similar frequencies of adenine and thymine on one hand and cytosine and guanine on the other, also vary substantially; the most extreme microbes can have genomes with AT content below 25% or above 85% AT. Base composition determines the frequency of DNA words, consisting of multiple nucleotides or oligonucleotides, and may therefore also influence compressibility. Using 4,713 RefSeq genomes, we examined the association between compressibility, using both a DNA based- (MBGC) and a general purpose (ZPAQ) compression algorithm, and genome size, AT content as well as genomic oligonucleotide usage variance (OUV) using generalized additive models. RESULTS We find that genome size (p < 0.001) and OUV (p < 0.001) are both strongly associated with genome redundancy for both type of file compressors. The DNA-based MBGC compressor managed to improve compression with approximately 3% on average with respect to ZPAQ. Moreover, MBGC detected a significant (p < 0.001) compression ratio difference between AT poor and AT rich genomes which was not detected with ZPAQ. CONCLUSION As lack of compressibility is equivalent to randomness, our findings suggest that smaller and AT rich genomes may have accumulated more random mutations on average than larger and AT poor genomes which, in turn, were significantly more redundant. Moreover, we find that OUV is a strong proxy for genome compressibility in microbial genomes. The ZPAQ compressor was found to agree with the MBGC compressor, albeit with a poorer performance, except for the compressibility of AT-rich and AT-poor/GC-rich genomes.
Collapse
|
research-article |
1 |
|
21
|
Niu Y, Wu X, Zhang C, Wei Y. Complete mitochondrial genome sequence of Hemiculter bleekeri. MITOCHONDRIAL DNA PART B-RESOURCES 2016; 1:909-910. [PMID: 33473674 PMCID: PMC7800471 DOI: 10.1080/23802359.2016.1186506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
In our research, the complete mitochondrial DNA (mtDNA) of Hemiculter bleekeri was obtained from the dorsal myotome of the fish. The total length of the mitochondrial genome is 16,617 bp and deposited in GenBank with accession number KU198332. The gene arrangement was similar to other bony fishes, which contained 37 genes (13 protein-coding genes, 2 ribosomal RNA and 22 transfer RNAs) and a major non-coding control region. The mtDNA may provide some important genetic background information of this valuable fish. The G contents was relatively low in the total mtDNA with the GCskewthinsp;= -0.22. The negative GC-skew ranging from -0.51(ATP8) to -0.19(CO1), and the more positive AT-skew varying from -0.26(ATP8) to -0.03(CO1). The phylogenetic analysis demonstrated that H. Bleekeri was clustered together with Hemiculter leucisculus, which could be a sister species.
Collapse
|
|
9 |
|
22
|
Hao CL, Yang YY, Wei NW, Liu YJ, Shi CX, Wang JP, Zhang L, Xia SZ, Yue C. Complete mitochondrial genomes of Dactylogyrus crucifer and Dactylogyrus zandti reveal distinct patterns of codon usage within Dactylogyrus. Gene 2025; 933:148935. [PMID: 39255859 DOI: 10.1016/j.gene.2024.148935] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2024] [Revised: 08/24/2024] [Accepted: 09/06/2024] [Indexed: 09/12/2024]
Abstract
Monogeneans of the genus Dactylogyrus Diesing, 1850, the largest genus in the family Dactylogyridae, mostly parasitize the gills of cyprinoid hosts; however, only 3 Dactylogyrus' mitochondrial genomes (mitogenomes) are studied so far. The aim of this research is to extend our understanding of the mitogenomes of Dactylogyrus. We sequenced the mitogenomes of D. crucifer and D. zandti isolated from Rutilus rutilus and Abramis brama orientalis in northwest China, and then we compared these mitogenomes with other monogeneans. We used Illumina NovaSeq to sequence the entire mitochondrial genomes of D. crucifer and D. zandti and characterized the mitogenomes to understand the gene structure, gene identity, the secondary structures of the 22 tRNA genes, and relative synonymous codon usage. We used the analytic Bayesian Information and Maximum Likelihood methods to determine their associated phylogenetic trees. The mitogenomes of D. crucifer and D. zandti were 14,403 and 18,584 bp, respectively. Organization and positioning of these genes were in accordance with Dactylogyrus lamellatus and Dactylogyrus tuba. The nucleotide composition of Dactylogyridae was different from other families of Monogenea, and the A+T count of genus Dactylogyrus (54 - 58.4 %) was lower than other genus species of the family Dactylogyridea (63.9 - 78.4 %) in protein-coding genes. Dactylogyrus members displayed a codon usage bias. The relative synonymous codon used by Dactylogyrus was not conserved and was lower than other monogeneans. The codon use patterns of closely-related species isolated from closely-related hosts were identical. Phylogenetic analyses using mitogenomic dataset produced Dactylogyrus isolated from host subfamily Leuciscinae formed a sister-group. Our results contributed significantly to an increased database of mitogenomes, more than 50 %, for Dactylogyrus that may help future studies of mitochondrial genes and codon uses for the analysis of monogenean phylogenetics.
Collapse
|
|
1 |
|