51
|
Abstract
Models of codon evolution have attracted particular interest because of their unique capabilities to detect selection forces and their high fit when applied to sequence evolution. We described here a novel approach for modeling codon evolution, which is based on Kronecker product of matrices. The 61 × 61 codon substitution rate matrix is created using Kronecker product of three 4 × 4 nucleotide substitution matrices, the equilibrium frequency of codons, and the selection rate parameter. The entities of the nucleotide substitution matrices and selection rate are considered as parameters of the model, which are optimized by maximum likelihood. Our fully mechanistic model allows the instantaneous substitution matrix between codons to be fully estimated with only 19 parameters instead of 3,721, by using the biological interdependence existing between positions within codons. We illustrate the properties of our models using computer simulations and assessed its relevance by comparing the AICc measures of our model and other models of codon evolution on simulations and a large range of empirical data sets. We show that our model fits most biological data better compared with the current codon models. Furthermore, the parameters in our model can be interpreted in a similar way as the exchangeability rates found in empirical codon models.
Collapse
Affiliation(s)
- Maryam Zaheri
- Department of Ecology and Evolution, Biophore, University of Lausanne, 1015 Lausanne, SwitzerlandSwiss Institute of Bioinformatics, Genopode, Quartier Sorge, 1015 Lausanne, Switzerland
| | - Linda Dib
- Department of Ecology and Evolution, Biophore, University of Lausanne, 1015 Lausanne, SwitzerlandSwiss Institute of Bioinformatics, Genopode, Quartier Sorge, 1015 Lausanne, Switzerland
| | - Nicolas Salamin
- Department of Ecology and Evolution, Biophore, University of Lausanne, 1015 Lausanne, SwitzerlandSwiss Institute of Bioinformatics, Genopode, Quartier Sorge, 1015 Lausanne, Switzerland
| |
Collapse
|
52
|
Jeffery NW, Gregory TR. Genome size estimates for crustaceans using Feulgen image analysis densitometry of ethanol-preserved tissues. Cytometry A 2014; 85:862-8. [PMID: 25139836 DOI: 10.1002/cyto.a.22516] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2014] [Accepted: 07/18/2014] [Indexed: 11/08/2022]
Abstract
Crustaceans are enormously diverse both phylogenetically and ecologically, but they remain substantially underrepresented in the existing genome size database. An expansion of this dataset could be facilitated if it were possible to obtain genome size estimates from ethanol-preserved specimens. In this study, two tests were performed in order to assess the reliability of genome size data generated using preserved material. First, the results of estimates based on flash-frozen versus ethanol-preserved material were compared across 37 species of crustaceans that differ widely in genome size. Second, a comparison was made of specimens from a single species that had been stored in ethanol for 1-14 years. In both cases, the use of gill tissue in Feulgen image analysis densitometry proved to be a very viable approach. This finding is of direct relevance to both new studies of field-collected crustaceans as well as potential studies based on existing collections.
Collapse
Affiliation(s)
- Nicholas W Jeffery
- Department of Integrative Biology, University of Guelph, Guelph, Ontario, N1G 2W1, Canada
| | | |
Collapse
|
53
|
Bainard JD, Gregory TR. Genome size evolution: patterns, mechanisms, and methodological advances. Genome 2014; 56:vii-viii. [PMID: 24168634 DOI: 10.1139/gen-2013-0170] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Affiliation(s)
- J D Bainard
- Department of Plant Sciences, University of Saskatchewan, Saskatoon SK, S7N 5A8, Canada
| | | |
Collapse
|
54
|
Zieliński P, Stuglik MT, Dudek K, Konczal M, Babik W. Development, validation and high-throughput analysis of sequence markers in nonmodel species. Mol Ecol Resour 2013; 14:352-60. [PMID: 24103386 DOI: 10.1111/1755-0998.12171] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2013] [Revised: 08/23/2013] [Accepted: 09/13/2013] [Indexed: 11/28/2022]
Abstract
DNA sequences derived from multiple regions of the nuclear genome are essential for historical inferences in the fields of phylogeography and phylogenetics. The appropriate markers should be single-copy, variable, easy to amplify from multiple samples and easy to sequence using high-throughput technologies. This may be difficult to achieve for species lacking sequenced genomes and particularly challenging for species possessing large genomes, which consist mostly of repetitive sequences. Here, we present a cost-effective, broadly applicable framework for designing, validating and high-throughput sequencing of multiple markers in nonmodel species without sequenced genomes. We demonstrate its utility in two closely related species of newts, representatives of urodeles, a vertebrate group characterized by large genomes. We show that over 80 markers, c. 600 bp each, developed mainly from 3' untranslated transcript regions (3'UTR) may be effectively multiplexed and sequenced. Data are further processed using standard, freely available bioinformatic tools, producing phase-resolved sequences. The approach does not require barcoded PCR primers, and the cost of library preparation is independent of the number of markers investigated. We hope that this approach will be of broad interest for researchers working at the interface of population genetics and phylogenetics, exploring deep intraspecific genetic structure, species boundaries and phylogeographies of closely related species.
Collapse
Affiliation(s)
- P Zieliński
- Institute of Environmental Sciences, Jagiellonian University, Gronostajowa 7, 30-387, Kraków, Poland
| | | | | | | | | |
Collapse
|
55
|
Yang YF, Zhu T, Niu DK. Association of intron loss with high mutation rate in Arabidopsis: implications for genome size evolution. Genome Biol Evol 2013; 5:723-33. [PMID: 23516254 PMCID: PMC4104619 DOI: 10.1093/gbe/evt043] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Despite the prevalence of intron losses during eukaryotic evolution, the selective forces acting on them have not been extensively explored. Arabidopsis thaliana lost half of its genome and experienced an elevated rate of intron loss after diverging from A. lyrata. The selective force for genome reduction was suggested to have driven the intron loss. However, the evolutionary mechanism of genome reduction is still a matter of debate. In this study, we found that intron-lost genes have high synonymous substitution rates. Assuming that differences in mutability among different introns are conserved among closely related species, we used the nucleotide substitution rate between orthologous introns in other species as the proxy of the mutation rate of Arabidopsis introns, either lost or extant. The lost introns were found to have higher mutation rates than extant introns. At the genome-wide level, A. thaliana has a higher mutation rate than A. lyrata, which correlates with the higher rate of intron loss and rapid genome reduction of A. thaliana. Our results indicate that selection to minimize mutational hazards might be the selective force for intron loss, and possibly also for genome reduction, in the evolution of A. thaliana. Small genome size and lower genome-wide intron density were widely reported to be correlated with phenotypic features, such as high metabolic rates and rapid growth. We argue that the mutational-hazard hypothesis is compatible with these correlations, by suggesting that selection for rapid growth might indirectly increase mutational hazards.
Collapse
Affiliation(s)
- Yu-Fei Yang
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering, and Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, China
| | | | | |
Collapse
|
56
|
Bainard J, Gregory T. Évolution de la taille des génomes : les modèles, les mécanismes et les avancées méthodologiques. Genome 2013. [DOI: 10.1139/gen-2013-0197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Affiliation(s)
- J.D. Bainard
- Department of Plant Sciences, University of Saskatchewan, Saskatoon SK, S7N 5A8, Canada
| | - T.R. Gregory
- Department of Integrative Biology, University of Guelph, Guelph ON, N1G 2W1, Canada
| |
Collapse
|
57
|
Jeffery NW, Oliveira IS, Gregory TR, Rowell DM, Mayer G. Genome size and chromosome number in velvet worms (Onychophora). Genetica 2013; 140:497-504. [PMID: 23307271 DOI: 10.1007/s10709-013-9698-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2012] [Accepted: 01/04/2013] [Indexed: 12/31/2022]
Abstract
The Onychophora (velvet worms) represents a small group of invertebrates (~180 valid species), which is commonly united with Tardigrada and Arthropoda in a clade called Panarthropoda. As with the majority of invertebrate taxa, genome size data are very limited for the Onychophora, with only one previously published estimate. Here we use both flow cytometry and Feulgen image analysis densitometry to provide genome size estimates for seven species of velvet worms from both major subgroups, Peripatidae and Peripatopsidae, along with karyotype data for each species. Genome sizes in these species range from roughly 5-19 pg, with densitometric estimates being slightly larger than those obtained by flow cytometry for all species. Chromosome numbers range from 2n = 8 to 2n = 54. No relationship is evident between genome size, chromosome number, or reproductive mode. Various avenues for future genomic research are presented based on these results.
Collapse
Affiliation(s)
- Nicholas W Jeffery
- Department of Integrative Biology, University of Guelph, Guelph, ON, Canada.
| | | | | | | | | |
Collapse
|
58
|
Large-scale transcriptome analysis of retroelements in the migratory locust, Locusta migratoria. PLoS One 2012; 7:e40532. [PMID: 22792363 PMCID: PMC3391268 DOI: 10.1371/journal.pone.0040532] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2011] [Accepted: 06/10/2012] [Indexed: 11/23/2022] Open
Abstract
Background Retroelements can successfully colonize eukaryotic genome through RNA-mediated transposition, and are considered to be some of the major mediators of genome size. The migratory locust Locusta migratoria is an insect with a large genome size, and its genome is probably subject to the proliferation of retroelements. An analysis of deep-sequencing transcriptome data will elucidate the structure, diversity and expression characteristics of retroelements. Results We performed a de novo assembly from deep sequencing RNA-seq data and identified 105 retroelements in the locust transcriptome. Phylogenetic analysis of reverse transcriptase sequences revealed 1 copia, 1 BEL, 8 gypsy and 23 non-long terminal repeat (LTR) retroelements in the locust transcriptome. A novel approach was developed to identify full-length LTR retroelements. A total of 5 full-length LTR retroelements and 2 full-length non-LTR retroelements that contained complete structures for retrotransposition were identified. Structural analysis indicated that all these retroelements may have been activated or deprived of retrotransposition activities very recently. Expression profiling analysis revealed that the retroelements exhibited a unique expression pattern at the egg stage and showed differential expression profiles between the solitarious and gregarious phases at the fifth instar and adult stage. Conclusion We hereby present the first de novo transcriptome analysis of retroelements in a species whose genome is not available. This work contributes to a comprehensive understanding of the landscape of retroelements in the locust transcriptome. More importantly, the results reveal that non-LTR retroelements are abundant and diverse in the locust transcriptome.
Collapse
|
59
|
Redi CA, Capanna E. Genome size evolution: sizing mammalian genomes. Cytogenet Genome Res 2012; 137:97-112. [PMID: 22627028 DOI: 10.1159/000338820] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
The study of genome size (GS) and its variation is so fascinating to the scientific community because it constitutes the link between the present-day analytical and molecular studies of the genome and the old trunk of the holistic and synthetic view of the genome. The GS of several taxa vary over a broad range and do not correlate with the complexity of the organisms (the C-value paradox). However, the biology of transposable elements has let us reach a satisfactory view of the molecular mechanisms that give rise to GS variation and novelties, providing a less perplexing view of the significance of the GS (C-enigma). The knowledge of the composition and structure of a genome is a pre-requisite for trying to understand the evolution of the main genome signature: its size. The radiation of mammals provides an approximately 180-million-year test case for theories of how GS evolves. It has been found from data-mining GS databases that GS is a useful cyto-taxonomical instrument at the level of orders/superorders, providing genomic signatures characterizing Monotremata, Marsupialia, Afrotheria, Xenarthra, Laurasiatheria, and Euarchontoglires. A hypothetical ancestral mammalian-like GS of 2.9-3.7 pg has been suggested. This value appears compatible with the average values calculated for the high systematic levels of the extant Monotremata (∼2.97 pg) and Marsupialia (∼4.07 pg), suggesting invasion of mobile DNA elements concurrently with the separation of the older clades of Afrotheria (∼5.5 pg) and Xenarthra (∼4.5 pg) with larger GS, leaving the Euarchontoglires (∼3.4 pg) and Laurasiatheria (∼2.8 pg) genomes with fewer transposable elements. However, the paucity of GS data (546 mammalian species sized from 5,488 living species) for species, genera, and families calls for caution. Considering that mammalian species may be vanished even before they are known, GS data are sorely needed to phenotype the effects brought about by their variation and to validate any hypotheses on GS evolution in mammals.
Collapse
Affiliation(s)
- C A Redi
- Fondazione IRCCS Policlinico San Matteo, Dipartimento di Biologia e Biotecnologie Lazzaro Spallanzani, Pavia, Italia.
| | | |
Collapse
|