1
|
Doronina L, Ogoniak L, Schmitz J. Homoplasy of Retrotransposon Insertions in Toothed Whales. Genes (Basel) 2023; 14:1830. [PMID: 37761970 PMCID: PMC10531181 DOI: 10.3390/genes14091830] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 09/19/2023] [Accepted: 09/20/2023] [Indexed: 09/29/2023] Open
Abstract
Retrotransposon insertion patterns facilitate a virtually homoplasy-free picture of phylogenetic history. Still, a few most likely random parallel insertions or deletions result in rare cases of homoplasy in primates. The following question arises: how frequent is retrotransposon homoplasy in other phylogenetic clades? Here, we derived genome insertion data of toothed whales to evaluate the extension of homoplasy in a representative laurasiatherian group. Among more than a thousand extracted and aligned retrotransposon loci, we detected 37 cases of precise parallel insertions in species that are separated by over more than 10 million years, a time frame which minimizes the effects of incomplete lineage sorting. We compared the phylogenetic signal of insertions with the flanking sequences of these loci to further exclude potential polymorphic loci derived by incomplete lineage sorting. We found that the phylogenetic signals of retrotransposon insertion patterns exhibiting true homoplasy differ from the signals of their flanking sequences. In toothed whales, precise parallel insertions account for around 0.18-0.29% of insertion cases, which is about 12.5 times the frequency of such insertions among Alus in primates. We also detected five specific deletions of retrotransposons on various lineages of toothed whale evolution, a frequency of 0.003%, which is slightly higher than such occurrences in primates. Overall, the level of retrotransposon homoplasy in toothed whales is still marginal compared to the phylogenetic diagnostic retrotransposon presence/absence signal.
Collapse
Affiliation(s)
- Liliya Doronina
- Institute of Experimental Pathology, ZMBE, University of Münster, 48149 Münster, Germany;
- Institute for Evolution and Biodiversity, University of Münster, 48149 Münster, Germany
| | - Lynn Ogoniak
- Institute of Experimental Pathology, ZMBE, University of Münster, 48149 Münster, Germany;
| | - Jürgen Schmitz
- Institute of Experimental Pathology, ZMBE, University of Münster, 48149 Münster, Germany;
| |
Collapse
|
2
|
Pauciullo A, Versace C, Gaspa G, Letaief N, Bedhiaf-Romdhani S, Fulgione A, Cosenza G. Sequencing and Characterization of αs2-Casein Gene ( CSN1S2) in the Old-World Camels Have Proven Genetic Variations Useful for the Understanding of Species Diversification. Animals (Basel) 2023; 13:2805. [PMID: 37685069 PMCID: PMC10487017 DOI: 10.3390/ani13172805] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 09/01/2023] [Accepted: 09/01/2023] [Indexed: 09/10/2023] Open
Abstract
The CSN1S2 gene encodes αs2-casein, the third most abundant protein in camel milk. Despite its importance in foals, human nutrition, and dairy processing, the CSN1S2 gene in camels has received little attention. This study presents the first complete characterization of the CSN1S2 gene sequence in Old-World camels (Camelus bactrianus and Camelus dromedarius). Additionally, the gene promoter, consisting of 752 bp upstream of exon 1, was analyzed. The entire gene comprises 17 exons, ranging in length from 24 bp (exons 4, 8, 11, and 13) to 280 bp (exon 17). Interesting was the identification of the exon 12 in both species. The promoter analysis revealed 24 putative binding sites in the Bactrian camel and 22 in dromedary camel. Most of these sites were typical elements associated with milk protein, such as C/EBP-α, C/EBP-β, Oct-1, and AP1. The SNP discovery showed relatively high genetic diversity compared to other camel casein genes (CSN1S1, CSN2, and CSN3), with a total of 34 polymorphic sites across the two species. Particularly noteworthy is the transition g.311G>A in the CSN1S2 promoter, creating a new putative consensus binding site for a C/EBP-β in the Bactrian camel. At the exon level, two novel variants were found. One was detected in exon 6 of the Bactrian camel (g.3639C>G), resulting in an amino acid replacement, p.36Ile>Met. The second variant was found in noncoding exon 17 of dromedary CSN1S2 (g.1511G>T). Although this mutation occurs in the 3'-UnTranslated Region, it represents the first example of exonic polymorphism in the CSN1S2 for this species. This SNP also affects the binding sites of different microRNAs, including the seed sequence of the miRNA 4662a-3p, highlighting its role as a regulatory factor for CSN1S2 gene. A PCR-RFLP was set up for genotyping a dromedary Tunisian population (n = 157), and the minor allele frequency was found to be 0.27 for the G allele, indicating a potential yield improvement margin. The interspersed elements (INEs) analysis revealed 10 INEs covering 7.34% and 8.14% of the CSN1S2 sequence in the Bactrian and dromedary camels, respectively. Furthermore, six elements (A, B, F, H, I, and L) are shared among cattle and camels and are partially found in other ruminants, suggesting a common ancestral origin of these retrotransposons. Conversely, elements C, D, E, and G are specific to camels.
Collapse
Affiliation(s)
- Alfredo Pauciullo
- Department of Agriculture, Forest and Food Sciences, University of Torino, 10095 Grugliasco, Italy
| | - Carmine Versace
- Department of Agriculture, Forest and Food Sciences, University of Torino, 10095 Grugliasco, Italy
| | - Giustino Gaspa
- Department of Agriculture, Forest and Food Sciences, University of Torino, 10095 Grugliasco, Italy
| | - Neyrouz Letaief
- Department of Agriculture, Forest and Food Sciences, University of Torino, 10095 Grugliasco, Italy
- Laboratory of Animal and Forage Production, National Agricultural Research Institute of Tunisia, University of Carthage, Ariana 1004, Tunisia
| | - Sonia Bedhiaf-Romdhani
- Laboratory of Animal and Forage Production, National Agricultural Research Institute of Tunisia, University of Carthage, Ariana 1004, Tunisia
| | - Andrea Fulgione
- Department of Agriculture, University of Napoli Federico II, 80055 Portici, Italy
| | - Gianfranco Cosenza
- Department of Agriculture, University of Napoli Federico II, 80055 Portici, Italy
| |
Collapse
|
3
|
Vollger MR, Dishuck PC, Harvey WT, DeWitt WS, Guitart X, Goldberg ME, Rozanski AN, Lucas J, Asri M, Munson KM, Lewis AP, Hoekzema K, Logsdon GA, Porubsky D, Paten B, Harris K, Hsieh P, Eichler EE. Increased mutation and gene conversion within human segmental duplications. Nature 2023; 617:325-334. [PMID: 37165237 PMCID: PMC10172114 DOI: 10.1038/s41586-023-05895-y] [Citation(s) in RCA: 24] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 02/28/2023] [Indexed: 05/12/2023]
Abstract
Single-nucleotide variants (SNVs) in segmental duplications (SDs) have not been systematically assessed because of the limitations of mapping short-read sequencing data1,2. Here we constructed 1:1 unambiguous alignments spanning high-identity SDs across 102 human haplotypes and compared the pattern of SNVs between unique and duplicated regions3,4. We find that human SNVs are elevated 60% in SDs compared to unique regions and estimate that at least 23% of this increase is due to interlocus gene conversion (IGC) with up to 4.3 megabase pairs of SD sequence converted on average per human haplotype. We develop a genome-wide map of IGC donors and acceptors, including 498 acceptor and 454 donor hotspots affecting the exons of about 800 protein-coding genes. These include 171 genes that have 'relocated' on average 1.61 megabase pairs in a subset of human haplotypes. Using a coalescent framework, we show that SD regions are slightly evolutionarily older when compared to unique sequences, probably owing to IGC. SNVs in SDs, however, show a distinct mutational spectrum: a 27.1% increase in transversions that convert cytosine to guanine or the reverse across all triplet contexts and a 7.6% reduction in the frequency of CpG-associated mutations when compared to unique DNA. We reason that these distinct mutational properties help to maintain an overall higher GC content of SD DNA compared to that of unique DNA, probably driven by GC-biased conversion between paralogous sequences5,6.
Collapse
Affiliation(s)
- Mitchell R Vollger
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Division of Medical Genetics, University of Washington School of Medicine, Seattle, WA, USA
| | - Philip C Dishuck
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - William T Harvey
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - William S DeWitt
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, CA, USA
| | - Xavi Guitart
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Michael E Goldberg
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Allison N Rozanski
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Julian Lucas
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Mobin Asri
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Katherine M Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Alexandra P Lewis
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Glennis A Logsdon
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Kelley Harris
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - PingHsun Hsieh
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.
- Howard Hughes Medical Institute, Chevy Chase, MD, USA.
| |
Collapse
|
4
|
Deng S. The origin of genetic and metabolic systems: Evolutionary structuralinsights. Heliyon 2023; 9:e14466. [PMID: 36967965 PMCID: PMC10036676 DOI: 10.1016/j.heliyon.2023.e14466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Revised: 02/27/2023] [Accepted: 03/06/2023] [Indexed: 03/16/2023] Open
Abstract
DNA is derived from reverse transcription and its origin is related to reverse transcriptase, DNA polymerase and integrase. The gene structure originated from the evolution of the first RNA polymerase. Thus, an explanation of the origin of the genetic system must also explain the evolution of these enzymes. This paper proposes a polymer structure model, termed the stable complex evolution model, which explains the evolution of enzymes and functional molecules. Enzymes evolved their functions by forming locally tightly packed complexes with specific substrates. A metabolic reaction can therefore be considered to be the result of adaptive evolution in this way when a certain essential molecule is lacking in a cell. The evolution of the primitive genetic and metabolic systems was thus coordinated and synchronized. According to the stable complex model, almost all functional molecules establish binding affinity and specific recognition through complementary interactions, and functional molecules therefore have the nature of being auto-reactive. This is thermodynamically favorable and leads to functional duplication and self-organization. Therefore, it can be speculated that biological systems have a certain tendency to maintain functional stability or are influenced by an inherent selective power. The evolution of dormant bacteria may support this hypothesis, and inherent selectivity can be unified with natural selection at the molecular level.
Collapse
|
5
|
Gualtieri CT. Genomic Variation, Evolvability, and the Paradox of Mental Illness. Front Psychiatry 2021; 11:593233. [PMID: 33551865 PMCID: PMC7859268 DOI: 10.3389/fpsyt.2020.593233] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/10/2020] [Accepted: 11/27/2020] [Indexed: 12/30/2022] Open
Abstract
Twentieth-century genetics was hard put to explain the irregular behavior of neuropsychiatric disorders. Autism and schizophrenia defy a principle of natural selection; they are highly heritable but associated with low reproductive success. Nevertheless, they persist. The genetic origins of such conditions are confounded by the problem of variable expression, that is, when a given genetic aberration can lead to any one of several distinct disorders. Also, autism and schizophrenia occur on a spectrum of severity, from mild and subclinical cases to the overt and disabling. Such irregularities reflect the problem of missing heritability; although hundreds of genes may be associated with autism or schizophrenia, together they account for only a small proportion of cases. Techniques for higher resolution, genomewide analysis have begun to illuminate the irregular and unpredictable behavior of the human genome. Thus, the origins of neuropsychiatric disorders in particular and complex disease in general have been illuminated. The human genome is characterized by a high degree of structural and behavioral variability: DNA content variation, epistasis, stochasticity in gene expression, and epigenetic changes. These elements have grown more complex as evolution scaled the phylogenetic tree. They are especially pertinent to brain development and function. Genomic variability is a window on the origins of complex disease, neuropsychiatric disorders, and neurodevelopmental disorders in particular. Genomic variability, as it happens, is also the fuel of evolvability. The genomic events that presided over the evolution of the primate and hominid lineages are over-represented in patients with autism and schizophrenia, as well as intellectual disability and epilepsy. That the special qualities of the human genome that drove evolution might, in some way, contribute to neuropsychiatric disorders is a matter of no little interest.
Collapse
|
6
|
Ahmad SF, Singchat W, Jehangir M, Suntronpong A, Panthum T, Malaivijitnond S, Srikulnath K. Dark Matter of Primate Genomes: Satellite DNA Repeats and Their Evolutionary Dynamics. Cells 2020; 9:E2714. [PMID: 33352976 PMCID: PMC7767330 DOI: 10.3390/cells9122714] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 12/15/2020] [Accepted: 12/16/2020] [Indexed: 12/12/2022] Open
Abstract
A substantial portion of the primate genome is composed of non-coding regions, so-called "dark matter", which includes an abundance of tandemly repeated sequences called satellite DNA. Collectively known as the satellitome, this genomic component offers exciting evolutionary insights into aspects of primate genome biology that raise new questions and challenge existing paradigms. A complete human reference genome was recently reported with telomere-to-telomere human X chromosome assembly that resolved hundreds of dark regions, encompassing a 3.1 Mb centromeric satellite array that had not been identified previously. With the recent exponential increase in the availability of primate genomes, and the development of modern genomic and bioinformatics tools, extensive growth in our knowledge concerning the structure, function, and evolution of satellite elements is expected. The current state of knowledge on this topic is summarized, highlighting various types of primate-specific satellite repeats to compare their proportions across diverse lineages. Inter- and intraspecific variation of satellite repeats in the primate genome are reviewed. The functional significance of these sequences is discussed by describing how the transcriptional activity of satellite repeats can affect gene expression during different cellular processes. Sex-linked satellites are outlined, together with their respective genomic organization. Mechanisms are proposed whereby satellite repeats might have emerged as novel sequences during different evolutionary phases. Finally, the main challenges that hinder the detection of satellite DNA are outlined and an overview of the latest methodologies to address technological limitations is presented.
Collapse
Affiliation(s)
- Syed Farhan Ahmad
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Worapong Singchat
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Maryam Jehangir
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Department of Structural and Functional Biology, Institute of Bioscience at Botucatu, São Paulo State University (UNESP), Botucatu, São Paulo 18618-689, Brazil
| | - Aorarat Suntronpong
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Thitipong Panthum
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Suchinda Malaivijitnond
- National Primate Research Center of Thailand, Chulalongkorn University, Saraburi 18110, Thailand;
- Department of Biology, Faculty of Science, Chulalongkorn University, Bangkok 10330, Thailand
| | - Kornsorn Srikulnath
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
- National Primate Research Center of Thailand, Chulalongkorn University, Saraburi 18110, Thailand;
- Center of Excellence on Agricultural Biotechnology (AG-BIO/PERDO-CHE), Bangkok 10900, Thailand
- Omics Center for Agriculture, Bioresources, Food and Health, Kasetsart University (OmiKU), Bangkok 10900, Thailand
| |
Collapse
|
7
|
Paladin L, Necci M, Piovesan D, Mier P, Andrade-Navarro MA, Tosatto SCE. A novel approach to investigate the evolution of structured tandem repeat protein families by exon duplication. J Struct Biol 2020; 212:107608. [PMID: 32896658 DOI: 10.1016/j.jsb.2020.107608] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Revised: 08/19/2020] [Accepted: 08/21/2020] [Indexed: 11/30/2022]
Abstract
Tandem Repeat Proteins (TRPs) are ubiquitous in cells and are enriched in eukaryotes. They contributed to the evolution of organism complexity, specializing for functions that require quick adaptability such as immunity-related functions. To investigate the hypothesis of repeat protein evolution through exon duplication and rearrangement, we designed a tool to analyze the relationships between exon/intron patterns and structural symmetries. The tool allows comparison of the structure fragments as defined by exon/intron boundaries from Ensembl against the structural element repetitions from RepeatsDB. The all-against-all pairwise structural alignment between fragments and comparison of the two definitions (structural units and exons) are visualized in a single matrix, the "repeat/exon plot". An analysis of different repeat protein families, including the solenoids Leucine-Rich, Ankyrin, Pumilio, HEAT repeats and the β propellers Kelch-like, WD40 and RCC1, shows different behaviors, illustrated here through examples. For each example, the analysis of the exon mapping in homologous proteins supports the conservation of their exon patterns. We propose that when a clear-cut relationship between exon and structural boundaries can be identified, it is possible to infer a specific "evolutionary pattern" which may improve TRPs detection and classification.
Collapse
Affiliation(s)
| | - Marco Necci
- Dept. of Biomedical Sciences, University of Padova, Italy
| | | | - Pablo Mier
- Faculty of Biology, Johannes Gutenberg University of Mainz, Germany
| | | | | |
Collapse
|
8
|
Pauciullo A, Shuiep ET, Ogah MD, Cosenza G, Di Stasio L, Erhardt G. Casein Gene Cluster in Camelids: Comparative Genome Analysis and New Findings on Haplotype Variability and Physical Mapping. Front Genet 2019; 10:748. [PMID: 31555318 PMCID: PMC6726744 DOI: 10.3389/fgene.2019.00748] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 07/17/2019] [Indexed: 12/22/2022] Open
Abstract
The structure of casein genes has been fully understood in llamas, whereas in other camelids, this information is still incomplete. In fact, structure and polymorphisms have been identified in three (CSN1S1, αs1-CN; CSN2, β-CN; CSN3, κ-CN) out of four casein genes, whereas controversial information is available for the CSN1S2 (αs2-CN) in terms of structure and genetic diversity. Data from the genome analysis, whose assembly is available for feral camel, Bactrian, dromedary, and alpaca, can contribute to a better knowledge. However, a majority of the scaffolds available in GenBank are still unplaced, and the comparative annotation is often inaccurate or lacking.Therefore, the aims of this study are 1) to perform a comparative genome analysis and synthesize the literature data on camelids casein cluster; 2) to analyze the casein variability in two dromedary populations (Sudanese and Nigerian) using polymorphisms at CSN1S1 (c.150G > T), CSN2 (g.2126A > G), and CSN3 (g.1029T > C); and 3) to physically map the casein cluster in alpaca. Exon structures, gene and intergenic distances, large insertion/deletion events, SNPs, and microsatellites were annotated. In all camelids, the CSN1S2 consists of 17 exons, confirming the structure of llama CSN1S2 gene. The comparative analysis of the complete casein cluster (∼190kb) shows 12,818 polymorphisms. The most polymorphic gene is the CSN1S1 (99 SNPs in Bactrian vs. 248 in dromedary vs. 626 in alpaca). The less polymorphic is the CSN3 in the Bactrian (22 SNPs) and alpaca (301 SNPs), whereas it is the CSN1S2 in dromedary (79 SNPs). In the two investigated dromedary populations, the allele frequencies for the three markers are slightly different: the allele C at CSN1S1 is very rare in Nigerian (0.054) and Sudanese dromedaries (0.094), whereas the frequency of the allele G at CSN2 is almost inverted. Haplotype analysis evidenced GAC as the most frequent (0.288) and TGC as the rarest (0.005). The analysis of R-banding metaphases hybridized with specific probes mapped the casein genes on chromosome 2q21 in alpaca. These data deepen the information on the structure of the casein cluster in camelids and add knowledge on the cytogenetic map and haplotype variability.
Collapse
Affiliation(s)
- Alfredo Pauciullo
- Department of Agricultural, Forest and Food Sciences, University of Torino, Grugliasco, Italy
| | - El Tahir Shuiep
- Institute of Molecular Biology, University of Nyala, Nyala, Sudan
| | - Moses Danlami Ogah
- Department of Animal Science, Nasarawa State University, Keffi, Shabu-Lafia, Nigeria
| | - Gianfranco Cosenza
- Department of Agriculture, University of Napoli Federico II, Portici Italy
| | - Liliana Di Stasio
- Department of Agricultural, Forest and Food Sciences, University of Torino, Grugliasco, Italy
| | - Georg Erhardt
- Department for Animal Breeding and Genetics, Justus Liebig University, Gießen, Germany
| |
Collapse
|
9
|
Chen C, Wang W, Wang X, Shen D, Wang S, Wang Y, Gao B, Wimmers K, Mao J, Li K, Song C. Retrotransposons evolution and impact on lncRNA and protein coding genes in pigs. Mob DNA 2019; 10:19. [PMID: 31080521 PMCID: PMC6501411 DOI: 10.1186/s13100-019-0161-8] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2019] [Accepted: 04/11/2019] [Indexed: 12/26/2022] Open
Abstract
BACKGROUND Retrotransposons are the major determinants of genome sizes and they have shaped both genes and genomes in mammalian organisms, but their overall activity, diversity, and evolution dynamics, particularly their impact on protein coding and lncRNA genes in pigs remain largely unknown. RESULTS In the present study, we performed de novo detection of retrotransposons in pigs by using multiple pipelines, four distinct families of pig-specific L1 s classified into 51 distinct subfamilies and representing four evolution models and three expansion waves of pig-specific SINEs represented by three distinct families were identified. ERVs were classified into 18 families and found two most "modern" subfamilies in the pig genome. The transposition activity of pig L1 was verified by experiment, the sense and antisense promoter activities of young L1 5'UTRs and ERV LTRs and expression profiles of young retrotransposons in multiple tissues and cell lines were also validated. Furthermore, retrotransposons had an extensive impact on lncRNA and protein coding genes at both the genomic and transcriptomic levels. Most protein coding and lncRNA (> 80%) genes contained retrotransposon insertions, and about half of protein coding genes (44.30%) and one-fourth (24.13%) of lncRNA genes contained the youngest retrotransposon insertions. Nearly half of protein coding genes (43.78%) could generate chimeric transcripts with retrotransposons. Significant distribution bias of retrotransposon composition, location, and orientation in lncRNA and protein coding genes, and their transcripts, were observed. CONCLUSIONS In the current study, we characterized the classification and evolution profile of retrotransposons in pigs, experimentally proved the transposition activity of the young pig L1 subfamily, characterized the sense and antisense expression profiles and promoter activities of young retrotransposons, and investigated their impact on lncRNA and protein coding genes by defining the mobilome landscapes at the genomic and transcriptomic levels. These findings help provide a better understanding of retrotransposon evolution in mammal and their impact on the genome and transcriptome.
Collapse
Affiliation(s)
- Cai Chen
- Institute of Animal Mobilome and Genome, College of Animal Science & Technology, Yangzhou University, Yangzhou, 225009 Jiangsu China
| | - Wei Wang
- Institute of Animal Mobilome and Genome, College of Animal Science & Technology, Yangzhou University, Yangzhou, 225009 Jiangsu China
| | - Xiaoyan Wang
- Institute of Animal Mobilome and Genome, College of Animal Science & Technology, Yangzhou University, Yangzhou, 225009 Jiangsu China
| | - Dan Shen
- Institute of Animal Mobilome and Genome, College of Animal Science & Technology, Yangzhou University, Yangzhou, 225009 Jiangsu China
| | - Saisai Wang
- Institute of Animal Mobilome and Genome, College of Animal Science & Technology, Yangzhou University, Yangzhou, 225009 Jiangsu China
| | - Yali Wang
- Institute of Animal Mobilome and Genome, College of Animal Science & Technology, Yangzhou University, Yangzhou, 225009 Jiangsu China
| | - Bo Gao
- Institute of Animal Mobilome and Genome, College of Animal Science & Technology, Yangzhou University, Yangzhou, 225009 Jiangsu China
| | - Klaus Wimmers
- Leibniz Institute for Farm Animal Biology (FBN), 18196 Dummerstorf, Germany
| | - Jiude Mao
- Life Science Center, University of Missouri, Columbia, MO 65211 USA
| | - Kui Li
- Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Chengyi Song
- Institute of Animal Mobilome and Genome, College of Animal Science & Technology, Yangzhou University, Yangzhou, 225009 Jiangsu China
| |
Collapse
|
10
|
Petersen M, Armisén D, Gibbs RA, Hering L, Khila A, Mayer G, Richards S, Niehuis O, Misof B. Diversity and evolution of the transposable element repertoire in arthropods with particular reference to insects. BMC Evol Biol 2019; 19:11. [PMID: 30626321 PMCID: PMC6327564 DOI: 10.1186/s12862-018-1324-9] [Citation(s) in RCA: 91] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2018] [Accepted: 12/11/2018] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Transposable elements (TEs) are a major component of metazoan genomes and are associated with a variety of mechanisms that shape genome architecture and evolution. Despite the ever-growing number of insect genomes sequenced to date, our understanding of the diversity and evolution of insect TEs remains poor. RESULTS Here, we present a standardized characterization and an order-level comparison of arthropod TE repertoires, encompassing 62 insect and 11 outgroup species. The insect TE repertoire contains TEs of almost every class previously described, and in some cases even TEs previously reported only from vertebrates and plants. Additionally, we identified a large fraction of unclassifiable TEs. We found high variation in TE content, ranging from less than 6% in the antarctic midge (Diptera), the honey bee and the turnip sawfly (Hymenoptera) to more than 58% in the malaria mosquito (Diptera) and the migratory locust (Orthoptera), and a possible relationship between the content and diversity of TEs and the genome size. CONCLUSION While most insect orders exhibit a characteristic TE composition, we also observed intraordinal differences, e.g., in Diptera, Hymenoptera, and Hemiptera. Our findings shed light on common patterns and reveal lineage-specific differences in content and evolution of TEs in insects. We anticipate our study to provide the basis for future comparative research on the insect TE repertoire.
Collapse
Affiliation(s)
- Malte Petersen
- University of Bonn, Bonn, Germany
- Zoological Research Museum Alexander Koenig, Center for Molecular Biodiversity Research, Adenauerallee 160, Bonn, 53113 Germany
- Senckenberg Gesellschaft für Naturforschung, Senckenberganlage 25, Frankfurt, 60325 Germany
| | - David Armisén
- Université de Lyon, Institut de Génomique Fonctionnelle de Lyon, CNRS UMR 5242, Ecole Normale Supérieure de Lyon, Université Claude Bernard Lyon 1, 46 allée d’Italie, Lyon, 69364 France
| | - Richard A. Gibbs
- Human Genome Sequencing Center, Department of Human and Molecular Genetics, Baylor College of Medicine, Houston, 77030 TX USA
| | - Lars Hering
- Department of Zoology, Institute of Biology, University of Kassel, Heinrich-Plett-Str. 40, Kassel, 34132 Germany
| | - Abderrahman Khila
- Université de Lyon, Institut de Génomique Fonctionnelle de Lyon, CNRS UMR 5242, Ecole Normale Supérieure de Lyon, Université Claude Bernard Lyon 1, 46 allée d’Italie, Lyon, 69364 France
| | - Georg Mayer
- Department of Zoology, Institute of Biology, University of Kassel, Heinrich-Plett-Str. 40, Kassel, 34132 Germany
| | - Stephen Richards
- Human Genome Sequencing Center, Department of Human and Molecular Genetics, Baylor College of Medicine, Houston, 77030 TX USA
| | - Oliver Niehuis
- Department of Evolutionary Biology and Ecology, Institute for Biology I (Zoology), University of Freiburg, Freiburg (Brsg.), 79104 Germany
| | - Bernhard Misof
- Zoological Research Museum Alexander Koenig, Center for Molecular Biodiversity Research, Adenauerallee 160, Bonn, 53113 Germany
| |
Collapse
|
11
|
Banuelos M, Sindi S. Modeling transposable element dynamics with fragmentation equations. Math Biosci 2018; 302:46-66. [DOI: 10.1016/j.mbs.2018.05.009] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2017] [Revised: 04/02/2018] [Accepted: 05/11/2018] [Indexed: 12/16/2022]
|
12
|
Sookdeo A, Ruiz-García M, Schneider H, Boissinot S. Contrasting Rates of LINE-1 Amplification among New World Primates of the Atelidae Family. Cytogenet Genome Res 2018; 154:217-228. [PMID: 29991050 DOI: 10.1159/000490481] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/13/2018] [Indexed: 11/19/2022] Open
Abstract
LINE-1 (L1) retrotransposons constitute the dominant category of transposons in mammalian genomes. L1 elements are active in the vast majority of mammals, and only a few cases of L1 extinction have been documented. The only possible case of extinction in primates was suggested for South American spider monkeys. However, these previous studies were based on a single species. We revisited this question with a larger phylogenetic sample, covering all 4 genera of Atelidae and 3 species of spider monkeys. We used an enrichment method to clone recently inserted L1 elements and performed an evolutionary analysis of the sequences. We were able to identify young L1 elements in all taxa, suggesting that L1 is probably still active in all Atelidae examined. However, we also detected considerable variations in the proportion of recent elements indicating that the rate of L1 amplification varies among Atelidae by a 3-fold factor. The extent of L1 amplification in Atelidae remains overall lower than in other New World monkeys. Multiple factors can affect the amplification of L1, such as the demography of the host and the control of transposition. These factors are discussed in the context of host life history.
Collapse
|
13
|
Vandewege MW, Platt RN, Ray DA, Hoffmann FG. Transposable Element Targeting by piRNAs in Laurasiatherians with Distinct Transposable Element Histories. Genome Biol Evol 2016; 8:1327-37. [PMID: 27060702 PMCID: PMC4898795 DOI: 10.1093/gbe/evw078] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
PIWI proteins and PIWI-interacting RNAs (piRNAs) are part of a cellular pathway that has evolved to protect genomes against the proliferation of transposable elements (TEs). PIWIs and piRNAs assemble into complexes that are involved in epigenetic and post-transcriptional repression of TEs. Most of our understanding of the mechanisms of piRNA-mediated TE silencing comes from fruit fly and mouse models. However, even in these well-studied animals it is unclear how piRNA responses relate to variable TE expression and whether the strength of the piRNA response affects TE content over time. Here, we assessed the evolutionary interactions between TE and piRNAs in a statistical framework using three nonmodel laurasiatherian mammals as a study system: dog, horse, and a vesper bat. These three species diverged ∼80 million years ago and have distinct genomic TE contents. By comparing species with distinct TE landscapes, we aimed to identify clear relationships among TE content, expression, and piRNAs. We found that the TE subfamilies that are the most transcribed appear to elicit the strongest “ping-pong” response. This was most evident among long interspersed elements, but the relationships between expression and ping-pong pilRNA (piRNA-like) expression were more complex among SINEs. SINE transcripts were equally abundant in the dog and horse yet new SINE insertions were relatively rare in the horse genome, where we identified a stronger piRNA response. Our analyses suggest that the piRNA response can have a strong impact on the TE composition of a genome. However, our results also suggest that the presence of a robust piRNA response is apparently not sufficient to stop TE mobilization and accumulation.
Collapse
Affiliation(s)
- Michael W Vandewege
- Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology, Mississippi State University
| | - Roy N Platt
- Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology, Mississippi State University Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University
| | - David A Ray
- Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology, Mississippi State University Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University
| | - Federico G Hoffmann
- Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology, Mississippi State University Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University
| |
Collapse
|
14
|
Ockendon NF, O'Connell LA, Bush SJ, Monzón-Sandoval J, Barnes H, Székely T, Hofmann HA, Dorus S, Urrutia AO. Optimization of next-generation sequencing transcriptome annotation for species lacking sequenced genomes. Mol Ecol Resour 2015; 16:446-58. [PMID: 26358618 PMCID: PMC4982090 DOI: 10.1111/1755-0998.12465] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2014] [Revised: 08/01/2015] [Accepted: 08/14/2015] [Indexed: 01/10/2023]
Abstract
Next‐generation sequencing methods, such as RNA‐seq, have permitted the exploration of gene expression in a range of organisms which have been studied in ecological contexts but lack a sequenced genome. However, the efficacy and accuracy of RNA‐seq annotation methods using reference genomes from related species have yet to be robustly characterized. Here we conduct a comprehensive power analysis employing RNA‐seq data from Drosophila melanogaster in conjunction with 11 additional genomes from related Drosophila species to compare annotation methods and quantify the impact of evolutionary divergence between transcriptome and the reference genome. Our analyses demonstrate that, regardless of the level of sequence divergence, direct genome mapping (DGM), where transcript short reads are aligned directly to the reference genome, significantly outperforms the widely used de novo and guided assembly‐based methods in both the quantity and accuracy of gene detection. Our analysis also reveals that DGM recovers a more representative profile of Gene Ontology functional categories, which are often used to interpret emergent patterns in genomewide expression analyses. Lastly, analysis of available primate RNA‐seq data demonstrates the applicability of our observations across diverse taxa. Our quantification of annotation accuracy and reduced gene detection associated with sequence divergence thus provides empirically derived guidelines for the design of future gene expression studies in species without sequenced genomes.
Collapse
Affiliation(s)
- Nina F Ockendon
- Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK.,Milner Centre, University of Bath, Bath, BA2 7AY, UK
| | - Lauren A O'Connell
- FAS Centre for Systems Biology, Harvard University, Cambridge, MA, 02138, USA
| | - Stephen J Bush
- Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK.,Milner Centre, University of Bath, Bath, BA2 7AY, UK
| | - Jimena Monzón-Sandoval
- Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK.,Milner Centre, University of Bath, Bath, BA2 7AY, UK
| | - Holly Barnes
- Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK.,Milner Centre, University of Bath, Bath, BA2 7AY, UK
| | - Tamás Székely
- Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK.,Milner Centre, University of Bath, Bath, BA2 7AY, UK
| | - Hans A Hofmann
- Center for Computational Biology and Bioinformatics, Department of Integrative Biology, The University of Texas, Austin, TX, 78712, USA
| | - Steve Dorus
- Department of Biology, Syracuse University, Syracuse, NY, 13244, USA
| | - Araxi O Urrutia
- Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK.,Milner Centre, University of Bath, Bath, BA2 7AY, UK
| |
Collapse
|
15
|
Dowall SD, Matthews DA, Garcia-Dorival I, Taylor I, Kenny J, Hertz-Fowler C, Hall N, Corbin-Lickfett K, Empig C, Schlunegger K, Barr JN, Carroll MW, Hewson R, Hiscox JA. Elucidating variations in the nucleotide sequence of Ebola virus associated with increasing pathogenicity. Genome Biol 2015; 15:540. [PMID: 25416632 DOI: 10.1186/preaccept-1724277741482641] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Ebolaviruses causes a severe and often fatal hemorrhagic fever in humans, with some species such as Ebola virus having case fatality rates approaching 90%. Currently the worst Ebola virus outbreak since the disease was discovered is occurring in West Africa. Although thought to be a zoonotic infection, a concern is that with increasing numbers of humans being infected, Ebola virus variants could be selected which are better adapted for human-to-human transmission. RESULTS To investigate whether genetic changes in Ebola virus become established in response to adaptation in a different host, a guinea pig model of infection was used. In this experimental system, guinea pigs were infected with Ebola virus (EBOV), which initially did not cause disease. To simulate transmission to uninfected individuals, the virus was serially passaged five times in naive animals. As the virus was passaged, virulence increased and clinical effects were observed in the guinea pig. An RNAseq and consensus mapping approach was then used to evaluate potential nucleotide changes in the Ebola virus genome at each passage. CONCLUSIONS Upon passage in the guinea pig model, EBOV become more virulent, RNA editing and also coding changes in key proteins become established. The data suggest that the initial evolutionary trajectory of EBOV in a new host can lead to a gain in virulence. Given the circumstances of the sustained transmission of EBOV in the current outbreak in West Africa, increases in virulence may be associated with prolonged and uncontrolled epidemics of EBOV.
Collapse
|
16
|
Dowall SD, Matthews DA, Garcia-Dorival I, Taylor I, Kenny J, Hertz-Fowler C, Hall N, Corbin-Lickfett K, Empig C, Schlunegger K, Barr JN, Carroll MW, Hewson R, Hiscox JA. Elucidating variations in the nucleotide sequence of Ebola virus associated with increasing pathogenicity. Genome Biol 2015. [PMID: 25416632 PMCID: PMC4289381 DOI: 10.1186/s13059-014-0540-x] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Background Ebolaviruses cause a severe and often fatal haemorrhagic fever in humans, with some species such as Ebola virus having case fatality rates approaching 90%. Currently, the worst Ebola virus outbreak since the disease was discovered is occurring in West Africa. Although thought to be a zoonotic infection, a concern is that with increasing numbers of humans being infected, Ebola virus variants could be selected which are better adapted for human-to-human transmission. Results To investigate whether genetic changes in Ebola virus become established in response to adaptation in a different host, a guinea pig model of infection was used. In this experimental system, guinea pigs were infected with Ebola virus (EBOV), which initially did not cause disease. To simulate transmission to uninfected individuals, the virus was serially passaged five times in naïve animals. As the virus was passaged, virulence increased and clinical effects were observed in the guinea pig. An RNAseq and consensus mapping approach was then used to evaluate potential nucleotide changes in the Ebola virus genome at each passage. Conclusions Upon passage in the guinea pig model, EBOV become more virulent, RNA editing and also coding changes in key proteins become established. The data suggest that the initial evolutionary trajectory of EBOV in a new host can lead to a gain in virulence. Given the circumstances of the sustained transmission of EBOV in the current outbreak in West Africa, increases in virulence may be associated with prolonged and uncontrolled epidemics of EBOV.
Collapse
|
17
|
Boissinot S, Alvarez L, Giraldo-Ramirez J, Tollis M. Neutral nuclear variation in Baboons (genus Papio) provides insights into their evolutionary and demographic histories. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 2014; 155:621-34. [PMID: 25234435 PMCID: PMC4339869 DOI: 10.1002/ajpa.22618] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2014] [Revised: 08/20/2014] [Accepted: 09/08/2014] [Indexed: 12/31/2022]
Abstract
Baboons (genus Papio) are distributed over most of sub-Saharan Africa and in the southern portion of the Arabian Peninsula. Six distinct morphotypes, with clearly defined geographic distributions, are recognized (the olive, chacma, yellow, Guinea, Kinda, and hamadryas baboons). The evolutionary relationships among baboon forms have long been a controversial issue. Phylogenetic analyses based on mitochondrial DNA sequences revealed that the modern baboon morphotypes are mitochondrially paraphyletic or polyphyletic. The discordance between mitochondrial lineages and morphology is indicative of extensive introgressive hybridization between ancestral baboon populations. To gain insights into the evolutionary relationships among morphotypes and their demographic history, we performed an analysis of nuclear variation in baboons. We sequenced 13 noncoding, putatively neutral, nuclear regions, and scored the presence/absence of 18 polymorphic transposable elements in a sample of 45 baboons belonging to five of the six recognized baboon forms. We found that the chacma baboon is the sister-taxon to all other baboons and the yellow baboon is the sister-taxon to an unresolved northern clade containing the olive, Guinea, and hamadryas baboons. We estimated that the diversification of baboons occurred entirely in the Pleistocene, the earliest split dating ∼1.5 million years ago, and that baboons have experienced relatively large and constant effective population sizes for most of their evolutionary history (∼30,000 to 95,000 individuals).
Collapse
Affiliation(s)
- Stéphane Boissinot
- Department of Biology, Queens College, the City University of New York, Queens, NY, USA
- Ecology, Evolutionary Biology and Behavior, Graduate Center, the City University of New York, New York, NY, USA
| | - Lauren Alvarez
- Department of Biology, Queens College, the City University of New York, Queens, NY, USA
| | | | - Marc Tollis
- Department of Biology, Queens College, the City University of New York, Queens, NY, USA
- Ecology, Evolutionary Biology and Behavior, Graduate Center, the City University of New York, New York, NY, USA
- School of Life Sciences, Arizona State University, 427 E Tyler Mall, Tempe, AZ, USA
| |
Collapse
|
18
|
Blackwell BJ, Lopez MF, Wang J, Krastins B, Sarracino D, Tollervey JR, Dobke M, Jordan IK, Lunyak VV. Protein interactions with piALU RNA indicates putative participation of retroRNA in the cell cycle, DNA repair and chromatin assembly. Mob Genet Elements 2014; 2:26-35. [PMID: 22754750 PMCID: PMC3383447 DOI: 10.4161/mge.19032] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
Recent analyses suggest that transposable element-derived transcripts are processed to yield a variety of small RNA species that play critical functional roles in gene regulation and chromatin organization as well as genome stability and maintenance. Here we report a mass spectrometry analysis of an RNA-affinity complex isolation using a piRNA homologous sequence derived from Alu retrotransposal RNA. Our data point to potential roles for piALU RNAs in DNA repair, cell cycle and chromatin regulations.
Collapse
|
19
|
Zhang LM, Luo H, Liu ZQ, Zhao Y, Luo JC, Hao DY, Jing HC. Genome-wide patterns of large-size presence/absence variants in sorghum. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2014; 56:24-37. [PMID: 24428208 DOI: 10.1111/jipb.12121] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/26/2013] [Accepted: 10/10/2013] [Indexed: 06/03/2023]
Abstract
The presence/absence variants (PAVs) are a major source of genome structural variation and have profound effects on phenotypic and genomic variation in animals and humans. However, little is understood about PAVs in plant genomes. Our previous resequencing effort on three sorghum (Sorghum bicolour L.) genomes, each 12× coverage, uncovered 5 364 PAVs. Here, we report a detailed characterization of 51 large-size (>30 kb) PAVs. These PAVs spanned a total size of 2.92 Mb of the sorghum genome containing 202 known and predicted genes, including 38 genes annotated to encode cell death and stress response genes. The PAVs varied considerably for repeat sequences and mobile elements with DNA transposons as the major components. The frequency and distribution of these PAVs differed substantially across 96 sorghum inbred lines, and the low- and high frequency PAVs differed in their gene categories. This report shed new light on the occurrence and diversity of PAVs in sorghum genomes. Our research exemplifies a new perspective to explore genome structural variation for genetic improvement in plant breeding.
Collapse
Affiliation(s)
- Li-Min Zhang
- The Key Laboratory of Plant Resources, Institute of Botany, The Chinese Academy of Sciences, Beijing, 100093, China; Key Laboratory for Molecular Enzymology and Engineering of the Ministry of Education, Jilin University, Changchun, 130012, China
| | | | | | | | | | | | | |
Collapse
|
20
|
Abstract
We analyzed 83 fully sequenced great ape genomes for mobile element insertions, predicting a total of 49,452 fixed and polymorphic Alu and long interspersed element 1 (L1) insertions not present in the human reference assembly and assigning each retrotransposition event to a different time point during great ape evolution. We used these homoplasy-free markers to construct a mobile element insertions-based phylogeny of humans and great apes and demonstrate their differential power to discern ape subspecies and populations. Within this context, we find a good correlation between L1 diversity and single-nucleotide polymorphism heterozygosity (r(2) = 0.65) in contrast to Alu repeats, which show little correlation (r(2) = 0.07). We estimate that the "rate" of Alu retrotransposition has differed by a factor of 15-fold in these lineages. Humans, chimpanzees, and bonobos show the highest rates of Alu accumulation--the latter two since divergence 1.5 Mya. The L1 insertion rate, in contrast, has remained relatively constant, with rates differing by less than a factor of three. We conclude that Alu retrotransposition has been the most variable form of genetic variation during recent human-great ape evolution, with increases and decreases occurring over very short periods of evolutionary time.
Collapse
|
21
|
Pfendner EG, Uitto J, Gerard GF, Terry SF. Pseudoxanthoma elasticum: genetic diagnostic markers. ACTA ACUST UNITED AC 2013; 2:63-79. [PMID: 23485117 DOI: 10.1517/17530059.2.1.63] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Pseudoxanthoma elasticum (PXE), an autosomal recessive disorder with considerable phenotypic variability, mainly affects the eyes, skin and cardiovascular system, and is characterized by ectopic mineralization of elastic fibers of connective tissues. Since the identification of the ABCC6 gene (ATP-binding cassette family C member 6), which encodes a putative transmembrane transporter (ABCC6), as the site of mutations responsible for PXE, a number of researchers have disclosed mutations spanning the entire gene. An important advance in the ability to identify mutations has been the identification of two closely related pseudogenes and identifying sequence differences between the coding gene and the pseudogenes allowing accurate sequencing. In this review, the mutation spectrum in PXE is summarized and a strategy to optimize mutation detection in this difficult disorder is outlined.
Collapse
|
22
|
Abstract
Long interspersed element-1 (LINE-1 or L1) is a repetitive DNA retrotransposon capable of duplication by a copy-and-paste genetic mechanism. Scattered throughout mammalian genomes, L1 is typically quiescent in most somatic cell types. In developing neurons, however, L1 can express and retrotranspose at high frequency. The L1 element can insert into various genomic locations including intragenic regions. These insertions can alter the dynamic of the neuronal transcriptome by changing the expression pattern of several nearby genes. The consequences of L1 genomic alterations in somatic cells are still under investigation, but the high level of mutagenesis within neurons suggests that each neuron is genetically unique. Furthermore, some neurological diseases, such as Rett syndrome and ataxia telangiectasia, misregulate L1 retrotransposition, which could contribute to some pathological aspects. In this review, we survey the literature related to neurodevelopmental retrotransposition and discuss possible relevance to neuronal function, evolution, and neurological disease.
Collapse
Affiliation(s)
- Charles A Thomas
- Department of Pediatrics/Rady Children's Hospital San Diego, University of California San Diego, La Jolla, California 92093, USA
| | | | | |
Collapse
|
23
|
Research proceedings on primate comparative genomics. Zool Res 2013; 33:108-18. [DOI: 10.3724/sp.j.1141.2012.01108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
|
24
|
Abstract
It is well known that remnants of partial or whole copies of mitochondrial DNA, known as Nuclear MiTochondrial sequences (NUMTs), are found in nuclear genomes. Since whole genome sequences have become available, many bioinformatics studies have identified putative NUMTs and from those attempted to infer the factors involved in NUMT creation. These studies conclude that NUMTs represent randomly chosen regions of the mitochondrial genome. There is less consensus regarding the nuclear insertion sites of NUMTs - previous studies have discussed the possible role of retrotransposons, but some recent ones have reported no correlation or even anti-correlation between NUMT sites and retrotransposons. These studies have generally defined NUMT sites using BLAST with default parameters. We analyze a redefined set of human NUMTs, computed with a carefully considered protocol. We discover that the inferred insertion points of NUMTs have a strong tendency to have high-predicted DNA curvature, occur in experimentally defined open chromatin regions and often occur immediately adjacent to A + T oligomers. We also show clear evidence that their flanking regions are indeed rich in retrotransposons. Finally we show that parts of the mitochondrial genome D-loop are under-represented as a source of NUMTs in primate evolution.
Collapse
Affiliation(s)
- Junko Tsuji
- Department of Computational Biology, Graduate School of Frontier Science, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8561, Japan
| | | | | | | |
Collapse
|
25
|
Redi CA, Capanna E. Genome size evolution: sizing mammalian genomes. Cytogenet Genome Res 2012; 137:97-112. [PMID: 22627028 DOI: 10.1159/000338820] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
The study of genome size (GS) and its variation is so fascinating to the scientific community because it constitutes the link between the present-day analytical and molecular studies of the genome and the old trunk of the holistic and synthetic view of the genome. The GS of several taxa vary over a broad range and do not correlate with the complexity of the organisms (the C-value paradox). However, the biology of transposable elements has let us reach a satisfactory view of the molecular mechanisms that give rise to GS variation and novelties, providing a less perplexing view of the significance of the GS (C-enigma). The knowledge of the composition and structure of a genome is a pre-requisite for trying to understand the evolution of the main genome signature: its size. The radiation of mammals provides an approximately 180-million-year test case for theories of how GS evolves. It has been found from data-mining GS databases that GS is a useful cyto-taxonomical instrument at the level of orders/superorders, providing genomic signatures characterizing Monotremata, Marsupialia, Afrotheria, Xenarthra, Laurasiatheria, and Euarchontoglires. A hypothetical ancestral mammalian-like GS of 2.9-3.7 pg has been suggested. This value appears compatible with the average values calculated for the high systematic levels of the extant Monotremata (∼2.97 pg) and Marsupialia (∼4.07 pg), suggesting invasion of mobile DNA elements concurrently with the separation of the older clades of Afrotheria (∼5.5 pg) and Xenarthra (∼4.5 pg) with larger GS, leaving the Euarchontoglires (∼3.4 pg) and Laurasiatheria (∼2.8 pg) genomes with fewer transposable elements. However, the paucity of GS data (546 mammalian species sized from 5,488 living species) for species, genera, and families calls for caution. Considering that mammalian species may be vanished even before they are known, GS data are sorely needed to phenotype the effects brought about by their variation and to validate any hypotheses on GS evolution in mammals.
Collapse
Affiliation(s)
- C A Redi
- Fondazione IRCCS Policlinico San Matteo, Dipartimento di Biologia e Biotecnologie Lazzaro Spallanzani, Pavia, Italia.
| | | |
Collapse
|
26
|
On parameters of the human genome. J Theor Biol 2011; 288:92-104. [DOI: 10.1016/j.jtbi.2011.07.021] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2011] [Revised: 06/28/2011] [Accepted: 07/21/2011] [Indexed: 02/06/2023]
|
27
|
Oliver KR, Greene WK. Mobile DNA and the TE-Thrust hypothesis: supporting evidence from the primates. Mob DNA 2011; 2:8. [PMID: 21627776 PMCID: PMC3123540 DOI: 10.1186/1759-8753-2-8] [Citation(s) in RCA: 77] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2011] [Accepted: 05/31/2011] [Indexed: 02/07/2023] Open
Abstract
Transposable elements (TEs) are increasingly being recognized as powerful facilitators of evolution. We propose the TE-Thrust hypothesis to encompass TE-facilitated processes by which genomes self-engineer coding, regulatory, karyotypic or other genetic changes. Although TEs are occasionally harmful to some individuals, genomic dynamism caused by TEs can be very beneficial to lineages. This can result in differential survival and differential fecundity of lineages. Lineages with an abundant and suitable repertoire of TEs have enhanced evolutionary potential and, if all else is equal, tend to be fecund, resulting in species-rich adaptive radiations, and/or they tend to undergo major evolutionary transitions. Many other mechanisms of genomic change are also important in evolution, and whether the evolutionary potential of TE-Thrust is realized is heavily dependent on environmental and ecological factors. The large contribution of TEs to evolutionary innovation is particularly well documented in the primate lineage. In this paper, we review numerous cases of beneficial TE-caused modifications to the genomes of higher primates, which strongly support our TE-Thrust hypothesis.
Collapse
Affiliation(s)
- Keith R Oliver
- School of Biological Sciences and Biotechnology, Faculty of Science and Engineering, Murdoch University, Perth W. A. 6150, Australia
| | - Wayne K Greene
- School of Veterinary and Biomedical Sciences, Faculty of Health Sciences, Murdoch University, Perth W. A. 6150, Australia
| |
Collapse
|
28
|
Horvath JE, Sheedy CB, Merrett SL, Diallo AB, Swofford DL, NISC Comparative Sequencing Program, Green ED, Willard HF. Comparative analysis of the primate X-inactivation center region and reconstruction of the ancestral primate XIST locus. Genome Res 2011; 21:850-62. [PMID: 21518738 DOI: 10.1101/gr.111849.110] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
Here we provide a detailed comparative analysis across the candidate X-Inactivation Center (XIC) region and the XIST locus in the genomes of six primates and three mammalian outgroup species. Since lemurs and other strepsirrhine primates represent the sister lineage to all other primates, this analysis focuses on lemurs to reconstruct the ancestral primate sequences and to gain insight into the evolution of this region and the genes within it. This comparative evolutionary genomics approach reveals significant expansion in genomic size across the XIC region in higher primates, with minimal size alterations across the XIST locus itself. Reconstructed primate ancestral XIC sequences show that the most dramatic changes during the past 80 million years occurred between the ancestral primate and the lineage leading to Old World monkeys. In contrast, the XIST locus compared between human and the primate ancestor does not indicate any dramatic changes to exons or XIST-specific repeats; rather, evolution of this locus reflects small incremental changes in overall sequence identity and short repeat insertions. While this comparative analysis reinforces that the region around XIST has been subject to significant genomic change, even among primates, our data suggest that evolution of the XIST sequences themselves represents only small lineage-specific changes across the past 80 million years.
Collapse
Affiliation(s)
- Julie E Horvath
- Duke Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina 27708, USA.
| | | | | | | | | | | | | | | |
Collapse
|
29
|
Comparative and demographic analysis of orang-utan genomes. Nature 2011; 469:529-33. [PMID: 21270892 PMCID: PMC3060778 DOI: 10.1038/nature09687] [Citation(s) in RCA: 372] [Impact Index Per Article: 28.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2010] [Accepted: 11/19/2010] [Indexed: 11/10/2022]
Abstract
“Orangutan” is derived from the Malay term “man of the forest” and aptly describes the Southeast Asian great apes native to Sumatra and Borneo. The orangutan species, Pongo abelii (Sumatran) and Pongo pygmaeus (Bornean), are the most phylogenetically distant great apes from humans, thereby providing an informative perspective on hominid evolution. Here we present a Sumatran orangutan draft genome assembly and short read sequence data from five Sumatran and five Bornean orangutan genomes. Our analyses reveal that, compared to other primates, the orangutan genome has many unique features. Structural evolution of the orangutan genome has proceeded much more slowly than other great apes, evidenced by fewer rearrangements, less segmental duplication, a lower rate of gene family turnover and surprisingly quiescent Alu repeats, which have played a major role in restructuring other primate genomes. We also describe the first primate polymorphic neocentromere, found in both Pongo species, emphasizing the gradual evolution of orangutan genome structure. Orangutans have extremely low energy usage for a eutherian mammal1, far lower than their hominid relatives. Adding their genome to the repertoire of sequenced primates illuminates new signals of positive selection in several pathways including glycolipid metabolism. From the population perspective, both Pongo species are deeply diverse; however, Sumatran individuals possess greater diversity than their Bornean counterparts, and more species-specific variation. Our estimate of Bornean/Sumatran speciation time, 400k years ago (ya), is more recent than most previous studies and underscores the complexity of the orangutan speciation process. Despite a smaller modern census population size, the Sumatran effective population size (Ne) expanded exponentially relative to the ancestral Ne after the split, while Bornean Ne declined over the same period. Overall, the resources and analyses presented here offer new opportunities in evolutionary genomics, insights into hominid biology, and an extensive database of variation for conservation efforts.
Collapse
|
30
|
Harris EE. Nonadaptive processes in primate and human evolution. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 2011; 143 Suppl 51:13-45. [PMID: 21086525 DOI: 10.1002/ajpa.21439] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
Evolutionary biology has tended to focus on adaptive evolution by positive selection as the primum mobile of evolutionary trajectories in species while underestimating the importance of nonadaptive evolutionary processes. In this review, I describe evidence that suggests that primate and human evolution has been strongly influenced by nonadaptive processes, particularly random genetic drift and mutation. This is evidenced by three fundamental effects: a relative relaxation of selective constraints (i.e., purifying selection), a relative increase in the fixation of slightly deleterious mutations, and a general reduction in the efficacy of positive selection. These effects are observed in protein-coding, regulatory regions, and in gene expression data, as well as in an augmentation of fixation of large-scale mutations, including duplicated genes, mobile genetic elements, and nuclear mitochondrial DNA. The evidence suggests a general population-level explanation such as a reduction in effective population size (N(e)). This would have tipped the balance between the evolutionary forces of natural selection and random genetic drift toward genetic drift for variants having small selective effects. After describing these proximate effects, I describe the potential consequences of these effects for primate and human evolution. For example, an increase in the fixation of slightly deleterious mutations could potentially have led to an increase in the fixation rate of compensatory mutations that act to suppress the effects of slightly deleterious substitutions. The potential consequences of compensatory evolution for the evolution of novel gene functions and in potentially confounding the detection of positively selected genes are explored. The consequences of the passive accumulation of large-scale genomic mutations by genetic drift are unclear, though evidence suggests that new gene copies as well as insertions of transposable elements into genes can potentially lead to adaptive phenotypes. Finally, because a decrease in selective constraint at the genetic level is expected to have effects at the morphological level, I review studies that compare rates of morphological change in various mammalian and island populations where N(e) is reduced. Furthermore, I discuss evidence that suggests that craniofacial morphology in the Homo lineage has shifted from an evolutionary rate constrained by purifying selection toward a neutral evolutionary rate.
Collapse
Affiliation(s)
- Eugene E Harris
- Department of Biological Sciences and Geology, Queensborough Community College, City University of New York, Bayside, NY 10364, USA.
| |
Collapse
|
31
|
Affiliation(s)
- Miriam K Konkel
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Bldg., Baton Rouge, LA 70803, USA
| | - Jerilyn A Walker
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Bldg., Baton Rouge, LA 70803, USA
| | - Mark A Batzer
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Bldg., Baton Rouge, LA 70803, USA
| |
Collapse
|
32
|
Chen HD, Fan WL, Kong SG, Lee HC. Universal global imprints of genome growth and evolution--equivalent length and cumulative mutation density. PLoS One 2010; 5:e9844. [PMID: 20418954 PMCID: PMC2854691 DOI: 10.1371/journal.pone.0009844] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2009] [Accepted: 02/08/2010] [Indexed: 11/19/2022] Open
Abstract
Background Segmental duplication is widely held to be an important mode of genome growth and evolution. Yet how this would affect the global structure of genomes has been little discussed. Methods/Principal Findings Here, we show that equivalent length, or , a quantity determined by the variance of fluctuating part of the distribution of the -mer frequencies in a genome, characterizes the latter's global structure. We computed the s of 865 complete chromosomes and found that they have nearly universal but (-dependent) values. The differences among the of a chromosome and those of its coding and non-coding parts were found to be slight. Conclusions We verified that these non-trivial results are natural consequences of a genome growth model characterized by random segmental duplication and random point mutation, but not of any model whose dominant growth mechanism is not segmental duplication. Our study also indicates that genomes have a nearly universal cumulative “point” mutation density of about 0.73 mutations per site that is compatible with the relatively low mutation rates of (15)10/site/Mya previously determined by sequence comparison for the human and E. coli genomes.
Collapse
Affiliation(s)
- Hong-Da Chen
- Graduate Institute of Systems Biology and Bioinformatics, National Central University, Chungli, Taiwan
- Department of Physics, National Central University, Chungli, Taiwan
| | - Wen-Lang Fan
- Department of Physics, National Central University, Chungli, Taiwan
- Genomic Research Center, Academia Sinaca, Taipei, Taiwan
| | - Sing-Guan Kong
- Graduate Institute of Systems Biology and Bioinformatics, National Central University, Chungli, Taiwan
- Department of Physics, National Central University, Chungli, Taiwan
| | - Hoong-Chien Lee
- Graduate Institute of Systems Biology and Bioinformatics, National Central University, Chungli, Taiwan
- Department of Physics, National Central University, Chungli, Taiwan
- Cathay Medical Research Institute, Cathay General Hospital, Taipei, Taiwan
- National Center for Theoretical Science, Shinchu, Taiwan
- * E-mail:
| |
Collapse
|
33
|
Sgaramella V, Astolfi PA. Somatic genome variations interact with environment, genome and epigenome in the determination of the phenotype: A paradigm shift in genomics? DNA Repair (Amst) 2010; 9:470-3. [PMID: 20153268 DOI: 10.1016/j.dnarep.2009.11.011] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2009] [Accepted: 11/03/2009] [Indexed: 01/18/2023]
|
34
|
Abstract
Advocates of chimpanzee research claim the genetic similarity of humans and chimpanzees make them an indispensable research tool to combat human diseases. Given that cancer is a leading cause of human death worldwide, one might expect that if chimpanzees were needed for, or were productive in, cancer research, then they would have been widely used. This comprehensive literature analysis reveals that chimpanzees have scarcely been used in any form of cancer research, and that chimpanzee tumours are extremely rare and biologically different from human cancers. Often, chimpanzee citations described peripheral use of chimpanzee cells and genetic material in predominantly human genomic studies. Papers describing potential new cancer therapies noted significant concerns regarding the chimpanzee model. Other studies described interventions that have not been pursued clinically. Finally, available evidence indicates that chimpanzees are not essential in the development of therapeutic monoclonal antibodies. It would therefore be unscientific to claim that chimpanzees are vital to cancer research. On the contrary, it is reasonable to conclude that cancer research would not suffer, if the use of chimpanzees for this purpose were prohibited in the US. Genetic differences between humans and chimpanzees, make them an unsuitable model for cancer, as well as other human diseases.
Collapse
Affiliation(s)
- Jarrod Bailey
- New England Anti-Vivisection Society, Boston, MA 02108-5100, USA.
| |
Collapse
|
35
|
Marques-Bonet T, Ryder OA, Eichler EE. Sequencing primate genomes: what have we learned? Annu Rev Genomics Hum Genet 2009; 10:355-86. [PMID: 19630567 DOI: 10.1146/annurev.genom.9.081307.164420] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
We summarize the progress in whole-genome sequencing and analyses of primate genomes. These emerging genome datasets have broadened our understanding of primate genome evolution revealing unexpected and complex patterns of evolutionary change. This includes the characterization of genome structural variation, episodic changes in the repeat landscape, differences in gene expression, new models regarding speciation, and the ephemeral nature of the recombination landscape. The functional characterization of genomic differences important in primate speciation and adaptation remains a significant challenge. Limited access to biological materials, the lack of detailed phenotypic data and the endangered status of many critical primate species have significantly attenuated research into the genetic basis of primate evolution. Next-generation sequencing technologies promise to greatly expand the number of available primate genome sequences; however, such draft genome sequences will likely miss critical genetic differences within complex genomic regions unless dedicated efforts are put forward to understand the full spectrum of genetic variation.
Collapse
Affiliation(s)
- Tomas Marques-Bonet
- Department of Genome Sciences, University of Washington and the Howard Hughes Medical Institute, Seattle, Washington 98105, USA.
| | | | | |
Collapse
|
36
|
Cordaux R, Batzer MA. The impact of retrotransposons on human genome evolution. Nat Rev Genet 2009; 10:691-703. [PMID: 19763152 DOI: 10.1038/nrg2640] [Citation(s) in RCA: 1115] [Impact Index Per Article: 74.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Their ability to move within genomes gives transposable elements an intrinsic propensity to affect genome evolution. Non-long terminal repeat (LTR) retrotransposons--including LINE-1, Alu and SVA elements--have proliferated over the past 80 million years of primate evolution and now account for approximately one-third of the human genome. In this Review, we focus on this major class of elements and discuss the many ways that they affect the human genome: from generating insertion mutations and genomic instability to altering gene expression and contributing to genetic innovation. Increasingly detailed analyses of human and other primate genomes are revealing the scale and complexity of the past and current contributions of non-LTR retrotransposons to genomic change in the human lineage.
Collapse
Affiliation(s)
- Richard Cordaux
- CNRS UMR 6556 Ecologie, Evolution, Symbiose, Université de Poitiers, 40 Avenue du Recteur Pineau, Poitiers, France
| | | |
Collapse
|
37
|
Liu GE, Alkan C, Jiang L, Zhao S, Eichler EE. Comparative analysis of Alu repeats in primate genomes. Genome Res 2009; 19:876-85. [PMID: 19411604 DOI: 10.1101/gr.083972.108] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Using bacteria artificial chromosome (BAC) end sequences (16.9 Mb) and high-quality alignments of genomic sequences (17.4 Mb), we performed a global assessment of the divergence distributions, phylogenies, and consensus sequences for Alu elements in primates including lemur, marmoset, macaque, baboon, and chimpanzee as compared to human. We found that in lemurs, Alu elements show a broader and more symmetric sequence divergence distribution, suggesting a steady rate of Alu retrotransposition activity among prosimians. In contrast, Alu elements in anthropoids show a skewed distribution shifted toward more ancient elements with continual declining rates in recent Alu activity along the hominoid lineage of evolution. Using an integrated approach combining mutation profile and insertion/deletion analyses, we identified nine novel lineage-specific Alu subfamilies in lemur (seven), marmoset (one), and baboon/macaque (one) containing multiple diagnostic mutations distinct from their human counterparts-Alu J, S, and Y subfamilies, respectively. Among these primates, we show that that the lemur has the lowest density of Alu repeats (55 repeats/Mb), while marmoset has the greatest abundance (188 repeats/Mb). We estimate that approximately 70% of lemur and 16% of marmoset Alu elements belong to lineage-specific subfamilies. Our analysis has provided an evolutionary framework for further classification and refinement of the Alu repeat phylogeny. The differences in the distribution and rates of Alu activity have played an important role in subtly reshaping the structure of primate genomes. The functional consequences of these changes among the diverse primate lineages over such short periods of evolutionary time are an important area of future investigation.
Collapse
Affiliation(s)
- George E Liu
- USDA, ARS, ANRI, Bovine Functional Genomics Laboratory, Beltsville, MD 20705, USA.
| | | | | | | | | |
Collapse
|
38
|
Novick P, Smith J, Ray D, Boissinot S. Independent and parallel lateral transfer of DNA transposons in tetrapod genomes. Gene 2009; 449:85-94. [PMID: 19747963 DOI: 10.1016/j.gene.2009.08.017] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2009] [Revised: 07/08/2009] [Accepted: 08/28/2009] [Indexed: 01/31/2023]
Abstract
In animals, the mode of transmission of transposable elements is generally vertical. However, recent studies have suggested that lateral transfer has occurred repeatedly in several distantly related tetrapod lineages, including mammals. Using transposons extracted from the genome of the lizard Anolis carolinensis as probes, we identified four novel families of hAT transposons that share extremely high similarity with elements in other genomes including several mammalian lineages (primates, chiropters, marsupials), one amphibian and one flatworm, the planarian Schmidtea mediterranea. The discontinuous phylogenetic distribution of these hAT families, coupled with very low synonymous divergence between species, strongly suggests that these elements were laterally transferred to these different species. This indicates that the horizontal transfer of DNA transposons in vertebrates might be more common than previously thought. Yet, it appears that the transfer of DNA transposons did not occur randomly as the same genomes have been invaded independently by different, unrelated transposon families whereas others seem to be immune to lateral transfer. This suggests that some organisms might be intrinsically more vulnerable to DNA transposon lateral transfer, possibly because of a weakened defense against transposons or because they have developed mechanisms to tolerate their impact.
Collapse
Affiliation(s)
- Peter Novick
- Department of Biology, Queens College, the City University of New York, Flushing, NY 11367, USA
| | | | | | | |
Collapse
|
39
|
Chapus C, Edwards SV. Genome evolution in Reptilia: in silico chicken mapping of 12,000 BAC-end sequences from two reptiles and a basal bird. BMC Genomics 2009; 10 Suppl 2:S8. [PMID: 19607659 PMCID: PMC2966332 DOI: 10.1186/1471-2164-10-s2-s8] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
BACKGROUND With the publication of the draft chicken genome and the recent production of several BAC clone libraries from non-avian reptiles and birds, it is now possible to undertake more detailed comparative genomic studies in Reptilia. Of interest in particular are the genomic events that transformed the large, repeat-rich genomes of mammals and non-avian reptiles into the minimalist chicken genome. We have used paired BAC end sequences (BESs) from the American alligator (Alligator mississippiensis), painted turtle (Chrysemys picta) and emu (Dromaius novaehollandiae) to investigate patterns of sequence divergence, gene and retroelement content, and microsynteny between these species and chicken. RESULTS From a total of 11,967 curated BESs, we successfully mapped 725, 773 and 2597 sequences in alligator, turtle, and emu, respectively, to sites in the draft chicken genome using a stringent BLAST protocol. Most commonly, sequences mapped to a single site in the chicken genome. Of 1675, 1828 and 2936 paired BESs obtained for alligator, turtle, and emu, respectively, a total of 34 (alligator, 2%), 24 (turtle, 1.3%) and 479 (emu, 16.3%) pairs were found to map with high confidence and in the correct orientation and with BAC-sized intermarker distances to single chicken chromosomes, including 25 such paired hits in emu mapping to the chicken Z chromosome. By determining the insert sizes of a subset of BAC clones from these three species, we also found a significant correlation between the intermarker distance in alligator and turtle and in chicken, with slopes as expected on the basis of the ratio of the genome sizes. CONCLUSION Our results suggest that a large number of small-scale chromosomal rearrangements and deletions in the lineage leading to chicken have drastically reduced the number of detected syntenies observed between the chicken and alligator, turtle, and emu genomes and imply that small deletions occurring widely throughout the genomes of reptilian and avian ancestors led to the ~50% reduction in genome size observed in birds compared to reptiles. We have also mapped and identified likely gene regions in hundreds of new BAC clones from these species.
Collapse
Affiliation(s)
- Charles Chapus
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
| | - Scott V Edwards
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
| |
Collapse
|
40
|
Thomson SJP, Goh FG, Banks H, Krausgruber T, Kotenko SV, Foxwell BMJ, Udalova IA. The role of transposable elements in the regulation of IFN-lambda1 gene expression. Proc Natl Acad Sci U S A 2009; 106:11564-9. [PMID: 19570999 PMCID: PMC2710658 DOI: 10.1073/pnas.0904477106] [Citation(s) in RCA: 94] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2009] [Indexed: 01/21/2023] Open
Abstract
IFNs lambda1, lambda2, and lambda3, or type III IFNs, are recently identified cytokines distantly related to type I IFNs. Despite an early evolutionary divergence, the 2 types of IFNs display similar antiviral activities, and both are produced primarily in dendritic cells. Although virus induction of the type I IFN-beta gene had served as a paradigm of gene regulation, relatively little is known about the regulation of IFN-lambda gene expression. Studies of virus induction of IFN-lambda1 identified an essential role of IFN regulatory factors (IRF) 3 and 7, which bind to a regulatory DNA sequence near the start site of transcription. Here, we report that the proximal promoter region of the IFN-lambda1 regulatory region is not sufficient for maximal gene induction in response to bacterial LPS, and we identify an essential cluster of homotypic NF-kappaB binding sites. Remarkably, these sites, which bind efficiently to NF-kappaB and function independently of the IRF3/7 binding sites, originate as transposable elements of the Alu and LTR families. We also show that depletion of the NF-kappaB RelA protein significantly reduces the level of the IFN-lambda1 gene expression. We conclude that IFN-lambda1 gene expression requires NF-kappaB, and we propose a model for IFN-lambda1 gene regulation, in which IRF and NF-kappaB activate gene expression independently via spatially separated promoter elements. These observations provide insights into the independent evolution of the IFN-lambda1 and IFN-beta promoters and directly implicate transposable elements in the regulation of the IFN-lambda1 gene by NF-kappaB.
Collapse
Affiliation(s)
- Scott J. P. Thomson
- Kennedy Institute of Rheumatology Division, Faculty of Medicine, Imperial College of Science, Technology, and Medicine, 65 Aspenlea Road, London W6 8LH, United Kingdom; and
| | - Fui G. Goh
- Kennedy Institute of Rheumatology Division, Faculty of Medicine, Imperial College of Science, Technology, and Medicine, 65 Aspenlea Road, London W6 8LH, United Kingdom; and
| | - Helen Banks
- Kennedy Institute of Rheumatology Division, Faculty of Medicine, Imperial College of Science, Technology, and Medicine, 65 Aspenlea Road, London W6 8LH, United Kingdom; and
| | - Thomas Krausgruber
- Kennedy Institute of Rheumatology Division, Faculty of Medicine, Imperial College of Science, Technology, and Medicine, 65 Aspenlea Road, London W6 8LH, United Kingdom; and
| | - Sergei V. Kotenko
- Department of Biochemistry and Molecular Biology, University of Medicine and Dentistry, New Jersey Medical School, Newark, NJ 07103
| | - Brian M. J. Foxwell
- Kennedy Institute of Rheumatology Division, Faculty of Medicine, Imperial College of Science, Technology, and Medicine, 65 Aspenlea Road, London W6 8LH, United Kingdom; and
| | - Irina A. Udalova
- Kennedy Institute of Rheumatology Division, Faculty of Medicine, Imperial College of Science, Technology, and Medicine, 65 Aspenlea Road, London W6 8LH, United Kingdom; and
| |
Collapse
|
41
|
Peng Z, Elango N, Wildman DE, Yi SV. Primate phylogenomics: developing numerous nuclear non-coding, non-repetitive markers for ecological and phylogenetic applications and analysis of evolutionary rate variation. BMC Genomics 2009; 10:247. [PMID: 19470178 PMCID: PMC2693144 DOI: 10.1186/1471-2164-10-247] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2008] [Accepted: 05/26/2009] [Indexed: 12/24/2022] Open
Abstract
Background Genetic analyses are often limited by the availability of appropriate molecular markers. Markers from neutrally evolving genomic regions may be particularly useful for inferring evolutionary histories because they escape the constraints of natural selection. For the majority of taxa however, obtaining such markers is challenging. Advances in genomics have the potential to alleviate the shortage of neutral markers. Here we present a method to develop numerous markers from putatively neutral regions of primate genomes. Results We began with the available whole genome sequences of human, chimpanzee and macaque. Using computational methods, we identified a total of 280 potential amplicons from putatively neutral, non-coding, non-repetitive regions of these genomes. Subsequently we amplified, using experimental methods, many of these amplicons from diverse primate taxa, including a ring-tailed lemur, which is distantly related to the genomic resources. Using a subset of 10 markers, we demonstrate the utility of the developed markers in phylogenetic and evolutionary rate analyses. Particularly, we uncovered substantial evolutionary rate variation among lineages, some of which are previously not reported. Conclusion We successfully developed numerous markers from putatively neutral regions of primate genomes using a strategy combining computational and experimental methods. Applying these markers to phylogenetic and evolutionary rate variation analyses exemplifies the utility of these markers. Diverse ecological and evolutionary analyses will benefit from these markers. Importantly, methods similar to those presented here can be applied to other taxa in the near future.
Collapse
Affiliation(s)
- Zuogang Peng
- School of Biology, Institute of Bioscience and Bioengineering, Institute of Biosystems, Georgia Institute of Technology, Atlanta, GA 30332, USA.
| | | | | | | |
Collapse
|
42
|
Xing J, Zhang Y, Han K, Salem AH, Sen SK, Huff CD, Zhou Q, Kirkness EF, Levy S, Batzer MA, Jorde LB. Mobile elements create structural variation: analysis of a complete human genome. Genome Res 2009; 19:1516-26. [PMID: 19439515 DOI: 10.1101/gr.091827.109] [Citation(s) in RCA: 220] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Structural variants (SVs) are common in the human genome. Because approximately half of the human genome consists of repetitive, transposable DNA sequences, it is plausible that these elements play an important role in generating SVs in humans. Sequencing of the diploid genome of one individual human (HuRef) affords us the opportunity to assess, for the first time, the impact of mobile elements on SVs in an individual in a thorough and unbiased fashion. In this study, we systematically evaluated more than 8000 SVs to identify mobile element-associated SVs as small as 100 bp and specific to the HuRef genome. Combining computational and experimental analyses, we identified and validated 706 mobile element insertion events (including Alu, L1, SVA elements, and nonclassical insertions), which added more than 305 kb of new DNA sequence to the HuRef genome compared with the Human Genome Project (HGP) reference sequence (hg18). We also identified 140 mobile element-associated deletions, which removed approximately 126 kb of sequence from the HuRef genome. Overall, approximately 10% of the HuRef-specific indels larger than 100 bp are caused by mobile element-associated events. More than one-third of the insertion/deletion events occurred in genic regions, and new Alu insertions occurred in exons of three human genes. Based on the number of insertions and the estimated time to the most recent common ancestor of HuRef and the HGP reference genome, we estimated the Alu, L1, and SVA retrotransposition rates to be one in 21 births, 212 births, and 916 births, respectively. This study presents the first comprehensive analysis of mobile element-related structural variants in the complete DNA sequence of an individual and demonstrates that mobile elements play an important role in generating inter-individual structural variation.
Collapse
Affiliation(s)
- Jinchuan Xing
- Department of Human Genetics, Eccles Institute of Human Genetics, University of Utah, Salt Lake City, Utah 84109, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
43
|
|
44
|
Gherman A, Chen PE, Teslovich TM, Stankiewicz P, Withers M, Kashuk CS, Chakravarti A, Lupski JR, Cutler DJ, Katsanis N. Population bottlenecks as a potential major shaping force of human genome architecture. PLoS Genet 2008; 3:e119. [PMID: 17658953 PMCID: PMC1925129 DOI: 10.1371/journal.pgen.0030119] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2007] [Accepted: 06/04/2007] [Indexed: 11/24/2022] Open
Abstract
The modern synthetic view of human evolution proposes that the fixation of novel mutations is driven by the balance among selective advantage, selective disadvantage, and genetic drift. When considering the global architecture of the human genome, the same model can be applied to understanding the rapid acquisition and proliferation of exogenous DNA. To explore the evolutionary forces that might have morphed human genome architecture, we investigated the origin, composition, and functional potential of numts (nuclear mitochondrial pseudogenes), partial copies of the mitochondrial genome found abundantly in chromosomal DNA. Our data indicate that these elements are unlikely to be advantageous, since they possess no gross positional, transcriptional, or translational features that might indicate beneficial functionality subsequent to integration. Using sequence analysis and fossil dating, we also show a probable burst of integration of numts in the primate lineage that centers on the prosimian–anthropoid split, mimics closely the temporal distribution of Alu and processed pseudogene acquisition, and coincides with the major climatic change at the Paleocene–Eocene boundary. We therefore propose a model according to which the gross architecture and repeat distribution of the human genome can be largely accounted for by a population bottleneck early in the anthropoid lineage and subsequent effectively neutral fixation of repetitive DNA, rather than positive selection or unusual insertion pressures. Throughout evolutionary history, fragments of the mitochondrial genome, known as numts (for nuclear mitochondrial sequences), have been inserted into the nuclear genome. These fragments are distinct from all other classes of repetitive DNA found in nuclear genomes, not least because they are incapable of mediating their own proliferation. Taking advantage of their unique evolutionary properties, we have used numts to improve our understanding of the architecture of the human genome with special emphasis on the mechanism of acquisition and retention of repeat sequences, which comprise the bulk of nuclear DNA. We find that numts are unlikely to have any evolutionary benefit driving their retention. Moreover, numts are not acquired randomly during evolutionary time. Instead, their rate of acquisition spikes dramatically around pronounced population bottlenecks, in a manner reminiscent of other repeat classes. Therefore, we propose that the primary driving force of repeat acquisition in the genome is not selection, but random genetic drift, whose force becomes pronounced during profound reductions of population size. Our findings support the theory of neutral evolution, according to which random genetic drift exerts an influence on the acquisition of DNA changes that far outweighs the power of positive selection.
Collapse
MESH Headings
- Alu Elements
- Animals
- Cell Line
- Chromosome Mapping
- DNA/genetics
- DNA, Mitochondrial/genetics
- Evolution, Molecular
- Genetic Drift
- Genetics, Population
- Genome, Human
- Genome, Mitochondrial
- Humans
- In Situ Hybridization, Fluorescence
- Models, Genetic
- Primates/genetics
- Protein Biosynthesis
- Pseudogenes
- Repetitive Sequences, Nucleic Acid
- Selection, Genetic
- Time Factors
- Transcription, Genetic
Collapse
Affiliation(s)
- Adrian Gherman
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Peter E Chen
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Tanya M Teslovich
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Pawel Stankiewicz
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - Marjorie Withers
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - Carl S Kashuk
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Aravinda Chakravarti
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, Maryland, United States of America
| | - James R Lupski
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
- Department of Pediatrics, Baylor College of Medicine, Houston, Texas, United States of America
- Texas Children's Hospital, Houston, Texas, United States of America
| | - David J Cutler
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, Maryland, United States of America
- * To whom correspondence should be addressed. E-mail: (DJC); (NK)
| | - Nicholas Katsanis
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, Maryland, United States of America
- Wilmer Eye Institute, Johns Hopkins University, Baltimore, Maryland, United States of America
- * To whom correspondence should be addressed. E-mail: (DJC); (NK)
| |
Collapse
|
45
|
Abstract
Transposable elements are mobile genetic units that exhibit broad diversity in their structure and transposition mechanisms. Transposable elements occupy a large fraction of many eukaryotic genomes and their movement and accumulation represent a major force shaping the genes and genomes of almost all organisms. This review focuses on DNA-mediated or class 2 transposons and emphasizes how this class of elements is distinguished from other types of mobile elements in terms of their structure, amplification dynamics, and genomic effect. We provide an up-to-date outlook on the diversity and taxonomic distribution of all major types of DNA transposons in eukaryotes, including Helitrons and Mavericks. We discuss some of the evolutionary forces that influence their maintenance and diversification in various genomic environments. Finally, we highlight how the distinctive biological features of DNA transposons have contributed to shape genome architecture and led to the emergence of genetic innovations in different eukaryotic lineages.
Collapse
Affiliation(s)
- Cédric Feschotte
- Department of Biology, University of Texas, Arlington, TX 76019, USA.
| | | |
Collapse
|
46
|
Evolutionary rates and patterns for human transcription factor binding sites derived from repetitive DNA. BMC Genomics 2008; 9:226. [PMID: 18485226 PMCID: PMC2397414 DOI: 10.1186/1471-2164-9-226] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2008] [Accepted: 05/17/2008] [Indexed: 12/14/2022] Open
Abstract
Background The majority of human non-protein-coding DNA is made up of repetitive sequences, mainly transposable elements (TEs). It is becoming increasingly apparent that many of these repetitive DNA sequence elements encode gene regulatory functions. This fact has important evolutionary implications, since repetitive DNA is the most dynamic part of the genome. We set out to assess the evolutionary rate and pattern of experimentally characterized human transcription factor binding sites (TFBS) that are derived from repetitive versus non-repetitive DNA to test whether repeat-derived TFBS are in fact rapidly evolving. We also evaluated the position-specific patterns of variation among TFBS to look for signs of functional constraint on TFBS derived from repetitive and non-repetitive DNA. Results We found numerous experimentally characterized TFBS in the human genome, 7–10% of all mapped sites, which are derived from repetitive DNA sequences including simple sequence repeats (SSRs) and TEs. TE-derived TFBS sequences are far less conserved between species than TFBS derived from SSRs and non-repetitive DNA. Despite their rapid evolution, several lines of evidence indicate that TE-derived TFBS are functionally constrained. First of all, ancient TE families, such as MIR and L2, are enriched for TFBS relative to younger families like Alu and L1. Secondly, functionally important positions in TE-derived TFBS, specifically those residues thought to physically interact with their cognate protein binding factors (TF), are more evolutionarily conserved than adjacent TFBS positions. Finally, TE-derived TFBS show position-specific patterns of sequence variation that are highly distinct from random patterns and similar to the variation seen for non-repeat derived sequences of the same TFBS. Conclusion The abundance of experimentally characterized human TFBS that are derived from repetitive DNA speaks to the substantial regulatory effects that this class of sequence has on the human genome. The unique evolutionary properties of repeat-derived TFBS are perhaps even more intriguing. TE-derived TFBS in particular, while clearly functionally constrained, evolve extremely rapidly relative to non-repeat derived sites. Such rapidly evolving TFBS are likely to confer species-specific regulatory phenotypes, i.e. divergent expression patterns, on the human evolutionary lineage. This result has practical implications with respect to the widespread use of evolutionary conservation as a surrogate for functionally relevant non-coding DNA. Most TE-derived TFBS would be missed using the kinds of sequence conservation-based screens, such as phylogenetic footprinting, that are used to help characterize non-coding DNA. Thus, the very TFBS that are most likely to yield human-specific characteristics will be neglected by the comparative genomic techniques that are currently de rigeur for the identification of novel regulatory sites.
Collapse
|
47
|
Xing J, Witherspoon DJ, Ray DA, Batzer MA, Jorde LB. Mobile DNA elements in primate and human evolution. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 2008; Suppl 45:2-19. [PMID: 18046749 DOI: 10.1002/ajpa.20722] [Citation(s) in RCA: 106] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
Roughly 50% of the primate genome consists of mobile, repetitive DNA sequences such as Alu and LINE1 elements. The causes and evolutionary consequences of mobile element insertion, which have received considerable attention during the past decade, are reviewed in this article. Because of their unique mutational mechanisms, these elements are highly useful for answering phylogenetic questions. We demonstrate how they have been used to help resolve a number of questions in primate phylogeny, including the human-chimpanzee-gorilla trichotomy and New World primate phylogeny. Alu and LINE1 element insertion polymorphisms have also been analyzed in human populations to test hypotheses about human evolution and population affinities and to address forensic issues. Finally, these elements have had impacts on the genome itself. We review how they have influenced fundamental ongoing processes like nonhomologous recombination, genomic deletion, and X chromosome inactivation.
Collapse
Affiliation(s)
- Jinchuan Xing
- Department of Human Genetics, University of Utah Health Sciences Center, Salt Lake City, UT 84112, USA
| | | | | | | | | |
Collapse
|
48
|
Feder ME. Evolvability of physiological and biochemical traits: evolutionary mechanisms including and beyond single-nucleotide mutation. ACTA ACUST UNITED AC 2008; 210:1653-60. [PMID: 17449831 DOI: 10.1242/jeb.02725] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
A longstanding challenge for biologists has been to explain not just how organisms are adapted to diverse environments, but how these adaptations arise. Although natural selection is clearly sufficient to act on heritable variation, is this heritable variation sufficient to yield complex adaptations and how does this variation itself arise? Much prior focus has been on mutation of single nucleotides in genes. This process is common and can have dramatic phenotypes, but could be limited in its ability to culminate in complex adaptations for two kinds of reasons: (i) because natural selection is powerful, it can purge genetic variation, and (ii) evolutionary transition from the absence to the presence of a complex adaptation seemingly requires multiple mutations at the right place and time and in the right sequence, with each intermediate stage having increased overall fitness; this seems highly improbable. Because the networks that organisms comprise are hierarchical and redundant and have modular structure, however, single-nucleotide mutations can have large and tolerable impacts. Diverse mechanisms, collectively evolutionary capacitors, can shield genetic variation from the purgative of selection. These features can enable evolution to proceed via single-nucleotide mutation. Importantly, single-nucleotide mutation usually only modifies existing genes rather than creating new ones, and numerous other mechanisms eclipse single-nucleotide mutation in creating genetic variation. These include gene duplication (both segmental and whole-genome), lateral gene transfer, hybridization, mobile genetic elements and symbiosis. Other processes can scramble and reassemble nucleotide sequence. The mechanisms beyond single-gene mutation offer considerable promise in detailing the evolution of complex physiological and biochemical traits, and have already done so for several morphological traits.
Collapse
Affiliation(s)
- Martin E Feder
- Department of Organismal Biology and Anatomy and The Committees on Evolutionary Biology, Genetics, and Molecular Medicine, The University of Chicago, 1027 E. 57th Street, Chicago, IL 60637, USA.
| |
Collapse
|
49
|
Gallach M, Arnau V, Marín I. Global patterns of sequence evolution in Drosophila. BMC Genomics 2007; 8:408. [PMID: 17996078 PMCID: PMC2180185 DOI: 10.1186/1471-2164-8-408] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2007] [Accepted: 11/09/2007] [Indexed: 01/30/2023] Open
Abstract
Background Sequencing of the genomes of several Drosophila allows for the first precise analyses of how global sequence patterns change among multiple, closely related animal species. A basic question is whether there are characteristic features that differentiate chromosomes within a species or between different species. Results We explored the euchromatin of the chromosomes of seven Drosophila species to establish their global patterns of DNA sequence diversity. Between species, differences in the types and amounts of simple sequence repeats were found. Within each species, the autosomes have almost identical oligonucleotide profiles. However, X chromosomes and autosomes have, in all species, a qualitatively different composition. The X chromosomes are less complex than the autosomes, containing both a higher amount of simple DNA sequences and, in several cases, chromosome-specific repetitive sequences. Moreover, we show that the right arm of the X chromosome of Drosophila pseudoobscura, which evolved from an autosome 10 – 18 millions of years ago, has a composition which is identical to that of the original, left arm of the X chromosome. Conclusion The consistent differences among species, differences among X chromosomes and autosomes and the convergent evolution of X and neo-X chromosomes demonstrate that strong forces are acting on drosophilid genomes to generate peculiar chromosomal landscapes. We discuss the relationships of the patterns observed with differential recombination and mutation rates and with the process of dosage compensation.
Collapse
Affiliation(s)
- Miguel Gallach
- Instituto de Biomedicina de Valencia, Consejo Superior de Investigaciones Científicas (IBV-CSIC), Valencia, Spain.
| | | | | |
Collapse
|
50
|
Han K, Lee J, Meyer TJ, Wang J, Sen SK, Srikanta D, Liang P, Batzer MA. Alu recombination-mediated structural deletions in the chimpanzee genome. PLoS Genet 2007; 3:1939-49. [PMID: 17953488 PMCID: PMC2041999 DOI: 10.1371/journal.pgen.0030184] [Citation(s) in RCA: 71] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2007] [Accepted: 09/07/2007] [Indexed: 12/02/2022] Open
Abstract
With more than 1.2 million copies, Alu elements are one of the most important sources of structural variation in primate genomes. Here, we compare the chimpanzee and human genomes to determine the extent of Alu recombination-mediated deletion (ARMD) in the chimpanzee genome since the divergence of the chimpanzee and human lineages (∼6 million y ago). Combining computational data analysis and experimental verification, we have identified 663 chimpanzee lineage-specific deletions (involving a total of ∼771 kb of genomic sequence) attributable to this process. The ARMD events essentially counteract the genomic expansion caused by chimpanzee-specific Alu inserts. The RefSeq databases indicate that 13 exons in six genes, annotated as either demonstrably or putatively functional in the human genome, and 299 intronic regions have been deleted through ARMDs in the chimpanzee lineage. Therefore, our data suggest that this process may contribute to the genomic and phenotypic diversity between chimpanzees and humans. In addition, we found four independent ARMD events at orthologous loci in the gorilla or orangutan genomes. This suggests that human orthologs of loci at which ARMD events have already occurred in other nonhuman primate genomes may be “at-risk” motifs for future deletions, which may subsequently contribute to human lineage-specific genetic rearrangements and disorders. The recent sequencing of a number of primate genomes shows that small segments of DNA known as Alu elements are found repeatedly along all chromosomes, and indeed comprise ∼10% of the human genome. Although older Alu elements that have been in the genome for a long time accumulate some random mutations, overall these elements retain high levels of sequence identity among themselves. The presence of many near-identical Alu elements located close to each other makes primate genomes prone to DNA recombination events that generate genomic deletions of varying sizes. Here, by scanning the chimpanzee genome for such deletions, we determined the role of the Alu recombination-mediated deletion process in creating structural differences between the chimpanzee and human genomes. Using a combination of computational and experimental techniques, we identified 663 deletions, involving the removal of ∼771 kb of genomic sequence. Interestingly, about half of these deletions were located within known or predicted genes, and in several cases, the deletions removed coding exons from chimpanzee genes as compared to their human counterparts. Alu recombination-mediated deletion shows signs of being a major sculptor of primate genomes and may be responsible for generating some of the genetic differences between humans and chimpanzees.
Collapse
Affiliation(s)
- Kyudong Han
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Biological Computation and Visualization Center, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Center for BioModular Multi-Scale Systems, Louisiana State University, Baton Rouge, Louisiana, United States of America
| | - Jungnam Lee
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Biological Computation and Visualization Center, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Center for BioModular Multi-Scale Systems, Louisiana State University, Baton Rouge, Louisiana, United States of America
| | - Thomas J Meyer
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Biological Computation and Visualization Center, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Center for BioModular Multi-Scale Systems, Louisiana State University, Baton Rouge, Louisiana, United States of America
| | - Jianxin Wang
- Department of Cancer Genetics, Roswell Park Cancer Institute, New York, United States of America
| | - Shurjo K Sen
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Biological Computation and Visualization Center, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Center for BioModular Multi-Scale Systems, Louisiana State University, Baton Rouge, Louisiana, United States of America
| | - Deepa Srikanta
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Biological Computation and Visualization Center, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Center for BioModular Multi-Scale Systems, Louisiana State University, Baton Rouge, Louisiana, United States of America
| | - Ping Liang
- Department of Cancer Genetics, Roswell Park Cancer Institute, New York, United States of America
| | - Mark A Batzer
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Biological Computation and Visualization Center, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Center for BioModular Multi-Scale Systems, Louisiana State University, Baton Rouge, Louisiana, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|