51
|
Cattle genomics and its implications for future nutritional strategies for dairy cattle. Animal 2011; 7 Suppl 1:172-83. [PMID: 23031138 DOI: 10.1017/s1751731111002588] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
The recently sequenced cattle (Bos taurus) genome unraveled the unique genomic features of the species and provided the molecular basis for applying a systemic approach to systematically link genomic information to metabolic traits. Comparative analysis has identified a variety of evolutionary adaptive features in the cattle genome, such as an expansion of the gene families related to the rumen function, large number of chromosomal rearrangements affecting regulation of genes for lactation, and chromosomal rearrangements that are associated with segmental duplications and copy number variations. Metabolic reconstruction of the cattle genome has revealed that core metabolic pathways are highly conserved among mammals although five metabolic genes are deleted or highly diverged and seven metabolic genes are present in duplicate in the cattle genome compared to their human counter parts. The evolutionary loss and gain of metabolic genes in the cattle genome may reflect metabolic adaptations of cattle. Metabolic reconstruction also provides a platform for better understanding of metabolic regulation in cattle and ruminants. A substantial body of transcriptomics data from dairy and beef cattle under different nutritional management and across different stages of growth and lactation are already available and will aid in linking the genome with metabolism and nutritional physiology of cattle. Application of cattle genomics has great potential for future development of nutritional strategies to improve efficiency and sustainability of beef and milk production. One of the biggest challenges is to integrate genomic and phenotypic data and interpret them in a biological and practical platform. Systems biology, a holistic and systemic approach, will be very useful in overcoming this challenge.
Collapse
|
52
|
MONTI VALENTINA, MANDRIOLI MAURO, RIVI MARCO, MANICARDI GIANCARLO. The vanishing clone: karyotypic evidence for extensive intraclonal genetic variation in the peach potato aphid, Myzus persicae (Hemiptera: Aphididae). Biol J Linn Soc Lond 2011. [DOI: 10.1111/j.1095-8312.2011.01812.x] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
53
|
Abstract
BACKGROUND Segmental duplications in genomes have been studied for many years. Recently, several studies have highlighted a biological phenomenon called breakpoint-duplication that apparently associates a significant proportion of segmental duplications in Mammals, and the Drosophila species group, to breakpoints in rearrangement events. RESULTS In this paper, we introduce and study a combinatorial problem, inspired from the breakpoint-duplication phenomenon, called the Genome Dedoubling Problem. It consists of finding a minimum length rearrangement scenario required to transform a genome with duplicated segments into a non-duplicated genome such that duplications are caused by rearrangement breakpoints. We show that the problem, in the Double-Cut-and-Join (DCJ) and the reversal rearrangement models, can be reduced to an APX-complete problem, and we provide algorithms for the Genome Dedoubling Problem with 2-approximable parts. We apply the methods for the reconstruction of a non-duplicated ancestor of Drosophila yakuba. CONCLUSIONS We present the Genome Dedoubling Problem, and describe two algorithms solving the problem in the DCJ model, and the reversal model. The usefulness of the problems and the methods are showed through an application to real Drosophila data.
Collapse
|
54
|
Anchoring the dog to its relatives reveals new evolutionary breakpoints across 11 species of the Canidae and provides new clues for the role of B chromosomes. Chromosome Res 2011; 19:685-708. [DOI: 10.1007/s10577-011-9233-4] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2011] [Revised: 08/15/2011] [Accepted: 08/16/2011] [Indexed: 12/27/2022]
|
55
|
Testing chromosomal phylogenies and inversion breakpoint reuse in Drosophila. The martensis cluster revisited. Chromosome Res 2011; 19:251-65. [PMID: 21394512 DOI: 10.1007/s10577-011-9195-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2010] [Revised: 02/09/2011] [Accepted: 02/09/2011] [Indexed: 01/06/2023]
Abstract
The chromosomal relationships of the four martensis cluster species are among the most complex and intricate within the entire Drosophila repleta group, due to the so-called sharing of inversions. Here, we have revised these relationships using comparative mapping of bacterial artificial chromosome (BAC) clones on the salivary gland chromosomes. A physical map of chromosome 2 of Drosophila uniseta (one of the cluster members) was generated by in situ hybridization of 82 BAC clones from the physical map of the Drosophila buzzatii genome (an outgroup that represents the ancestral arrangement). By comparing the marker positions, we determined the number, order, and orientation of conserved chromosomal segments between chromosome 2 of D. buzzatii and D. uniseta. GRIMM software was used to infer that a minimum of five chromosomal inversions are necessary to transform the chromosome 2 of D. buzzatii into that of D. uniseta. Two of these inversions have been overlooked in previous cytological analyses. The five fixed inversions entail two breakpoint reuses because only nine syntenic segments and eight interruptions were observed. We tested for the presence of the five inversions fixed in D. uniseta in the other three species of the martensis cluster by in situ hybridization of eight breakpoint-bearing BAC clones. The results shed light on the chromosomal phylogeny of the martensis cluster, yet leave a number of questions open.
Collapse
|
56
|
Alekseyev MA, Pevzner PA. Comparative genomics reveals birth and death of fragile regions in mammalian evolution. Genome Biol 2010; 11:R117. [PMID: 21118492 PMCID: PMC3156956 DOI: 10.1186/gb-2010-11-11-r117] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2010] [Revised: 10/05/2010] [Accepted: 11/30/2010] [Indexed: 12/15/2022] Open
Abstract
Background An important question in genome evolution is whether there exist fragile regions (rearrangement hotspots) where chromosomal rearrangements are happening over and over again. Although nearly all recent studies supported the existence of fragile regions in mammalian genomes, the most comprehensive phylogenomic study of mammals raised some doubts about their existence. Results Here we demonstrate that fragile regions are subject to a birth and death process, implying that fragility has a limited evolutionary lifespan. Conclusions This finding implies that fragile regions migrate to different locations in different mammals, explaining why there exist only a few chromosomal breakpoints shared between different lineages. The birth and death of fragile regions as a phenomenon reinforces the hypothesis that rearrangements are promoted by matching segmental duplications and suggests putative locations of the currently active fragile regions in the human genome.
Collapse
Affiliation(s)
- Max A Alekseyev
- Department of Computer Science & Engineering, University of South Carolina, 301 Main St, Columbia, SC 29208, USA.
| | | |
Collapse
|
57
|
Brinkmeyer-Langford CL, Murphy WJ, Childers CP, Skow LC. A conserved segmental duplication within ELA. Anim Genet 2010; 41 Suppl 2:186-95. [DOI: 10.1111/j.1365-2052.2010.02137.x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
58
|
Trifonov VA, Dementyeva PV, Beklemisheva VR, Yudkin DV, Vorobieva NV, Graphodatsky AS. Supernumerary chromosomes, segmental duplications, and evolution. RUSS J GENET+ 2010. [DOI: 10.1134/s1022795410090206] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
59
|
Appels R, Barrerro R, Keeble G, Bellgard M. Advances in genome studies: The PAG 2010 conference. Funct Integr Genomics 2010; 10:1-9. [PMID: 20182762 DOI: 10.1007/s10142-010-0164-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
An overview is provided of the advances in plant, animal and human genome studies by summarizing the contents of seven plenary lectures presented at the Plant and Animal Genome (PAG) meeting in January 2010. The area of biology covered was wide and reflected the nature of this fast moving science.
Collapse
Affiliation(s)
- R Appels
- Centre for Comparative genomics, Murdoch University, Perth, WA 6510, USA.
| | | | | | | |
Collapse
|
60
|
Genome landscape and evolutionary plasticity of chromosomes in malaria mosquitoes. PLoS One 2010; 5:e10592. [PMID: 20485676 PMCID: PMC2868863 DOI: 10.1371/journal.pone.0010592] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2010] [Accepted: 04/14/2010] [Indexed: 12/19/2022] Open
Abstract
BACKGROUND Nonrandom distribution of rearrangements is a common feature of eukaryotic chromosomes that is not well understood in terms of genome organization and evolution. In the major African malaria vector Anopheles gambiae, polymorphic inversions are highly nonuniformly distributed among five chromosomal arms and are associated with epidemiologically important adaptations. However, it is not clear whether the genomic content of the chromosomal arms is associated with inversion polymorphism and fixation rates. METHODOLOGY/PRINCIPAL FINDINGS To better understand the evolutionary dynamics of chromosomal inversions, we created a physical map for an Asian malaria mosquito, Anopheles stephensi, and compared it with the genome of An. gambiae. We also developed and deployed novel Bayesian statistical models to analyze genome landscapes in individual chromosomal arms An. gambiae. Here, we demonstrate that, despite the paucity of inversion polymorphisms on the X chromosome, this chromosome has the fastest rate of inversion fixation and the highest density of transposable elements, simple DNA repeats, and GC content. The highly polymorphic and rapidly evolving autosomal 2R arm had overrepresentation of genes involved in cellular response to stress supporting the role of natural selection in maintaining adaptive polymorphic inversions. In addition, the 2R arm had the highest density of regions involved in segmental duplications that clustered in the breakpoint-rich zone of the arm. In contrast, the slower evolving 2L, 3R, and 3L, arms were enriched with matrix-attachment regions that potentially contribute to chromosome stability in the cell nucleus. CONCLUSIONS/SIGNIFICANCE These results highlight fundamental differences in evolutionary dynamics of the sex chromosome and autosomes and revealed the strong association between characteristics of the genome landscape and rates of chromosomal evolution. We conclude that a unique combination of various classes of genes and repetitive DNA in each arm, rather than a single type of repetitive element, is likely responsible for arm-specific rates of rearrangements.
Collapse
|
61
|
Larkin DM. Role of chromosomal rearrangements and conserved chromosome regions in amniote evolution. MOLECULAR GENETICS MICROBIOLOGY AND VIROLOGY 2010. [DOI: 10.3103/s0891416810010015] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
62
|
Quinlan AR, Clark RA, Sokolova S, Leibowitz ML, Zhang Y, Hurles ME, Mell JC, Hall IM. Genome-wide mapping and assembly of structural variant breakpoints in the mouse genome. Genome Res 2010; 20:623-35. [PMID: 20308636 DOI: 10.1101/gr.102970.109] [Citation(s) in RCA: 200] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Structural variation (SV) is a rich source of genetic diversity in mammals, but due to the challenges associated with mapping SV in complex genomes, basic questions regarding their genomic distribution and mechanistic origins remain unanswered. We have developed an algorithm (HYDRA) to localize SV breakpoints by paired-end mapping, and a general approach for the genome-wide assembly and interpretation of breakpoint sequences. We applied these methods to two inbred mouse strains: C57BL/6J and DBA/2J. We demonstrate that HYDRA accurately maps diverse classes of SV, including those involving repetitive elements such as transposons and segmental duplications; however, our analysis of the C57BL/6J reference strain shows that incomplete reference genome assemblies are a major source of noise. We report 7196 SVs between the two strains, more than two-thirds of which are due to transposon insertions. Of the remainder, 59% are deletions (relative to the reference), 26% are insertions of unlinked DNA, 9% are tandem duplications, and 6% are inversions. To investigate the origins of SV, we characterized 3316 breakpoint sequences at single-nucleotide resolution. We find that approximately 16% of non-transposon SVs have complex breakpoint patterns consistent with template switching during DNA replication or repair, and that this process appears to preferentially generate certain classes of complex variants. Moreover, we find that SVs are significantly enriched in regions of segmental duplication, but that this effect is largely independent of DNA sequence homology and thus cannot be explained by non-allelic homologous recombination (NAHR) alone. This result suggests that the genetic instability of such regions is often the cause rather than the consequence of duplicated genomic architecture.
Collapse
Affiliation(s)
- Aaron R Quinlan
- Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine, Charlottesville, Virginia 22908, USA
| | | | | | | | | | | | | | | |
Collapse
|
63
|
Abstract
BACKGROUND Over the past decades, extensive comparative mapping research has been performed in the plant family Solanaceae. The recent identification of a large set of single-copy conserved orthologous (COSII) markers has greatly accelerated comparative mapping studies among major solanaceous species including tomato, potato, eggplant, pepper and diploid Nicotiana species (as well as tetraploid tobacco). The large amount of comparative data now available for these species provides the opportunity to describe the overall patterns of chromosomal evolution in this important plant family. The results of this investigation are described herein. RESULTS We combined data from multiple COSII studies, and other comparative mapping studies performed in tomato, potato, eggplant, pepper and diploid Nicotiana species, to deduce the features and outcomes of chromosomal evolution in the Solanaceae over the past 30 million years. This includes estimating the rates and timing of chromosomal changes (inversions and translocations) as well as deducing the age of ancestral progenitor species and predicting their genome configurations. CONCLUSIONS The Solanaceae has experienced chromosomal changes at a modest rate compared with other families and the rates are likely conserved across different lineages of the family. Chromosomal inversions occur at a consistently higher rate than do translocations. Further, we find evidences for non-random positioning of the chromosomal rearrangement breakpoints. This finding is consistent with the similar finding in mammals, where hot spots for chromosomal breakages have apparently played a significant role in shaping genome evolution. Finally, by utilizing multiple genome comparisons we were able to reconstruct the most likely genome configuration for a number of now-extinct progenitor species that gave rise to the extant solanaceous species used in this research. The results from this study provide the first broad overview of chromosomal evolution in the family Solanaceae, and one of the most detailed thus far for any family of plants.
Collapse
Affiliation(s)
- Feinan Wu
- Department of Plant Breeding and Genetics, Cornell University, Ithaca, NY 14853, USA
| | | |
Collapse
|
64
|
Wu F, Tanksley SD. Chromosomal evolution in the plant family Solanaceae. BMC Genomics 2010; 11:182. [PMID: 20236516 PMCID: PMC2847972 DOI: 10.1186/1471-2164-11-182] [Citation(s) in RCA: 88] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2009] [Accepted: 03/17/2010] [Indexed: 11/12/2022] Open
Abstract
Background Over the past decades, extensive comparative mapping research has been performed in the plant family Solanaceae. The recent identification of a large set of single-copy conserved orthologous (COSII) markers has greatly accelerated comparative mapping studies among major solanaceous species including tomato, potato, eggplant, pepper and diploid Nicotiana species (as well as tetraploid tobacco). The large amount of comparative data now available for these species provides the opportunity to describe the overall patterns of chromosomal evolution in this important plant family. The results of this investigation are described herein. Results We combined data from multiple COSII studies, and other comparative mapping studies performed in tomato, potato, eggplant, pepper and diploid Nicotiana species, to deduce the features and outcomes of chromosomal evolution in the Solanaceae over the past 30 million years. This includes estimating the rates and timing of chromosomal changes (inversions and translocations) as well as deducing the age of ancestral progenitor species and predicting their genome configurations. Conclusions The Solanaceae has experienced chromosomal changes at a modest rate compared with other families and the rates are likely conserved across different lineages of the family. Chromosomal inversions occur at a consistently higher rate than do translocations. Further, we find evidences for non-random positioning of the chromosomal rearrangement breakpoints. This finding is consistent with the similar finding in mammals, where hot spots for chromosomal breakages have apparently played a significant role in shaping genome evolution. Finally, by utilizing multiple genome comparisons we were able to reconstruct the most likely genome configuration for a number of now-extinct progenitor species that gave rise to the extant solanaceous species used in this research. The results from this study provide the first broad overview of chromosomal evolution in the family Solanaceae, and one of the most detailed thus far for any family of plants.
Collapse
Affiliation(s)
- Feinan Wu
- Department of Plant Breeding and Genetics, Cornell University, Ithaca, NY 14853, USA
| | | |
Collapse
|
65
|
Tang J, Le S, Sun L, Yan X, Zhang M, Macleod J, Leroy B, Northrup N, Ellis A, Yeatman TJ, Liang Y, Zwick ME, Zhao S. Copy number abnormalities in sporadic canine colorectal cancers. Genome Res 2010; 20:341-50. [PMID: 20086242 DOI: 10.1101/gr.092726.109] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Human colorectal cancer (CRC) is one of the better-understood systems for studying the genetics of cancer initiation and progression. To develop a cross-species comparison strategy for identifying CRC causative gene or genomic alterations, we performed array comparative genomic hybridization (aCGH) to investigate copy number abnormalities (CNAs), one of the most prominent lesion types reported for human CRCs, in 10 spontaneously occurring canine CRCs. The results revealed for the first time a strong degree of genetic homology between sporadic canine and human CRCs. First, we saw that between 5% and 22% of the canine genome was amplified/deleted in these tumors, and that, reminiscent of human CRCs, the total altered sequences directly correlated to the tumor's progression stage, origin, and likely microsatellite instability status. Second, when mapping the identified CNAs onto syntenic regions of the human genome, we noted that the canine orthologs of genes participating in known human CRC pathways were recurrently disrupted, indicating that these pathways might be altered in the canine CRCs as well. Last, we observed a significant overlapping of CNAs between human and canine tumors, and tumors from the two species were clustered according to the tumor subtypes but not the species. Significantly, compared with the shared CNAs, we found that species-specific (especially human-specific) CNAs localize to evolutionarily unstable regions that harbor more segmental duplications and interspecies genomic rearrangement breakpoints. These findings indicate that CNAs recurrent between human and dog CRCs may have a higher probability of being cancer-causative, compared with CNAs found in one species only.
Collapse
Affiliation(s)
- Jie Tang
- Department of Biochemistry and Molecular Biology, Institute of Bioinformatics, University of Georgia, Athens, Georgia 30602, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
66
|
Liu GE, Ventura M, Cellamare A, Chen L, Cheng Z, Zhu B, Li C, Song J, Eichler EE. Analysis of recent segmental duplications in the bovine genome. BMC Genomics 2009; 10:571. [PMID: 19951423 PMCID: PMC2796684 DOI: 10.1186/1471-2164-10-571] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2009] [Accepted: 12/01/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Duplicated sequences are an important source of gene innovation and structural variation within mammalian genomes. We performed the first systematic and genome-wide analysis of segmental duplications in the modern domesticated cattle (Bos taurus). Using two distinct computational analyses, we estimated that 3.1% (94.4 Mb) of the bovine genome consists of recently duplicated sequences (>or= 1 kb in length, >or= 90% sequence identity). Similar to other mammalian draft assemblies, almost half (47% of 94.4 Mb) of these sequences have not been assigned to cattle chromosomes. RESULTS In this study, we provide the first experimental validation large duplications and briefly compared their distribution on two independent bovine genome assemblies using fluorescent in situ hybridization (FISH). Our analyses suggest that the (75-90%) of segmental duplications are organized into local tandem duplication clusters. Along with rodents and carnivores, these results now confidently establish tandem duplications as the most likely mammalian archetypical organization, in contrast to humans and great ape species which show a preponderance of interspersed duplications. A cross-species survey of duplicated genes and gene families indicated that duplication, positive selection and gene conversion have shaped primates, rodents, carnivores and ruminants to different degrees for their speciation and adaptation. We identified that bovine segmental duplications corresponding to genes are significantly enriched for specific biological functions such as immunity, digestion, lactation and reproduction. CONCLUSION Our results suggest that in most mammalian lineages segmental duplications are organized in a tandem configuration. Segmental duplications remain problematic for genome and assembly and we highlight genic regions that require higher quality sequence characterization. This study provides insights into mammalian genome evolution and generates a valuable resource for cattle genomics research.
Collapse
Affiliation(s)
- George E Liu
- USDA, ARS, ANRI, Bovine Functional Genomics Laboratory, Beltsville, Maryland 20705, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
67
|
Delprat A, Negre B, Puig M, Ruiz A. The transposon Galileo generates natural chromosomal inversions in Drosophila by ectopic recombination. PLoS One 2009; 4:e7883. [PMID: 19936241 PMCID: PMC2775673 DOI: 10.1371/journal.pone.0007883] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2009] [Accepted: 10/01/2009] [Indexed: 11/25/2022] Open
Abstract
Background Transposable elements (TEs) are responsible for the generation of chromosomal inversions in several groups of organisms. However, in Drosophila and other Dipterans, where inversions are abundant both as intraspecific polymorphisms and interspecific fixed differences, the evidence for a role of TEs is scarce. Previous work revealed that the transposon Galileo was involved in the generation of two polymorphic inversions of Drosophila buzzatii. Methodology/Principal Findings To assess the impact of TEs in Drosophila chromosomal evolution and shed light on the mechanism involved, we isolated and sequenced the two breakpoints of another widespread polymorphic inversion from D. buzzatii, 2z3. In the non inverted chromosome, the 2z3 distal breakpoint was located between genes CG2046 and CG10326 whereas the proximal breakpoint lies between two novel genes that we have named Dlh and Mdp. In the inverted chromosome, the analysis of the breakpoint sequences revealed relatively large insertions (2,870-bp and 4,786-bp long) including two copies of the transposon Galileo (subfamily Newton), one at each breakpoint, plus several other TEs. The two Galileo copies: (i) are inserted in opposite orientation; (ii) present exchanged target site duplications; and (iii) are both chimeric. Conclusions/Significance Our observations provide the best evidence gathered so far for the role of TEs in the generation of Drosophila inversions. In addition, they show unequivocally that ectopic recombination is the causative mechanism. The fact that the three polymorphic D. buzzatii inversions investigated so far were generated by the same transposon family is remarkable and is conceivably due to Galileo's unusual structure and current (or recent) transpositional activity.
Collapse
Affiliation(s)
- Alejandra Delprat
- Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Bellaterra (Barcelona), Spain
| | | | | | | |
Collapse
|
68
|
Marques-Bonet T, Ryder OA, Eichler EE. Sequencing primate genomes: what have we learned? Annu Rev Genomics Hum Genet 2009; 10:355-86. [PMID: 19630567 DOI: 10.1146/annurev.genom.9.081307.164420] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
We summarize the progress in whole-genome sequencing and analyses of primate genomes. These emerging genome datasets have broadened our understanding of primate genome evolution revealing unexpected and complex patterns of evolutionary change. This includes the characterization of genome structural variation, episodic changes in the repeat landscape, differences in gene expression, new models regarding speciation, and the ephemeral nature of the recombination landscape. The functional characterization of genomic differences important in primate speciation and adaptation remains a significant challenge. Limited access to biological materials, the lack of detailed phenotypic data and the endangered status of many critical primate species have significantly attenuated research into the genetic basis of primate evolution. Next-generation sequencing technologies promise to greatly expand the number of available primate genome sequences; however, such draft genome sequences will likely miss critical genetic differences within complex genomic regions unless dedicated efforts are put forward to understand the full spectrum of genetic variation.
Collapse
Affiliation(s)
- Tomas Marques-Bonet
- Department of Genome Sciences, University of Washington and the Howard Hughes Medical Institute, Seattle, Washington 98105, USA.
| | | | | |
Collapse
|
69
|
Marques-Bonet T, Girirajan S, Eichler EE. The origins and impact of primate segmental duplications. Trends Genet 2009; 25:443-54. [PMID: 19796838 PMCID: PMC2847396 DOI: 10.1016/j.tig.2009.08.002] [Citation(s) in RCA: 120] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2009] [Revised: 08/07/2009] [Accepted: 08/10/2009] [Indexed: 12/25/2022]
Abstract
Duplicated sequences are substrates for the emergence of new genes and are an important source of genetic instability associated with rare and common diseases. Analyses of primate genomes have shown an increase in the proportion of interspersed segmental duplications (SDs) within the genomes of humans and great apes. This contrasts with other mammalian genomes that seem to have their recently duplicated sequences organized in a tandem configuration. In this review, we focus on the mechanistic origin and impact of this difference with respect to evolution, genetic diversity and primate phenotype. Although many genomes will be sequenced in the future, resolution of this aspect of genomic architecture still requires high quality sequences and detailed analyses.
Collapse
Affiliation(s)
- Tomas Marques-Bonet
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | | | | |
Collapse
|
70
|
Abstract
The analysis of genome rearrangements provides a global view on the evolution of a set of related species. We present a new algorithm called EMRAE (efficient method to recover ancestral events) to reliably predict a wide-range of rearrangement events in the ancestry of a group of species. Using simulated data sets, we show that EMRAE achieves comparable sensitivity but significantly higher specificity when predicting evolutionary events relative to other tools to study genome rearrangements. We apply our approach to the synteny blocks of six mammalian genomes (human, chimpanzee, rhesus macaque, mouse, rat, and dog) and predict 1109 rearrangement events, including 831 inversions, 15 translocations, 237 transpositions, and 26 fusions/fissions. Studying the sequence features at the breakpoints of the primate rearrangement events, we demonstrate that they are not only enriched in segmental duplications (SDs), but that the enrichment of matching pairs of SDs is even stronger within the pairs of breakpoints associated with recovered events. We also show that pairs of L1 repeats are frequently associated with ancestral inversions across all studied lineages. Together, this substantiates the model that regions of high sequence identity have been associated with rearrangement events throughout the mammalian phylogeny.
Collapse
Affiliation(s)
- Hao Zhao
- Computational and Mathematical Biology, Genome Institute of Singapore, Singapore
| | | |
Collapse
|
71
|
Wildman DE, Jameson NM, Opazo JC, Yi SV. A fully resolved genus level phylogeny of neotropical primates (Platyrrhini). Mol Phylogenet Evol 2009; 53:694-702. [PMID: 19632342 DOI: 10.1016/j.ympev.2009.07.019] [Citation(s) in RCA: 90] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2009] [Revised: 07/16/2009] [Accepted: 07/20/2009] [Indexed: 12/01/2022]
Abstract
There are more than 125 species of extant New World monkeys (Primates: Platyrrhini) found in approximately 15 genera. The phylogenetic relationships of these neotropical primates have been extensively studied from a molecular perspective. While these studies have been successful at inferring many of the relationships within the platyrrhines, key questions remain. The current study provides a framework for using non-genic, non-coding markers in comparative primate phylogenomic studies in species whose genomes are not yet scheduled for complete sequencing. A random genomic shotgun library was generated from the nocturnal Owl monkey Aotus lemurinus. Eleven unlinked, non-coding, non-genic, non-repetitive, nuclear DNA markers derived from this library were sequenced in at least one representative species of every platyrrhine genus. The combined sequence from these markers yielded a 7.7 kb multiple sequence alignment of 22 taxa. We analyzed these markers independently and combined with a 10 kb dataset consisting of "traditional," previously published markers located within or directly adjacent to genes. Parsimony, maximum likelihood, and Bayesian analysis converged on a single topology for the platyrrhine generic relationships. Notably, we confidently inferred that Pitheciidae is the sister taxon to the other two platyrrhine families (Cebidae, Atelidae). This relationship is supported by high values of branch support as well as topology tests. Additionally, Aotus formed a sister taxon to a clade comprising Cebus and Saimiri. With a fully resolved platyrrhine phylogeny in place it is now possible to design and test hypotheses regarding the evolution and diversification of platyrrhine phenotypes and life histories.
Collapse
Affiliation(s)
- Derek E Wildman
- Center for Molecular Medicine and Genetics, Wayne State University, School of Medicine, 540 E. Canfield Ave., Detroit, MI 48201, USA.
| | | | | | | |
Collapse
|
72
|
Lemaitre C, Zaghloul L, Sagot MF, Gautier C, Arneodo A, Tannier E, Audit B. Analysis of fine-scale mammalian evolutionary breakpoints provides new insight into their relation to genome organisation. BMC Genomics 2009; 10:335. [PMID: 19630943 PMCID: PMC2722678 DOI: 10.1186/1471-2164-10-335] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2009] [Accepted: 07/24/2009] [Indexed: 11/21/2022] Open
Abstract
Background The Intergenic Breakage Model, which is the current model of structural genome evolution, considers that evolutionary rearrangement breakages happen with a uniform propensity along the genome but are selected against in genes, their regulatory regions and in-between. However, a growing body of evidence shows that there exists regions along mammalian genomes that present a high susceptibility to breakage. We reconsidered this question taking advantage of a recently published methodology for the precise detection of rearrangement breakpoints based on pairwise genome comparisons. Results We applied this methodology between the genome of human and those of five sequenced eutherian mammals which allowed us to delineate evolutionary breakpoint regions along the human genome with a finer resolution (median size 26.6 kb) than obtained before. We investigated the distribution of these breakpoints with respect to genome organisation into domains of different activity. In agreement with the Intergenic Breakage Model, we observed that breakpoints are under-represented in genes. Surprisingly however, the density of breakpoints in small intergenes (1 per Mb) appears significantly higher than in gene deserts (0.1 per Mb). More generally, we found a heterogeneous distribution of breakpoints that follows the organisation of the genome into isochores (breakpoints are more frequent in GC-rich regions). We then discuss the hypothesis that regions with an enhanced susceptibility to breakage correspond to regions of high transcriptional activity and replication initiation. Conclusion We propose a model to describe the heterogeneous distribution of evolutionary breakpoints along human chromosomes that combines natural selection and a mutational bias linked to local open chromatin state.
Collapse
Affiliation(s)
- Claire Lemaitre
- Université de Bordeaux, Centre de Bioinformatique - Génomique Fonctionnelle Bordeaux, F-33000 Bordeaux, France.
| | | | | | | | | | | | | |
Collapse
|
73
|
Lewin HA, Larkin DM, Pontius J, O'Brien SJ. Every genome sequence needs a good map. Genome Res 2009; 19:1925-8. [PMID: 19596977 DOI: 10.1101/gr.094557.109] [Citation(s) in RCA: 115] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Affiliation(s)
- Harris A Lewin
- Department of Animal Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA.
| | | | | | | |
Collapse
|
74
|
Carbone L, Harris RA, Vessere GM, Mootnick AR, Humphray S, Rogers J, Kim SK, Wall JD, Martin D, Jurka J, Milosavljevic A, de Jong PJ. Evolutionary breakpoints in the gibbon suggest association between cytosine methylation and karyotype evolution. PLoS Genet 2009; 5:e1000538. [PMID: 19557196 PMCID: PMC2695003 DOI: 10.1371/journal.pgen.1000538] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2009] [Accepted: 05/26/2009] [Indexed: 01/30/2023] Open
Abstract
Gibbon species have accumulated an unusually high number of chromosomal changes since diverging from the common hominoid ancestor 15-18 million years ago. The cause of this increased rate of chromosomal rearrangements is not known, nor is it known if genome architecture has a role. To address this question, we analyzed sequences spanning 57 breaks of synteny between northern white-cheeked gibbons (Nomascus l. leucogenys) and humans. We find that the breakpoint regions are enriched in segmental duplications and repeats, with Alu elements being the most abundant. Alus located near the gibbon breakpoints (<150 bp) have a higher CpG content than other Alus. Bisulphite allelic sequencing reveals that these gibbon Alus have a lower average density of methylated cytosine that their human orthologues. The finding of higher CpG content and lower average CpG methylation suggests that the gibbon Alu elements are epigenetically distinct from their human orthologues. The association between undermethylation and chromosomal rearrangement in gibbons suggests a correlation between epigenetic state and structural genome variation in evolution.
Collapse
Affiliation(s)
- Lucia Carbone
- Children's Hospital and Research Center Oakland, Oakland, California, United States of America.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
75
|
Adamovic T, McAllister D, Guryev V, Wang X, Andrae JW, Cuppen E, Jacob HJ, Sugg SL. Microalterations of inherently unstable genomic regions in rat mammary carcinomas as revealed by long oligonucleotide array-based comparative genomic hybridization. Cancer Res 2009; 69:5159-67. [PMID: 19509235 DOI: 10.1158/0008-5472.can-08-4038] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The presence of copy number variants in normal genomes poses a challenge to identify small genuine somatic copy number changes in high-resolution cancer genome profiling studies due to the use of unpaired reference DNA. Another problem is the well-known rearrangements of immunoglobulin and T-cell receptor genes in lymphocytes (a commonly used reference), which may misdirect the researcher to a locus with no relevance in tumorigenesis. We here show real gains of the IgG heavy chain V gene region in carcinogen-induced rat mammary tumor samples after normalization to paired mammary gland, a tissue without lymphocyte infiltration. We further show that the segmental duplication region encompassing the IgG heavy chain V genes is a copy number variant between the susceptible (SS) and the resistant (BN) to mammary tumor development inbred rat strains. Our data suggest that the already inherently unstable genomic region is a convenient target for additional structural rearrangements (gains) at the somatic level when exposed to a carcinogen (7,12-dimethylbenz[a]anthracene), which subsequently seem to benefit tumor development in the mammary gland of the susceptible strain. Thus, the selection of an appropriate reference DNA enabled us to identify immunoglobulin genes as novel cancer targets playing a role in mammary tumor development. We conclude that control DNA in array-based comparative genomic hybridization experiments should be selected with care, and DNA from pooled spleen (contains immature lymphocytes and is used as reference in animal studies) or blood may not be the ideal control in the study of primary tumors.
Collapse
Affiliation(s)
- Tatjana Adamovic
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, Wisconsin, USA.
| | | | | | | | | | | | | | | |
Collapse
|
76
|
Alekseyev MA, Pevzner PA. Breakpoint graphs and ancestral genome reconstructions. Genes Dev 2009; 19:943-57. [PMID: 19218533 PMCID: PMC2675983 DOI: 10.1101/gr.082784.108] [Citation(s) in RCA: 87] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2008] [Accepted: 01/22/2009] [Indexed: 11/24/2022]
Abstract
Recently completed whole-genome sequencing projects marked the transition from gene-based phylogenetic studies to phylogenomics analysis of entire genomes. We developed an algorithm MGRA for reconstructing ancestral genomes and used it to study the rearrangement history of seven mammalian genomes: human, chimpanzee, macaque, mouse, rat, dog, and opossum. MGRA relies on the notion of the multiple breakpoint graphs to overcome some limitations of the existing approaches to ancestral genome reconstructions. MGRA also generates the rearrangement-based characters guiding the phylogenetic tree reconstruction when the phylogeny is unknown.
Collapse
Affiliation(s)
- Max A. Alekseyev
- Department of Computer Science and Engineering, University of California at San Diego, La Jolla, California 92093-0404, USA
| | - Pavel A. Pevzner
- Department of Computer Science and Engineering, University of California at San Diego, La Jolla, California 92093-0404, USA
| |
Collapse
|
77
|
Kulemzina AI, Trifonov VA, Perelman PL, Rubtsova NV, Volobuev V, Ferguson-Smith MA, Stanyon R, Yang F, Graphodatsky AS. Cross-species chromosome painting in Cetartiodactyla: reconstructing the karyotype evolution in key phylogenetic lineages. Chromosome Res 2009; 17:419-36. [PMID: 19350402 DOI: 10.1007/s10577-009-9032-3] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2008] [Revised: 01/28/2009] [Accepted: 01/28/2009] [Indexed: 01/21/2023]
Abstract
Recent molecular and morphological studies place Artiodactyla and Cetacea into the order Cetartiodactyla. Within the Cetartiodactyla such families as Bovidae, Cervidae, and Suidae are well studied by comparative chromosome painting, but many taxa that are crucial for understanding cetartiodactyl phylogeny remain poorly studied. Here we present the genome-wide comparative maps of five cetartiodactyl species obtained by chromosome painting with human and dromedary paint probes from four taxa: Cetacea, Hippopotamidae, Giraffidae, and Moschidae. This is the first molecular cytogenetic report on pilot whale, hippopotamus, okapi, and Siberian musk deer. Our results, when integrated with previously published comparative chromosome maps allow us to reconstruct the evolutionary pathway and rates of chromosomal rearrangements in Cetartiodactyla. We hypothesize that the putative cetartiodactyl ancestral karyotype (CAK) contained 25-26 pairs of autosomes, 2n = 52-54, and that the association of human chromosomes 8/9 could be a cytogenetic signature that unites non-camelid cetartiodactyls. There are no unambiguous cytogenetic landmarks that unite Hippopotamidae and Cetacea. If we superimpose chromosome rearrangements on the supertree generated by Price and colleagues, several homoplasy events are needed to explain cetartiodactyl karyotype evolution. Our results apparently favour a model of non-random breakpoints in chromosome evolution. Cetariodactyl karyotype evolution is characterized by alternating periods of low and fast rates in various lineages. The highest rates are found in Suina (Suidae+Tayasuidae) lineage (1.76 rearrangements per million years (R/My)) and the lowest in Cetaceans (0.07 R/My). Our study demonstrates that the combined use of human and camel paints is highly informative for revealing evolutionary karyotypic rearrangements among cetartiodactyl species.
Collapse
Affiliation(s)
- Anastasia I Kulemzina
- Institute of Cytology and Genetics, Russian Academy of Sciences, Novosibirsk, 630090, Russia
| | | | | | | | | | | | | | | | | |
Collapse
|
78
|
Larkin DM, Pape G, Donthu R, Auvil L, Welge M, Lewin HA. Breakpoint regions and homologous synteny blocks in chromosomes have different evolutionary histories. Genome Res 2009; 19:770-7. [PMID: 19342477 DOI: 10.1101/gr.086546.108] [Citation(s) in RCA: 135] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
The persistence of large blocks of homologous synteny and a high frequency of breakpoint reuse are distinctive features of mammalian chromosomes that are not well understood in evolutionary terms. To gain a better understanding of the evolutionary forces that affect genome architecture, synteny relationships among 10 amniotes (human, chimp, macaque, rat, mouse, pig, cattle, dog, opossum, and chicken) were compared at <1 human-Mbp resolution. Homologous synteny blocks (HSBs; N = 2233) and chromosome evolutionary breakpoint regions (EBRs; N = 1064) were identified from pairwise comparisons of all genomes. Analysis of the size distribution of HSBs shared in all 10 species' chromosomes (msHSBs) identified three (>20 Mbp) that are larger than expected by chance. Gene network analysis of msHSBs >3 human-Mbp and EBRs <1 Mbp demonstrated that msHSBs are significantly enriched for genes involved in development of the central nervous and other organ systems, whereas EBRs are enriched for genes associated with adaptive functions. In addition, we found EBRs are significantly enriched for structural variations (segmental duplications, copy number variants, and indels), retrotransposed and zinc finger genes, and single nucleotide polymorphisms. These results demonstrate that chromosome breakage in evolution is nonrandom and that HSBs and EBRs are evolving in distinctly different ways. We suggest that natural selection acts on the genome to maintain combinations of genes and their regulatory elements that are essential to fundamental processes of amniote development and biological organization. Furthermore, EBRs may be used extensively to generate new genetic variation and novel combinations of genes and regulatory elements that contribute to adaptive phenotypes.
Collapse
Affiliation(s)
- Denis M Larkin
- Department of Animal Sciences, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | | | | | | | | | | |
Collapse
|
79
|
Koszul R, Fischer G. A prominent role for segmental duplications in modeling Eukaryotic genomes. C R Biol 2009; 332:254-66. [DOI: 10.1016/j.crvi.2008.07.005] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2008] [Accepted: 07/12/2008] [Indexed: 01/22/2023]
|
80
|
Peng Q, Alekseyev MA, Tesler G, Pevzner PA. Decoding Synteny Blocks and Large-Scale Duplications in Mammalian and Plant Genomes. LECTURE NOTES IN COMPUTER SCIENCE 2009. [DOI: 10.1007/978-3-642-04241-6_19] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
|
81
|
Rocchi M, Stanyon R, Archidiacono N. Evolutionary new centromeres in primates. PROGRESS IN MOLECULAR AND SUBCELLULAR BIOLOGY 2009; 48:103-52. [PMID: 19521814 DOI: 10.1007/978-3-642-00182-6_5] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
The centromere has a pivotal role in structuring chromosomal architecture, but remains a poorly understood and seemingly paradoxical "black hole." Centromeres are a very rapidly evolving segment of the genome and it is now known that centromere shifts in evolution are not rare and must be considered on a par with other chromosome rearrangements. Recently, unprecedented findings on neocentromeres and evolutionary new centromeres (ENC) have helped clarify the relationship of the centromere within the genome and shown that these two phenomena are two faces of the same coin. No prominent sequence features are known that promote centromere formation and both types of new centromeres are formed epigenetically, both clinical neocentromeres and ENC cluster at chromosomal "hotspots." The clustering of neocentromeres in 8p is probably the result of the relatively high frequency of noncanonical pairing. Studies on the evolution of the chromosomes 3, 13, and 15 help explain why there are clusters of neocentromeres. These domains often correspond to ancestral inactivated centromeres and some regions can preserve features that trigger neocentromere emergence over tens of millions of years. Neocentromeres may be correlated with the distribution of segmental duplications (SDs) in regions of extreme plasticity that often can be characterized as gene deserts. Further, because centromeres and associated pericentric regions are dynamically complex, centromere shifts may turbocharge genome reorganization by influencing the distribution of heterochromatin. The "reuse" of regions as centromere seeding-points in evolution and in human clinical cases further extends the concept of "reuse" of specific domains for "chromosomal events."
Collapse
Affiliation(s)
- Mariano Rocchi
- Dipartimento di Genetica e Microbiologia, Via Amendola, 165/A, 70126 Bari, Italy.
| | | | | |
Collapse
|
82
|
Alekseyev MA. Multi-break rearrangements and breakpoint re-uses: from circular to linear genomes. J Comput Biol 2008; 15:1117-31. [PMID: 18788907 DOI: 10.1089/cmb.2008.0080] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Multi-break rearrangements break a genome into multiple fragments and further glue them together in a new order. While 2-break rearrangements represent standard reversals, fusions, fissions, and translocations, 3-break rearrangements represent a natural generalization of transpositions. Alekseyev and Pevzner (2007a, 2008a) studied multi-break rearrangements in circular genomes and further applied them to the analysis of chromosomal evolution in mammalian genomes. In this paper, we extend these results to the more difficult case of linear genomes. In particular, we give lower bounds for the rearrangement distance between linear genomes and for the breakpoint re-use rate as functions of the number and proportion of transpositions. We further use these results to analyze comparative genomic architecture of mammalian genomes.
Collapse
Affiliation(s)
- Max A Alekseyev
- Department of Computer Science and Engineering, University of California at San Diego, La Jolla, California 92093, USA.
| |
Collapse
|
83
|
Girirajan S, Chen L, Graves T, Marques-Bonet T, Ventura M, Fronick C, Fulton L, Rocchi M, Fulton RS, Wilson RK, Mardis ER, Eichler EE. Sequencing human-gibbon breakpoints of synteny reveals mosaic new insertions at rearrangement sites. Genome Res 2008; 19:178-90. [PMID: 19029537 DOI: 10.1101/gr.086041.108] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The gibbon genome exhibits extensive karyotypic diversity with an increased rate of chromosomal rearrangements during evolution. In an effort to understand the mechanistic origin and implications of these rearrangement events, we sequenced 24 synteny breakpoint regions in the white-cheeked gibbon (Nomascus leucogenys, NLE) in the form of high-quality BAC insert sequences (4.2 Mbp). While there is a significant deficit of breakpoints in genes, we identified seven human gene structures involved in signaling pathways (DEPDC4, GNG10), phospholipid metabolism (ENPP5, PLSCR2), beta-oxidation (ECH1), cellular structure and transport (HEATR4), and transcription (ZNF461), that have been disrupted in the NLE gibbon lineage. Notably, only three of these genes show the expected evolutionary signatures of pseudogenization. Sequence analysis of the breakpoints suggested both nonclassical nonhomologous end-joining (NHEJ) and replication-based mechanisms of rearrangement. A substantial number (11/24) of human-NLE gibbon breakpoints showed new insertions of gibbon-specific repeats and mosaic structures formed from disparate sequences including segmental duplications, LINE, SINE, and LTR elements. Analysis of these sites provides a model for a replication-dependent repair mechanism for double-strand breaks (DSBs) at rearrangement sites and insights into the structure and formation of primate segmental duplications at sites of genomic rearrangements during evolution.
Collapse
Affiliation(s)
- Santhosh Girirajan
- Department of Genome Sciences, Howard Hughes Medical Institute, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
84
|
Baertsch R, Diekhans M, Kent WJ, Haussler D, Brosius J. Retrocopy contributions to the evolution of the human genome. BMC Genomics 2008; 9:466. [PMID: 18842134 PMCID: PMC2584115 DOI: 10.1186/1471-2164-9-466] [Citation(s) in RCA: 92] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2008] [Accepted: 10/08/2008] [Indexed: 02/06/2023] Open
Abstract
Background Evolution via point mutations is a relatively slow process and is unlikely to completely explain the differences between primates and other mammals. By contrast, 45% of the human genome is composed of retroposed elements, many of which were inserted in the primate lineage. A subset of retroposed mRNAs (retrocopies) shows strong evidence of expression in primates, often yielding functional retrogenes. Results To identify and analyze the relatively recently evolved retrogenes, we carried out BLASTZ alignments of all human mRNAs against the human genome and scored a set of features indicative of retroposition. Of over 12,000 putative retrocopy-derived genes that arose mainly in the primate lineage, 726 with strong evidence of transcript expression were examined in detail. These mRNA retroposition events fall into three categories: I) 34 retrocopies and antisense retrocopies that added potential protein coding space and UTRs to existing genes; II) 682 complete retrocopy duplications inserted into new loci; and III) an unexpected set of 13 retrocopies that contributed out-of-frame, or antisense sequences in combination with other types of transposed elements (SINEs, LINEs, LTRs), even unannotated sequence to form potentially novel genes with no homologs outside primates. In addition to their presence in human, several of the gene candidates also had potentially viable ORFs in chimpanzee, orangutan, and rhesus macaque, underscoring their potential of function. Conclusion mRNA-derived retrocopies provide raw material for the evolution of genes in a wide variety of ways, duplicating and amending the protein coding region of existing genes as well as generating the potential for new protein coding space, or non-protein coding RNAs, by unexpected contributions out of frame, in reverse orientation, or from previously non-protein coding sequence.
Collapse
Affiliation(s)
- Robert Baertsch
- Center for Biomolecular Science and Engineering, University of California Santa Cruz, Santa Cruz, California 95064, USA.
| | | | | | | | | |
Collapse
|
85
|
Varki A, Geschwind DH, Eichler EE. Explaining human uniqueness: genome interactions with environment, behaviour and culture. Nat Rev Genet 2008; 9:749-63. [PMID: 18802414 DOI: 10.1038/nrg2428] [Citation(s) in RCA: 107] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
What makes us human? Specialists in each discipline respond through the lens of their own expertise. In fact, 'anthropogeny' (explaining the origin of humans) requires a transdisciplinary approach that eschews such barriers. Here we take a genomic and genetic perspective towards molecular variation, explore systems analysis of gene expression and discuss an organ-systems approach. Rejecting any 'genes versus environment' dichotomy, we then consider genome interactions with environment, behaviour and culture, finally speculating that aspects of human uniqueness arose because of a primate evolutionary trend towards increasing and irreversible dependence on learned behaviours and culture - perhaps relaxing allowable thresholds for large-scale genomic diversity.
Collapse
Affiliation(s)
- Ajit Varki
- Center for Academic Research and Training in Anthropogeny, University of California, San Diego, La Jolla, California 92093, USA.
| | | | | |
Collapse
|
86
|
Marques-Bonet T, Cheng Z, She X, Eichler EE, Navarro A. The genomic distribution of intraspecific and interspecific sequence divergence of human segmental duplications relative to human/chimpanzee chromosomal rearrangements. BMC Genomics 2008; 9:384. [PMID: 18699995 PMCID: PMC2542386 DOI: 10.1186/1471-2164-9-384] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2007] [Accepted: 08/12/2008] [Indexed: 11/20/2022] Open
Abstract
BACKGROUND It has been suggested that chromosomal rearrangements harbor the molecular footprint of the biological phenomena which they induce, in the form, for instance, of changes in the sequence divergence rates of linked genes. So far, all the studies of these potential associations have focused on the relationship between structural changes and the rates of evolution of single-copy DNA and have tried to exclude segmental duplications (SDs). This is paradoxical, since SDs are one of the primary forces driving the evolution of structure and function in our genomes and have been linked not only with novel genes acquiring new functions, but also with overall higher DNA sequence divergence and major chromosomal rearrangements. RESULTS Here we take the opposite view and focus on SDs. We analyze several of the features of SDs, including the rates of intraspecific divergence between paralogous copies of human SDs and of interspecific divergence between human SDs and chimpanzee DNA. We study how divergence measures relate to chromosomal rearrangements, while considering other factors that affect evolutionary rates in single copy DNA. CONCLUSION We find that interspecific SD divergence behaves similarly to divergence of single-copy DNA. In contrast, old and recent paralogous copies of SDs do present different patterns of intraspecific divergence. Also, we show that some relatively recent SDs accumulate in regions that carry inversions in sister lineages.
Collapse
Affiliation(s)
- Tomàs Marques-Bonet
- Unitat de Biologia Evolutiva Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra, Barcelona, Catalonia, Spain
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Ze Cheng
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Xinwei She
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Arcadi Navarro
- Unitat de Biologia Evolutiva Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra, Barcelona, Catalonia, Spain
- Institucio Catalana de Recerca i Estudis Avancats (ICREA) and Universitat Pompeu Fabra, Barcelona, Catalonia, Spain
- Population Genomics Node (GNV8), National Institute for Bioinformatics (INB) Universitat Pompeu Fabra, Spain
| |
Collapse
|
87
|
Nguyen DQ, Webber C, Hehir-Kwa J, Pfundt R, Veltman J, Ponting CP. Reduced purifying selection prevails over positive selection in human copy number variant evolution. Genome Res 2008; 18:1711-23. [PMID: 18687881 DOI: 10.1101/gr.077289.108] [Citation(s) in RCA: 67] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Copy number variation is a dominant contributor to genomic variation and may frequently underlie an individual's variable susceptibilities to disease. Here we question our previous proposition that copy number variants (CNVs) are often retained in the human population because of their adaptive benefit. We show that genic biases of CNVs are best explained, not by positive selection, but by reduced efficiency of selection in eliminating deleterious changes from the human population. Of four CNV data sets examined, three exhibit significant increases in protein evolutionary rates. These increases appear to be attributable to the frequent coincidence of CNVs with segmental duplications (SDs) that recombine infrequently. Furthermore, human orthologs of mouse genes, which, when disrupted, result in pre- or postnatal lethality, are unusually depleted in CNVs. Together, these findings support a model of reduced purifying selection (Hill-Robertson interference) within copy number variable regions that are enriched in nonessential genes, allowing both the fixation of slightly deleterious substitutions and increased drift of CNV alleles. Additionally, all four CNV sets exhibited increased rates of interspecies chromosomal rearrangement and nucleotide substitution and an increased gene density. We observe that sequences with high G+C contents are most prone to copy number variation. In particular, frequently duplicated human SD sequence, or CNVs that are large and/or observed frequently, tend to be elevated in G+C content. In contrast, SD sequences that appear fixed in the human population lie more frequently within low G+C sequence. These findings provide an overarching view of how CNVs arise and segregate in the human population.
Collapse
Affiliation(s)
- Duc-Quang Nguyen
- MRC Functional Genomics Unit, University of Oxford, Department of Physiology, Anatomy and Genetics, Oxford OX1 3QX, United Kingdom
| | | | | | | | | | | |
Collapse
|
88
|
Liao BY, Zhang J. Coexpression of linked genes in Mammalian genomes is generally disadvantageous. Mol Biol Evol 2008; 25:1555-65. [PMID: 18440951 PMCID: PMC2734128 DOI: 10.1093/molbev/msn101] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/19/2008] [Indexed: 01/06/2023] Open
Abstract
Similarity in gene expression pattern between closely linked genes is known in several eukaryotes. Two models have been proposed to explain the presence of such coexpression patterns. The adaptive model assumes that coexpression is advantageous and is established by relocation of initially unlinked but coexpressed genes, whereas the neutral model asserts that coexpression is a type of leaky expression due to similar expressional environments of linked genes, but is neither advantageous nor detrimental. However, these models are incompatible with several empirical observations. Here, we propose that coexpression of linked genes is a form of transcriptional interference that is disadvantageous to the organism. We show that even distantly linked genes that are tens of megabases away exhibit significant coexpression in the human genome. However, the linkage is more likely to be broken during evolution between genes of high coexpression than those of low coexpression and the breakage of linkage reduces gene coexpression. These results support our hypothesis that coexpression of linked genes in mammalian genomes is generally disadvantageous, implying that many mammalian genes may never reach their optimal expression pattern due to the interference of their genomic environment and that such transcriptional interference may be a force promoting recurrent relocation of genes in the genome.
Collapse
Affiliation(s)
- Ben-Yang Liao
- Department of Ecology and Evolutionary Biology, University of Michigan, USA
| | | |
Collapse
|
89
|
A high-resolution radiation hybrid map of rhesus macaque chromosome 5 identifies rearrangements in the genome assembly. Genomics 2008; 92:210-8. [PMID: 18601997 DOI: 10.1016/j.ygeno.2008.05.013] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2008] [Revised: 05/16/2008] [Accepted: 05/21/2008] [Indexed: 11/21/2022]
Abstract
A 10,000-rad radiation hybrid (RH) cell panel of the rhesus macaque was generated to construct a comprehensive RH map of chromosome 5. The map represents 218 markers typed in 185 RH clones. The 4846-cR map has an average marker spacing of 798 kb. Alignments of the RH map to macaque and human genome sequences confirm a large inversion and reveal a previously unreported telomeric inversion. The macaque genome sequence indicates small translocations from the ancestral homolog of macaque chromosome 5 to macaque chromosomes 1 and 6. The RH map suggests that these are probably assembly artifacts. Unlike the genome sequence, the RH mapping data indicate the conservation of synteny between macaque chromosome 5 and human chromosome 4. This study shows that the 10,000-rad panel is appropriate for the generation of a high-resolution whole-genome RH map suitable for the verification of the rhesus genome assembly.
Collapse
|
90
|
Sequence signatures of a recent chromosomal rearrangement in Drosophila mojavensis. Genetica 2008; 136:5-11. [PMID: 18661244 DOI: 10.1007/s10709-008-9296-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2008] [Accepted: 07/12/2008] [Indexed: 12/22/2022]
Abstract
The X-chromosome inversion, Xe, distinguishes Drosophila mojavensis and D. arizonae. Earlier work mapped the breakpoints of this inversion to large intervals and provided hypotheses for the locations of the breakpoints within 3000-bp intergenic regions on the D. mojavensis genome sequence assembly. Here, we sequenced these regions directly in the putatively ancestral D. arizonae X-chromosome. We find that the two inversion breakpoints are near an inverted gene duplication and a common repetitive element, respectively, and these features were likely present in the non-inverted ancestral chromosome on the D. mojavensis lineage. Contrary to an earlier hypothesis, the inverted gene duplication appears to predate the inversion. We find no sequence similarity between the breakpoint regions in the D. mojavensis ancestor, excluding an ectopic-exchange model of chromosome rearrangements. We also found no evidence that staggered single-strand breaks caused the inversion. We suggest these features may have contributed to the chromosomal breakages resulting in this inversion.
Collapse
|
91
|
She X, Cheng Z, Zöllner S, Church DM, Eichler EE. Mouse segmental duplication and copy number variation. Nat Genet 2008; 40:909-14. [PMID: 18500340 PMCID: PMC2574762 DOI: 10.1038/ng.172] [Citation(s) in RCA: 179] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2008] [Accepted: 05/14/2008] [Indexed: 11/08/2022]
Abstract
Detailed analyses of the clone-based genome assembly reveal that the recent duplication content of mouse (4.94%) is now comparable to that of human (5.5%), in contrast to previous estimates from the whole-genome shotgun sequence assembly. However, the architecture of mouse and human genomes differs markedly: most mouse duplications are organized into discrete clusters of tandem duplications that show depletion of genes and transcripts and enrichment of long interspersed nuclear element (LINE) and long terminal repeat (LTR) retroposons. We assessed copy number variation of the C57BL/6J duplicated regions within 15 mouse strains previously used for genetic association studies, sequencing and the Mouse Phenome Project. We determined that over 60% of these base pairs are polymorphic among the strains (on average, there was 20 Mb of copy-number-variable DNA between different mouse strains). Our data suggest that different mouse strains show comparable, if not greater, copy number polymorphism when compared to human; however, such variation is more locally restricted. We show large and complex patterns of interstrain copy number variation restricted to large gene families associated with spermatogenesis, pregnancy, viviparity, pheromone signaling and immune response.
Collapse
Affiliation(s)
- Xinwei She
- Department of Genome Sciences, University of Washington, 1705 NE Pacific Street, Seattle, Washington 98195, USA
| | | | | | | | | |
Collapse
|
92
|
A gene-based genetic linkage map of the collared flycatcher (Ficedula albicollis) reveals extensive synteny and gene-order conservation during 100 million years of avian evolution. Genetics 2008; 179:1479-95. [PMID: 18562642 DOI: 10.1534/genetics.108.088195] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
By taking advantage of a recently developed reference marker set for avian genome analysis we have constructed a gene-based genetic map of the collared flycatcher, an important "ecological model" for studies of life-history evolution, sexual selection, speciation, and quantitative genetics. A pedigree of 322 birds from a natural population was genotyped for 384 single nucleotide polymorphisms (SNPs) from 170 protein-coding genes and 71 microsatellites. Altogether, 147 gene markers and 64 microsatellites form 33 linkage groups with a total genetic distance of 1787 cM. Male recombination rates are, on average, 22% higher than female rates (total distance 1982 vs. 1627 cM). The ability to anchor the collared flycatcher map with the chicken genome via the gene-based SNPs revealed an extraordinary degree of both synteny and gene-order conservation during avian evolution. The great majority of chicken chromosomes correspond to a single linkage group in collared flycatchers, with only a few cases of inter- and intrachromosomal rearrangements. The rate of chromosomal diversification, fissions/fusions, and inversions combined is thus considerably lower in birds (0.05/MY) than in mammals (0.6-2.0/MY). A dearth of repeat elements, known to promote chromosomal breakage, in avian genomes may contribute to their stability. The degree of genome stability is likely to have important consequences for general evolutionary patterns and may explain, for example, the comparatively slow rate by which genetic incompatibility among lineages of birds evolves.
Collapse
|
93
|
Lemaitre C, Tannier E, Gautier C, Sagot MF. Precise detection of rearrangement breakpoints in mammalian chromosomes. BMC Bioinformatics 2008; 9:286. [PMID: 18564416 PMCID: PMC2443379 DOI: 10.1186/1471-2105-9-286] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2008] [Accepted: 06/18/2008] [Indexed: 11/14/2022] Open
Abstract
Background Genomes undergo large structural changes that alter their organisation. The chromosomal regions affected by these rearrangements are called breakpoints, while those which have not been rearranged are called synteny blocks. We developed a method to precisely delimit rearrangement breakpoints on a genome by comparison with the genome of a related species. Contrary to current methods which search for synteny blocks and simply return what remains in the genome as breakpoints, we propose to go further and to investigate the breakpoints themselves in order to refine them. Results Given some reliable and non overlapping synteny blocks, the core of the method consists in refining the regions that are not contained in them. By aligning each breakpoint sequence against its specific orthologous sequences in the other species, we can look for weak similarities inside the breakpoint, thus extending the synteny blocks and narrowing the breakpoints. The identification of the narrowed breakpoints relies on a segmentation algorithm and is statistically assessed. Since this method requires as input synteny blocks with some properties which, though they appear natural, are not verified by current methods for detecting such blocks, we further give a formal definition and provide an algorithm to compute them. The whole method is applied to delimit breakpoints on the human genome when compared to the mouse and dog genomes. Among the 355 human-mouse and 240 human-dog breakpoints, 168 and 146 respectively span less than 50 Kb. We compared the resulting breakpoints with some publicly available ones and show that we achieve a better resolution. Furthermore, we suggest that breakpoints are rarely reduced to a point, and instead consist in often large regions that can be distinguished from the sequences around in terms of segmental duplications, similarity with related species, and transposable elements. Conclusion Our method leads to smaller breakpoints than already published ones and allows for a better description of their internal structure. In the majority of cases, our refined regions of breakpoint exhibit specific biological properties (no similarity, presence of segmental duplications and of transposable elements). We hope that this new result may provide some insight into the mechanism and evolutionary properties of chromosomal rearrangements.
Collapse
|
94
|
Misceo D, Capozzi O, Roberto R, Dell'oglio MP, Rocchi M, Stanyon R, Archidiacono N. Tracking the complex flow of chromosome rearrangements from the Hominoidea Ancestor to extant Hylobates and Nomascus Gibbons by high-resolution synteny mapping. Genome Res 2008; 18:1530-7. [PMID: 18552313 DOI: 10.1101/gr.078295.108] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
In this study we characterized the extension, reciprocal arrangement, and orientation of syntenic chromosomal segments in the lar gibbon (Hylobates lar, HLA) by hybridization of a panel of approximately 1000 human BAC clones. Each lar gibbon rearrangement was defined by a splitting BAC clone or by two overlapping clones flanking the breakpoint. A reconstruction of the synteny arrangement of the last common ancestor of all living lesser apes was made by combining these data with previous results in Nomascus leucogenys, Hoolock hoolock, and Symphalangus syndactylus. The definition of the ancestral synteny organization facilitated tracking the cascade of chromosomal changes from the Hominoidea ancestor to the present day karyotype of Hylobates and Nomascus. Each chromosomal rearrangement could be placed within an approximate phylogenetic and temporal framework. We identified 12 lar-specific rearrangements and five previously undescribed rearrangements that occurred in the Hylobatidae ancestor. The majority of the chromosomal differences between lar gibbons and humans are due to rearrangements that occurred in the Hylobatidae ancestor (38 events), consistent with the hypothesis that the genus Hylobates is the most recently evolved lesser ape genus. The rates of rearrangements in gibbons are 10 to 20 times higher than the mammalian default rate. Segmental duplication may be a driving force in gibbon chromosome evolution, because a consistent number of rearrangements involves pericentromeric regions (10 events) and centromere inactivation (seven events). Both phenomena can be reasonably supposed to have strongly contributed to the euchromatic dispersal of segmental duplications typical of pericentromeric regions. This hypothesis can be more fully tested when the sequence of this gibbon species becomes available. The detailed synteny map provided here will, in turn, substantially facilitate sequence assembly efforts.
Collapse
Affiliation(s)
- Doriana Misceo
- Department of Genetics and Microbiology, University of Bari, 70126 Bari, Italy
| | | | | | | | | | | | | |
Collapse
|
95
|
Jiang Z, Hubley R, Smit A, Eichler EE. DupMasker: a tool for annotating primate segmental duplications. Genome Res 2008; 18:1362-8. [PMID: 18502942 DOI: 10.1101/gr.078477.108] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Segmental duplications (SDs) play an important role in genome rearrangement, evolution, and the copy-number variation (CNV) of primate genomes. Such sequences are difficult to detect, a priori, because they share no defining sequence features that distinguish them from unique portions of the genome. Current sequence annotation of segmental duplications requires computationally intensive, genome-wide self-comparisons that cannot be easily implemented on new data sets. Based on the successful implementation of RepeatMasker, we developed a new genome annotation tool, DupMasker. The program uses a library of nonredundant consensus sequences of human segmental duplications, wherein a majority of the ancestral origins have been determined based on comparisons to mammalian outgroup genomes. Using DupMasker, new human and nonhuman primate (NHP) sequences may be readily queried to provide details on the origin and degree of sequence identity of each duplicon. This program can be applied to delineate the order and orientation of duplicons within complex duplication blocks and used to characterize structural variation differences between sequenced human haplotypes. We predict this tool will be valuable in the annotation of large-insert sequence clones, allowing putative unique and duplicated regions of the genomes to be annotated prior to whole genome assembly comparisons.
Collapse
Affiliation(s)
- Zhaoshi Jiang
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | | | | | | |
Collapse
|
96
|
Kehrer-Sawatzki H, Cooper DN. Molecular mechanisms of chromosomal rearrangement during primate evolution. Chromosome Res 2008; 16:41-56. [PMID: 18293104 DOI: 10.1007/s10577-007-1207-1] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
Breakpoint analysis of the large chromosomal rearrangements which have occurred during primate evolution promises to yield new insights into the underlying mechanisms of mutagenesis. Comparison of these evolutionary breakpoints with those that are disease-associated in humans, and which occur during either meiotic or mitotic cell division, should help to identify basic mechanistic similarities as well as differences. It has recently become clear that segmental duplications (SDs) have had a very significant impact on genome plasticity during primate evolution. In comparisons of the human and chimpanzee genomes, SDs have been found in flanking regions of 70-80% of inversions and approximately 40% of deletions/duplications. A strong spatial association between primate-specific breakpoints and SDs has also become evident from comparisons of human with other mammalian genomes. The lineage-specific hyperexpansion of certain SDs observed in the genomes of human, chimpanzee, gorilla and gibbon is indicative of the intrinsic instability of some SDs in primates. However, since many primate-specific breakpoints map to regions lacking SDs, but containing interspersed high-copy repetitive sequence elements such as SINEs, LINEs, LTRs, alpha-satellites and (AT)( n ) repeats, we may infer that a range of different molecular mechanisms have probably been involved in promoting chromosomal breakage during the evolution of primate genomes.
Collapse
|
97
|
Abstract
The evolution of karyotypes has been the subject of intensive study since the middle of the 20th century. This was motivated by the observation that the karyotypes of related species showed remarkable conservation. The recent emergence of whole-genome sequencing projects gives the opportunity to complement the cytogenetic approaches by addressing the conservation of karyotypes using chromosome sequence comparison. In this short review we present a description of recent advances in computational biology methods dedicated to the study of chromosome evolution and more specifically ancestral karyotype reconstruction in an attempt to provide an integrated overview of both cytogenetic and computational approaches.
Collapse
|
98
|
Abstract
In 1992 the Japanese macaque was the first species for which the homology of the entire karyotype was established by cross-species chromosome painting. Today, there are chromosome painting data on more than 50 species of primates. Although chromosome painting is a rapid and economical method for tracking translocations, it has limited utility for revealing intrachromosomal rearrangements. Fortunately, the use of BAC-FISH in the last few years has allowed remarkable progress in determining marker order along primate chromosomes and there are now marker order data on an array of primate species for a good number of chromosomes. These data reveal inversions, but also show that centromeres of many orthologous chromosomes are embedded in different genomic contexts. Even if the mechanisms of neocentromere formation and progression are just beginning to be understood, it is clear that these phenomena had a significant impact on shaping the primate genome and are fundamental to our understanding of genome evolution. In this report we complete and integrate the dataset of BAC-FISH marker order for human syntenies 1, 2, 4, 5, 8, 12, 17, 18, 19, 21, 22 and the X. These results allowed us to develop hypotheses about the content, marker order and centromere position in ancestral karyotypes at five major branching points on the primate evolutionary tree: ancestral primate, ancestral anthropoid, ancestral platyrrhine, ancestral catarrhine and ancestral hominoid. Current models suggest that between-species structural rearrangements are often intimately related to speciation. Comparative primate cytogenetics has become an important tool for elucidating the phylogeny and the taxonomy of primates. It has become increasingly apparent that molecular cytogenetic data in the future can be fruitfully combined with whole-genome assemblies to advance our understanding of primate genome evolution as well as the mechanisms and processes that have led to the origin of the human genome.
Collapse
|
99
|
Kirsch S, Münch C, Jiang Z, Cheng Z, Chen L, Batz C, Eichler EE, Schempp W. Evolutionary dynamics of segmental duplications from human Y-chromosomal euchromatin/heterochromatin transition regions. Genome Res 2008; 18:1030-42. [PMID: 18445620 DOI: 10.1101/gr.076711.108] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Human chromosomal regions enriched in segmental duplications are subject to extensive genomic reorganization. Such regions are particularly informative for illuminating the evolutionary history of a given chromosome. We have analyzed 866 kb of Y-chromosomal non-palindromic segmental duplications delineating four euchromatin/heterochromatin transition regions (Yp11.2/Yp11.1, Yq11.1/Yq11.21, Yq11.23/Yq12, and Yq12/PAR2). Several computational methods were applied to decipher the segmental duplication architecture and identify the ancestral origin of the 41 different duplicons. Combining computational and comparative FISH analysis, we reconstruct the evolutionary history of these regions. Our analysis indicates a continuous process of transposition of duplicated sequences onto the evolving higher primate Y chromosome, providing unique insights into the development of species-specific Y-chromosomal and autosomal duplicons. Phylogenetic sequence comparisons show that duplicons of the human Yp11.2/Yp11.1 region were already present in the macaque-human ancestor as multiple paralogs located predominantly in subtelomeric regions. In contrast, duplicons from the Yq11.1/Yq11.21, Yq11.23/Yq12, and Yq12/PAR2 regions show no evidence of duplication in rhesus macaque, but map to the pericentromeric regions in chimpanzee and human. This suggests an evolutionary shift in the direction of duplicative transposition events from subtelomeric in Old World monkeys to pericentromeric in the human/ape lineage. Extensive chromosomal relocation of autosomal-duplicated sequences from euchromatin/heterochromatin transition regions to interstitial regions as demonstrated on the pygmy chimpanzee Y chromosome support a model in which substantial reorganization and amplification of duplicated sequences may contribute to speciation.
Collapse
Affiliation(s)
- Stefan Kirsch
- Institute of Human Genetics, University of Freiburg, 79106 Freiburg, Germany
| | | | | | | | | | | | | | | |
Collapse
|
100
|
Abstract
There is growing appreciation that the human genome contains significant numbers of structural rearrangements, such as insertions, deletions, inversions, and large tandem repeats. Recent studies have defined approximately 5% of the human genome as structurally variant in the normal population, involving more than 800 independent genes. We present a detailed review of the various structural rearrangements identified to date in humans, with particular reference to their influence on human phenotypic variation. Our current knowledge of the extent of human structural variation shows that the human genome is a highly dynamic structure that shows significant large-scale variation from the currently published genome reference sequence.
Collapse
Affiliation(s)
- Andrew J Sharp
- Department of Genome Sciences, University of Washington, Howard Hughes Medical Institute, Seattle, Washington 98195, USA
| | | | | |
Collapse
|