101
|
Lappalainen T, Dermitzakis ET. Evolutionary history of regulatory variation in human populations. Hum Mol Genet 2010; 19:R197-203. [DOI: 10.1093/hmg/ddq406] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
|
102
|
Ju YS, Hong D, Kim S, Park SS, Kim S, Lee S, Park H, Kim JI, Seo JS. Reference-unbiased copy number variant analysis using CGH microarrays. Nucleic Acids Res 2010; 38:e190. [PMID: 20802225 PMCID: PMC2978381 DOI: 10.1093/nar/gkq730] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Comparative genomic hybridization (CGH) microarrays have been used to determine copy number variations (CNVs) and their effects on complex diseases. Detection of absolute CNVs independent of genomic variants of an arbitrary reference sample has been a critical issue in CGH array experiments. Whole genome analysis using massively parallel sequencing with multiple ultra-high resolution CGH arrays provides an opportunity to catalog highly accurate genomic variants of the reference DNA (NA10851). Using information on variants, we developed a new method, the CGH array reference-free algorithm (CARA), which can determine reference-unbiased absolute CNVs from any CGH array platform. The algorithm enables the removal and rescue of false positive and false negative CNVs, respectively, which appear due to the effects of genomic variants of the reference sample in raw CGH array experiments. We found that the CARA remarkably enhanced the accuracy of CGH array in determining absolute CNVs. Our method thus provides a new approach to interpret CGH array data for personalized medicine.
Collapse
Affiliation(s)
- Young Seok Ju
- Genomic Medicine Institute, Medical Research Center, Seoul National University, Department of Biochemistry and Molecular Biology, Seoul National University College of Medicine, Seoul 110-799, Korea
| | | | | | | | | | | | | | | | | |
Collapse
|
103
|
Abstract
Uncovering general principles of genome evolution that are time-invariant and that operate in germ and somatic cells has implications for genome-wide association studies (GWAS), gene therapy, and disease genomics. Here we investigate the relationship between structural alterations (e.g., insertions and deletions) and single-nucleotide substitutions by comparing the following genomes that diverged at different times across germ- and somatic-cell lineages: (i) the reference human and chimpanzee genome (in million years), (ii) the reference human and personal genomes (in tens of thousands of years), and (iii) structurally altered regions in cancer and genetically engineered cells (in days). At the species level, genes with structural alteration in nearby regions show increased single-nucleotide changes and tend to evolve faster. In personal genomes, the single-nucleotide substitution rate is higher near sites of structural alteration and decreases with increasing distance. In human cancer cell populations and in cells genetically engineered using zinc-finger nucleases, single-nucleotide changes occur frequently near sites of structural alterations. We present evidence that structural alteration induces single-nucleotide changes in nearby regions and discuss possible molecular mechanisms that contribute to this phenomenon. We propose that the low fidelity of nonreplicative error-prone repair polymerases, which are used during insertion or deletion, result in break-repair-induced single-nucleotide mutations in the vicinity of structural alteration. Thus, in the mutational landscape, structural alterations are linked to single-nucleotide changes across different time scales in both somatic- and germ-cell lineages. We discuss implications for genome evolution, GWAS, disease genomics, and gene therapy and emphasize the need to investigate both types of mutations within a single framework.
Collapse
|
104
|
Ohnologs in the human genome are dosage balanced and frequently associated with disease. Proc Natl Acad Sci U S A 2010; 107:9270-4. [PMID: 20439718 DOI: 10.1073/pnas.0914697107] [Citation(s) in RCA: 197] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
About 30% of protein-coding genes in the human genome are related through two whole genome duplication (WGD) events. Although WGD is often credited with great evolutionary importance, the processes governing the retention of these genes and their biological significance remain unclear. One increasingly popular hypothesis is that dosage balance constraints are a major determinant of duplicate gene retention. We test this hypothesis and show that WGD-duplicated genes (ohnologs) have rarely experienced subsequent small-scale duplication (SSD) and are also refractory to copy number variation (CNV) in human populations and are thus likely to be sensitive to relative quantities (i.e., they are dosage-balanced). By contrast, genes that have experienced SSD in the vertebrate lineage are more likely to also display CNV. This supports the hypothesis of biased retention of dosage-balanced genes after WGD. We also show that ohnologs have a strong association with human disease. In particular, Down Syndrome (DS) caused by trisomy 21 is widely assumed to be caused by dosage effects, and 75% of previously reported candidate genes for this syndrome are ohnologs that experienced no other copy number changes. We propose the remaining dosage-balanced ohnologs on chromosome 21 as candidate DS genes. These observations clearly show a persistent resistance to dose changes in genes duplicated by WGD. Dosage balance constraints simultaneously explain duplicate gene retention and essentiality after WGD.
Collapse
|
105
|
Hehir-Kwa JY, Wieskamp N, Webber C, Pfundt R, Brunner HG, Gilissen C, de Vries BBA, Ponting CP, Veltman JA. Accurate distinction of pathogenic from benign CNVs in mental retardation. PLoS Comput Biol 2010; 6:e1000752. [PMID: 20421931 PMCID: PMC2858682 DOI: 10.1371/journal.pcbi.1000752] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2009] [Accepted: 03/19/2010] [Indexed: 11/18/2022] Open
Abstract
Copy number variants (CNVs) have recently been recognized as a common form of genomic variation in humans. Hundreds of CNVs can be detected in any individual genome using genomic microarrays or whole genome sequencing technology, but their phenotypic consequences are still poorly understood. Rare CNVs have been reported as a frequent cause of neurological disorders such as mental retardation (MR), schizophrenia and autism, prompting widespread implementation of CNV screening in diagnostics. In previous studies we have shown that, in contrast to benign CNVs, MR-associated CNVs are significantly enriched in genes whose mouse orthologues, when disrupted, result in a nervous system phenotype. In this study we developed and validated a novel computational method for differentiating between benign and MR-associated CNVs using structural and functional genomic features to annotate each CNV. In total 13 genomic features were included in the final version of a Naïve Bayesian Tree classifier, with LINE density and mouse knock-out phenotypes contributing most to the classifier's accuracy. After demonstrating that our method (called GECCO) perfectly classifies CNVs causing known MR-associated syndromes, we show that it achieves high accuracy (94%) and negative predictive value (99%) on a blinded test set of more than 1,200 CNVs from a large cohort of individuals with MR. These results indicate that this classification method will be of value for objectively prioritizing CNVs in clinical research and diagnostics. Rare copy number variants (CNVs) are a frequent cause of neurological disorders such as mental retardation (MR). However CNVs are also commonly identified in healthy individuals. It is therefore crucial for both diagnostic and research applications to be able to distinguish between disease-causing CNVs and “benign” CNVs occurring as normal genomic variation. Separating these two types can take advantage of significant differences in their genomic contents. For example, benign CNVs are enriched in repetitive sequences. By contrast, CNVs associated with MR tend to have high densities of functional elements, including genes whose mouse orthologues, when knocked-out, lead to specific nervous system abnormalities. We have developed a novel objective approach that is effective in distinguishing MR-associated CNVs from benign CNVs based on the presence of 13 genomic attributes. This method is able to achieve high accuracies in a cohort of CNVs known to cause MR and in a cohort of individuals with unexplained MR. The development of this technique promises to substantially improve the methodology for determining the pathogenicity of CNVs.
Collapse
Affiliation(s)
- Jayne Y. Hehir-Kwa
- Radboud University Nijmegen Medical Centre, Department of Human Genetics, Nijmegen, The Netherlands
| | - Nienke Wieskamp
- Radboud University Nijmegen Medical Centre, Department of Human Genetics, Nijmegen, The Netherlands
| | - Caleb Webber
- MRC Functional Genomics Unit, University of Oxford, Department of Physiology, Anatomy and Genetics, Oxford, United Kingdom
| | - Rolph Pfundt
- Radboud University Nijmegen Medical Centre, Department of Human Genetics, Nijmegen, The Netherlands
| | - Han G. Brunner
- Radboud University Nijmegen Medical Centre, Department of Human Genetics, Nijmegen, The Netherlands
| | - Christian Gilissen
- Radboud University Nijmegen Medical Centre, Department of Human Genetics, Nijmegen, The Netherlands
| | - Bert B. A. de Vries
- Radboud University Nijmegen Medical Centre, Department of Human Genetics, Nijmegen, The Netherlands
| | - Chris P. Ponting
- MRC Functional Genomics Unit, University of Oxford, Department of Physiology, Anatomy and Genetics, Oxford, United Kingdom
| | - Joris A. Veltman
- Radboud University Nijmegen Medical Centre, Department of Human Genetics, Nijmegen, The Netherlands
- * E-mail:
| |
Collapse
|
106
|
Völker M, Backström N, Skinner BM, Langley EJ, Bunzey SK, Ellegren H, Griffin DK. Copy number variation, chromosome rearrangement, and their association with recombination during avian evolution. Genome Res 2010; 20:503-11. [PMID: 20357050 PMCID: PMC2847753 DOI: 10.1101/gr.103663.109] [Citation(s) in RCA: 111] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2009] [Accepted: 02/08/2010] [Indexed: 11/25/2022]
Abstract
Chromosomal rearrangements and copy number variants (CNVs) play key roles in genome evolution and genetic disease; however, the molecular mechanisms underlying these types of structural genomic variation are not fully understood. The availability of complete genome sequences for two bird species, the chicken and the zebra finch, provides, for the first time, an ideal opportunity to analyze the relationship between structural genomic variation (chromosomal and CNV) and recombination on a genome-wide level. The aims of this study were therefore threefold: (1) to combine bioinformatics, physical mapping to produce comprehensive comparative maps of the genomes of chicken and zebra finch. In so doing, this allowed the identification of evolutionary chromosomal rearrangements distinguishing them. The previously reported interchromosomal conservation of synteny was confirmed, but a larger than expected number of intrachromosomal rearrangements were reported; (2) to hybridize zebra finch genomic DNA to a chicken tiling path microarray and identify CNVs in the zebra finch genome relative to chicken; 32 interspecific CNVs were identified; and (3) to test the hypothesis that there is an association between CNV, chromosomal rearrangements, and recombination by correlating data from (1) and (2) with recombination rate data from a high-resolution genetic linkage map of the zebra finch. We found a highly significant association of both chromosomal rearrangements and CNVs with elevated recombination rates. The results thus provide support for the notion of recombination-based processes playing a major role in avian genome evolution.
Collapse
Affiliation(s)
- Martin Völker
- School of Biosciences, University of Kent, Canterbury, Kent CT2 7NJ, United Kingdom
| | - Niclas Backström
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18D, SE-752 36 Uppsala, Sweden
| | - Benjamin M. Skinner
- School of Biosciences, University of Kent, Canterbury, Kent CT2 7NJ, United Kingdom
| | - Elizabeth J. Langley
- School of Biosciences, University of Kent, Canterbury, Kent CT2 7NJ, United Kingdom
| | - Sydney K. Bunzey
- School of Biosciences, University of Kent, Canterbury, Kent CT2 7NJ, United Kingdom
| | - Hans Ellegren
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18D, SE-752 36 Uppsala, Sweden
| | - Darren K. Griffin
- School of Biosciences, University of Kent, Canterbury, Kent CT2 7NJ, United Kingdom
| |
Collapse
|
107
|
Johansen CT, Hegele RA. Predictive genetic testing for coronary artery disease. Crit Rev Clin Lab Sci 2010; 46:343-60. [PMID: 19958218 DOI: 10.3109/07388550903422075] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Coronary artery disease (CAD) is an inflammatory-metabolic disease in which atherosclerotic plaques cause stenosis of the coronary arteries, leading to acute clinical complications such as myocardial infarction. Since CAD is a multifactorial, polygenic disease with a substantial environmental component, individual risk prediction and stratification is often difficult. Recent technological advances have resulted in substantial progress elucidating the impact of common genetic variation on CAD progression. The discovery of common genetic variants, including the chromosome 9p21.3 locus as the strongest and most highly replicated independent genetic CAD risk factor, has stimulated interest in genetic testing for CAD-associated risk variants. The ultimate goal of genotype-based CAD risk prediction is to improve upon the discrimination and stratification offered by conventional risk factors alone. Genotype-based CAD risk prediction may eventually have clinical utility, but not without intrinsic complexities. Are genotype variables superior predictors of risk compared to a family history of CAD? Is a 10-year risk prediction window ambitious enough for the predictive power of genotype data? This review will outline the current state of genotype-based CAD risk prediction and highlight challenging issues integral to the successful implementation of genetic testing for CAD.
Collapse
|
108
|
Abstract
During the last quarter of the twentieth century, our knowledge about human genetic variation was limited mainly to the heterochromatin polymorphisms, large enough to be visible in the light microscope, and the single nucleotide polymorphisms (SNPs) identified by traditional PCR-based DNA sequencing. In the past five years, the rapid development and expanded use of microarray technologies, including oligonucleotide array comparative genomic hybridization and SNP genotyping arrays, as well as next-generation sequencing with “paired-end” methods, has enabled a whole-genome analysis with essentially unlimited resolution. The discovery of submicroscopic copy-number variations (CNVs) present in our genomes has changed dramatically our perspective on DNA structural variation and disease. It is now thought that CNVs encompass more total nucleotides and arise more frequently than SNPs. CNVs, to a larger extent than SNPs, have been shown to be responsible for human evolution, genetic diversity between individuals, and a rapidly increasing number of traits or susceptibility to traits; such conditions have been referred to as genomic disorders. In addition to well-known sporadic chromosomal microdeletion syndromes and Mendelian diseases, many common complex traits including autism and schizophrenia can result from CNVs. Both recombination- and replication-based mechanisms for CNV formation have been described.
Collapse
Affiliation(s)
| | - James R. Lupski
- Departments of Molecular and Human Genetics, Houston, Texas 77030
- Departments of Pediatrics, Baylor College of Medicine, Houston, Texas 77030
- Departments of Texas Children's Hospital, Houston, Texas 77030
| |
Collapse
|
109
|
Copy-number variants in neurodevelopmental disorders: promises and challenges. Trends Genet 2009; 25:536-44. [DOI: 10.1016/j.tig.2009.10.006] [Citation(s) in RCA: 102] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2009] [Revised: 10/14/2009] [Accepted: 10/15/2009] [Indexed: 02/01/2023]
|
110
|
Rich SS, Akolkar B, Concannon P, Erlich H, Hilner JE, Julier C, Morahan G, Nerup J, Nierras C, Pociot F, Todd JA. Current status and the future for the genetics of type I diabetes. Genes Immun 2009; 10 Suppl 1:S128-31. [PMID: 19956094 PMCID: PMC2805458 DOI: 10.1038/gene.2009.100] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The Type I Diabetes Genetics Consortium (T1DGC) is an international collaboration whose primary goal is to identify genes whose variants modify an individual's risk of type I diabetes (T1D). An integral part of the T1DGC's mission is the establishment of clinical and data resources that can be used by, and that are fully accessible to, the T1D research community (http://www.t1dgc.org). The T1DGC has organized the collection and analyses of study samples and conducted several major research projects focused on T1D gene discovery: a genome-wide linkage scan, an intensive evaluation of the human major histocompatibility complex, a detailed examination of published candidate genes, and a genome-wide association scan. These studies have provided important information to the scientific community regarding the function of specific genes or chromosomal regions on T1D risk. The results are continually being updated and displayed (http://www.t1dbase.org). The T1DGC welcomes all investigators interested in using these data for scientific endeavors on T1D. The T1DGC resources provide a framework for future research projects, including examination of structural variation, re-sequencing of candidate regions in a search for T1D-associated genes and causal variants, correlation of T1D risk genotypes with biomarkers obtained from T1DGC serum and plasma samples, and in-depth bioinformatics analyses.
Collapse
Affiliation(s)
- S S Rich
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA 22908, USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
111
|
Springer NM, Ying K, Fu Y, Ji T, Yeh CT, Jia Y, Wu W, Richmond T, Kitzman J, Rosenbaum H, Iniguez AL, Barbazuk WB, Jeddeloh JA, Nettleton D, Schnable PS. Maize inbreds exhibit high levels of copy number variation (CNV) and presence/absence variation (PAV) in genome content. PLoS Genet 2009; 5:e1000734. [PMID: 19956538 PMCID: PMC2780416 DOI: 10.1371/journal.pgen.1000734] [Citation(s) in RCA: 355] [Impact Index Per Article: 23.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2009] [Accepted: 10/19/2009] [Indexed: 12/25/2022] Open
Abstract
Following the domestication of maize over the past approximately 10,000 years, breeders have exploited the extensive genetic diversity of this species to mold its phenotype to meet human needs. The extent of structural variation, including copy number variation (CNV) and presence/absence variation (PAV), which are thought to contribute to the extraordinary phenotypic diversity and plasticity of this important crop, have not been elucidated. Whole-genome, array-based, comparative genomic hybridization (CGH) revealed a level of structural diversity between the inbred lines B73 and Mo17 that is unprecedented among higher eukaryotes. A detailed analysis of altered segments of DNA conservatively estimates that there are several hundred CNV sequences among the two genotypes, as well as several thousand PAV sequences that are present in B73 but not Mo17. Haplotype-specific PAVs contain hundreds of single-copy, expressed genes that may contribute to heterosis and to the extraordinary phenotypic diversity of this important crop.
Collapse
Affiliation(s)
- Nathan M. Springer
- Department of Plant Biology, University of Minnesota, Saint Paul, Minnesota, United States of America
| | - Kai Ying
- Interdepartmental Genetics Graduate Program, Iowa State University, Ames, Iowa, United States of America
- Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, Iowa, United States of America
| | - Yan Fu
- Department of Agronomy, Iowa State University, Ames, Iowa, United States of America
- Center for Carbon Capturing Crops, Iowa State University, Ames, Iowa, United States of America
| | - Tieming Ji
- Department of Statistics, Iowa State University, Ames, Iowa, United States of America
| | - Cheng-Ting Yeh
- Department of Agronomy, Iowa State University, Ames, Iowa, United States of America
- Center for Plant Genomics, Iowa State University, Ames, Iowa, United States of America
| | - Yi Jia
- Interdepartment Plant Biology, Iowa State University, Ames, Iowa, United States of America
| | - Wei Wu
- Department of Agronomy, Iowa State University, Ames, Iowa, United States of America
- Center for Plant Genomics, Iowa State University, Ames, Iowa, United States of America
| | - Todd Richmond
- Roche NimbleGen, Madison, Wisconsin, United States of America
| | - Jacob Kitzman
- Roche NimbleGen, Madison, Wisconsin, United States of America
| | - Heidi Rosenbaum
- Roche NimbleGen, Madison, Wisconsin, United States of America
| | | | - W. Brad Barbazuk
- University of Florida, Gainesville, Florida, United States of America
| | | | - Dan Nettleton
- Department of Statistics, Iowa State University, Ames, Iowa, United States of America
| | - Patrick S. Schnable
- Interdepartmental Genetics Graduate Program, Iowa State University, Ames, Iowa, United States of America
- Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, Iowa, United States of America
- Department of Agronomy, Iowa State University, Ames, Iowa, United States of America
- Center for Carbon Capturing Crops, Iowa State University, Ames, Iowa, United States of America
- Center for Plant Genomics, Iowa State University, Ames, Iowa, United States of America
- Interdepartment Plant Biology, Iowa State University, Ames, Iowa, United States of America
- * E-mail:
| |
Collapse
|
112
|
Delprat A, Negre B, Puig M, Ruiz A. The transposon Galileo generates natural chromosomal inversions in Drosophila by ectopic recombination. PLoS One 2009; 4:e7883. [PMID: 19936241 PMCID: PMC2775673 DOI: 10.1371/journal.pone.0007883] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2009] [Accepted: 10/01/2009] [Indexed: 11/25/2022] Open
Abstract
Background Transposable elements (TEs) are responsible for the generation of chromosomal inversions in several groups of organisms. However, in Drosophila and other Dipterans, where inversions are abundant both as intraspecific polymorphisms and interspecific fixed differences, the evidence for a role of TEs is scarce. Previous work revealed that the transposon Galileo was involved in the generation of two polymorphic inversions of Drosophila buzzatii. Methodology/Principal Findings To assess the impact of TEs in Drosophila chromosomal evolution and shed light on the mechanism involved, we isolated and sequenced the two breakpoints of another widespread polymorphic inversion from D. buzzatii, 2z3. In the non inverted chromosome, the 2z3 distal breakpoint was located between genes CG2046 and CG10326 whereas the proximal breakpoint lies between two novel genes that we have named Dlh and Mdp. In the inverted chromosome, the analysis of the breakpoint sequences revealed relatively large insertions (2,870-bp and 4,786-bp long) including two copies of the transposon Galileo (subfamily Newton), one at each breakpoint, plus several other TEs. The two Galileo copies: (i) are inserted in opposite orientation; (ii) present exchanged target site duplications; and (iii) are both chimeric. Conclusions/Significance Our observations provide the best evidence gathered so far for the role of TEs in the generation of Drosophila inversions. In addition, they show unequivocally that ectopic recombination is the causative mechanism. The fact that the three polymorphic D. buzzatii inversions investigated so far were generated by the same transposon family is remarkable and is conceivably due to Galileo's unusual structure and current (or recent) transpositional activity.
Collapse
Affiliation(s)
- Alejandra Delprat
- Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Bellaterra (Barcelona), Spain
| | | | | | | |
Collapse
|
113
|
Origins and functional impact of copy number variation in the human genome. Nature 2009; 464:704-12. [PMID: 19812545 DOI: 10.1038/nature08516] [Citation(s) in RCA: 1391] [Impact Index Per Article: 92.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2009] [Accepted: 09/21/2009] [Indexed: 02/07/2023]
Abstract
Structural variations of DNA greater than 1 kilobase in size account for most bases that vary among human genomes, but are still relatively under-ascertained. Here we use tiling oligonucleotide microarrays, comprising 42 million probes, to generate a comprehensive map of 11,700 copy number variations (CNVs) greater than 443 base pairs, of which most (8,599) have been validated independently. For 4,978 of these CNVs, we generated reference genotypes from 450 individuals of European, African or East Asian ancestry. The predominant mutational mechanisms differ among CNV size classes. Retrotransposition has duplicated and inserted some coding and non-coding DNA segments randomly around the genome. Furthermore, by correlation with known trait-associated single nucleotide polymorphisms (SNPs), we identified 30 loci with CNVs that are candidates for influencing disease susceptibility. Despite this, having assessed the completeness of our map and the patterns of linkage disequilibrium between CNVs and SNPs, we conclude that, for complex traits, the heritability void left by genome-wide association studies will not be accounted for by common CNVs.
Collapse
|
114
|
Armengol L, Villatoro S, González JR, Pantano L, García-Aragonés M, Rabionet R, Cáceres M, Estivill X. Identification of copy number variants defining genomic differences among major human groups. PLoS One 2009; 4:e7230. [PMID: 19789632 PMCID: PMC2747275 DOI: 10.1371/journal.pone.0007230] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2009] [Accepted: 08/20/2009] [Indexed: 12/14/2022] Open
Abstract
Background Understanding the genetic contribution to phenotype variation of human groups is necessary to elucidate differences in disease predisposition and response to pharmaceutical treatments in different human populations. Methodology/Principal Findings We have investigated the genome-wide profile of structural variation on pooled samples from the three populations studied in the HapMap project by comparative genome hybridization (CGH) in different array platforms. We have identified and experimentally validated 33 genomic loci that show significant copy number differences from one population to the other. Interestingly, we found an enrichment of genes related to environment adaptation (immune response, lipid metabolism and extracellular space) within these regions and the study of expression data revealed that more than half of the copy number variants (CNVs) translate into gene-expression differences among populations, suggesting that they could have functional consequences. In addition, the identification of single nucleotide polymorphisms (SNPs) that are in linkage disequilibrium with the copy number alleles allowed us to detect evidences of population differentiation and recent selection at the nucleotide variation level. Conclusions Overall, our results provide a comprehensive view of relevant copy number changes that might play a role in phenotypic differences among major human populations, and generate a list of interesting candidates for future studies.
Collapse
Affiliation(s)
- Lluís Armengol
- Genetic Causes of Disease Group, Genes and Disease Program, Center for Genomic Regulation (CRG-UPF) and CIBERESP, Barcelona, Catalonia, Spain
- Quantitative Genomic Medicine Laboratories (qGenomics), Barcelona, Catalonia, Spain
| | - Sergi Villatoro
- Genetic Causes of Disease Group, Genes and Disease Program, Center for Genomic Regulation (CRG-UPF) and CIBERESP, Barcelona, Catalonia, Spain
| | - Juan R. González
- Center for Research in Environmental Epidemiology (CREAL), Barcelona, Catalonia, Spain
| | - Lorena Pantano
- Genetic Causes of Disease Group, Genes and Disease Program, Center for Genomic Regulation (CRG-UPF) and CIBERESP, Barcelona, Catalonia, Spain
| | - Manel García-Aragonés
- Genetic Causes of Disease Group, Genes and Disease Program, Center for Genomic Regulation (CRG-UPF) and CIBERESP, Barcelona, Catalonia, Spain
| | - Raquel Rabionet
- Genetic Causes of Disease Group, Genes and Disease Program, Center for Genomic Regulation (CRG-UPF) and CIBERESP, Barcelona, Catalonia, Spain
| | - Mario Cáceres
- Genetic Causes of Disease Group, Genes and Disease Program, Center for Genomic Regulation (CRG-UPF) and CIBERESP, Barcelona, Catalonia, Spain
| | - Xavier Estivill
- Genetic Causes of Disease Group, Genes and Disease Program, Center for Genomic Regulation (CRG-UPF) and CIBERESP, Barcelona, Catalonia, Spain
- Genetics Unit, Department of Health and Experimental Life Sciences, Pompeu Fabra University (UPF), Barcelona, Catalonia, Spain
- National Genotyping Center (CeGen) Barcelona Genotyping Node, Center for Genomic Regulation (CRG-UPF), Barcelona, Catalonia, Spain
- * E-mail:
| |
Collapse
|
115
|
Abstract
Genomic structural variation is generally defined as deletions, insertions, duplications, inversions, translocations or copy number variation (CNV) in large DNA segments (>1 kb). The structural variation in an individual genome includes thousands of discrete regions, spans millions of base pairs, and encompasses numerous entire genes and their regulatory regions. This results in missing or change of gene functions, and subsequently leads to phenotypic changes, disease susceptibilities or induction of diseases. Research on genomic structural variation is useful in analyzing the integrated genotype with genomic variation and understanding the potential medical effects and the entire function of the organism. Here, we reviewed the latest research progresses of the types of human genomic structural variants and the methods for disclosing these variants, as well as the impact of the variants on individual phenotype, disease, and evolution.
Collapse
Affiliation(s)
- Yong-Shu He
- Department of Biology, Kunming Medical College, Kunming 650031, China.
| | | | | |
Collapse
|
116
|
Wu X, Xiao H. Progress in the detection of human genome structural variations. ACTA ACUST UNITED AC 2009; 52:560-7. [DOI: 10.1007/s11427-009-0078-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2008] [Accepted: 01/21/2009] [Indexed: 01/04/2023]
|
117
|
Zhang F, Carvalho CMB, Lupski JR. Complex human chromosomal and genomic rearrangements. Trends Genet 2009; 25:298-307. [PMID: 19560228 DOI: 10.1016/j.tig.2009.05.005] [Citation(s) in RCA: 199] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2009] [Revised: 05/14/2009] [Accepted: 05/19/2009] [Indexed: 01/08/2023]
Abstract
Copy number variation (CNV) is a major source of genetic variation among humans. In addition to existing as benign polymorphisms, CNVs can also convey clinical phenotypes, including genomic disorders, sporadic diseases and complex human traits. CNV results from genomic rearrangements that can represent simple deletion or duplication of a genomic segment, or be more complex. Complex chromosomal rearrangements (CCRs) have been known for some time but their mechanisms have remained elusive. Recent technology advances and high-resolution human genome analyses have revealed that complex genomic rearrangements can account for a large fraction of non-recurrent rearrangements at a given locus. Various mechanisms, most of which are DNA-replication-based, for example fork stalling and template switching (FoSTeS) and microhomology-mediated break-induced replication (MMBIR), have been proposed for generating such complex genomic rearrangements and are probably responsible for CCR.
Collapse
Affiliation(s)
- Feng Zhang
- Department of Molecular and Human Genetics, Baylor College of Medicine, and Texas Children's Hospital, Houston, TX 77030, USA
| | | | | |
Collapse
|
118
|
Faddah DA, Ganko EW, McCoach C, Pickrell JK, Hanlon SE, Mann FG, Mieczkowska JO, Jones CD, Lieb JD, Vision TJ. Systematic identification of balanced transposition polymorphisms in Saccharomyces cerevisiae. PLoS Genet 2009; 5:e1000502. [PMID: 19503594 PMCID: PMC2682701 DOI: 10.1371/journal.pgen.1000502] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2008] [Accepted: 05/04/2009] [Indexed: 01/22/2023] Open
Abstract
High-throughput techniques for detecting DNA polymorphisms generally do not identify changes in which the genomic position of a sequence, but not its copy number, varies among individuals. To explore such balanced structural polymorphisms, we used array-based Comparative Genomic Hybridization (aCGH) to conduct a genome-wide screen for single-copy genomic segments that occupy different genomic positions in the standard laboratory strain of Saccharomyces cerevisiae (S90) and a polymorphic wild isolate (Y101) through analysis of six tetrads from a cross of these two strains. Paired-end high-throughput sequencing of Y101 validated four of the predicted rearrangements. The transposed segments contained one to four annotated genes each, yet crosses between S90 and Y101 yielded mostly viable tetrads. The longest segment comprised 13.5 kb near the telomere of chromosome XV in the S288C reference strain and Southern blotting confirmed its predicted location on chromosome IX in Y101. Interestingly, inter-locus crossover events between copies of this segment occurred at a detectable rate. The presence of low-copy repetitive sequences at the junctions of this segment suggests that it may have arisen through ectopic recombination. Our methodology and findings provide a starting point for exploring the origins, phenotypic consequences, and evolutionary fate of this largely unexplored form of genomic polymorphism. Balanced structural polymorphisms are differences in the relative arrangement of genomic features within species that do not affect DNA copy number. Little is known about their prevalence or importance because they are difficult to observe. Here, we present a novel methodology for systematically identifying such polymorphisms based on the idea that single-copy DNA that occupies different genomic locations in two parents will segregate independently during meiosis and will therefore reveal itself as a copy number difference among a fraction of progeny. Comparative hybridization reveals multiple balanced structural polymorphisms that involve changes to gene order in two strains of yeast; the results are independently validated using paired-end whole genome shotgun sequencing. The longest transposed segment we identify comprises 13.5 kb near the telomere of chromosome XV in the S288C reference strain and contains several annotated genes. We map the location of this polymorphism in the non-reference strain using genome-wide genotypic data, which also reveals an appreciable frequency of ectopic recombination among transposed segment pairs. The breakpoints of the remaining polymorphisms are localized by the paired-end sequence data. Our work provides proof-of-principle for a very general approach to systematically identify all balanced genomic polymorphisms in two different genotypes and is a starting point for understanding the frequency, evolutionary origins, and functional consequences of this seldom-studied class of genomic structural variation in eukaryotes.
Collapse
Affiliation(s)
- Dina A. Faddah
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Eric W. Ganko
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Caroline McCoach
- Department of Biochemistry, Stanford University, Stanford, California, United States of America
| | - Joseph K. Pickrell
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Sean E. Hanlon
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Frederick G. Mann
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Joanna O. Mieczkowska
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Corbin D. Jones
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Jason D. Lieb
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- * E-mail: (JDL); (TJV)
| | - Todd J. Vision
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- * E-mail: (JDL); (TJV)
| |
Collapse
|
119
|
Yeshaya J, Amir I, Rimon A, Freedman J, Shohat M, Avivi L. Microdeletion syndromes disclose replication timing alterations of genes unrelated to the missing DNA. Mol Cytogenet 2009; 2:11. [PMID: 19284877 PMCID: PMC2660353 DOI: 10.1186/1755-8166-2-11] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2009] [Accepted: 03/14/2009] [Indexed: 11/26/2022] Open
Abstract
Background The temporal order of allelic replication is interrelated to the epigenomic profile. A significant epigenetic marker is the asynchronous replication of monoallelically-expressed genes versus the synchronous replication of biallelically-expressed genes. The present study sought to determine whether a microdeletion in the genome affects epigenetic profiles of genes unrelated to the missing segment. In order to test this hypothesis, we checked the replication patterns of two genes – SNRPN, a normally monoallelically expressed gene (assigned to 15q11.13), and the RB1, an archetypic biallelically expressed gene (assigned to 13.q14) in the genomes of patients carrying the 22q11.2 deletion (DiGeorge/Velocardiofacial syndrome) and those carrying the 7q11.23 deletion (Williams syndrome). Results The allelic replication timing was determined by fluorescence in situ hybridization (FISH) technology performed on peripheral blood cells. As expected, in the cells of normal subjects the frequency of cells showing asynchronous replication for SNRPN was significantly (P < 10-12) higher than the corresponding value for RB1. In contrast, cells of the deletion-carrying patients exhibited a reversal in this replication pattern: there was a significantly lower frequency of cells engaging in asynchronous replication for SNRPN than for RB1 (P < 10-4 and P < 10-3 for DiGeorge/Velocardiofacial and Williams syndromes, respectively). Accordingly, the significantly lower frequency of cells showing asynchronous replication for SNRPN than for RB1 is a new epigenetic marker distinguishing these deletion syndrome genotypes from normal ones. Conclusion In cell samples of each deletion-carrying individual, an aberrant, reversed pattern of replication is delineated, namely, where a monoallelic gene replicates more synchronously than a biallelic gene. This inverted pattern, which appears to be non-deletion-specific, clearly distinguishes cells of deletion-carriers from normal ones. As such, it offers a potential epigenetic marker for suspecting a hidden microdeletion that is too small to be detected by conventional karyotyping methods.
Collapse
Affiliation(s)
- Josepha Yeshaya
- Raphael Recanati Genetic Institute, Rabin Medical Center Beilinson Campus, Petah-Tikva, Israel
| | - Itay Amir
- Raphael Recanati Genetic Institute, Rabin Medical Center Beilinson Campus, Petah-Tikva, Israel.,Department of Human Molecular Genetics & Biochemistry, Sackler School of Medicine, Tel-Aviv University, Tel-Aviv, Israel
| | - Ayelet Rimon
- Raphael Recanati Genetic Institute, Rabin Medical Center Beilinson Campus, Petah-Tikva, Israel
| | - Jane Freedman
- Raphael Recanati Genetic Institute, Rabin Medical Center Beilinson Campus, Petah-Tikva, Israel
| | - Mordechai Shohat
- Raphael Recanati Genetic Institute, Rabin Medical Center Beilinson Campus, Petah-Tikva, Israel.,Department of Human Molecular Genetics & Biochemistry, Sackler School of Medicine, Tel-Aviv University, Tel-Aviv, Israel.,Department of Pediatrics C, Schneider Children's Medical Center of Israel, Petah Tikva, Israel
| | - Lydia Avivi
- Department of Human Molecular Genetics & Biochemistry, Sackler School of Medicine, Tel-Aviv University, Tel-Aviv, Israel
| |
Collapse
|
120
|
Perry GH. The evolutionary significance of copy number variation in the human genome. Cytogenet Genome Res 2009; 123:283-7. [PMID: 19287166 DOI: 10.1159/000184719] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/22/2008] [Indexed: 12/27/2022] Open
Abstract
Copy number variation provides the raw material for gene family expansion and diversification, which is an important evolutionary force. Moreover, copy number variants (CNVs) can influence gene transcriptional and translational levels and have been associated with complex disease susceptibility. Therefore, natural selection may have affected at least some of the greater than one thousand CNVs thus far discovered among the genomes of phenotypically normal humans. While identifying and understanding particular instances of natural selection may shed light on important aspects of human evolutionary history, our ability to analyze CNVs in traditional population genetic frameworks has been limited. However, progress has been made by adapting some of these frameworks for use with copy number data. Moving forward, these efforts will be aided by non-human organism studies of the population genetics of copy number variation, and by more direct comparisons of within-species copy number variation and between-species copy number fixation.
Collapse
Affiliation(s)
- G H Perry
- Department of Human Genetics, University of Chicago, Chicago, IL, USA.
| |
Collapse
|
121
|
De S, Teichmann SA, Babu MM. The impact of genomic neighborhood on the evolution of human and chimpanzee transcriptome. Genome Res 2009; 19:785-94. [PMID: 19233772 DOI: 10.1101/gr.086165.108] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Divergence of gene expression can result in phenotypic variation, which contributes to the evolution of new species. Although the influence of trans- and cis-regulatory mutations is well known, the genome-wide impact of changes in genomic neighborhood of genes on expression divergence between species remains largely unexplored. Here, we compare the neighborhood of orthologous genes (within a window of 2 MB) in human and chimpanzee with the expression levels of their transcripts from several equivalent tissues and demonstrate that genes with altered neighborhood are more likely to undergo expression divergence than genes with conserved neighborhood. We observe the same trend when expression divergence data were analyzed from six different brain parts that are equivalent between human and chimpanzee. Additionally, we find enrichment for genes with altered neighborhood to be expressed in a tissue-specific manner in the human brain. These results suggest that expression divergence induced by this mechanism could have contributed to the phenotypic differences between human and chimpanzee. We propose that, in addition to other molecular mechanisms, change in genomic neighborhood is an important factor that drives transcriptome evolution.
Collapse
|
122
|
Prazeres da Costa O, González J, Ruiz A. Cloning and sequencing of the breakpoint regions of inversion 5g fixed in Drosophila buzzatii. Chromosoma 2009; 118:349-60. [PMID: 19198866 DOI: 10.1007/s00412-008-0201-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2008] [Revised: 12/22/2008] [Accepted: 12/22/2008] [Indexed: 10/21/2022]
Abstract
Chromosomal inversions are ubiquitous in Drosophila both as intraspecific polymorphisms and interspecific differences. Many gaps still remain in our understanding of the mechanisms that generate them. Previous work has shown that in Drosophila buzzatii, three polymorphic inversions were generated by ectopic recombination between copies of the transposon Galileo. In this study, we have characterized the breakpoint regions of inversion 5g, fixed in D. buzzatii and absent in Drosophila koepferae and other closely related species. A novel approach comprising four experimental steps was used. First, D. buzzatii BAC clones encompassing the breakpoints were identified and their ends sequenced. Then, breakpoint regions were mapped at high resolution in the Drosophila mojavensis genome sequence. Finally, breakpoint regions were isolated by polymerase chain reaction in D. buzzatii and D. koepferae and sequenced. Our aim was to shed light on the mechanism that generated inversion 5g and specifically to test for an implication of the transposon Galileo. No evidence implicates Galileo or other transposable elements in the origin of inversion 5g that was generated most likely by two independent breaks and non-homologous end-joining repair. Our results show that different inversion-generating mechanisms may coexist within the same lineage and suggest a hypothesis for the evolutionary time and mode of their operation.
Collapse
Affiliation(s)
- Olivia Prazeres da Costa
- Departament de Genètica i de Microbiologia, Facultat de Biociències, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| | | | | |
Collapse
|
123
|
Barnett GC, West CML, Dunning AM, Elliott RM, Coles CE, Pharoah PDP, Burnet NG. Normal tissue reactions to radiotherapy: towards tailoring treatment dose by genotype. Nat Rev Cancer 2009; 9:134-42. [PMID: 19148183 PMCID: PMC2670578 DOI: 10.1038/nrc2587] [Citation(s) in RCA: 515] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
A key challenge in radiotherapy is to maximize radiation doses to cancer cells while minimizing damage to surrounding healthy tissue. As severe toxicity in a minority of patients limits the doses that can be safely given to the majority, there is interest in developing a test to measure an individual's radiosensitivity before treatment. Variation in sensitivity to radiation is an inherited genetic trait and recent progress in genotyping raises the possibility of genome-wide studies to characterize genetic profiles that predict patient response to radiotherapy.
Collapse
Affiliation(s)
- Gillian C Barnett
- Department of Oncology, University of Cambridge, Oncology Centre, Addenbrooke's Hospital, Hills Road, Cambridge CB2 0QQ, UK.
| | | | | | | | | | | | | |
Collapse
|
124
|
Abstract
Two developments have sparked new directions in the genetics-to-genomics transition for research and medical applications: the advance of whole-genome assays by array or DNA sequencing technologies, and the discovery among human genomes of extensive submicroscopic genomic structural variation, including copy number variation. For health care to benefit from interpretation of genomic data, we need to know how these variants contribute to the phenotype of the individual. Research is revealing the spectrum, both in size and complexity, of structural genotypic variation, and its association with a broad range of human phenotypes. Genomic disorders associated with relatively large, recurrent contiguous variants have been recognized for some time, as have certain Mendelian traits associated with functional disruption of single genes by structural variation. More recent examples from phenotype- and genotype-driven studies demonstrate a greater level of complexity, with evidence of incremental dosage effects, gene interaction networks, buffering and modifiers, and position effects. Mechanisms underlying such variation are emerging to provide a handle on the bulk of human variation, which is associated with complex traits and adaptive potential. Interpreting genotypes for personalized health care and communicating knowledge to the individual will be significant challenges for genomics professionals.
Collapse
|
125
|
Kim PM, Lam HYK, Urban AE, Korbel JO, Affourtit J, Grubert F, Chen X, Weissman S, Snyder M, Gerstein MB. Analysis of copy number variants and segmental duplications in the human genome: Evidence for a change in the process of formation in recent evolutionary history. Genome Res 2008; 18:1865-74. [PMID: 18842824 DOI: 10.1101/gr.081422.108] [Citation(s) in RCA: 118] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Segmental duplications (SDs) are operationally defined as >1 kb stretches of duplicated DNA with high sequence identity. They arise from copy number variants (CNVs) fixed in the population. To investigate the formation of SDs and CNVs, we examine their large-scale patterns of co-occurrence with different repeats. Alu elements, a major class of genomic repeats, had previously been identified as prime drivers of SD formation. We also observe this association; however, we find that it sharply decreases for younger SDs. Continuing this trend, we find only weak associations of CNVs with Alus. Similarly, we find an association of SDs with processed pseudogenes, which is decreasing for younger SDs and absent entirely for CNVs. Next, we find that SDs are significantly co-localized with each other, resulting in a highly skewed "power-law" distribution and chromosomal hotspots. We also observe a significant association of CNVs with SDs, but find that an SD-mediated mechanism only accounts for some CNVs (<28%). Overall, our results imply that a shift in predominant formation mechanism occurred in recent history: approximately 40 million years ago, during the "Alu burst" in retrotransposition activity, non-allelic homologous recombination, first mediated by Alus and then the by newly formed CNVs themselves, was the main driver of genome rearrangements; however, its relative importance has decreased markedly since then, with proportionally more events now stemming from other repeats and from non-homologous end-joining. In addition to a coarse-grained analysis, we performed targeted sequencing of 67 CNVs and then analyzed a combined set of 270 CNVs (540 breakpoints) to verify our conclusions.
Collapse
Affiliation(s)
- Philip M Kim
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
126
|
Abstract
Recent years have seen great advances in generating and analyzing data to identify the genetic architecture of biological traits. Human disease has understandably received intense research focus, and the genes responsible for most Mendelian diseases have successfully been identified. However, the same advances have shown a consistent if less satisfying pattern, in which complex traits are affected by variation in large numbers of genes, most of which have individually minor or statistically elusive effects, leaving the bulk of genetic etiology unaccounted for. This pattern applies to diverse and unrelated traits, not just disease, in basically all species, and is consistent with evolutionary expectations, raising challenging questions about the best way to approach and understand biological complexity.
Collapse
Affiliation(s)
- Kenneth M Weiss
- Department of Anthropology and Integrated Biosciences Genetics Program, Pennsylvania State University, University Park, Pennsylvania 16802, USA.
| |
Collapse
|