1
|
Fitzgerald T, Birney E. CNest: A novel copy number association discovery method uncovers 862 new associations from 200,629 whole-exome sequence datasets in the UK Biobank. CELL GENOMICS 2022; 2:100167. [PMID: 36779085 PMCID: PMC9903682 DOI: 10.1016/j.xgen.2022.100167] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/21/2021] [Revised: 04/11/2022] [Accepted: 07/13/2022] [Indexed: 10/15/2022]
Abstract
Copy number variation (CNV) is known to influence human traits, having a rich history of research into common and rare genetic disease, and although CNV is accepted as an important class of genomic variation, progress on copy-number-based genome-wide association studies (GWASs) from next-generation sequencing (NGS) data has been limited. Here we present a novel method for large-scale copy number analysis from NGS data generating robust copy number estimates and allowing copy number GWASs (CN-GWASs) to be performed genome-wide in discovery mode. We provide a detailed analysis in the UK Biobank resource and a specifically designed software package. We use these methods to perform CN-GWAS analysis across 78 human traits, discovering over 800 genetic associations that are likely to contribute strongly to trait distributions. Finally, we compare CNV and SNP association signals across the same traits and samples, defining specific CNV association classes.
Collapse
Affiliation(s)
- Tomas Fitzgerald
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge CB10 1SD, UK
| | - Ewan Birney
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge CB10 1SD, UK
| |
Collapse
|
2
|
Layton KKS, Bradbury IR. Harnessing the power of multi-omics data for predicting climate change response. J Anim Ecol 2021; 91:1064-1072. [PMID: 34679193 DOI: 10.1111/1365-2656.13619] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Accepted: 10/11/2021] [Indexed: 01/19/2023]
Abstract
Predicting how species will respond to future climate change is of central importance in the midst of the global biodiversity crisis, and recent work has demonstrated the utility of population genomics for improving these predictions. Here, we suggest a broadening of the approach to include other types of genomic variants that play an important role in adaptation, like structural (e.g. copy number variants) and epigenetic variants (e.g. DNA methylation). These data could provide additional power for forecasting response, especially in weakly structured or panmictic species. Incorporating structural and epigenetic variation into estimates of climate change vulnerability, or maladaptation, may not only improve prediction power but also provide insight into the molecular mechanisms underpinning species' response to climate change.
Collapse
Affiliation(s)
- Kara K S Layton
- School of Biological Sciences, University of Aberdeen, Aberdeen, UK
| | - Ian R Bradbury
- Northwest Atlantic Fisheries Centre, Fisheries and Oceans Canada, St. John's, Canada
| |
Collapse
|
3
|
Berdan EL, Blanckaert A, Slotte T, Suh A, Westram AM, Fragata I. Unboxing mutations: Connecting mutation types with evolutionary consequences. Mol Ecol 2021; 30:2710-2723. [PMID: 33955064 DOI: 10.1111/mec.15936] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Revised: 03/30/2021] [Accepted: 04/20/2021] [Indexed: 01/09/2023]
Abstract
A key step in understanding the genetic basis of different evolutionary outcomes (e.g., adaptation) is to determine the roles played by different mutation types (e.g., SNPs, translocations and inversions). To do this we must simultaneously consider different mutation types in an evolutionary framework. Here, we propose a research framework that directly utilizes the most important characteristics of mutations, their population genetic effects, to determine their relative evolutionary significance in a given scenario. We review known population genetic effects of different mutation types and show how these may be connected to different evolutionary outcomes. We provide examples of how to implement this framework and pinpoint areas where more data, theory and synthesis are needed. Linking experimental and theoretical approaches to examine different mutation types simultaneously is a critical step towards understanding their evolutionary significance.
Collapse
Affiliation(s)
- Emma L Berdan
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| | | | - Tanja Slotte
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| | - Alexander Suh
- School of Biological Sciences - Organisms and the Environment, University of East Anglia, Norwich, UK.,Department of Organismal Biology - Systematic Biology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Anja M Westram
- IST Austria, Klosterneuburg, Austria.,Faculty of Biosciences and Aquaculture, Nord University, Bodø, Norway
| | - Inês Fragata
- cE3c - Centre for Ecology, Evolution and Environmental Changes, Faculdade de Ciências, Universidade de Lisboa, Lisboa, Portugal
| |
Collapse
|
4
|
Finke K, Kourakos M, Brown G, Dang HT, Tan SJS, Simons YB, Ramdas S, Schäffer AA, Kember RL, Bućan M, Mathieson S. Ancestral haplotype reconstruction in endogamous populations using identity-by-descent. PLoS Comput Biol 2021; 17:e1008638. [PMID: 33635861 PMCID: PMC7946327 DOI: 10.1371/journal.pcbi.1008638] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2020] [Revised: 03/10/2021] [Accepted: 12/15/2020] [Indexed: 12/24/2022] Open
Abstract
In this work we develop a novel algorithm for reconstructing the genomes of ancestral individuals, given genotype or sequence data from contemporary individuals and an extended pedigree of family relationships. A pedigree with complete genomes for every individual enables the study of allele frequency dynamics and haplotype diversity across generations, including deviations from neutrality such as transmission distortion. When studying heritable diseases, ancestral haplotypes can be used to augment genome-wide association studies and track disease inheritance patterns. The building blocks of our reconstruction algorithm are segments of Identity-By-Descent (IBD) shared between two or more genotyped individuals. The method alternates between identifying a source for each IBD segment and assembling IBD segments placed within each ancestral individual. Unlike previous approaches, our method is able to accommodate complex pedigree structures with hundreds of individuals genotyped at millions of SNPs. We apply our method to an Old Order Amish pedigree from Lancaster, Pennsylvania, whose founders came to North America from Europe during the early 18th century. The pedigree includes 1338 individuals from the past 12 generations, 394 with genotype data. The motivation for reconstruction is to understand the genetic basis of diseases segregating in the family through tracking haplotype transmission over time. Using our algorithm thread, we are able to reconstruct an average of 224 ancestral individuals per chromosome. For these ancestral individuals, on average we reconstruct 79% of their haplotypes. We also identify a region on chromosome 16 that is difficult to reconstruct—we find that this region harbors a short Amish-specific copy number variation and the gene HYDIN. thread was developed for endogamous populations, but can be applied to any extensive pedigree with the recent generations genotyped. We anticipate that this type of practical ancestral reconstruction will become more common and necessary to understand rare and complex heritable diseases in extended families. When analyzing complex heritable traits, genomic data from many generations of an extended family increases the amount of information available for statistical inference. However, typically only genomic data from the recent generations of a pedigree are available, as ancestral individuals are deceased. In this work we present an algorithm, called thread, for reconstructing the genomes of ancestral individuals, given a complex pedigree and genomic data from the recent generations. Previous approaches have not been able to accommodate large datasets (both in terms of sites and individuals), made simplifying assumptions about pedigree structure, or did not tie reconstructed sequences back to specific individuals. We apply thread to a complex Old Order Amish pedigree of 1338 individuals, 394 with genotype data.
Collapse
Affiliation(s)
- Kelly Finke
- Department of Computer Science, Swarthmore College, Swarthmore, Pennsylvania, United States of America
- Department of Biology, Swarthmore College, Swarthmore, Pennsylvania, United States of America
| | - Michael Kourakos
- Department of Computer Science, Swarthmore College, Swarthmore, Pennsylvania, United States of America
| | - Gabriela Brown
- Department of Computer Science, Swarthmore College, Swarthmore, Pennsylvania, United States of America
| | - Huyen Trang Dang
- Department of Computer Science, Bryn Mawr College, Bryn Mawr, Pennsylvania, United States of America
| | - Shi Jie Samuel Tan
- Department of Computer Science, Haverford College, Haverford, Pennsylvania, United States of America
| | - Yuval B. Simons
- Department of Genetics, Stanford University, Stanford, California, United States of America
| | - Shweta Ramdas
- Department of Genetics, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Alejandro A. Schäffer
- Cancer Data Science Laboratory, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Rachel L. Kember
- Department of Psychiatry, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Maja Bućan
- Department of Genetics, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Sara Mathieson
- Department of Computer Science, Haverford College, Haverford, Pennsylvania, United States of America
- * E-mail:
| |
Collapse
|
5
|
An Evolutionary Perspective on the Impact of Genomic Copy Number Variation on Human Health. J Mol Evol 2019; 88:104-119. [PMID: 31522275 DOI: 10.1007/s00239-019-09911-6] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2019] [Accepted: 08/27/2019] [Indexed: 02/06/2023]
Abstract
Copy number variants (CNVs), deletions and duplications of segments of DNA, account for at least five times more variable base pairs in humans than single-nucleotide variants. Several common CNVs were shown to change coding and regulatory sequences and thus dramatically affect adaptive phenotypes involving immunity, perception, metabolism, skin structure, among others. Some of these CNVs were also associated with susceptibility to cancer, infection, and metabolic disorders. These observations raise the possibility that CNVs are a primary contributor to human phenotypic variation and consequently evolve under selective pressures. Indeed, locus-specific haplotype-level analyses revealed signatures of natural selection on several CNVs. However, more traditional tests of selection which are often applied to single-nucleotide variation often have diminished statistical power when applied to CNVs because they often do not show strong linkage disequilibrium with nearby variants. Recombination-based formation mechanisms of CNVs lead to frequent recurrence and gene conversion events, breaking the linkage disequilibrium involving CNVs. Similar methodological challenges also prevent routine genome-wide association studies to adequately investigate the impact of CNVs on heritable human disease. Thus, we argue that the full relevance of CNVs to human health and evolution is yet to be elucidated. We further argue that a holistic investigation of formation mechanisms within an evolutionary framework would provide a powerful framework to understand the functional and biomedical impact of CNVs. In this paper, we review several cases where studies reveal diverse evolutionary histories and unexpected functional consequences of CNVs. We hope that this review will encourage further work on CNVs by both evolutionary and medical geneticists.
Collapse
|
6
|
Kaseniit KE, Hogan GJ, D'Auria KM, Haverty C, Muzzey D. Strategies to minimize false positives and interpret novel microdeletions based on maternal copy-number variants in 87,000 noninvasive prenatal screens. BMC Med Genomics 2018; 11:90. [PMID: 30340588 PMCID: PMC6194617 DOI: 10.1186/s12920-018-0410-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2018] [Accepted: 10/01/2018] [Indexed: 12/29/2022] Open
Abstract
BACKGROUND Noninvasive prenatal screening (NIPS) of common aneuploidies using cell-free DNA from maternal plasma is part of routine prenatal care and is widely used in both high-risk and low-risk patient populations. High specificity is needed for clinically acceptable positive predictive values. Maternal copy-number variants (mCNVs) have been reported as a source of false-positive aneuploidy results that compromises specificity. METHODS We surveyed the mCNV landscape in 87,255 patients undergoing NIPS. We evaluated both previously reported and novel algorithmic strategies for mitigating the effects of mCNVs on the screen's specificity. Further, we analyzed the frequency, length, and positional distribution of CNVs in our large dataset to investigate the curation of novel fetal microdeletions, which can be identified by NIPS but are challenging to interpret clinically. RESULTS mCNVs are common, with 65% of expecting mothers harboring an autosomal CNV spanning more than 200 kb, underscoring the need for robust NIPS analysis strategies. By analyzing empirical and simulated data, we found that general, outlier-robust strategies reduce the rate of mCNV-caused false positives but not as appreciably as algorithms specifically designed to account for mCNVs. We demonstrate that large-scale tabulation of CNVs identified via routine NIPS could be clinically useful: together with the gene density of a putative microdeletion region, we show that the region's relative tolerance to duplications versus deletions may aid the interpretation of microdeletion pathogenicity. CONCLUSIONS Our study thoroughly investigates a common source of NIPS false positives and demonstrates how to bypass its corrupting effects. Our findings offer insight into the interpretation of NIPS results and inform the design of NIPS algorithms suitable for use in screening in the general obstetric population.
Collapse
Affiliation(s)
- Kristjan Eerik Kaseniit
- Myriad Women's Health (previously Counsyl), 180 Kimball Way, South San Francisco, CA, 94080, USA
| | - Gregory J Hogan
- Myriad Women's Health (previously Counsyl), 180 Kimball Way, South San Francisco, CA, 94080, USA
| | - Kevin M D'Auria
- Myriad Women's Health (previously Counsyl), 180 Kimball Way, South San Francisco, CA, 94080, USA
| | - Carrie Haverty
- Myriad Women's Health (previously Counsyl), 180 Kimball Way, South San Francisco, CA, 94080, USA
| | - Dale Muzzey
- Myriad Women's Health (previously Counsyl), 180 Kimball Way, South San Francisco, CA, 94080, USA.
| |
Collapse
|
7
|
Hu XS, Yeh FC, Hu Y, Deng LT, Ennos RA, Chen X. High mutation rates explain low population genetic divergence at copy-number-variable loci in Homo sapiens. Sci Rep 2017; 7:43178. [PMID: 28225073 PMCID: PMC5320550 DOI: 10.1038/srep43178] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2016] [Accepted: 01/19/2017] [Indexed: 11/09/2022] Open
Abstract
Copy-number-variable (CNV) loci differ from single nucleotide polymorphic (SNP) sites in size, mutation rate, and mechanisms of maintenance in natural populations. It is therefore hypothesized that population genetic divergence at CNV loci will differ from that found at SNP sites. Here, we test this hypothesis by analysing 856 CNV loci from the genomes of 1184 healthy individuals from 11 HapMap populations with a wide range of ancestry. The results show that population genetic divergence at the CNV loci is generally more than three times lower than at genome-wide SNP sites. Populations generally exhibit very small genetic divergence (Gst = 0.05 ± 0.049). The smallest divergence is among African populations (Gst = 0.0081 ± 0.0025), with increased divergence among non-African populations (Gst = 0.0217 ± 0.0109) and then among African and non-African populations (Gst = 0.0324 ± 0.0064). Genetic diversity is high in African populations (~0.13), low in Asian populations (~0.11), and intermediate in the remaining 11 populations. Few significant linkage disequilibria (LDs) occur between the genome-wide CNV loci. Patterns of gametic and zygotic LDs indicate the absence of epistasis among CNV loci. Mutation rate is about twice as large as the migration rate in the non-African populations, suggesting that the high mutation rates play dominant roles in producing the low population genetic divergence at CNV loci.
Collapse
Affiliation(s)
- Xin-Sheng Hu
- Guangdong Key Laboratory for Innovative Development and Utilization of Forest Plant Germplasm, South China Agricultural University, Guangdong 510642, China.,College of Forestry and Landscape Architecture, South China Agricultural University, Guangdong 510642, China
| | - Francis C Yeh
- Department of Renewable Resources, 751 General Service Building, University of Alberta, Edmonton, AB T6G 2H1, Canada
| | - Yang Hu
- Department of Computing Science, University of Alberta, Edmonton, AB T6G 2S4, Canada
| | - Li-Ting Deng
- Guangdong Key Laboratory for Innovative Development and Utilization of Forest Plant Germplasm, South China Agricultural University, Guangdong 510642, China.,College of Forestry and Landscape Architecture, South China Agricultural University, Guangdong 510642, China
| | - Richard A Ennos
- Institute of Evolutionary Biology, Ashworth Laboratories, School of Biological Sciences, University of Edinburgh, Edinburgh EH 9 3JT, United Kingdom
| | - Xiaoyang Chen
- Guangdong Key Laboratory for Innovative Development and Utilization of Forest Plant Germplasm, South China Agricultural University, Guangdong 510642, China.,College of Forestry and Landscape Architecture, South China Agricultural University, Guangdong 510642, China
| |
Collapse
|
8
|
Hu XS, Hu Y, Chen X. Testing neutrality at copy-number-variable loci under the finite-allele and finite-site models. Theor Popul Biol 2016; 112:1-13. [PMID: 27423854 DOI: 10.1016/j.tpb.2016.07.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2016] [Revised: 07/05/2016] [Accepted: 07/06/2016] [Indexed: 02/01/2023]
Abstract
Copy-number variation (CNV) is an important form of DNA structural variation because a certain proportion of genomes in many eukaryotic species can contribute to such variations. Owing to the differences between CNVs and single nucleotide polymorphisms (SNPs) in size, mutation rate and maintaining mechanism, it is more realistic to characterize CNV evolution under the finite-allele and finite-site models. Here, we propose a method to test multiple CNVs neutrality under the finite-allele and finite-site models and the assumption of mutation-drift process. The statistical property of the method is evaluated through Monte Carlo simulations under the effects of the sample size, the scaled mutation rates, the number of CNVs, the population demographic change, and selection. Different from Tajima's D test, a bootstrap or a permutation approach is suggested to conduct a neutrality test. Application of this method is illustrated using the diploid CNV genotypes measured in discrete copy numbers in 11 HapMap phase III populations. The results show that the mutation-drift process can explain the variation of genome-wide CNVs among 1184 individuals (856 CNVs, ∼0.02Mb on average in size), irrespective of the historical demographic changes. Patterns from allele-frequency-spectrum analysis also support the hypothesis of neutral CNVs. Our results suggest that most human chromosomal changes in healthy individuals via unbalanced rearrangements of the segments with certain sizes are neutral.
Collapse
Affiliation(s)
- Xin-Sheng Hu
- Guangdong Key Laboratory for Innovative Development and Utilization of Forest Plant Germplasm, South China Agricultural University, Guangdong 510642, China; State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, South China Agricultural University, Guangdong 510642, China; Department of Plant Sciences, University of Oxford, South Parks Road, Oxford, OX13RB, United Kingdom.
| | - Yang Hu
- Department of Computing Science, University of Alberta, Edmonton, AB T6G 2S4, Canada
| | - Xiaoyang Chen
- Guangdong Key Laboratory for Innovative Development and Utilization of Forest Plant Germplasm, South China Agricultural University, Guangdong 510642, China; State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, South China Agricultural University, Guangdong 510642, China.
| |
Collapse
|
9
|
Global patterns of large copy number variations in the human genome reveal complexity in chromosome organization. Genet Res (Camb) 2015; 97:e18. [PMID: 26390810 DOI: 10.1017/s0016672315000191] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
Abstract
Global patterns of copy number variations (CNVs) in chromosomes are required to understand the dynamics of genome organization and complexity. For this study, analysis was performed using the Affymetrix Genome-Wide Human SNP Array 6.0 chip and CytoScan High-Density arrays. We identified a total of 44 109 CNVs from 1715 genomes with a mean of 25 CNVs in an individual, which established the first drafts of population-specific CNV maps providing a rationale for prioritizing chromosomal regions. About 19 905 ancient CNVs were identified across all chromosomes and populations at varying frequencies. CNV count, and sometimes CNV size, contributed to the bulk CNV size of the chromosome. Population specific lengthening and shortening of chromosomal length was observed. Sex bias for CNV presence was largely dependent on ethnicity. Lower CNV inheritance rate was observed for India, compared to YRI and CEU. A total of 33 candidate CNV hotspots from 5382 copy number (CN) variable region (CNVR) clusters were identified. Population specific CNV distribution patterns in p and q arms disturbed the assumption that CNV counts in the p arm are less common compared to long arms, and the CNV occurrence and distribution in chromosomes is length independent. This study unraveled the force of independent evolutionary dynamics on genome organization and complexity across chromosomes and populations.
Collapse
|
10
|
Chen L, Zhou W, Zhang L, Zhang F. Genome architecture and its roles in human copy number variation. Genomics Inform 2014; 12:136-44. [PMID: 25705150 PMCID: PMC4330246 DOI: 10.5808/gi.2014.12.4.136] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2014] [Revised: 11/12/2014] [Accepted: 11/12/2014] [Indexed: 02/06/2023] Open
Abstract
Besides single-nucleotide variants in the human genome, large-scale genomic variants, such as copy number variations (CNVs), are being increasingly discovered as a genetic source of human diversity and the pathogenic factors of diseases. Recent experimental findings have shed light on the links between different genome architectures and CNV mutagenesis. In this review, we summarize various genomic features and discuss their contributions to CNV formation. Genomic repeats, including both low-copy and high-copy repeats, play important roles in CNV instability, which was initially known as DNA recombination events. Furthermore, it has been found that human genomic repeats can also induce DNA replication errors and consequently result in CNV mutations. Some recent studies showed that DNA replication timing, which reflects the high-order information of genomic organization, is involved in human CNV mutations. Our review highlights that genome architecture, from DNA sequence to high-order genomic organization, is an important molecular factor in CNV mutagenesis and human genomic instability.
Collapse
Affiliation(s)
- Lu Chen
- School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Weichen Zhou
- School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Ling Zhang
- School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Feng Zhang
- School of Life Sciences, Fudan University, Shanghai 200438, China. ; Collaborative Innovation Center for Genetics and Development, Fudan University, Shanghai 200438, China
| |
Collapse
|
11
|
Association of Copy Number Variations in Autism Spectrum Disorders: A Systematic Review. ACTA ACUST UNITED AC 2014. [DOI: 10.1155/2014/713109] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Autism spectrum disorders (ASDs) are characterized by language impairments, social deficits, and repetitive behaviors. The onset of symptoms occurs by the age of 3 and shows a lifelong persistence. Genetics plays a major role in the etiology of ASD. Except genetics, several potential risk factors (environmental factors and epigenetics) may contribute to ASD. Copy number variations (CNVs) are the most widespread structural variations in the human genome. These variations can alter the genome structure either by deletion or by duplication. CNVs can be de novo or inherited. Chromosomal rearrangements have been detected in 5–10% of the patients with ASD and recently copy number changes ranging from a few kilobases (kb) to several megabases (Mb) in size have been reported. Recent data have also revealed that submicroscopic CNVs can have a role in ASD, and de novo CNVs seem to be a more common risk factor in sporadic compared with inherited forms of ASD. CNVs are being implicated as a contributor to the pathophysiology of complex neurodevelopmental disorders and they can affect a wide range of human phenotypes including mental retardation (MR), autism, neuropsychiatric disorders, and susceptibility to other complex traits such as HIV, Crohn’s disease, and psoriasis. This review emphasizes the major CNVs reported to date in ASD.
Collapse
|
12
|
Chen L, Zhou W, Zhang C, Lupski JR, Jin L, Zhang F. CNV instability associated with DNA replication dynamics: evidence for replicative mechanisms in CNV mutagenesis. Hum Mol Genet 2014; 24:1574-83. [PMID: 25398944 DOI: 10.1093/hmg/ddu572] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Copy number variation (CNV) in the human genome is of vital importance to human health and evolution of our species. However, much of the molecular basis of CNV mutagenesis remains to be elucidated. Considering the DNA replication model of 'fork stalling and template switching' for CNV formation, we hypothesized that replication fork progression could be important for CNV mutagenesis. However, molecular assays of replication fork progression at the genome level are technically challenging. Instead, we conducted an estimation of DNA replication dynamics, as the statistic R, using the readily available data of replication timing. Small R-values can reflect 'slowed' replication, which could result from less fork initiation, reduced fork speed or fork barriers. We generated genome-wide profiles of R in the genomes of human, mouse and Drosophila. Intriguingly, the CNV breakpoints in all three genomes showed significantly biased distributions toward the genomic regions with small R-values, suggesting potential replication stress-induced CNV instability. Notably, among the human CNVs with distinct breakpoint junction characteristics, the homology-mediated and VNTR-mediated CNVs contribute the most to the correlation between CNV instability and the statistic R, consistent with the recent findings in the C. elegans and yeast genomes of repeat-induced DNA replication error and consequent CNV formation. The statistic R may reflect both replication stress and the effect of local genome architecture on fork progression. Our concordant observations suggest an important role for DNA replicative mechanisms in CNV mutagenesis and genome instability.
Collapse
Affiliation(s)
- Lu Chen
- State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology and
| | - Weichen Zhou
- State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology and
| | - Cheng Zhang
- State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology and
| | - James R Lupski
- Department of Molecular and Human Genetics and Department of Pediatrics, Baylor College of Medicine, Houston TX 77030, USA Texas Children's Hospital, Houston, TX 77030, USA
| | - Li Jin
- State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology and Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Feng Zhang
- State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology and Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, China,
| |
Collapse
|
13
|
Peng Z, Zhou W, Fu W, Du R, Jin L, Zhang F. Correlation between frequency of non-allelic homologous recombination and homology properties: evidence from homology-mediated CNV mutations in the human genome. Hum Mol Genet 2014; 24:1225-33. [PMID: 25324539 DOI: 10.1093/hmg/ddu533] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Non-allelic homologous recombination (NAHR) is one of the key mechanisms of DNA rearrangement. NAHR occurring between direct homologous repeats can generate genomic copy number variation (CNV) and make significant contributions to both genome evolution and human diseases such as cancer. Intriguingly, previous observations on the rare CNVs at certain genomic disorder loci suggested that NAHR frequency could be dependent on homology properties. However, such a correlation remains unclear at the other NAHR-mediated CNV loci, especially the common CNVs in human populations. Different from the rare CNVs associated with genomic disorders, it is challenging to identify de novo NAHR events at common CNV loci. Therefore, our previously proposed statistic M was employed in estimating relative mutation rate for the NAHR-mediated CNVs in human populations. By utilizing generalized regression neural network and principal component analysis in studying 4330 CNVs ascertained in 3 HapMap populations, we identified the CNVs mediated by NAHR between paired segmental duplications (SDs) and further revealed the correlations between SD properties and NAHR probability. SD length and inter-SD distance were shown to make major contributions to the occurrence of NAHR, whereas chromosomal position and sequence similarity of paired SDs are also involved in NAHR. An integrated effect of SD properties on NAHR frequency was revealed for the common CNVs in human populations. These observations can be well explained by ectopic synapsis in NAHR together with our proposed model of chromosomal compression/extension/looping (CCEL) for homology mis-pairing. Our findings showed the important roles of SDs in NAHR and human genomic evolution.
Collapse
Affiliation(s)
- Zhen Peng
- State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology and Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Weichen Zhou
- State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology and
| | - Wenqing Fu
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Renqian Du
- State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology and
| | - Li Jin
- State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology and Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Feng Zhang
- State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology and Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, China,
| |
Collapse
|
14
|
Human gene copy number variation and infectious disease. Hum Genet 2014; 133:1217-33. [PMID: 25110110 DOI: 10.1007/s00439-014-1457-x] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2014] [Accepted: 05/20/2014] [Indexed: 01/05/2023]
Abstract
Variability in the susceptibility to infectious disease and its clinical manifestation can be determined by variation in the environment and by genetic variation in the pathogen and the host. Despite several successes based on candidate gene studies, defining the host variation affecting infectious disease has not been as successful as for other multifactorial diseases. Both single nucleotide variation and copy number variation (CNV) of the host contribute to the host's susceptibility to infectious disease. In this review we focus on CNV, particularly on complex multiallelic CNV that is often not well characterised either directly by hybridisation methods or indirectly by analysis of genotypes and flanking single nucleotide variants. We summarise the well-known examples, such as α-globin deletion and susceptibility to severe malaria, as well as more recent controversies, such as the extensive CNV of the chemokine gene CCL3L1 and HIV infection. We discuss the potential biological mechanisms that could underly any genetic association and reflect on the extensive complexity and functional variation generated by a combination of CNV and sequence variation, as illustrated by the Fc gamma receptor genes FCGR3A, FCGR3B and FCGR2C. We also highlight some understudied areas that might prove fruitful areas for further research.
Collapse
|
15
|
Veerappa AM, N MM, Vishweswaraiah S, Lingaiah K, Suresh RV, Nachappa SA, Prashali N, Yadav SN, Srikanta MA, Manjegowda DS, Seshachalam KB, Ramachandra NB. Copy number variations burden on miRNA genes reveals layers of complexities involved in the regulation of pathways and phenotypic expression. PLoS One 2014; 9:e90391. [PMID: 24587348 PMCID: PMC3938728 DOI: 10.1371/journal.pone.0090391] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2013] [Accepted: 01/28/2014] [Indexed: 12/21/2022] Open
Abstract
MicroRNAs are involved in post-transcriptional down-regulation of gene expression. Variations in miRNA genes can severely affect downstream-regulated genes and their pathways. However, population-specific burden of CNVs on miRNA genes and the complexities created towards the phenotype is not known. From a total of 44109 CNVs investigated from 1715 individuals across 12 populations using high-throughput arrays, 4007 miRNA-CNVs (∼ 9%) consisting 6542 (∼ 5%) miRNA genes with a total of 333 (∼ 5%) singleton miRNA genes were identified. We found miRNA-CNVs across the genomes of individuals showing multiple hits in many targets, co-regulated under the same pathway. This study proposes four mechanisms unraveling the many complexities in miRNA genes, targets and co-regulated miRNA genes towards establishment of phenotypic diversity.
Collapse
Affiliation(s)
- Avinash M. Veerappa
- Genetics and Genomics Lab, Department of Studies in Zoology, University of Mysore, Manasagangotri, Mysore, Karnataka, India
| | - Megha Murthy N
- Genetics and Genomics Lab, Department of Studies in Zoology, University of Mysore, Manasagangotri, Mysore, Karnataka, India
| | - Sangeetha Vishweswaraiah
- Genetics and Genomics Lab, Department of Studies in Zoology, University of Mysore, Manasagangotri, Mysore, Karnataka, India
| | - Kusuma Lingaiah
- Genetics and Genomics Lab, Department of Studies in Zoology, University of Mysore, Manasagangotri, Mysore, Karnataka, India
| | - Raviraj V. Suresh
- Genetics and Genomics Lab, Department of Studies in Zoology, University of Mysore, Manasagangotri, Mysore, Karnataka, India
| | - Somanna Ajjamada Nachappa
- Genetics and Genomics Lab, Department of Studies in Zoology, University of Mysore, Manasagangotri, Mysore, Karnataka, India
| | - Nelchi Prashali
- Genetics and Genomics Lab, Department of Studies in Zoology, University of Mysore, Manasagangotri, Mysore, Karnataka, India
| | - Sangeetha Nuggehalli Yadav
- Genetics and Genomics Lab, Department of Studies in Zoology, University of Mysore, Manasagangotri, Mysore, Karnataka, India
| | - Manjula Arsikere Srikanta
- Genetics and Genomics Lab, Department of Studies in Zoology, University of Mysore, Manasagangotri, Mysore, Karnataka, India
| | - Dinesh S. Manjegowda
- Department of Anatomy, Yenepoya Medical College, Yenepoya University, Mangalore, Karnataka, India
- Nitte University Centre for Science Education & Research, K S Hegde Medical Academy, Nitte University, Deralakatte, Mangalore, Karnataka, India
| | | | - Nallur B. Ramachandra
- Genetics and Genomics Lab, Department of Studies in Zoology, University of Mysore, Manasagangotri, Mysore, Karnataka, India
| |
Collapse
|
16
|
Yang R, Chen B, Pfütze K, Buch S, Steinke V, Holinski-Feder E, Stöcker S, von Schönfels W, Becker T, Schackert HK, Royer-Pokora B, Kloor M, Schmiegel WH, Büttner R, Engel C, Lascorz Puertolas J, Försti A, Kunkel N, Bugert P, Schreiber S, Krawczak M, Schafmayer C, Propping P, Hampe J, Hemminki K, Burwinkel B. Genome-wide analysis associates familial colorectal cancer with increases in copy number variations and a rare structural variation at 12p12.3. Carcinogenesis 2013; 35:315-23. [PMID: 24127187 DOI: 10.1093/carcin/bgt344] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Colorectal cancer (CRC) is one of the most common cancer worldwide. However, a large number of genetic risk factors involved in CRC have not been understood. Copy number variations (CNVs) might partly contribute to the 'missing heritability' of CRC. An increased overall burden of CNV has been identified in several complex diseases, whereas the association between the overall CNV burden and CRC risk is largely unknown. We performed a genome-wide investigation of CNVs on genomic DNA from 384 familial CRC cases and 1285 healthy controls by the Affymetrix 6.0 array. An increase of overall CNV burden was observed in familial CRC patients compared with healthy controls, especially for CNVs larger than 50kb (case/control ratio = 1.66, P = 0.025). In addition, we discovered for the first time a novel structural variation at 12p12.3 and determined the breakpoints by strategic PCR and sequencing. This 12p12.3 structural variation was found in four of 2862 CRC cases but not in 6243 healthy controls (P = 0.0098). RERGL gene (RERG/RAS-like), the only gene influenced by the 12p12.3 structural variation, sharing most of the conserved regions with its close family member RERG tumor suppressor gene (RAS-like, estrogen-regulated, growth inhibitor), might be a novel CRC-related gene. In conclusion, this is the first study to reveal the contribution of the overall burden of CNVs to familial CRC risk and identify a novel rare structural variation at 12p12.3 containing RERGL gene to be associated with CRC.
Collapse
Affiliation(s)
- Rongxi Yang
- Molecular Epidemiology, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Zheng C, Miao X, Li Y, Huang Y, Ruan J, Ma X, Wang L, Wu CI, Cai J. Determination of genomic copy number alteration emphasizing a restriction site-based strategy of genome re-sequencing. Bioinformatics 2013; 29:2813-21. [DOI: 10.1093/bioinformatics/btt481] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
|
18
|
Ezawa K, Landan G, Graur D. Detecting negative selection on recurrent mutations using gene genealogy. BMC Genet 2013; 14:37. [PMID: 23651527 PMCID: PMC3661350 DOI: 10.1186/1471-2156-14-37] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2012] [Accepted: 04/13/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Whether or not a mutant allele in a population is under selection is an important issue in population genetics, and various neutrality tests have been invented so far to detect selection. However, detection of negative selection has been notoriously difficult, partly because negatively selected alleles are usually rare in the population and have little impact on either population dynamics or the shape of the gene genealogy. Recently, through studies of genetic disorders and genome-wide analyses, many structural variations were shown to occur recurrently in the population. Such "recurrent mutations" might be revealed as deleterious by exploiting the signal of negative selection in the gene genealogy enhanced by their recurrence. RESULTS Motivated by the above idea, we devised two new test statistics. One is the total number of mutants at a recurrently mutating locus among sampled sequences, which is tested conditionally on the number of forward mutations mapped on the sequence genealogy. The other is the size of the most common class of identical-by-descent mutants in the sample, again tested conditionally on the number of forward mutations mapped on the sequence genealogy. To examine the performance of these two tests, we simulated recurrently mutated loci each flanked by sites with neutral single nucleotide polymorphisms (SNPs), with no recombination. Using neutral recurrent mutations as null models, we attempted to detect deleterious recurrent mutations. Our analyses demonstrated high powers of our new tests under constant population size, as well as their moderate power to detect selection in expanding populations. We also devised a new maximum parsimony algorithm that, given the states of the sampled sequences at a recurrently mutating locus and an incompletely resolved genealogy, enumerates mutation histories with a minimum number of mutations while partially resolving genealogical relationships when necessary. CONCLUSIONS With their considerably high powers to detect negative selection, our new neutrality tests may open new venues for dealing with the population genetics of recurrent mutations as well as help identifying some types of genetic disorders that may have escaped identification by currently existing methods.
Collapse
Affiliation(s)
- Kiyoshi Ezawa
- Department of Biology and Biochemistry, University of Houston, Houston, TX 77204-5001, USA
- Present address: Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, Iizuka, Fukuoka 820-8502, Japan
| | - Giddy Landan
- Department of Biology and Biochemistry, University of Houston, Houston, TX 77204-5001, USA
- Present address: Institute of Genomic Microbiology, Heinrich-Heine University Düsseldorf, Universitätsstr. 1, Düsseldorf 40225, Germany
| | - Dan Graur
- Department of Biology and Biochemistry, University of Houston, Houston, TX 77204-5001, USA
| |
Collapse
|
19
|
Chang CL, Semyonov J, Cheng PJ, Huang SY, Park JI, Tsai HJ, Lin CY, Grützner F, Soong YK, Cai JJ, Hsu SYT. Widespread divergence of the CEACAM/PSG genes in vertebrates and humans suggests sensitivity to selection. PLoS One 2013; 8:e61701. [PMID: 23613906 PMCID: PMC3628338 DOI: 10.1371/journal.pone.0061701] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2012] [Accepted: 03/17/2013] [Indexed: 11/30/2022] Open
Abstract
In mammals, carcinoembryonic antigen cell adhesion molecules (CEACAMs) and pregnancy-specific glycoproteins (PSGs) play important roles in the regulation of pathogen transmission, tumorigenesis, insulin signaling turnover, and fetal–maternal interactions. However, how these genes evolved and to what extent they diverged in humans remain to be investigated specifically. Based on syntenic mapping of chordate genomes, we reveal that diverging homologs with a prototypic CEACAM architecture–including an extracellular domain with immunoglobulin variable and constant domain-like regions, and an intracellular domain containing ITAM motif–are present from cartilaginous fish to humans, but are absent in sea lamprey, cephalochordate or urochordate. Interestingly, the CEACAM/PSG gene inventory underwent radical divergence in various vertebrate lineages: from zero in avian species to dozens in therian mammals. In addition, analyses of genetic variations in human populations showed the presence of various types of copy number variations (CNVs) at the CEACAM/PSG locus. These copy number polymorphisms have 3–80% frequency in select populations, and encompass single to more than six PSG genes. Furthermore, we found that CEACAM/PSG genes contain a significantly higher density of nonsynonymous single nucleotide polymorphism (SNP) compared to the chromosome average, and many CEACAM/PSG SNPs exhibit high population differentiation. Taken together, our study suggested that CEACAM/PSG genes have had a more dynamic evolutionary history in vertebrates than previously thought. Given that CEACAM/PSGs play important roles in maternal–fetal interaction and pathogen recognition, these data have laid the groundwork for future analysis of adaptive CEACAM/PSG genotype-phenotypic relationships in normal and complicated pregnancies as well as other etiologies.
Collapse
Affiliation(s)
- Chia Lin Chang
- Department of Obstetrics and Gynecology, Chang Gung Memorial Hospital Linkou Medical Center, Chang Gung University, Kweishan, Taoyuan, Taiwan.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Zhou W, Zhang F, Chen X, Shen Y, Lupski JR, Jin L. Increased genome instability in human DNA segments with self-chains: homology-induced structural variations via replicative mechanisms. Hum Mol Genet 2013; 22:2642-51. [PMID: 23474816 DOI: 10.1093/hmg/ddt113] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Environmental factors including ionizing radiation and chemical agents have been known to be able to induce DNA rearrangements and cause genomic structural variations (SVs); however, the roles of intrinsic characteristics of the human genome, such as regional genome architecture, in SV formation and the potential mechanisms underlying genomic instability remain to be further elucidated. Recently, locus-specific observations showed that 'self-chain' (SC), a group of short low-copy repeats (LCRs) in the human genome, can induce autism-associated SV mutations of the MECP2 and NRXN1 genes. In this study, we conducted a genome-wide analysis to investigate SCs and their potential roles in genomic SV formation. Utilizing a vast amount of human SV data, we observed a significant biased distribution of human germline SV breakpoints to SC regions. Notably, the breakpoint distribution pattern is different between SV types across deletion, duplication, inversion and insertion. Our observations were coincident with a mechanism of SC-induced DNA replicative errors, whereas SC may sporadically be used as substrates of nonallelic homologous recombination (NAHR). This contention was further supported by our consistent findings in somatic SV mutations of cancer genomes, suggesting a general mechanism of SC-induced genome instability in human germ and somatic cells.
Collapse
Affiliation(s)
- Weichen Zhou
- State Key Laboratory of Genetic Engineering and Ministry of Education, Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China
| | | | | | | | | | | |
Collapse
|
21
|
Chen X, Shen Y, Zhang F, Chiang C, Pillalamarri V, Blumenthal I, Talkowski M, Wu BL, Gusella J. Molecular analysis of a deletion hotspot in the NRXN1 region reveals the involvement of short inverted repeats in deletion CNVs. Am J Hum Genet 2013; 92:375-86. [PMID: 23472757 PMCID: PMC3591860 DOI: 10.1016/j.ajhg.2013.02.006] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2012] [Revised: 12/04/2012] [Accepted: 02/12/2013] [Indexed: 01/07/2023] Open
Abstract
NRXN1 microdeletions occur at a relatively high frequency and confer increased risk for neurodevelopmental and neurobehavioral abnormalities. The mechanism that makes NRXN1 a deletion hotspot is unknown. Here, we identified deletions of the NRXN1 region in affected cohorts, confirming a strong association with the autism spectrum and other neurodevelopmental disorders. Interestingly, deletions in both affected and control individuals were clustered in the 5' portion of NRXN1 and its immediate upstream region. To explore the mechanism of deletion, we mapped and analyzed the breakpoints of 32 deletions. At the deletion breakpoints, frequent microhomology (68.8%, 2-19 bp) suggested predominant mechanisms of DNA replication error and/or microhomology-mediated end-joining. Long terminal repeat (LTR) elements, unique non-B-DNA structures, and MEME-defined sequence motifs were significantly enriched, but Alu and LINE sequences were not. Importantly, small-size inverted repeats (minus self chains, minus sequence motifs, and partial complementary sequences) were significantly overrepresented in the vicinity of NRXN1 region deletion breakpoints, suggesting that, although they are not interrupted by the deletion process, such inverted repeats can predispose a region to genomic instability by mediating single-strand DNA looping via the annealing of partially reverse complementary strands and the promoting of DNA replication fork stalling and DNA replication error. Our observations highlight the potential importance of inverted repeats of variable sizes in generating a rearrangement hotspot in which individual breakpoints are not recurrent. Mechanisms that involve short inverted repeats in initiating deletion may also apply to other deletion hotspots in the human genome.
Collapse
Affiliation(s)
- Xiaoli Chen
- Capital Institute of Pediatrics, Beijing 100020, China
- Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Laboratory Medicine, Children’s Hospital Boston, Boston, MA 02115, USA
| | - Yiping Shen
- Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Laboratory Medicine, Children’s Hospital Boston, Boston, MA 02115, USA
- Shanghai Children’s Medical Center, Shanghai Jiaotong University School of Medicine, Shanghai 200127, China
- Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Feng Zhang
- State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai 200433, China
| | - Colby Chiang
- Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Vamsee Pillalamarri
- Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Ian Blumenthal
- Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Michael Talkowski
- Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Neurology, Harvard Medical School, Boston, MA 02115, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Bai-Lin Wu
- Department of Laboratory Medicine, Children’s Hospital Boston, Boston, MA 02115, USA
- Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
- Children’s Hospital and Institutes of Biomedical Science, Fudan University, Shanghai 200032, China
| | - James F. Gusella
- Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA 02114, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
22
|
Chen W, Xu Z, Nishitani M, Van Ryzin C, McDonnell NB, Merke DP. Complement component 4 copy number variation and CYP21A2 genotype associations in patients with congenital adrenal hyperplasia due to 21-hydroxylase deficiency. Hum Genet 2012; 131:1889-94. [PMID: 22886582 DOI: 10.1007/s00439-012-1217-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2012] [Accepted: 08/01/2012] [Indexed: 11/30/2022]
Abstract
Congenital adrenal hyperplasia (CAH) due to 21-hydroxylase deficiency (21-OHD) is an autosomal recessive disorder of cortisol biosynthesis caused by CYP21A2 mutations. An increase in gene copy number variation (CNV) exists at the CYP21A2 locus. CNV of C4, a neighboring gene that encodes complement component 4, is associated with autoimmune disease susceptibility. In this study, we performed comprehensive genetic analysis of the RP-C4-CYP21-TNX (RCCX) region in 127 unrelated 21-OHD patients (100 classic, 27 nonclassic). C4 copy number was determined by Southern blot. C4 CNV and serum C4 levels were evaluated in relation to CYP21A2 mutations and relevant phenotypes. We found that the most common CYP21A2 mutation associated with the nonclassic form of CAH, V281L, was associated with high C4 copy number (p = 7.13 × 10(-16)). Large CYP21A2 deletion, a common mutation associated with the classic form of CAH, was associated with low C4 copy number (p = 1.61 × 10(-14)). Monomodular RCCX with a short C4 gene, a risk factor for autoimmune disease, was significantly less frequent in CAH patients compared to population estimates (2.8 vs. 10.6 %; p = 1.08 × 10(-4)). In conclusion, CAH patients have increased C4 CNV, with mutation-specific associations that may be protective for autoimmune disease. The study of CYP21A2 in relation to neighboring genes provides insight into the genetics of CNV hotspots, an important determinant of human health.
Collapse
Affiliation(s)
- Wuyan Chen
- Laboratory of Clinical Investigation, National Institute on Aging, Baltimore, MD, USA.
| | | | | | | | | | | |
Collapse
|
23
|
Du R, Lu C, Jiang Z, Li S, Ma R, An H, Xu M, An Y, Xia Y, Jin L, Wang X, Zhang F. Efficient typing of copy number variations in a segmental duplication-mediated rearrangement hotspot using multiplex competitive amplification. J Hum Genet 2012; 57:545-551. [PMID: 22673690 DOI: 10.1038/jhg.2012.66] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Local genomic architecture, such as segmental duplications (SDs), can induce copy number variations (CNVs) hotspots in the human genome, many of which manifest as genomic disorders. Significant technological advances have been achieved for genome-wide CNV investigations, but these costly methods are not suitable for genotyping certain disease-associated CNVs or other loci of interest in populations. Recently, two independent studies showed that the murine meiosis expressed gene 1 (Meig1) was critical to spermatogenesis. We found that the human orthologue MEIG1 is flanked by an SD pair, between which non-allelic homologous recombination (NAHR) can cause recurrent CNVs. To study this potential CNV hotspot and its role in spermatogenesis, we developed a new CNV genotyping method, AccuCopy, based on multiplex competitive amplification to investigate 320 patients with spermatogenic impairment and 93 healthy controls. Three MEIG1 duplications (two in patients and one in controls) were identified, whereas no deletion was found. As NAHR results in more recurrent deletions than duplications at a locus, the over representation of recurrent MEIG1 duplications suggests a potential purifying selection operating on this hotspot, possibly via fecundity. We also showed that AccuCopy is an efficient and reliable method for multiplex CNV genotyping.
Collapse
Affiliation(s)
- Renqian Du
- MOE Key Laboratory of Contemporary Anthropology and State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Machado LR, Hardwick RJ, Bowdrey J, Bogle H, Knowles TJ, Sironi M, Hollox EJ. Evolutionary history of copy-number-variable locus for the low-affinity Fcγ receptor: mutation rate, autoimmune disease, and the legacy of helminth infection. Am J Hum Genet 2012; 90:973-85. [PMID: 22608500 DOI: 10.1016/j.ajhg.2012.04.018] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2011] [Revised: 03/22/2012] [Accepted: 04/10/2012] [Indexed: 11/18/2022] Open
Abstract
Both sequence variation and copy-number variation (CNV) of the genes encoding receptors for immunoglobulin G (Fcγ receptors) have been genetically and functionally associated with a number of autoimmune diseases. However, the molecular nature and evolutionary context of this variation is unknown. Here, we describe the structure of the CNV, estimate its mutation rate and diversity, and place it in the context of the known functional alloantigen variation of these genes. Deletion of Fcγ receptor IIIB, associated with systemic lupus erythematosus, is a result of independent nonallelic homologous recombination events with a frequency of approximately 0.1%. We also show that pathogen diversity, in particular helminth diversity, has played a critical role in shaping the functional variation at these genes both between mammalian species and between human populations. Positively selected amino acids are involved in the interaction with IgG and include some amino acids that are known polymorphic alloantigens in humans. This supports a genetic contribution to the hygiene hypothesis, which states that past evolution in the context of helminth diversity has left humans with an array of susceptibility alleles for autoimmune disease in the context of a helminth-free environment. This approach shows the link between pathogens and autoimmune disease at the genetic level and provides a strategy for interrogating the genetic variation underlying autoimmune-disease risk and infectious-disease susceptibility.
Collapse
|
25
|
Iskow RC, Gokcumen O, Lee C. Exploring the role of copy number variants in human adaptation. Trends Genet 2012; 28:245-57. [PMID: 22483647 PMCID: PMC3533238 DOI: 10.1016/j.tig.2012.03.002] [Citation(s) in RCA: 97] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2011] [Revised: 03/05/2012] [Accepted: 03/06/2012] [Indexed: 11/18/2022]
Abstract
Over the past decade, the ubiquity of copy number variants (CNVs, the gain or loss of genomic material) in the genomes of healthy humans has become apparent. Although some of these variants are associated with disorders, a handful of studies documented an adaptive advantage conferred by CNVs. In this review, we propose that CNVs are substrates for human evolution and adaptation. We discuss the possible mechanisms and evolutionary processes in which CNVs are selected, outline the current challenges in identifying these loci, and highlight that copy number variable regions allow for the creation of novel genes that may diversify the repertoire of such genes in response to rapidly changing environments. We expect that many more adaptive CNVs will be discovered in the coming years, and we believe that these new findings will contribute to our understanding of human-specific phenotypes.
Collapse
Affiliation(s)
- Rebecca C Iskow
- Department of Pathology, Brigham and Women's Hospital, Boston, MA 02115, USA
| | | | | |
Collapse
|
26
|
Ortutay C, Vihinen M. Conserved and quickly evolving immunome genes have different evolutionary paths. Hum Mutat 2012; 33:1456-63. [PMID: 22623381 DOI: 10.1002/humu.22125] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2012] [Accepted: 05/15/2012] [Indexed: 12/11/2022]
Abstract
Genetic, transcript, and protein level variations have important functional and evolutionary consequences. We performed systematic data collection and analysis of copy-number variations, single-nucleotide polymorphisms, disease-causing variations, messenger RNA splicing variants, and protein posttranslational modifications for the genes and proteins essential for human immune system. Information about polymorphic and evolutionarily fixed genetic variations was used to group immunome genes to the most conserved and the most quickly changing ones under directed selection during the recent immunome evolution. Gene Ontology terms related to adaptive immunity are associated with gene groups subject to recent directing selection. In addition, several other characteristics of the immunome genes and proteins in these two categories have statistically significant differences. The presented findings question the usability of directed mouse genes as models for human diseases and conditions and shed light on the fine tuning of human immunity and its diverse functions.
Collapse
Affiliation(s)
- Csaba Ortutay
- Institute of Biomedical Technology, University of Tampere, Tampere, Finland
| | | |
Collapse
|
27
|
Su Z, Gu X. Revisit on the evolutionary relationship between alternative splicing and gene duplication. Gene 2012; 504:102-6. [PMID: 22621894 DOI: 10.1016/j.gene.2012.05.012] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2012] [Revised: 03/18/2012] [Accepted: 05/07/2012] [Indexed: 11/19/2022]
Abstract
Gene duplications and alternative splicing (AS) isoforms are two widespread types of genetic variations that can facilitate diversification of protein function. A number of studies claimed that after gene duplication, two AS isoforms with differential functions can be 'fixed', respectively, in each of the duplicate copies. This simple 'functional-sharing' hypothesis was recently challenged by Roux and Robinson-Rechavi (2011). Instead, they proposed a more sophisticated hypothesis, invoking that less alternative splicing genes tend to be duplicated more frequently, and single-copy genes are younger than duplicate genes, or the 'duplicability-age' hypothesis for short. In this letter, we show that all these genome-wide analyses of AS isoforms actually did not provide clear-cut evidence to nullify the basic idea of functional-sharing hypothesis. After updating our understanding of genome-wide alternative splicing, duplicability and CNV (copy number variation), we argue that the foundation of the duplicability-age hypothesis remains to be justified carefully. Finally, we suggest that a better approach to resolving this controversy is the correspondence analysis of indels (insertions and deletions) between duplicate genes to the genomic exon-intron structure, which can be used to experimentally test the effect of functional-sharing hypothesis.
Collapse
Affiliation(s)
- Zhixi Su
- MOE Key Laboratory of Contemporary Anthropology and Center for Evolutionary Biology, School of Life Sciences, Fudan University, Shanghai 200433, China
| | | |
Collapse
|
28
|
Lu P, Han X, Qi J, Yang J, Wijeratne AJ, Li T, Ma H. Analysis of Arabidopsis genome-wide variations before and after meiosis and meiotic recombination by resequencing Landsberg erecta and all four products of a single meiosis. Genome Res 2012; 22:508-18. [PMID: 22106370 PMCID: PMC3290786 DOI: 10.1101/gr.127522.111] [Citation(s) in RCA: 98] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2011] [Accepted: 11/17/2011] [Indexed: 11/24/2022]
Abstract
Meiotic recombination, including crossovers (COs) and gene conversions (GCs), impacts natural variation and is an important evolutionary force. COs increase genetic diversity by redistributing existing variation, whereas GCs can alter allelic frequency. Here, we sequenced Arabidopsis Landsberg erecta (Ler) and two sets of all four meiotic products from a Columbia (Col)/Ler hybrid to investigate genome-wide variation and meiotic recombination at nucleotide resolution. Comparing Ler and Col sequences uncovered 349,171 Single Nucleotide Polymorphisms (SNPs), 58,085 small and 2315 large insertions/deletions (indels), with highly correlated genome-wide distributions of SNPs, and small indels. A total of 443 genes have at least 10 nonsynonymous substitutions in protein-coding regions, with enrichment for disease-resistance genes. Another 316 genes are affected by large indels, including 130 genes with complete deletion of coding regions in Ler. Using the Arabidopsis qrt1 mutant, two sets of four meiotic products were generated and analyzed by sequencing for meiotic recombination, representing the first tetrad analysis with whole-genome sequencing in a nonfungal species. We detected 18 COs, six of which had an associated GC event, and four GCs without COs (NCOs), and revealed that Arabidopsis GCs are likely fewer and with shorter tracts than those in yeast. Meiotic recombination and chromosome assortment events dramatically redistributed genome variation in meiotic products, contributing to population diversity. In particular, meiosis provides a rapid mechanism to generate copy-number variation (CNV) of sequences that have different chromosomal positions in Col and Ler.
Collapse
Affiliation(s)
- Pingli Lu
- Department of Biology and the Huck Institutes of the Life Sciences, the Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Xinwei Han
- Department of Biology and the Huck Institutes of the Life Sciences, the Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Intercollege Graduate Program in Genetics, the Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Ji Qi
- State Key Laboratory of Genetic Engineering, Institute of Plant Biology, Center for Evolutionary Biology, School of Life Sciences, Fudan University, Shanghai 200433, China
- Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China
| | - Jiange Yang
- Department of Biology and the Huck Institutes of the Life Sciences, the Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Asela J. Wijeratne
- Department of Biology and the Huck Institutes of the Life Sciences, the Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Intercollege Graduate Program in Plant Biology, the Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Tao Li
- Institute of Hydrobiology, Chinese Academy of Science, Wuhan 430072, China
| | - Hong Ma
- State Key Laboratory of Genetic Engineering, Institute of Plant Biology, Center for Evolutionary Biology, School of Life Sciences, Fudan University, Shanghai 200433, China
- Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China
| |
Collapse
|
29
|
Cardoso-Moreira M, Emerson JJ, Clark AG, Long M. Drosophila duplication hotspots are associated with late-replicating regions of the genome. PLoS Genet 2011; 7:e1002340. [PMID: 22072977 PMCID: PMC3207856 DOI: 10.1371/journal.pgen.1002340] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2011] [Accepted: 08/26/2011] [Indexed: 12/22/2022] Open
Abstract
Duplications play a significant role in both extremes of the phenotypic spectrum of newly arising mutations: they can have severe deleterious effects (e.g. duplications underlie a variety of diseases) but can also be highly advantageous. The phenotypic potential of newly arisen duplications has stimulated wide interest in both the mutational and selective processes shaping these variants in the genome. Here we take advantage of the Drosophila simulans–Drosophila melanogaster genetic system to further our understanding of both processes. Regarding mutational processes, the study of two closely related species allows investigation of the potential existence of shared duplication hotspots, and the similarities and differences between the two genomes can be used to dissect its underlying causes. Regarding selection, the difference in the effective population size between the two species can be leveraged to ask questions about the strength of selection acting on different classes of duplications. In this study, we conducted a survey of duplication polymorphisms in 14 different lines of D. simulans using tiling microarrays and combined it with an analogous survey for the D. melanogaster genome. By integrating the two datasets, we identified duplication hotspots conserved between the two species. However, unlike the duplication hotspots identified in mammalian genomes, Drosophila duplication hotspots are not associated with sequences of high sequence identity capable of mediating non-allelic homologous recombination. Instead, Drosophila duplication hotspots are associated with late-replicating regions of the genome, suggesting a link between DNA replication and duplication rates. We also found evidence supporting a higher effectiveness of selection on duplications in D. simulans than in D. melanogaster. This is also true for duplications segregating at high frequency, where we find evidence in D. simulans that a sizeable fraction of these mutations is being driven to fixation by positive selection. DNA duplications are important contributors to the phenotypic differences observed between individuals. These mutations can disrupt the normal functioning of genes and so are often associated with disease. But because they can add genetic information they can also lead to evolutionary change. Understanding how selection and non-random mutation processes shape the distribution of duplications throughout the genome is important to elucidate both the medical and evolutionary impacts of these mutations. Here, we examined the roles of selection and mutation in shaping patterns of duplication polymorphisms across the genomes of the fruit fly Drosophila melanogaster and its sister species, D. simulans. We found that selection is pervasive in both genomes but is more efficient in D. simulans than in D. melanogaster. We also found that these two species have shared duplication hotspots, i.e. orthologous regions experiencing high rates of duplication in the two genomes. After excluding the hypothesis that Drosophila duplication hotspots are associated with regions of the genome rich in segmental duplications (as observed for mammalian genomes), we show that they are associated with late-replicating regions of the genome. Our work therefore proposes a link between DNA replication and rates of duplication across the genome.
Collapse
|
30
|
Cooper DN, Bacolla A, Férec C, Vasquez KM, Kehrer-Sawatzki H, Chen JM. On the sequence-directed nature of human gene mutation: the role of genomic architecture and the local DNA sequence environment in mediating gene mutations underlying human inherited disease. Hum Mutat 2011; 32:1075-99. [PMID: 21853507 PMCID: PMC3177966 DOI: 10.1002/humu.21557] [Citation(s) in RCA: 90] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2011] [Accepted: 06/17/2011] [Indexed: 12/21/2022]
Abstract
Different types of human gene mutation may vary in size, from structural variants (SVs) to single base-pair substitutions, but what they all have in common is that their nature, size and location are often determined either by specific characteristics of the local DNA sequence environment or by higher order features of the genomic architecture. The human genome is now recognized to contain "pervasive architectural flaws" in that certain DNA sequences are inherently mutation prone by virtue of their base composition, sequence repetitivity and/or epigenetic modification. Here, we explore how the nature, location and frequency of different types of mutation causing inherited disease are shaped in large part, and often in remarkably predictable ways, by the local DNA sequence environment. The mutability of a given gene or genomic region may also be influenced indirectly by a variety of noncanonical (non-B) secondary structures whose formation is facilitated by the underlying DNA sequence. Since these non-B DNA structures can interfere with subsequent DNA replication and repair and may serve to increase mutation frequencies in generalized fashion (i.e., both in the context of subtle mutations and SVs), they have the potential to serve as a unifying concept in studies of mutational mechanisms underlying human inherited disease.
Collapse
Affiliation(s)
- David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, United Kingdom.
| | | | | | | | | | | |
Collapse
|
31
|
Du RQ, Jin L, Zhang F. [Copy number variations in the human genome: their mutational mechanisms and roles in diseases]. YI CHUAN = HEREDITAS 2011; 33:857-69. [PMID: 21831802 DOI: 10.3724/sp.j.1005.2011.00857] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Copy number variation (CNV) is the main type of structure variation (SV) caused by genomic rearrangement, which mainly includes deletion and duplication of sub-microscopic but large (>1 kb) genomic segments. CNV has been recognized as one of the main genetic factors underlying human diseases. The mutation rate (per locus) of CNV is much higher than that of single nucleotide polymorphism (SNP). The genome-wide assays for CNV study include array-based comparative genomic hybridization (aCGH), SNP genotyping microarrays, and next-generation sequencing techniques. Various molecular mechanisms are involved in CNV formation, which can be divided into two main categories, DNA recombination-based and DNA replication-based mechanisms. CNVs can be associated with Mendelian diseases, sporadic diseases, and susceptibility to complex diseases. CNVs can convey clinical phenotypes by gene dosage, gene disruption, gene fusion, and position effects. Further studies on CNVs will shed new light on human genome structure, genetic variations between individuals, and missing heritability of human diseases.
Collapse
Affiliation(s)
- Ren-Qian Du
- MOE Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai 200433, China.
| | | | | |
Collapse
|
32
|
Multiple recurrent de novo CNVs, including duplications of the 7q11.23 Williams syndrome region, are strongly associated with autism. Neuron 2011; 70:863-85. [PMID: 21658581 DOI: 10.1016/j.neuron.2011.05.002] [Citation(s) in RCA: 902] [Impact Index Per Article: 69.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/18/2011] [Indexed: 11/22/2022]
Abstract
We have undertaken a genome-wide analysis of rare copy-number variation (CNV) in 1124 autism spectrum disorder (ASD) families, each comprised of a single proband, unaffected parents, and, in most kindreds, an unaffected sibling. We find significant association of ASD with de novo duplications of 7q11.23, where the reciprocal deletion causes Williams-Beuren syndrome, characterized by a highly social personality. We identify rare recurrent de novo CNVs at five additional regions, including 16p13.2 (encompassing genes USP7 and C16orf72) and Cadherin 13, and implement a rigorous approach to evaluating the statistical significance of these observations. Overall, large de novo CNVs, particularly those encompassing multiple genes, confer substantial risks (OR = 5.6; CI = 2.6-12.0, p = 2.4 × 10(-7)). We estimate there are 130-234 ASD-related CNV regions in the human genome and present compelling evidence, based on cumulative data, for association of rare de novo events at 7q11.23, 15q11.2-13.1, 16p11.2, and Neurexin 1.
Collapse
|
33
|
Dukes-Rimsky L, Guzauskas GF, Holden KR, Griggs R, Ladd S, Montoya MDC, DuPont BR, Srivastava AK. Microdeletion at 4q21.3 is associated with intellectual disability, dysmorphic facies, hypotonia, and short stature. Am J Med Genet A 2011; 155A:2146-53. [PMID: 21834054 DOI: 10.1002/ajmg.a.34137] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2010] [Accepted: 04/22/2011] [Indexed: 11/05/2022]
Abstract
Chromosomal imbalances are a major cause of intellectual disability (ID) and multiple congenital anomalies. We have clinically and molecularly characterized two patients with chromosome translocations and ID. Using whole genome array CGH analysis, we identified a microdeletion involving 4q21.3, unrelated to the translocations in both patients. We confirmed the 4q21.3 microdeletions using fluorescence in situ hybridization and quantitative genomic PCR. The corresponding deletion boundaries in the patients were further mapped and compared to previously reported 4q21 deletions and the associated clinical features. We determined a common region of deletion overlap that appears unique to ID, short stature, hypotonia, and dysmorphic facial features.
Collapse
Affiliation(s)
- Lynn Dukes-Rimsky
- J C Self Research Institute of Human Genetics, Greenwood Genetic Center, Greenwood, South Carolina 29646, USA
| | | | | | | | | | | | | | | |
Collapse
|