101
|
Chen W, Hayward C, Wright AF, Hicks AA, Vitart V, Knott S, Wild SH, Pramstaller PP, Wilson JF, Rudan I, Porteous DJ. Copy number variation across European populations. PLoS One 2011; 6:e23087. [PMID: 21829696 PMCID: PMC3150386 DOI: 10.1371/journal.pone.0023087] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2011] [Accepted: 07/12/2011] [Indexed: 12/13/2022] Open
Abstract
Genome analysis provides a powerful approach to test for evidence of genetic variation within and between geographical regions and local populations. Copy number variants which comprise insertions, deletions and duplications of genomic sequence provide one such convenient and informative source. Here, we investigate copy number variants from genome wide scans of single nucleotide polymorphisms in three European population isolates, the island of Vis in Croatia, the islands of Orkney in Scotland and the South Tyrol in Italy. We show that whereas the overall copy number variant frequencies are similar between populations, their distribution is highly specific to the population of origin, a finding which is supported by evidence for increased kinship correlation for specific copy number variants within populations.
Collapse
Affiliation(s)
- Wanting Chen
- Medical Genetics Section, Centre for Molecular Medicine, Institute of Genetics & Molecular Medicine, University of Edinburgh, Western General Hospital, Crewe Road South, Edinburgh, United Kingdom
| | - Caroline Hayward
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, United Kingdom
| | - Alan F. Wright
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, United Kingdom
| | - Andrew A. Hicks
- Institute of Genetic Medicine, European Academy Bozen/Bolzano (EURAC), Bolzano/Bozen, Italy - Affiliated Institute of the University of Lübeck, Lübeck, Germany
| | - Veronique Vitart
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, United Kingdom
| | - Sara Knott
- Institute of Evolutionary Biology, University of Edinburgh, Ashworth Laboratories, King's Buildings, Edinburgh, United Kingdom
| | - Sarah H. Wild
- Centre for Population Health Sciences, The University of Edinburgh Medical School, Edinburgh, United Kingdom
| | - Peter P. Pramstaller
- Institute of Genetic Medicine, European Academy Bozen/Bolzano (EURAC), Bolzano/Bozen, Italy - Affiliated Institute of the University of Lübeck, Lübeck, Germany
- Department of Neurology, General Central Hospital, Bolzano, Italy
- Department of Neurology, University of Lübeck, Lübeck, Germany
| | - James F. Wilson
- Centre for Population Health Sciences, The University of Edinburgh Medical School, Edinburgh, United Kingdom
| | - Igor Rudan
- Centre for Population Health Sciences, The University of Edinburgh Medical School, Edinburgh, United Kingdom
- Croatian Centre for Global Health, University of Split Medical School, Split, Croatia
| | - David J. Porteous
- Medical Genetics Section, Centre for Molecular Medicine, Institute of Genetics & Molecular Medicine, University of Edinburgh, Western General Hospital, Crewe Road South, Edinburgh, United Kingdom
- * E-mail:
| |
Collapse
|
102
|
Johansson ACV, Feuk L. Characterization of copy number-stable regions in the human genome. Hum Mutat 2011; 32:947-55. [PMID: 21542059 DOI: 10.1002/humu.21524] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2011] [Accepted: 04/20/2011] [Indexed: 01/25/2023]
Abstract
In the past few years the number of copy number variants (CNVs) identified in the human genome has increased significantly, but our understanding of the functional impact of CNVs is still limited. Clinically significant variations cannot easily be distinguished from benign, complicating interpretation of patient data. Multiple studies have focused on analysis of regions that vary in copy number in specific disorders. Here we use the opposite strategy and focus our analysis on regions that never seem to vary in the general population, hypothesizing that these are copy number stable because variations within them are deleterious. Our results show that copy number stable regions are characterized by correlation with a number of genomic features, allowing us to define a list of genomic regions that are dosage sensitive in humans. We find that these dosage-sensitive regions show significant overlap with de novo CNVs identified in patients with intellectual disability or autism. There is also a significant association between copy number stable regions and rare inherited variants in autism patients, but not in controls. Based on this predictive power, we propose that copy number stable regions can be used to complement maps of known CNVs to facilitate interpretation of patient data.
Collapse
Affiliation(s)
- Anna C V Johansson
- Department of Immunology, Genetics and Pathology, Rudbeck Laboratory, Uppsala University, Uppsala, Sweden
| | | |
Collapse
|
103
|
Mkrtchyan H, Gross M, Hinreiner S, Polytiko A, Manvelyan M, Mrasek K, Kosyakova N, Ewers E, Nelle H, Liehr T, Bhatt S, Thoma K, Gebhart E, Wilhelm S, Fahsold R, Volleth M, Weise A. The human genome puzzle - the role of copy number variation in somatic mosaicism. Curr Genomics 2011; 11:426-31. [PMID: 21358987 PMCID: PMC3018723 DOI: 10.2174/138920210793176047] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2010] [Revised: 05/01/2010] [Accepted: 05/24/2010] [Indexed: 11/30/2022] Open
Abstract
The discovery of copy number variations (CNV) in the human genome opened new perspectives in the study of the genetic causes of inherited disorders and the etiology of common diseases. Differently patterned instances of somatic mosaicism in CNV regions have been shown to be present in monozygotic twins and throughout different tissues within an individual. A single-cell-level investigation of CNV in different human cell types led us to uncover mitotically derived genomic mosaicism, which is stable in different cell types of one individual. A unique study of immortalized B-lymphoblastoid cell lines obtained with 20 year interval from the same two subjects shows that mitotic changes in CNV regions may happen early during embryonic development and seem to occur only once, as levels of mosaicism remained stable. This finding has the potential to change our concept of dynamic human genome variation. We propose that further genomic studies should focus on the single-cell level, to understand better the etiology and physiology of aging and diseases mediated by somatic variations.
Collapse
Affiliation(s)
- Hasmik Mkrtchyan
- Jena University Hospital, Institute of Human Genetics and Anthropology, Jena, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
104
|
Liu WW, Gao YX, Zhou LP, Duan A, Tan LL, Li WZ, Yan M, Yang HY, Yan SL, Wang MQ, Ding WJ. Observations on Copy Number Variations in a Kidney-yang Deficiency Syndrome Family. EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE 2011; 2011:548358. [PMID: 21811512 PMCID: PMC3136678 DOI: 10.1093/ecam/neq069] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/01/2009] [Accepted: 05/19/2010] [Indexed: 11/17/2022]
Abstract
We have performed an analysis of a family with kidney-yang deficiency syndrome (KDS) in order to determine the structural genomic variations through a novel approach designated as “copy number variants” (CNVs). Twelve KDS subjects and three healthy spouses from this family were included in this study. Genomic DNA samples were genotyped utilizing an Affymetrix 100 K single nucleotide polymorphism array, and CNVs were identified by Copy Number Algorithm (CNAT4.0, Affymetrix). Our results demonstrate that 447 deleted and 476 duplicated CNVs are shared among KDS subjects within the family. The homologus ratio of deleted CNVs was as high as 99.78%. One-copy-duplicated CNVs display mid-range homology. For two copies of duplicated CNVs (CNV4), a markedly heterologous ratio was observed. Therefore, with the important exception of CNV4, our data shows that CNVs shared among KDS subjects display typical Mendelian inheritance. A total of 113 genes with established functions were identified from the CNV flanks; significantly enriched genes surrounding CNVs may contribute to certain adaptive benefit. These genes could be classified into categories including: binding and transporter, cell cycle, signal transduction, biogenesis, nerve development, metabolism regulation and immune response. They can also be included into three pathways, that is, signal transduction, metabolic processes and immunological networks. Particularly, the results reported here are consistent with the extensive impairments observed in KDS patients, involving the mass-energy-information-carrying network. In conclusion, this article provides the first set of CNVs from KDS patients that will facilitate our further understanding of the genetic basis of KDS and will allow novel strategies for a rational therapy of this disease.
Collapse
Affiliation(s)
- Wei Wei Liu
- Department of Fundamental Medicine, Chengdu University of Traditional Chinese Medicine, Chengdu 610075, China
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
105
|
Abstract
Despite the common assumption that orthologs usually share the same function, there have been various reports of divergence between orthologs, even among species as close as mammals. The comparison of mouse and human is of special interest, because mouse is often used as a model organism to understand human biology. We review the literature on evidence for divergence between human and mouse orthologous genes, and discuss it in the context of biomedical research.
Collapse
Affiliation(s)
- Walid H Gharib
- Department of Ecology and Evolution, Biophore, Swiss Institute of Bioinformatics, Lausanne University, CH-1015 Lausanne, Switzerland
| | | |
Collapse
|
106
|
Magi A, Benelli M, Yoon S, Roviello F, Torricelli F. Detecting common copy number variants in high-throughput sequencing data by using JointSLM algorithm. Nucleic Acids Res 2011; 39:e65. [PMID: 21321017 PMCID: PMC3105418 DOI: 10.1093/nar/gkr068] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
The discovery of genomic structural variants (SVs), such as copy number variants (CNVs), is essential to understand genetic variation of human populations and complex diseases. Over recent years, the advent of new high-throughput sequencing (HTS) platforms has opened many opportunities for SVs discovery, and a very promising approach consists in measuring the depth of coverage (DOC) of reads aligned to the human reference genome. At present, few computational methods have been developed for the analysis of DOC data and all of these methods allow to analyse only one sample at time. For these reasons, we developed a novel algorithm (JointSLM) that allows to detect common CNVs among individuals by analysing DOC data from multiple samples simultaneously. We test JointSLM performance on synthetic and real data and we show its unprecedented resolution that enables the detection of recurrent CNV regions as small as 500 bp in size. When we apply JointSLM to analyse chromosome one of eight genomes with different ancestry, we identify 3000 regions with recurrent CNVs of different frequency and size: hierarchical clustering on these regions segregates the eight individuals in two groups that reflect their ancestry, demonstrating the potential utility of JointSLM for population genetics studies.
Collapse
Affiliation(s)
- Alberto Magi
- Laboratory Department, Diagnostic Genetic Unit, Careggi Hospital, Florence 5014, Italy.
| | | | | | | | | |
Collapse
|
107
|
Pramanik S, Cui X, Wang HY, Chimge NO, Hu G, Shen L, Gao R, Li H. Segmental duplication as one of the driving forces underlying the diversity of the human immunoglobulin heavy chain variable gene region. BMC Genomics 2011; 12:78. [PMID: 21272357 PMCID: PMC3042411 DOI: 10.1186/1471-2164-12-78] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2010] [Accepted: 01/27/2011] [Indexed: 11/10/2022] Open
Abstract
Background Segmental duplication and deletion were implicated for a region containing the human immunoglobulin heavy chain variable (IGHV) gene segments, 1.9III/hv3005 (possible allelic variants of IGHV3-30) and hv3019b9 (a possible allelic variant of IGHV3-33). However, very little is known about the ranges of the duplication and the polymorphic region. This is mainly because of the difficulty associated with distinguishing between allelic and paralogous sequences in the IGHV region containing extensive repetitive sequences. Inability to separate the two parental haploid genomes in the subjects is another serious barrier. To address these issues, unique DNA sequence tags evenly distributed within and flanking the duplicated region implicated by the previous studies were selected. The selected tags in single sperm from six unrelated healthy donors were amplified by multiplex PCR followed by microarray detection. In this way, individual haplotypes of different parental origins in the sperm donors could be analyzed separately and precisely. The identified polymorphic region was further analyzed at the nucleotide sequence level using sequences from the three human genomic sequence assemblies in the database. Results A large polymorphic region was identified using the selected sequence tags. Four of the 12 haplotypes were shown to contain consecutively undetectable tags spanning in a variable range. Detailed analysis of sequences from the genomic sequence assemblies revealed two large duplicate sequence blocks of 24,696 bp and 24,387 bp, respectively, and an incomplete copy of 961 bp in this region. It contains up to 13 IGHV gene segments depending on haplotypes. A polymorphic region was found to be located within the duplicated blocks. The variants of this polymorphism unusually diverged at the nucleotide sequence level and in IGHV gene segment number, composition and organization, indicating a limited selection pressure in general. However, the divergence level within the gene segments is significantly different from that in the intergenic regions indicating that these regions may have been subject to different selection pressures and that the IGHV gene segments in this region are functionally important. Conclusions Non-reciprocal genetic rearrangements associated with large duplicate sequence blocks could substantially contribute to the IGHV region diversity. Since the resulting polymorphisms may affect the number, composition and organization of the gene segments in this region, it may have significant impact on the function of the IGHV gene segment repertoire, antibody diversity, and therefore, the immune system. Because one of the gene segments, 3-30 (1.9III), is associated with autoimmune diseases, it could be of diagnostic significance to learn about the variants in the haplotypes by using the multiplex haplotype analysis system used in the present study with DNA sequence tags specific for the variants of all gene segments in this region.
Collapse
Affiliation(s)
- Sreemanta Pramanik
- Department of Molecular Genetics, Microbiology, and Immunology, University of Medicine and Dentistry of New Jersey-Robert Wood Johnson Medical School, Piscataway, NJ 08854, USA
| | | | | | | | | | | | | | | |
Collapse
|
108
|
Huber AK, Concepcion ES, Gandhi A, Menconi F, Smith EP, Keddache M, Tomer Y. Analysis of immune regulatory genes' copy number variants in Graves' disease. Thyroid 2011; 21:69-74. [PMID: 21054240 PMCID: PMC3012451 DOI: 10.1089/thy.2010.0262] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
BACKGROUND Copy number variants (CNVs) have recently been reported to be associated with several autoimmune conditions. Moreover, loci involved in immunity are enriched in CNVs. Therefore, we hypothesized that CNVs in immune genes associated with Graves' disease (GD) may contribute to the etiology of disease. METHODS One hundred ninety-one North American Caucasian GD patients and 192 Caucasian controls were analyzed for CNVs in three major immune regulatory genes: CD40, PTPN22, and CTLA-4. Copy number was determined using quantitative-PCR (Q-PCR) assays specifically designed for determining copy numbers in genomic DNA. Additionally, a well-characterized CNV in the amylase gene was typed in a separate dataset of DNA samples that were derived from cell lines or blood. RESULTS No CNVs could be confirmed in the CD40 and CTLA-4 genes, even though a CD40 CNV is cataloged in the Database of Genomic Variants. Only the PTPN22 CNV was confirmed in our cohort, but it was rare and appeared in only two individuals. A key finding was that the source of DNA has a significant effect on CNV typing. There was a statistically significant increase in amylase locus deletions in cell line-derived DNA compared to blood-derived DNA samples. CONCLUSIONS We conclude that CNV analysis should be performed only using blood-derived DNA Samples. Additionally, the CTLA-4, CD40, and PTPN22 loci do not harbor CNVs that play a role in the etiology of GD.
Collapse
Affiliation(s)
- Amanda K Huber
- Division of Endocrinology, Department of Medicine, Mount Sinai School of Medicine, New York, New York 10029, USA.
| | | | | | | | | | | | | |
Collapse
|
109
|
Li X, Tan L, Liu X, Lei S, Yang T, Chen X, Zhang F, Fang Y, Guo Y, Zhang L, Yan H, Pan F, Zhang Z, Peng Y, Zhou Q, He L, Zhu X, Cheng J, Zhang L, Liu Y, Tian Q, Deng H. A genome wide association study between copy number variation (CNV) and human height in Chinese population. J Genet Genomics 2010; 37:779-85. [DOI: 10.1016/s1673-8527(09)60095-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2010] [Revised: 10/27/2010] [Accepted: 10/29/2010] [Indexed: 12/24/2022]
|
110
|
Fontanesi L, Martelli PL, Beretti F, Riggio V, Dall'Olio S, Colombo M, Casadio R, Russo V, Portolano B. An initial comparative map of copy number variations in the goat (Capra hircus) genome. BMC Genomics 2010; 11:639. [PMID: 21083884 PMCID: PMC3011854 DOI: 10.1186/1471-2164-11-639] [Citation(s) in RCA: 103] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2010] [Accepted: 11/17/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The goat (Capra hircus) represents one of the most important farm animal species. It is reared in all continents with an estimated world population of about 800 million of animals. Despite its importance, studies on the goat genome are still in their infancy compared to those in other farm animal species. Comparative mapping between cattle and goat showed only a few rearrangements in agreement with the similarity of chromosome banding. We carried out a cross species cattle-goat array comparative genome hybridization (aCGH) experiment in order to identify copy number variations (CNVs) in the goat genome analysing animals of different breeds (Saanen, Camosciata delle Alpi, Girgentana, and Murciano-Granadina) using a tiling oligonucleotide array with ~385,000 probes designed on the bovine genome. RESULTS We identified a total of 161 CNVs (an average of 17.9 CNVs per goat), with the largest number in the Saanen breed and the lowest in the Camosciata delle Alpi goat. By aggregating overlapping CNVs identified in different animals we determined CNV regions (CNVRs): on the whole, we identified 127 CNVRs covering about 11.47 Mb of the virtual goat genome referred to the bovine genome (0.435% of the latter genome). These 127 CNVRs included 86 loss and 41 gain and ranged from about 24 kb to about 1.07 Mb with a mean and median equal to 90,292 bp and 49,530 bp, respectively. To evaluate whether the identified goat CNVRs overlap with those reported in the cattle genome, we compared our results with those obtained in four independent cattle experiments. Overlapping between goat and cattle CNVRs was highly significant (P < 0.0001) suggesting that several chromosome regions might contain recurrent interspecies CNVRs. Genes with environmental functions were over-represented in goat CNVRs as reported in other mammals. CONCLUSIONS We describe a first map of goat CNVRs. This provides information on a comparative basis with the cattle genome by identifying putative recurrent interspecies CNVs between these two ruminant species. Several goat CNVs affect genes with important biological functions. Further studies are needed to evaluate the functional relevance of these CNVs and their effects on behavior, production, and disease resistance traits in goats.
Collapse
Affiliation(s)
- Luca Fontanesi
- DIPROVAL, Sezione di Allevamenti Zootecnici, University of Bologna, Via F.lli Rosselli 107, 42123 Reggio Emilia, Italy.
| | | | | | | | | | | | | | | | | |
Collapse
|
111
|
Huang N, Lee I, Marcotte EM, Hurles ME. Characterising and predicting haploinsufficiency in the human genome. PLoS Genet 2010; 6:e1001154. [PMID: 20976243 PMCID: PMC2954820 DOI: 10.1371/journal.pgen.1001154] [Citation(s) in RCA: 488] [Impact Index Per Article: 34.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2010] [Accepted: 09/10/2010] [Indexed: 01/21/2023] Open
Abstract
Haploinsufficiency, wherein a single functional copy of a gene is insufficient to maintain normal function, is a major cause of dominant disease. Human disease studies have identified several hundred haploinsufficient (HI) genes. We have compiled a map of 1,079 haplosufficient (HS) genes by systematic identification of genes unambiguously and repeatedly compromised by copy number variation among 8,458 apparently healthy individuals and contrasted the genomic, evolutionary, functional, and network properties between these HS genes and known HI genes. We found that HI genes are typically longer and have more conserved coding sequences and promoters than HS genes. HI genes exhibit higher levels of expression during early development and greater tissue specificity. Moreover, within a probabilistic human functional interaction network HI genes have more interaction partners and greater network proximity to other known HI genes. We built a predictive model on the basis of these differences and annotated 12,443 genes with their predicted probability of being haploinsufficient. We validated these predictions of haploinsufficiency by demonstrating that genes with a high predicted probability of exhibiting haploinsufficiency are enriched among genes implicated in human dominant diseases and among genes causing abnormal phenotypes in heterozygous knockout mice. We have transformed these gene-based haploinsufficiency predictions into haploinsufficiency scores for genic deletions, which we demonstrate to better discriminate between pathogenic and benign deletions than consideration of the deletion size or numbers of genes deleted. These robust predictions of haploinsufficiency support clinical interpretation of novel loss-of-function variants and prioritization of variants and genes for follow-up studies. Humans, like most complex organisms, have two copies of most genes in their genome, one from the mother and one from the father. This redundancy provides a back-up copy for most genes, should one copy be lost through mutation. For a minority of genes, one functional copy is not enough to sustain normal human function, and mutations causing the loss of function of one of the copies of such genes are a major cause of childhood developmental diseases. Over the past 20 years medical geneticists have identified over 300 such genes, but it is not known how many of the 22,000 genes in our genome may also be sensitive to gene loss. By comparing these ∼300 genes known to be sensitive to gene loss with over 1,000 genes where loss of a single copy does not result in disease, we have identified some key evolutionary and functional similarities between genes sensitive to loss of a single copy. We have used these similarities to predict for most genes in the genome, whether loss of a single copy is likely to result in disease. These predictions will help in the interpretation of mutations seen in patients.
Collapse
Affiliation(s)
- Ni Huang
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge, United Kingdom
| | - Insuk Lee
- Center for Systems and Synthetic Biology, Department of Chemistry and Biochemistry, Institute for Cellular and Molecular Biology, University of Texas, Austin, Texas, United States of America
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul, South Korea
| | - Edward M. Marcotte
- Center for Systems and Synthetic Biology, Department of Chemistry and Biochemistry, Institute for Cellular and Molecular Biology, University of Texas, Austin, Texas, United States of America
| | - Matthew E. Hurles
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge, United Kingdom
- * E-mail:
| |
Collapse
|
112
|
Genetic variation of the Fc gamma receptor 3B gene and association with rheumatoid arthritis. PLoS One 2010; 5. [PMID: 20957197 PMCID: PMC2950138 DOI: 10.1371/journal.pone.0013173] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2010] [Accepted: 09/07/2010] [Indexed: 01/24/2023] Open
Abstract
Background Fc gamma receptors (FcγRs) play a crucial role in immunity by linking IgG antibody-mediated responses with cellular effector and regulatory functions. Genetic variants in these receptors have been previously identified as risk factors for several chronic inflammatory conditions. The present study aimed to investigate the presence of copy number variations (CNVs) in the FCGR3B gene and its potential association with the autoimmune disease rheumatoid arthritis (RA). Methodology/Principal Findings CNV of the FCGR3B gene was studied using Multiplex Ligation Dependent Probe Amplification (MLPA) in 518 Dutch RA patients and 304 healthy controls. Surprisingly, three independent MLPA probes targeting the FCGR3B promoter measured different CNV frequencies, with probe#1 and #2 measuring 0 to 5 gene copies and probe#3 showing little evidence of CNV. Quantitative-PCR correlated with the copy number results from MLPA probe#2, which detected low copy number (1 copy) in 6.7% and high copy number (≥3 copies) in 9.4% of the control population. No significant difference was observed between RA patients and the healthy controls, neither in the low copy nor the high copy number groups (p-values = 0.36 and 0.71, respectively). Sequencing of the FCGR3B promoter region revealed an insertion/deletion (indel) that explained the disparate CNV results of MLPA probe#1. Finally, a non-significant trend was found between the novel -256A>TG indel and RA (40.7% in healthy controls versus 35.9% in RA patients; P = 0.08). Conclusions/Significance The current study highlights the complexity and poor characterization of the FCGR3B gene sequence, indicating that the design and interpretation of genotyping assays based on specific probe sequences must be performed with caution. Nonetheless, we confirmed the presence of CNV and identified novel polymorphisms in the FCGR3B gene in the Dutch population. Although no association was found between RA and FCGR3B CNV, the possible protective effect of the -256A>TG indel polymorphism must be addressed in larger studies.
Collapse
|
113
|
Comprehensive copy number variant (CNV) analysis of neuronal pathways genes in psychiatric disorders identifies rare variants within patients. J Psychiatr Res 2010; 44:971-8. [PMID: 20398908 DOI: 10.1016/j.jpsychires.2010.03.007] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/22/2010] [Revised: 03/12/2010] [Accepted: 03/16/2010] [Indexed: 01/11/2023]
Abstract
BACKGROUND Copy number variations (CNV) have become an important source of human genome variability noteworthy to consider when studying genetic susceptibility to complex diseases. As recent studies have found evidences for the potential involvement of CNVs in psychiatric disorders, we have studied the dosage effect of structural genome variants as a possible susceptibility factor for different psychiatric disorders in a candidate gene approach. METHODS After selection of 68 psychiatric disorders' candidate genes overlapping with CNVs, MLPA assays were designed to determine changes in copy number of these genes. The studied sample consisted of 724 patients with psychiatric disorders (accounting for anxiety disorders, mood disorders, eating disorders and schizophrenia) and 341 control individuals. RESULTS CNVs were detected in 30 out of the 68 genes screened, indicating that a considerable proportion of neuronal pathways genes contain CNVs. When testing the overall burden of rare structural genomic variants in the different psychiatric disorders compared to control individuals, there was no statistically significant difference in the total amount of gains and losses. However, 14 out of the 30 changes were only found in psychiatric disorder patients but not in control individuals. These genes include GRM7, previously associated to major depression disorder and bipolar disorder, SLC6A13, in anxiety disorders, and S100B, SSTR5 and COMT in schizophrenia. CONCLUSIONS Although we have not been able to found a clear association between the studied CNVs and psychiatric disorders, the rare variants found only within the patients could account for a step further towards understanding the pathophysiology of psychiatric disorders.
Collapse
|
114
|
Agam A, Yalcin B, Bhomra A, Cubin M, Webber C, Holmes C, Flint J, Mott R. Elusive copy number variation in the mouse genome. PLoS One 2010; 5:e12839. [PMID: 20877625 PMCID: PMC2943477 DOI: 10.1371/journal.pone.0012839] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2010] [Accepted: 08/16/2010] [Indexed: 11/18/2022] Open
Abstract
Background Array comparative genomic hybridization (aCGH) to detect copy number variants (CNVs) in mammalian genomes has led to a growing awareness of the potential importance of this category of sequence variation as a cause of phenotypic variation. Yet there are large discrepancies between studies, so that the extent of the genome affected by CNVs is unknown. We combined molecular and aCGH analyses of CNVs in inbred mouse strains to investigate this question. Principal Findings Using a 2.1 million probe array we identified 1,477 deletions and 499 gains in 7 inbred mouse strains. Molecular characterization indicated that approximately one third of the CNVs detected by the array were false positives and we estimate the false negative rate to be more than 50%. We show that low concordance between studies is largely due to the molecular nature of CNVs, many of which consist of a series of smaller deletions and gains interspersed by regions where the DNA copy number is normal. Conclusions Our results indicate that CNVs detected by arrays may be the coincidental co-localization of smaller CNVs, whose presence is more likely to perturb an aCGH hybridization profile than the effect of an isolated, small, copy number alteration. Our findings help explain the hitherto unexplored discrepancies between array-based studies of copy number variation in the mouse genome.
Collapse
Affiliation(s)
- Avigail Agam
- Wellcome Trust Centre For Human Genetics, Oxford, United Kingdom.
| | | | | | | | | | | | | | | |
Collapse
|
115
|
Colobran R, Pedrosa E, Carretero-Iglesia L, Juan M. Copy number variation in chemokine superfamily: the complex scene of CCL3L-CCL4L genes in health and disease. Clin Exp Immunol 2010; 162:41-52. [PMID: 20659124 DOI: 10.1111/j.1365-2249.2010.04224.x] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Genome copy number changes (copy number variations: CNVs) include inherited, de novo and somatically acquired deviations from a diploid state within a particular chromosomal segment. CNVs are frequent in higher eukaryotes and associated with a substantial portion of inherited and acquired risk for various human diseases. CNVs are distributed widely in the genomes of apparently healthy individuals and thus constitute significant amounts of population-based genomic variation. Human CNV loci are enriched for immune genes and one of the most striking examples of CNV in humans involves a genomic region containing the chemokine genes CCL3L and CCL4L. The CCL3L-CCL4L copy number variable region (CNVR) shows extensive architectural complexity, with smaller CNVs within the larger ones and with interindividual variation in breakpoints. Furthermore, the individual genes embedded in this CNVR account for an additional level of genetic and mRNA complexity: CCL4L1 and CCL4L2 have identical exonic sequences but produce a different pattern of mRNAs. CCL3L2 was considered previously as a CCL3L1 pseudogene, but is actually transcribed. Since 2005, CCL3L-CCL4L CNV has been associated extensively with various human immunodeficiency virus-related outcomes, but some recent studies called these associations into question. This controversy may be due in part to the differences in alternative methods for quantifying gene copy number and differentiating the individual genes. This review summarizes and discusses the current knowledge about CCL3L-CCL4L CNV and points out that elucidating their complete phenotypic impact requires dissecting the combinatorial genomic complexity posed by various proportions of distinct CCL3L and CCL4L genes among individuals.
Collapse
Affiliation(s)
- R Colobran
- Laboratory of Immunobiology for Research and Application to Diagnosis (LIRAD), Tissue and Blood Bank (BST), Institut d'Investigació en Ciències de la Salut Germans Trias i Pujol (IGTP) Servei d'Immunologia, Centre de Diagnòstic Biomèdic (CDB), Hospital Clínic, IDIBAPS (Institut d'Investigacions Biomèdiques August Pi i Sunyer), Barcelona, Spain
| | | | | | | |
Collapse
|
116
|
Amar S, Ovadia O, Maier W, Ebstein R, Belmaker RH, Mishmar D, Agam G. Copy number variation of the SELENBP1 gene in schizophrenia. Behav Brain Funct 2010; 6:40. [PMID: 20615253 PMCID: PMC2915948 DOI: 10.1186/1744-9081-6-40] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2010] [Accepted: 07/08/2010] [Indexed: 12/12/2022] Open
Abstract
Background Schizophrenia is associated with rare copy-number (CN) mutations. Screening for such alleles genome-wide, though comprehensive, cannot study in-depth the causality of particular loci, therefore cannot provide the functional interpretation for the disease etiology. We hypothesized that CN mutations in the SELENBP1 locus could associate with the disorder and that these mutations could alter the gene product's activity in patients. Methods We analyzed SELENBP1 CN variation (CNV) in blood DNA from 49 schizophrenia patients and 49 controls (cohort A). Since CN of genes may vary among tissues, we investigated SELENBP1 CN in age- sex- and postmortem interval-matched cerebellar DNA samples from 14 patients and 14 controls (cohort B). Since CNV may either be de-novo or inherited we analyzed CNV of the SELENBP1 locus in blood DNA from 26 trios of schizophrenia probands and their healthy parents (cohort C). SELENBP1 mRNA levels were measured by real-time PCR. Results In cohort A reduced CN of the SELENBP1 locus was found in four patients but in none of the controls. In cohort B we found reduced CN of the SELENBP1 locus in two patients but in none of the controls. In cohort C three patients exhibited drastic CN reduction, not present in their parents, indicating de-novo mutation. A reduction in SELENBP1 mRNA levels in the postmortem cerebellar samples of schizophrenia patients was found. Conclusions We report a focused study of CN mutations in the selenium binding-protein1 (SELENBP1) locus previously linked with schizophrenia. We provide evidence for recurrence of decreased CN of the SELENBP1 locus in three unrelated patients' cohorts but not in controls, raising the possibility of functional involvement of these mutations in the etiology of the disease.
Collapse
Affiliation(s)
- Shirly Amar
- Psychiatry Research Unit, Faculty of Health Sciences, Ben-Gurion University of the Negev, and Mental Health Center, Beersheva, Israel
| | | | | | | | | | | | | |
Collapse
|
117
|
Wang Z, Dong X, Ding G, Li Y. Comparing the retention mechanisms of tandem duplicates and retrogenes in human and mouse genomes. Genet Sel Evol 2010; 42:24. [PMID: 20584267 PMCID: PMC2902415 DOI: 10.1186/1297-9686-42-24] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2010] [Accepted: 06/28/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Multiple models have been proposed to interpret the retention of duplicated genes. In this study, we attempted to compare whether the duplicates arising from tandem duplications and retropositions are retained by the same mechanisms in human and mouse genomes. RESULTS Both sequence and expression similarity analyses revealed that tandem duplicates tend to be more conserved, whereas retrogenes tend to be more divergent. The duplicability of tandem duplicates is also higher than that of retrogenes. However, positive selection seems to play significant roles in the retention of both types of duplicates. CONCLUSIONS We propose that dosage effect is more prevalent in the retention of tandem duplicates, while 'escape from adaptive conflict' (EAC) effect is more prevalent in the retention of retrogenes.
Collapse
Affiliation(s)
- Zhen Wang
- Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 320 Yueyang Road, Shanghai, PR China
| | | | | | | |
Collapse
|
118
|
Ostrovnaya I, Nanjangud G, Olshen AB. A classification model for distinguishing copy number variants from cancer-related alterations. BMC Bioinformatics 2010; 11:297. [PMID: 20525196 PMCID: PMC2897829 DOI: 10.1186/1471-2105-11-297] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2009] [Accepted: 06/02/2010] [Indexed: 01/17/2023] Open
Abstract
BACKGROUND Both somatic copy number alterations (CNAs) and germline copy number variants (CNVs) that are prevalent in healthy individuals can appear as recurrent changes in comparative genomic hybridization (CGH) analyses of tumors. In order to identify important cancer genes CNAs and CNVs must be distinguished. Although the Database of Genomic Variants (DGV) contains a list of all known CNVs, there is no standard methodology to use the database effectively. RESULTS We develop a prediction model that distinguishes CNVs from CNAs based on the information contained in the DGV and several other variables, including segment's length, height, closeness to a telomere or centromere and occurrence in other patients. The models are fitted on data from glioblastoma and their corresponding normal samples that were collected as part of The Cancer Genome Atlas project and hybridized to Agilent 244 K arrays. CONCLUSIONS Using the DGV alone CNVs in the test set can be correctly identified with about 85% accuracy if the outliers are removed before segmentation and with 72% accuracy if the outliers are included, and additional variables improve the prediction by about 2-3% and 12%, respectively. Final models applied to data from ovarian tumors have about 90% accuracy with all the variables and 86% accuracy with the DGV alone.
Collapse
Affiliation(s)
- Irina Ostrovnaya
- Department of Epidemiology and Biostatistics, Memorial Sloan-Kettering Cancer Center, New York, NY, USA.
| | | | | |
Collapse
|
119
|
Fadista J, Thomsen B, Holm LE, Bendixen C. Copy number variation in the bovine genome. BMC Genomics 2010; 11:284. [PMID: 20459598 PMCID: PMC2902221 DOI: 10.1186/1471-2164-11-284] [Citation(s) in RCA: 126] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2009] [Accepted: 05/06/2010] [Indexed: 12/12/2022] Open
Abstract
Background Copy number variations (CNVs), which represent a significant source of genetic diversity in mammals, have been shown to be associated with phenotypes of clinical relevance and to be causative of disease. Notwithstanding, little is known about the extent to which CNV contributes to genetic variation in cattle. Results We designed and used a set of NimbleGen CGH arrays that tile across the assayable portion of the cattle genome with approximately 6.3 million probes, at a median probe spacing of 301 bp. This study reports the highest resolution map of copy number variation in the cattle genome, with 304 CNV regions (CNVRs) being identified among the genomes of 20 bovine samples from 4 dairy and beef breeds. The CNVRs identified covered 0.68% (22 Mb) of the genome, and ranged in size from 1.7 to 2,031 kb (median size 16.7 kb). About 20% of the CNVs co-localized with segmental duplications, while 30% encompass genes, of which the majority is involved in environmental response. About 10% of the human orthologous of these genes are associated with human disease susceptibility and, hence, may have important phenotypic consequences. Conclusions Together, this analysis provides a useful resource for assessment of the impact of CNVs regarding variation in bovine health and production traits.
Collapse
Affiliation(s)
- João Fadista
- Group of Molecular Genetics and Systems Biology, Department of Genetics and Biotechnology, Faculty of Agricultural Sciences, Aarhus University, Blichers Allé 20, DK-8830 Tjele, Denmark
| | | | | | | |
Collapse
|
120
|
Ohnologs in the human genome are dosage balanced and frequently associated with disease. Proc Natl Acad Sci U S A 2010; 107:9270-4. [PMID: 20439718 DOI: 10.1073/pnas.0914697107] [Citation(s) in RCA: 197] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
About 30% of protein-coding genes in the human genome are related through two whole genome duplication (WGD) events. Although WGD is often credited with great evolutionary importance, the processes governing the retention of these genes and their biological significance remain unclear. One increasingly popular hypothesis is that dosage balance constraints are a major determinant of duplicate gene retention. We test this hypothesis and show that WGD-duplicated genes (ohnologs) have rarely experienced subsequent small-scale duplication (SSD) and are also refractory to copy number variation (CNV) in human populations and are thus likely to be sensitive to relative quantities (i.e., they are dosage-balanced). By contrast, genes that have experienced SSD in the vertebrate lineage are more likely to also display CNV. This supports the hypothesis of biased retention of dosage-balanced genes after WGD. We also show that ohnologs have a strong association with human disease. In particular, Down Syndrome (DS) caused by trisomy 21 is widely assumed to be caused by dosage effects, and 75% of previously reported candidate genes for this syndrome are ohnologs that experienced no other copy number changes. We propose the remaining dosage-balanced ohnologs on chromosome 21 as candidate DS genes. These observations clearly show a persistent resistance to dose changes in genes duplicated by WGD. Dosage balance constraints simultaneously explain duplicate gene retention and essentiality after WGD.
Collapse
|
121
|
Sandve SR, Fjellheim S. Did gene family expansions during the Eocene-Oligocene boundary climate cooling play a role in Pooideae adaptation to cool climates? Mol Ecol 2010; 19:2075-88. [PMID: 20406386 DOI: 10.1111/j.1365-294x.2010.04629.x] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Adaptation to cool environments is a common feature in the core group of the grass subfamily Pooideae (Triticeae and Poeae). This suggest an ancient evolutionary origin of low temperature stress tolerance dating back prior to the initiation of taxonomic divergence of core Pooideae species. Viewing the Pooideae evolution in a palaeo-climatic perspective reveals that taxonomic divergence of the core Pooideae group initiated shortly after a global super-cooling period at the Eocene-Oligocene boundary (approximately 33.5-26 Ma). This global climate cooling altered distributions of plants and animals and must have imposed selection pressure for improved low temperature stress responses. Lineage-specific gene family expansions are known to be involved in adaptation to new environmental stresses. In Pooideae, two gene families involved in low temperature stress response, the C-repeat binding factor (CBF) and fructosyl transferase (FT) gene families, has undergone lineage-specific expansions. We investigated the timing of these gene family expansions by molecular dating and found that Pooideae-specific expansion events in CBF and FT gene families took place during Eocene-Oligocene super-cooling period. We hypothesize that the E-O super-cooling exerted selection pressure for improved low temperature stress response and frost tolerance in a core Pooideae ancestor, and that those individuals with multiple copies of CBF and FT genes were favoured.
Collapse
Affiliation(s)
- Simen Rød Sandve
- Department of Plant and Environmental Sciences, Norwegian University of Life Sciences, PO Box 5003, 1431 Aas, Norway.
| | | |
Collapse
|
122
|
Birchler JA, Veitia RA. The gene balance hypothesis: implications for gene regulation, quantitative traits and evolution. THE NEW PHYTOLOGIST 2010; 186:54-62. [PMID: 19925558 PMCID: PMC2858765 DOI: 10.1111/j.1469-8137.2009.03087.x] [Citation(s) in RCA: 205] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
The gene balance hypothesis states that the stoichiometry of members of multisubunit complexes affects the function of the whole because of the kinetics and mode of assembly. Gene regulatory mechanisms also would be governed by these principles. Here, we review the impact of this concept with regard to the effects on the genetics of quantitative traits, the fate of duplication of genes following polyploidization events or segmental duplication, the basis of aneuploid syndromes, the constraints on cis and trans variation in gene regulation and the potential involvement in hybrid incompatibilities.
Collapse
Affiliation(s)
- James A Birchler
- Division of Biological Sciences, University of Missouri, Columbia, MO 65211, USA.
| | | |
Collapse
|
123
|
Dosage sensitivity shapes the evolution of copy-number varied regions. PLoS One 2010; 5:e9474. [PMID: 20224824 PMCID: PMC2835737 DOI: 10.1371/journal.pone.0009474] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2009] [Accepted: 01/12/2010] [Indexed: 01/31/2023] Open
Abstract
Dosage sensitivity is an important evolutionary force which impacts on gene dispensability and duplicability. The newly available data on human copy-number variation (CNV) allow an analysis of the most recent and ongoing evolution. Provided that heterozygous gene deletions and duplications actually change gene dosage, we expect to observe negative selection against CNVs encompassing dosage sensitive genes. In this study, we make use of several sources of population genetic data to identify selection on structural variations of dosage sensitive genes. We show that CNVs can directly affect expression levels of contained genes. We find that genes encoding members of protein complexes exhibit limited expression variation and overlap significantly with a manually derived set of dosage sensitive genes. We show that complexes and other dosage sensitive genes are underrepresented in CNV regions, with a particular bias against frequent variations and duplications. These results suggest that dosage sensitivity is a significant force of negative selection on regions of copy-number variation.
Collapse
|
124
|
Laland KN, Odling-Smee J, Myles S. How culture shaped the human genome: bringing genetics and the human sciences together. Nat Rev Genet 2010; 11:137-48. [PMID: 20084086 DOI: 10.1038/nrg2734] [Citation(s) in RCA: 366] [Impact Index Per Article: 26.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
Researchers from diverse backgrounds are converging on the view that human evolution has been shaped by gene-culture interactions. Theoretical biologists have used population genetic models to demonstrate that cultural processes can have a profound effect on human evolution, and anthropologists are investigating cultural practices that modify current selection. These findings are supported by recent analyses of human genetic variation, which reveal that hundreds of genes have been subject to recent positive selection, often in response to human activities. Here, we collate these data, highlighting the considerable potential for cross-disciplinary exchange to provide novel insights into how culture has shaped the human genome.
Collapse
Affiliation(s)
- Kevin N Laland
- School of Biology, University of St Andrews, Bute Building, Westburn Lane, St Andrews, Fife KY16 9TS, UK.
| | | | | |
Collapse
|
125
|
Brunson T, Wang Q, Chambers I, Song Q. A copy number variation in human NCF1 and its pseudogenes. BMC Genet 2010; 11:13. [PMID: 20178640 PMCID: PMC2846862 DOI: 10.1186/1471-2156-11-13] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2009] [Accepted: 02/23/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Neutrophil cytosolic factor-1 (NCF1) is a component of NADPH oxidase. The NCF1 gene colocalizes with two pseudogenes (NCF1B and NCF1C). These two pseudogenes have a GT deletion in exon 2, resulting in a frameshift and an early stop codon. Here, we report a copy number variation (CNV) of the NCF1 pseudogenes and their alternative spliced expressions. RESULTS We examined three normal populations (86 individuals). We observed the 2:2:2 pattern (NCF1B:NCF1:NCF1C) in only 26 individuals. On average, each African- American has 1.4 +/- 0.8 (Mean +/- SD) copies of NCF1B and 2.3 +/- 0.6 copies of NCF1C; each Caucasian has 1.8 +/- 0.7 copies of NCF1B and 1.9 +/- 0.4 copies of NCF1C; and each Mexican has 1.6 +/- 0.6 copies of NCF1B and 1.0 +/- 0.4 copies of NCF1C. Mexicans have significantly less NCF1C copies than African-Americans (p = 6e-15) and Caucasians (p = 3e-11). Mendelian transmission of this CNV was observed in two CEPH pedigrees. Moreover, we cloned two alternative spliced transcripts generated from these two pseudogenes that adopt alternative exon-2 instead of their defective exon 2. The NCF1 pseudogene expression responded robustly to PMA induction during macrophage differentiation. NCF1B decreased from 32.9% to 8.3% in the cDNA pool transcribed from 3 gene copies. NCF1Psis also displayed distinct expression patterns in different human tissues. CONCLUSIONS Our results suggest that these two pseudogenes may adopt an alternative exon-2 in different tissues and in response to external stimuli. The GT deletion is insufficient to define them as functionless pseudogenes; this CNV may have biological relevance.
Collapse
Affiliation(s)
- Tiffany Brunson
- Cardiovascular Research Institute, Morehouse School of Medicine, Atlanta, Georgia, USA
| | | | | | | |
Collapse
|
126
|
Maydan JS, Lorch A, Edgley ML, Flibotte S, Moerman DG. Copy number variation in the genomes of twelve natural isolates of Caenorhabditis elegans. BMC Genomics 2010; 11:62. [PMID: 20100350 PMCID: PMC2822765 DOI: 10.1186/1471-2164-11-62] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2009] [Accepted: 01/25/2010] [Indexed: 11/23/2022] Open
Abstract
Background Copy number variation is an important component of genetic variation in higher eukaryotes. The extent of natural copy number variation in C. elegans is unknown outside of 2 highly divergent wild isolates and the canonical N2 Bristol strain. Results We have used array comparative genomic hybridization (aCGH) to detect copy number variation in the genomes of 12 natural isolates of Caenorhabditis elegans. Deletions relative to the canonical N2 strain are more common in these isolates than duplications, and indels are enriched in multigene families on the autosome arms. Among the strains in our study, the Hawaiian and Madeiran strains (CB4856 and JU258) carry the largest number of deletions, followed by the Vancouver strain (KR314). Overall we detected 510 different deletions affecting 1136 genes, or over 5% of the genes in the canonical N2 genome. The indels we identified had a median length of 2.7 kb. Since many deletions are found in multiple isolates, deletion loci were used as markers to derive an unrooted tree to estimate genetic relatedness among the strains. Conclusion Copy number variation is extensive in C. elegans, affecting over 5% of the genes in the genome. The deletions we have detected in natural isolates of C. elegans contribute significantly to the number of deletion alleles available to researchers. The relationships between strains are complex and different regions of the genome possess different genealogies due to recombination throughout the natural history of the species, which may not be apparent in studies utilizing smaller numbers of genetic markers.
Collapse
Affiliation(s)
- Jason S Maydan
- Department of Zoology, University of British Columbia, British Columbia, Canada
| | | | | | | | | |
Collapse
|
127
|
Gaunt TR, Rodriguez S, Guthrie PA, Day IN. An expectation-maximization program for determining allelic spectrum from CNV data (CoNVEM): insights into population allelic architecture and its mutational history. Hum Mutat 2010; 31:414-20. [DOI: 10.1002/humu.21199] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
128
|
Vucic EA, Thu KL, Williams AC, Lam WL, Coe BP. Copy number variations in the human genome and strategies for analysis. Methods Mol Biol 2010; 628:103-117. [PMID: 20238078 DOI: 10.1007/978-1-60327-367-1_6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
The structure and sequence of the genome is immensely variable in the human population. Segmental copy number variants (CNVs) contribute to the extensive phenotypic diversity among humans and have been shown to associate with disease susceptibility. In this article, we provide a detailed review of human genetic variations and the experimental approaches used to discover, catalog, and genotype CNVs.
Collapse
Affiliation(s)
- Emily A Vucic
- Department of Cancer Genetics and Developmental Biology, British Columbia Cancer Research Centre, Vancouver, BC, Canada
| | | | | | | | | |
Collapse
|
129
|
Enhanced fixation and preservation of a newly arisen duplicate gene by masking deleterious loss-of-function mutations. Genet Res (Camb) 2009; 91:267-80. [PMID: 19640322 DOI: 10.1017/s0016672309000196] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
Abstract
Segmental duplications are enriched within many eukaryote genomes, and their potential consequence is gene duplication. While previous theoretical studies of gene duplication have mainly focused on the gene silencing process after fixation, the process leading to fixation is even more important for segmental duplications, because the majority of duplications would be lost before reaching a significant frequency in a population. Here, by a series of computer simulations, we show that purifying selection against loss-of-function mutations increases the fixation probability of a new duplicate gene, especially when the gene is haplo-insufficient. Theoretically, the probability of simultaneous preservation of both duplicate genes becomes twice the loss-of-function mutation rate (u(c)) when the population size (N), the degree of dominance of mutations (h) and the recombination rate between the duplicate genes (c) are all sufficiently large (Nu(c)>1, h>0.1 and c>u(c)). The preservation probability declines rapidly with h and becomes 0 when h=0 (haplo-sufficiency). We infer that masking deleterious loss-of-function mutations give duplicate genes an immediate selective advantage and, together with effects of increased gene dosage, would predominantly determine the fates of the duplicate genes in the early phase of their evolution.
Collapse
|
130
|
Copy-number variants in neurodevelopmental disorders: promises and challenges. Trends Genet 2009; 25:536-44. [DOI: 10.1016/j.tig.2009.10.006] [Citation(s) in RCA: 102] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2009] [Revised: 10/14/2009] [Accepted: 10/15/2009] [Indexed: 02/01/2023]
|
131
|
Armengol L, Villatoro S, González JR, Pantano L, García-Aragonés M, Rabionet R, Cáceres M, Estivill X. Identification of copy number variants defining genomic differences among major human groups. PLoS One 2009; 4:e7230. [PMID: 19789632 PMCID: PMC2747275 DOI: 10.1371/journal.pone.0007230] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2009] [Accepted: 08/20/2009] [Indexed: 12/14/2022] Open
Abstract
Background Understanding the genetic contribution to phenotype variation of human groups is necessary to elucidate differences in disease predisposition and response to pharmaceutical treatments in different human populations. Methodology/Principal Findings We have investigated the genome-wide profile of structural variation on pooled samples from the three populations studied in the HapMap project by comparative genome hybridization (CGH) in different array platforms. We have identified and experimentally validated 33 genomic loci that show significant copy number differences from one population to the other. Interestingly, we found an enrichment of genes related to environment adaptation (immune response, lipid metabolism and extracellular space) within these regions and the study of expression data revealed that more than half of the copy number variants (CNVs) translate into gene-expression differences among populations, suggesting that they could have functional consequences. In addition, the identification of single nucleotide polymorphisms (SNPs) that are in linkage disequilibrium with the copy number alleles allowed us to detect evidences of population differentiation and recent selection at the nucleotide variation level. Conclusions Overall, our results provide a comprehensive view of relevant copy number changes that might play a role in phenotypic differences among major human populations, and generate a list of interesting candidates for future studies.
Collapse
Affiliation(s)
- Lluís Armengol
- Genetic Causes of Disease Group, Genes and Disease Program, Center for Genomic Regulation (CRG-UPF) and CIBERESP, Barcelona, Catalonia, Spain
- Quantitative Genomic Medicine Laboratories (qGenomics), Barcelona, Catalonia, Spain
| | - Sergi Villatoro
- Genetic Causes of Disease Group, Genes and Disease Program, Center for Genomic Regulation (CRG-UPF) and CIBERESP, Barcelona, Catalonia, Spain
| | - Juan R. González
- Center for Research in Environmental Epidemiology (CREAL), Barcelona, Catalonia, Spain
| | - Lorena Pantano
- Genetic Causes of Disease Group, Genes and Disease Program, Center for Genomic Regulation (CRG-UPF) and CIBERESP, Barcelona, Catalonia, Spain
| | - Manel García-Aragonés
- Genetic Causes of Disease Group, Genes and Disease Program, Center for Genomic Regulation (CRG-UPF) and CIBERESP, Barcelona, Catalonia, Spain
| | - Raquel Rabionet
- Genetic Causes of Disease Group, Genes and Disease Program, Center for Genomic Regulation (CRG-UPF) and CIBERESP, Barcelona, Catalonia, Spain
| | - Mario Cáceres
- Genetic Causes of Disease Group, Genes and Disease Program, Center for Genomic Regulation (CRG-UPF) and CIBERESP, Barcelona, Catalonia, Spain
| | - Xavier Estivill
- Genetic Causes of Disease Group, Genes and Disease Program, Center for Genomic Regulation (CRG-UPF) and CIBERESP, Barcelona, Catalonia, Spain
- Genetics Unit, Department of Health and Experimental Life Sciences, Pompeu Fabra University (UPF), Barcelona, Catalonia, Spain
- National Genotyping Center (CeGen) Barcelona Genotyping Node, Center for Genomic Regulation (CRG-UPF), Barcelona, Catalonia, Spain
- * E-mail:
| |
Collapse
|
132
|
Abstract
To take full advantage of the mouse as a model organism, it is essential to distinguish lineage-specific biology from what is shared between human and mouse. Investigations into shared genetic elements common to both have been well served by the draft human and mouse genome sequences. More recently, the virtually complete euchromatic sequences of the two reference genomes have been finished. These reveal a high ( approximately 5%) level of sequence duplications that had previously been recalcitrant to sequencing and assembly. Within these duplications lie large numbers of rodent- or primate-specific genes. In the present paper, we review the sequence properties of the two genomes, dwelling most on the duplications, deletions and insertions that separate each of them from their most recent common ancestor, approx. 90 million years ago. We consider the differences in gene numbers and repertoires between the two species, and speculate on their contributions to lineage-specific biology. Loss of ancient single-copy genes are rare, as are gains of new functional genes through retrotransposition. Instead, most changes to the gene repertoire have occurred in large multicopy families. It has been proposed that numbers of such 'environmental genes' rise and fall, and their sequences change, as adaptive responses to infection and other environmental pressures, including conspecific competition. Nevertheless, many such genes may be under little or no selection.
Collapse
|
133
|
Association of copy number variation in the FCGR3B gene with risk of autoimmune diseases. Genes Immun 2009; 11:155-60. [PMID: 19741716 DOI: 10.1038/gene.2009.71] [Citation(s) in RCA: 69] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Copy number variation (CNV) in the human genome is an important determinant of susceptibility to autoimmune diseases. Many autoimmune diseases share similar clinical and pathogenic features. Thus, CNVs of genes involved in immunity may serve as shared determinants of multiple autoimmune diseases. Here, we determined the association between CNV in the gene encoding FCGR3B with the risk of developing autoimmune diseases and whether the observed associations are modified by the CNV in CCL3L1 (CC chemokine ligand 3-like 1), a gene encoding a potent chemokine. In a cross-sectional study of 774 subjects, we estimated FCGR3B and CCL3L1 gene copy number in 146, 158 and 61 subjects with systemic lupus erythematosus (SLE), rheumatoid arthritis (RA) and primary Sjögren's syndrome (SS), respectively, and 409 healthy controls. The median gene dose of FCGR3B in the study population was two. FCGR3B copy number < or >2 was associated with an increased risk of SLE and primary SS but not RA. This association was mostly evident in subjects who also had two copies of CCL3L1. Thus, our data suggest that epistatic interactions between CNV of FCGR3B and CCL3L1, two immune response genes, may influence phenotypically related autoimmune diseases.
Collapse
|
134
|
Cheeseman IH, Gomez-Escobar N, Carret CK, Ivens A, Stewart LB, Tetteh KKA, Conway DJ. Gene copy number variation throughout the Plasmodium falciparum genome. BMC Genomics 2009; 10:353. [PMID: 19653891 PMCID: PMC2732925 DOI: 10.1186/1471-2164-10-353] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2009] [Accepted: 08/04/2009] [Indexed: 11/17/2022] Open
Abstract
Background Gene copy number variation (CNV) is responsible for several important phenotypes of the malaria parasite Plasmodium falciparum, including drug resistance, loss of infected erythrocyte cytoadherence and alteration of receptor usage for erythrocyte invasion. Despite the known effects of CNV, little is known about its extent throughout the genome. Results We performed a whole-genome survey of CNV genes in P. falciparum using comparative genome hybridisation of a diverse set of 16 laboratory culture-adapted isolates to a custom designed high density Affymetrix GeneChip array. Overall, 186 genes showed hybridisation signals consistent with deletion or amplification in one or more isolate. There is a strong association of CNV with gene length, genomic location, and low orthology to genes in other Plasmodium species. Sub-telomeric regions of all chromosomes are strongly associated with CNV genes independent from members of previously described multigene families. However, ~40% of CNV genes were located in more central regions of the chromosomes. Among the previously undescribed CNV genes, several that are of potential phenotypic relevance are identified. Conclusion CNV represents a major form of genetic variation within the P. falciparum genome; the distribution of gene features indicates the involvement of highly non-random mutational and selective processes. Additional studies should be directed at examining CNV in natural parasite populations to extend conclusions to clinical settings.
Collapse
Affiliation(s)
- Ian H Cheeseman
- Department of Infectious & Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, UK.
| | | | | | | | | | | | | |
Collapse
|
135
|
Alaerts M, Del-Favero J. Searching genetic risk factors for schizophrenia and bipolar disorder: learn from the past and back to the future. Hum Mutat 2009; 30:1139-52. [DOI: 10.1002/humu.21042] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
|
136
|
Hastings PJ, Lupski JR, Rosenberg SM, Ira G. Mechanisms of change in gene copy number. Nat Rev Genet 2009. [PMID: 19597530 DOI: 10.1038/nrg2593.mechanisms] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Deletions and duplications of chromosomal segments (copy number variants, CNVs) are a major source of variation between individual humans and are an underlying factor in human evolution and in many diseases, including mental illness, developmental disorders and cancer. CNVs form at a faster rate than other types of mutation, and seem to do so by similar mechanisms in bacteria, yeast and humans. Here we review current models of the mechanisms that cause copy number variation. Non-homologous end-joining mechanisms are well known, but recent models focus on perturbation of DNA replication and replication of non-contiguous DNA segments. For example, cellular stress might induce repair of broken replication forks to switch from high-fidelity homologous recombination to non-homologous repair, thus promoting copy number change.
Collapse
Affiliation(s)
- P J Hastings
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA.
| | | | | | | |
Collapse
|
137
|
Hastings PJ, Lupski JR, Rosenberg SM, Ira G. Mechanisms of change in gene copy number. Nat Rev Genet 2009; 10:551-64. [PMID: 19597530 DOI: 10.1038/nrg2593] [Citation(s) in RCA: 853] [Impact Index Per Article: 56.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
Deletions and duplications of chromosomal segments (copy number variants, CNVs) are a major source of variation between individual humans and are an underlying factor in human evolution and in many diseases, including mental illness, developmental disorders and cancer. CNVs form at a faster rate than other types of mutation, and seem to do so by similar mechanisms in bacteria, yeast and humans. Here we review current models of the mechanisms that cause copy number variation. Non-homologous end-joining mechanisms are well known, but recent models focus on perturbation of DNA replication and replication of non-contiguous DNA segments. For example, cellular stress might induce repair of broken replication forks to switch from high-fidelity homologous recombination to non-homologous repair, thus promoting copy number change.
Collapse
Affiliation(s)
- P J Hastings
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA.
| | | | | | | |
Collapse
|
138
|
Wu ZJ, Jin W. [Copy-number variation: a new pattern of structural diversity in genome]. YI CHUAN = HEREDITAS 2009; 31:339-47. [PMID: 19586885 DOI: 10.3724/sp.j.1005.2009.00339] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Copy number variation (CNV) is increasingly recognized as a source of inter-individual differences in genome sequence and has been proposed as a driving force for genome evolution and phenotypic variation. Many CNVs resulted in different levels of gene expression, which may account for a significant proportion of normal phenotypic variation and human diseases. This review unveiled the research process and study strategy of CNVs. Subsequently, the potential mechanisms of CNV formation and its clinical implications were discussed. In addition, the first-generation copy number variation map of the human genome was introduced, which demonstrated that DNA copy number variation was associated with specific chromosomal rearrangements and genomic disorders.
Collapse
Affiliation(s)
- Zhi-Jun Wu
- Department of Cardiology, Rui Jin Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200025, China.
| | | |
Collapse
|
139
|
Webber C, Hehir-Kwa JY, Nguyen DQ, de Vries BBA, Veltman JA, Ponting CP. Forging links between human mental retardation-associated CNVs and mouse gene knockout models. PLoS Genet 2009; 5:e1000531. [PMID: 19557186 PMCID: PMC2694283 DOI: 10.1371/journal.pgen.1000531] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2009] [Accepted: 05/22/2009] [Indexed: 12/20/2022] Open
Abstract
Rare copy number variants (CNVs) are frequently associated with common neurological disorders such as mental retardation (MR; learning disability), autism, and schizophrenia. CNV screening in clinical practice is limited because pathological CNVs cannot be distinguished routinely from benign CNVs, and because genes underlying patients' phenotypes remain largely unknown. Here, we present a novel, statistically robust approach that forges links between 148 MR–associated CNVs and phenotypes from ∼5,000 mouse gene knockout experiments. These CNVs were found to be significantly enriched in two classes of genes, those whose mouse orthologues, when disrupted, result in either abnormal axon or dopaminergic neuron morphologies. Additional enrichments highlighted correspondences between relevant mouse phenotypes and secondary presentations such as brain abnormality, cleft palate, and seizures. The strength of these phenotype enrichments (>100% increases) greatly exceeded molecular annotations (<30% increases) and allowed the identification of 78 genes that may contribute to MR and associated phenotypes. This study is the first to demonstrate how the power of mouse knockout data can be systematically exploited to better understand genetically heterogeneous neurological disorders. Mental retardation (MR; also known as learning disability) affects 1%–3% of people and is often associated with the presence of genomic copy number variations (CNVs) such as deletions and duplications. Most of these CNVs are rare and they often involve tens, sometimes hundreds, of genes. Pinpointing exactly which particular gene or genes are responsible for MR in an individual patient is therefore challenging and limits diagnostic applications. In this study, the functions of genes present within a large collection of MR–associated CNVs were investigated by comparing them to data from large-scale mouse knock-out experiments. We found that MR–associated CNVs contain greater than expected numbers of genes that give specific nervous system phenotypes when disrupted in the mouse. Not only does this study confirm that CNVs frequently cause MR, but it narrows down the list of genes whose changes lead to this disorder from thousands to several dozen. This reduced list of genes brings wide-spread genetic testing for MR one step closer. It also provides a better understanding of the biology behind MR that could, eventually, yield medical treatments.
Collapse
Affiliation(s)
- Caleb Webber
- MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
| | - Jayne Y. Hehir-Kwa
- Department of Human Genetics, Nijmegen Centre for Molecular Life Sciences, Radboud University Nijmegen Medical Centre, Nijmegen, The Netherlands
| | - Duc-Quang Nguyen
- MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
| | - Bert B. A. de Vries
- Department of Human Genetics, Nijmegen Centre for Molecular Life Sciences, Radboud University Nijmegen Medical Centre, Nijmegen, The Netherlands
| | - Joris A. Veltman
- Department of Human Genetics, Nijmegen Centre for Molecular Life Sciences, Radboud University Nijmegen Medical Centre, Nijmegen, The Netherlands
- * E-mail: (JV); (CPP)
| | - Chris P. Ponting
- MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
- * E-mail: (JV); (CPP)
| |
Collapse
|
140
|
Faddah DA, Ganko EW, McCoach C, Pickrell JK, Hanlon SE, Mann FG, Mieczkowska JO, Jones CD, Lieb JD, Vision TJ. Systematic identification of balanced transposition polymorphisms in Saccharomyces cerevisiae. PLoS Genet 2009; 5:e1000502. [PMID: 19503594 PMCID: PMC2682701 DOI: 10.1371/journal.pgen.1000502] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2008] [Accepted: 05/04/2009] [Indexed: 01/22/2023] Open
Abstract
High-throughput techniques for detecting DNA polymorphisms generally do not identify changes in which the genomic position of a sequence, but not its copy number, varies among individuals. To explore such balanced structural polymorphisms, we used array-based Comparative Genomic Hybridization (aCGH) to conduct a genome-wide screen for single-copy genomic segments that occupy different genomic positions in the standard laboratory strain of Saccharomyces cerevisiae (S90) and a polymorphic wild isolate (Y101) through analysis of six tetrads from a cross of these two strains. Paired-end high-throughput sequencing of Y101 validated four of the predicted rearrangements. The transposed segments contained one to four annotated genes each, yet crosses between S90 and Y101 yielded mostly viable tetrads. The longest segment comprised 13.5 kb near the telomere of chromosome XV in the S288C reference strain and Southern blotting confirmed its predicted location on chromosome IX in Y101. Interestingly, inter-locus crossover events between copies of this segment occurred at a detectable rate. The presence of low-copy repetitive sequences at the junctions of this segment suggests that it may have arisen through ectopic recombination. Our methodology and findings provide a starting point for exploring the origins, phenotypic consequences, and evolutionary fate of this largely unexplored form of genomic polymorphism. Balanced structural polymorphisms are differences in the relative arrangement of genomic features within species that do not affect DNA copy number. Little is known about their prevalence or importance because they are difficult to observe. Here, we present a novel methodology for systematically identifying such polymorphisms based on the idea that single-copy DNA that occupies different genomic locations in two parents will segregate independently during meiosis and will therefore reveal itself as a copy number difference among a fraction of progeny. Comparative hybridization reveals multiple balanced structural polymorphisms that involve changes to gene order in two strains of yeast; the results are independently validated using paired-end whole genome shotgun sequencing. The longest transposed segment we identify comprises 13.5 kb near the telomere of chromosome XV in the S288C reference strain and contains several annotated genes. We map the location of this polymorphism in the non-reference strain using genome-wide genotypic data, which also reveals an appreciable frequency of ectopic recombination among transposed segment pairs. The breakpoints of the remaining polymorphisms are localized by the paired-end sequence data. Our work provides proof-of-principle for a very general approach to systematically identify all balanced genomic polymorphisms in two different genotypes and is a starting point for understanding the frequency, evolutionary origins, and functional consequences of this seldom-studied class of genomic structural variation in eukaryotes.
Collapse
Affiliation(s)
- Dina A. Faddah
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Eric W. Ganko
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Caroline McCoach
- Department of Biochemistry, Stanford University, Stanford, California, United States of America
| | - Joseph K. Pickrell
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Sean E. Hanlon
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Frederick G. Mann
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Joanna O. Mieczkowska
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Corbin D. Jones
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Jason D. Lieb
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- * E-mail: (JDL); (TJV)
| | - Todd J. Vision
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- * E-mail: (JDL); (TJV)
| |
Collapse
|
141
|
Zhao Y, Marotta M, Eichler EE, Eng C, Tanaka H. Linkage disequilibrium between two high-frequency deletion polymorphisms: implications for association studies involving the glutathione-S transferase (GST) genes. PLoS Genet 2009; 5:e1000472. [PMID: 19424424 PMCID: PMC2672168 DOI: 10.1371/journal.pgen.1000472] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2008] [Accepted: 04/07/2009] [Indexed: 11/18/2022] Open
Abstract
Copy number variations (CNVs) represent a large source of genetic variation in humans and have been increasingly studied for disease association. A deletion polymorphism of the gene encoding the cytosolic detoxification enzyme glutathione S-transferase theta 1 (GSTT1) has been extensively studied for cancer susceptibility (919 studies, from HuGE navigator, http://www.hugenavigator.net/). However, clear conclusions have not been reached. Since the GSTT1 gene is located within a genomic region of segmental duplications (SD), there may be a confounding effect from another, yet-uncharacterized CNV at the same locus. Here we describe a previously uncharacterized 38-kilo-base (kb) long deletion polymorphism of GSTT2B located within a 61-kb DNA inverted repeat. GSTT2B is a duplicated copy of GSTT2, the only paralogue of GSTT1 in humans. A newly developed PCR assay revealed that a microhomology-mediated breakpoint appears to be shared among individuals at high frequency. The GSTT2B deletion polymorphism was in strong linkage disequilibrium (LD) (D′ = 0.841) with the neighboring GSTT1 deletion polymorphism in the Caucasian population. Alleles harboring a single deletion were significantly overrepresented (p = 2.22×10−16), suggesting a selection against alleles with both deletions. The deletion alleles are almost certainly the derived ones, because the GSTT2B-GSTT2-GSTT1 genes were strictly retained in chimpanzees. Extremely low GSTT2 mRNA expression was associated with the GSTT2B deletion, suggesting an influence of the deletion on the flanking region and loss of GSTT2 function. Genome-wide LD analysis between deletion polymorphisms further points to the uniqueness of two deletions, because strong LD between deletion polymorphisms might be very rare in humans. These results show a complex genomic organization and unexpected biological functions of CNVs within segmental duplications and emphasize the importance of detailed structural characterization for disease association studies. Common diseases such as cancer are caused by interactions between multiple genetic and environmental factors. Glutathione S-transferases (GST) are key enzymes in eliminating carcinogens and harmful macromolecules from cells. Based on the assumption that individuals who do not have a particular type of GST genes are susceptible to cancers, a number of studies have been conducted to find a link between GST genotypes and cancer. However such associations remain inconclusive to date. Because GST genes are clustered in repetitive, complex regions in the genome, other previously uncharacterized variations/polymorphisms may have had an impact on the data. We describe here such a genotype, a 37-kb deletion of GSTT2B gene that is found very frequently among humans. The neighboring GSTT2 gene expression is greatly impaired by the GSTT2B deletion, conferring a potentially null allele at GSTT2. The GSTT2B deletion is non-randomly associated with another high frequency deletion of the GSTT1 gene. Therefore, a detailed characterization of this complex region of the genome revealed unexpected genetic and biological interactions of large deletion polymorphisms; this is essential to consider in future disease association studies.
Collapse
Affiliation(s)
- Yongzhong Zhao
- Department of Molecular Genetics, Cleveland Clinic, Cleveland, Ohio, United States of America
- Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, United States of America
| | - Michael Marotta
- Department of Molecular Genetics, Cleveland Clinic, Cleveland, Ohio, United States of America
- Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, United States of America
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
- Howard Hughes Medical Institute, University of Washington, Seattle, Washington, United States of America
| | - Charis Eng
- Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, United States of America
- Genomic Medicine Institute, Cleveland Clinic, Cleveland, Ohio, United States of America
- Taussig Cancer Institute, Cleveland Clinic, Cleveland, Ohio, United States of America
- Department of Genetics, Case Western Reserve University School of Medicine, Cleveland, Ohio, United States of America
- CASE Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, Ohio, United States of America
| | - Hisashi Tanaka
- Department of Molecular Genetics, Cleveland Clinic, Cleveland, Ohio, United States of America
- Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, United States of America
- * E-mail:
| |
Collapse
|
142
|
Luo M, Cui X, Fredman D, Brookes AJ, Azaro MA, Greenawalt DM, Hu G, Wang HY, Tereshchenko IV, Lin Y, Shentu Y, Gao R, Shen L, Li H. Genetic structures of copy number variants revealed by genotyping single sperm. PLoS One 2009; 4:e5236. [PMID: 19384415 PMCID: PMC2668179 DOI: 10.1371/journal.pone.0005236] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2008] [Accepted: 03/12/2009] [Indexed: 11/19/2022] Open
Abstract
Background Copy number variants (CNVs) occupy a significant portion of the human genome and may have important roles in meiotic recombination, human genome evolution and gene expression. Many genetic diseases may be underlain by CNVs. However, because of the presence of their multiple copies, variability in copy numbers and the diploidy of the human genome, detailed genetic structure of CNVs cannot be readily studied by available techniques. Methodology/Principal Findings Single sperm samples were used as the primary subjects for the study so that CNV haplotypes in the sperm donors could be studied individually. Forty-eight CNVs characterized in a previous study were analyzed using a microarray-based high-throughput genotyping method after multiplex amplification. Seventeen single nucleotide polymorphisms (SNPs) were also included as controls. Two single-base variants, either allelic or paralogous, could be discriminated for all markers. Microarray data were used to resolve SNP alleles and CNV haplotypes, to quantitatively assess the numbers and compositions of the paralogous segments in each CNV haplotype. Conclusions/Significance This is the first study of the genetic structure of CNVs on a large scale. Resulting information may help understand evolution of the human genome, gain insight into many genetic processes, and discriminate between CNVs and SNPs. The highly sensitive high-throughput experimental system with haploid sperm samples as subjects may be used to facilitate detailed large-scale CNV analysis.
Collapse
Affiliation(s)
- Minjie Luo
- Department of Molecular Genetics, Microbiology, and Immunology/The Cancer Institute of New Jersey, University of Medicine and Dentistry of New Jersey Robert Wood Johnson Medical School, Piscataway, New Jersey, United States of America
| | - Xiangfeng Cui
- Department of Molecular Genetics, Microbiology, and Immunology/The Cancer Institute of New Jersey, University of Medicine and Dentistry of New Jersey Robert Wood Johnson Medical School, Piscataway, New Jersey, United States of America
| | - David Fredman
- Bergen Center for Computational Science, University of Bergen, Bergen, Norway
| | - Anthony J. Brookes
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Marco A. Azaro
- Department of Molecular Genetics, Microbiology, and Immunology/The Cancer Institute of New Jersey, University of Medicine and Dentistry of New Jersey Robert Wood Johnson Medical School, Piscataway, New Jersey, United States of America
| | - Danielle M. Greenawalt
- Department of Molecular Genetics, Microbiology, and Immunology/The Cancer Institute of New Jersey, University of Medicine and Dentistry of New Jersey Robert Wood Johnson Medical School, Piscataway, New Jersey, United States of America
| | - Guohong Hu
- Department of Molecular Genetics, Microbiology, and Immunology/The Cancer Institute of New Jersey, University of Medicine and Dentistry of New Jersey Robert Wood Johnson Medical School, Piscataway, New Jersey, United States of America
| | - Hui-Yun Wang
- Department of Molecular Genetics, Microbiology, and Immunology/The Cancer Institute of New Jersey, University of Medicine and Dentistry of New Jersey Robert Wood Johnson Medical School, Piscataway, New Jersey, United States of America
| | - Irina V. Tereshchenko
- Department of Molecular Genetics, Microbiology, and Immunology/The Cancer Institute of New Jersey, University of Medicine and Dentistry of New Jersey Robert Wood Johnson Medical School, Piscataway, New Jersey, United States of America
| | - Yong Lin
- Department of Biometry, University of Medicine and Dentistry of New Jersey Robert Wood Johnson Medical School, Piscataway, New Jersey, United States of America
| | - Yue Shentu
- Department of Statistics, Rutgers University, Hill Center for the Mathematical Sciences, Piscataway, New Jersey, United States of America
| | - Richeng Gao
- Department of Molecular Genetics, Microbiology, and Immunology/The Cancer Institute of New Jersey, University of Medicine and Dentistry of New Jersey Robert Wood Johnson Medical School, Piscataway, New Jersey, United States of America
| | - Li Shen
- Department of Molecular Genetics, Microbiology, and Immunology/The Cancer Institute of New Jersey, University of Medicine and Dentistry of New Jersey Robert Wood Johnson Medical School, Piscataway, New Jersey, United States of America
| | - Honghua Li
- Department of Molecular Genetics, Microbiology, and Immunology/The Cancer Institute of New Jersey, University of Medicine and Dentistry of New Jersey Robert Wood Johnson Medical School, Piscataway, New Jersey, United States of America
- * E-mail:
| |
Collapse
|
143
|
Henrichsen CN, Chaignat E, Reymond A. Copy number variants, diseases and gene expression. Hum Mol Genet 2009; 18:R1-8. [PMID: 19297395 DOI: 10.1093/hmg/ddp011] [Citation(s) in RCA: 292] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Copy number variation (CNV) has recently gained considerable interest as a source of genetic variation likely to play a role in phenotypic diversity and evolution. Much effort has been put into the identification and mapping of regions that vary in copy number among seemingly normal individuals in humans and a number of model organisms, using bioinformatics or hybridization-based methods. These have allowed uncovering associations between copy number changes and complex diseases in whole-genome association studies, as well as identify new genomic disorders. At the genome-wide scale, however, the functional impact of CNV remains poorly studied. Here we review the current catalogs of CNVs, their association with diseases and how they link genotype and phenotype. We describe initial evidence which revealed that genes in CNV regions are expressed at lower and more variable levels than genes mapping elsewhere, and also that CNV not only affects the expression of genes varying in copy number, but also have a global influence on the transcriptome. Further studies are warranted for complete cataloguing and fine mapping of CNVs, as well as to elucidate the different mechanisms by which they influence gene expression.
Collapse
Affiliation(s)
- Charlotte N Henrichsen
- The Center for Integrative Genomics, Genopode Building, University of Lausanne, Lausanne, Switzerland
| | | | | |
Collapse
|
144
|
Abstract
The olfactory receptor gene (OR) superfamily is the largest in the human genome. The superfamily contains 390 putatively functional genes and 465 pseudogenes arranged into 18 gene families and 300 subfamilies. Even members within the same subfamily are often located on different chromosomes. OR genes are located on all autosomes except chromosome 20, plus the X chromosome but not the Y chromosome. The gene:pseudogene ratio is lowest in human, higher in chimpanzee and highest in rat and mouse -- most likely reflecting the greater need of olfaction for survival in the rodent than in the human. The OR genes undergo allelic exclusion, each sensory neurone expressing usually only one odourant receptor allele; the mechanism by which this phenomenon is regulated is not yet understood. The nomenclature system (based on evolutionary divergence of genes into families and subfamilies of the OR gene superfamily) has been designed similarly to that originally used for the CYP gene superfamily.
Collapse
Affiliation(s)
- Tsviya Olender
- The Crown Human Genome Center, Department of Molecular Genetics, The Weizmann Institute of Science, Rehovot, Israel
| | | | | |
Collapse
|
145
|
Cutter AD, Dey A, Murray RL. Evolution of the Caenorhabditis elegans genome. Mol Biol Evol 2009; 26:1199-234. [PMID: 19289596 DOI: 10.1093/molbev/msp048] [Citation(s) in RCA: 90] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
A fundamental problem in genome biology is to elucidate the evolutionary forces responsible for generating nonrandom patterns of genome organization. As the first metazoan to benefit from full-genome sequencing, Caenorhabditis elegans has been at the forefront of research in this area. Studies of genomic patterns, and their evolutionary underpinnings, continue to be augmented by the recent push to obtain additional full-genome sequences of related Caenorhabditis taxa. In the near future, we expect to see major advances with the onset of whole-genome resequencing of multiple wild individuals of the same species. In this review, we synthesize many of the important insights to date in our understanding of genome organization and function that derive from the evolutionary principles made explicit by theoretical population genetics and molecular evolution and highlight fertile areas for future research on unanswered questions in C. elegans genome evolution. We call attention to the need for C. elegans researchers to generate and critically assess nonadaptive hypotheses for genomic and developmental patterns, in addition to adaptive scenarios. We also emphasize the potential importance of evolution in the gonochoristic (female and male) ancestors of the androdioecious (hermaphrodite and male) C. elegans as the source for many of its genomic and developmental patterns.
Collapse
Affiliation(s)
- Asher D Cutter
- Department of Ecology & Evolutionary Biology and the Centre for the Analysis of Genome Evolution and Function, University of Toronto, Toronto, Ontario, Canada.
| | | | | |
Collapse
|
146
|
Bailey JA, Kidd JM, Eichler EE. Human copy number polymorphic genes. Cytogenet Genome Res 2009; 123:234-43. [PMID: 19287160 DOI: 10.1159/000184713] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/20/2008] [Indexed: 11/19/2022] Open
Abstract
Recent large-scale genomic studies within human populations have identified numerous genomic regions as copy number variant (CNV). As these CNV regions often overlap coding regions of the genome, large lists of potentially copy number polymorphic genes have been produced that are candidates for disease association. Most of the current data regarding normal genic variation, however, has been generated using BAC or SNP microarrays, which lack precision especially with respect to exons. To address this, we assessed 2,790 candidate CNV genes defined from available studies in nine well-characterized HapMap individuals by designing a customized oligonucleotide microarray targeted specifically to exons. Using exon array comparative genomic hybridization (aCGH), we detected 255 (9%) of the candidates as true CNVs including 134 with evidence of variation over the entire gene. Individuals differed in copy number from the control by an average of 100 gene loci. Both partial- and whole-gene CNVs were strongly associated with segmental duplications (55 and 71%, respectively) as well as regions of positive selection. We confirmed 37% of the whole-gene CNVs using the fosmid end sequence pair (ESP) structural variation map for these same individuals. If we modify the end sequence pair mapping strategy to include low-sequence identity ESPs (98-99.5%) and ESPs with an everted orientation, we can capture 82% of the missed genes leading to more complete ascertainment of structural variation within duplicated genes. Our results indicate that segmental duplications are the source of the majority of full-length copy number polymorphic genes, most of the variant genes are organized as tandem duplications, and a significant fraction of these genes will represent paralogs with levels of sequence diversity beyond thresholds of allelic variation. In addition, these data provide a targeted set of CNV genes enriched for regions likely to be associated with human phenotypic differences due to copy number changes and present a source of copy number responsive oligonucleotide probes for future association studies.
Collapse
Affiliation(s)
- J A Bailey
- Department of Pathology, Case Western University School of Medicine and University Hospitals of Cleveland, Cleveland, OH, USA.
| | | | | |
Collapse
|
147
|
Perry GH. The evolutionary significance of copy number variation in the human genome. Cytogenet Genome Res 2009; 123:283-7. [PMID: 19287166 DOI: 10.1159/000184719] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/22/2008] [Indexed: 12/27/2022] Open
Abstract
Copy number variation provides the raw material for gene family expansion and diversification, which is an important evolutionary force. Moreover, copy number variants (CNVs) can influence gene transcriptional and translational levels and have been associated with complex disease susceptibility. Therefore, natural selection may have affected at least some of the greater than one thousand CNVs thus far discovered among the genomes of phenotypically normal humans. While identifying and understanding particular instances of natural selection may shed light on important aspects of human evolutionary history, our ability to analyze CNVs in traditional population genetic frameworks has been limited. However, progress has been made by adapting some of these frameworks for use with copy number data. Moving forward, these efforts will be aided by non-human organism studies of the population genetics of copy number variation, and by more direct comparisons of within-species copy number variation and between-species copy number fixation.
Collapse
Affiliation(s)
- G H Perry
- Department of Human Genetics, University of Chicago, Chicago, IL, USA.
| |
Collapse
|
148
|
Lachman HM. Copy variations in schizophrenia and bipolar disorder. Cytogenet Genome Res 2009; 123:27-35. [PMID: 19287136 DOI: 10.1159/000184689] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/30/2008] [Indexed: 01/19/2023] Open
Abstract
The analysis of copy number variations (CNVs) is an emerging tool for identifying genetic factors underlying complex traits. In this chapter I will review studies that have been carried out showing that CNVs play a role in the development of two such complex traits; schizophrenia (SZ) and bipolar disorder (BD). There are two aspects to consider regarding the role of copy variations in these conditions. One is gene discovery in which DNA from patients is analyzed for the purpose of identifying rare, patient-specific CNVs that may be informative to a larger population of affected individuals. The model for this concept is based on the emergence of DISC1 as a SZ candidate gene, which was discovered in a single informative family with a rare chromosomal translocation. Another aspect revolves around the idea that polymorphic CNVs found in the general population, many of which appear to disrupt previously identified SZ and BD candidate genes, contribute to disease pathogenesis. Here, gene-disrupting CNVs are viewed in the same manner as functional SNPs and analyzed for involvement in disease susceptibility using genetic association. Although the analysis of CNVs in patients with psychiatric disorders is in its infancy, informative new findings have already been made, suggesting that this is a very promising line of research.
Collapse
Affiliation(s)
- H M Lachman
- Department of Psychiatry and Behavioral Sciences, Division of Basic Research Albert Einstein College of Medicine, Bronx, New York, USA.
| |
Collapse
|
149
|
de Smith AJ, Walters RG, Froguel P, Blakemore AI. Human genes involved in copy number variation: mechanisms of origin, functional effects and implications for disease. Cytogenet Genome Res 2009; 123:17-26. [PMID: 19287135 DOI: 10.1159/000184688] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/04/2008] [Indexed: 11/19/2022] Open
Abstract
Copy number variants (CNVs) overlap over 7000 genes, many of which are pivotal in biological pathways. The implications of this are profound, with consequences for evolutionary studies, population genetics, gene function and human phenotype, including elucidation of genetic susceptibility to major common diseases, the heritability of which has thus far defied full explanation. Even though this research is still in its infancy, CNVs have already been associated with a number of monogenic, syndromic and complex diseases: the development of high throughput and high resolution techniques for CNV screening is likely to bring further new insights into the contribution of copy number variation to common diseases. Amongst genes overlapped by CNVs, significant enrichments for certain gene ontology categories have been identified, including those related to immune responses and interactions with the environment. Genes in both of these categories are thought to be important in evolutionary adaptation and to be particular targets of natural selection. Thus, a full appreciation of copy number variation may be important for our understanding of human evolution.
Collapse
Affiliation(s)
- A J de Smith
- Section of Genomic Medicine, Imperial College London, Hammersmith Hospital, London, UK
| | | | | | | |
Collapse
|
150
|
Schaschl H, Aitman TJ, Vyse TJ. Copy number variation in the human genome and its implication in autoimmunity. Clin Exp Immunol 2009; 156:12-6. [PMID: 19220326 DOI: 10.1111/j.1365-2249.2008.03865.x] [Citation(s) in RCA: 69] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
Abstract
The causes of autoimmune disease remain poorly defined. However, it is known that genetic factors contribute to disease susceptibility. Hitherto, studies have focused upon single nucleotide polymorphisms as both tools for mapping and as probable causal variants. Recent studies, using genome-wide analytical techniques, have revealed that, in the genome, segments of DNA ranging in size from kilobases to megabases can vary in copy number. These changes of DNA copy number represent an important element of genomic polymorphism in humans and in other species and may therefore make a substantial contribution to phenotypic variation and population differentiation. Furthermore, copy number variation (CNV) in genomic regions harbouring dosage-sensitive genes may cause or predispose to a variety of human genetic diseases. Several recent studies have reported an association between CNV and autoimmunity in humans such as systemic lupus, psoriasis, Crohn's disease, rheumatoid arthritis and type 1 diabetes. The use of novel analytical techniques facilitates the study of complex human genomic structures such as CNV, and allows new susceptibility loci for autoimmunity to be found that are not readily mappable by single nucleotide polymorphism-based association analyses alone.
Collapse
Affiliation(s)
- H Schaschl
- Imperial College London, Faculty of Medicine, Section of Molecular Genetics and Rheumatology, Hammersmith Campus, London, UK
| | | | | |
Collapse
|