451
|
Illuminating the functional and structural repertoire of human TBC/RABGAPs. Nat Rev Mol Cell Biol 2012; 13:67-73. [PMID: 22251903 DOI: 10.1038/nrm3267] [Citation(s) in RCA: 124] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The Tre2-Bub2-Cdc16 (TBC) domain-containing RAB-specific GTPase-activating proteins (TBC/RABGAPs) are characterized by the presence of highly conserved TBC domains and act as negative regulators of RABs. The importance of TBC/RABGAPs in the regulation of specific intracellular trafficking routes is now emerging, as is their role in different diseases. Importantly, TBC/RABGAPs act as key regulatory nodes, integrating signalling between RABs and other small GTPases and ensuring the appropriate retrieval, transport and delivery of different intracellular vesicles.
Collapse
|
452
|
Pereira R, Phillips C, Pinto N, Santos C, dos Santos SEB, Amorim A, Carracedo Á, Gusmão L. Straightforward inference of ancestry and admixture proportions through ancestry-informative insertion deletion multiplexing. PLoS One 2012; 7:e29684. [PMID: 22272242 PMCID: PMC3260179 DOI: 10.1371/journal.pone.0029684] [Citation(s) in RCA: 189] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2011] [Accepted: 12/02/2011] [Indexed: 02/06/2023] Open
Abstract
Ancestry-informative markers (AIMs) show high allele frequency divergence between different ancestral or geographically distant populations. These genetic markers are especially useful in inferring the likely ancestral origin of an individual or estimating the apportionment of ancestry components in admixed individuals or populations. The study of AIMs is of great interest in clinical genetics research, particularly to detect and correct for population substructure effects in case-control association studies, but also in population and forensic genetics studies. This work presents a set of 46 ancestry-informative insertion deletion polymorphisms selected to efficiently measure population admixture proportions of four different origins (African, European, East Asian and Native American). All markers are analyzed in short fragments (under 230 basepairs) through a single PCR followed by capillary electrophoresis (CE) allowing a very simple one tube PCR-to-CE approach. HGDP-CEPH diversity panel samples from the four groups, together with Oceanians, were genotyped to evaluate the efficiency of the assay in clustering populations from different continental origins and to establish reference databases. In addition, other populations from diverse geographic origins were tested using the HGDP-CEPH samples as reference data. The results revealed that the AIM-INDEL set developed is highly efficient at inferring the ancestry of individuals and provides good estimates of ancestry proportions at the population level. In conclusion, we have optimized the multiplexed genotyping of 46 AIM-INDELs in a simple and informative assay, enabling a more straightforward alternative to the commonly available AIM-SNP typing methods dependent on complex, multi-step protocols or implementation of large-scale genotyping technologies.
Collapse
Affiliation(s)
- Rui Pereira
- IPATIMUP – Institute of Molecular Pathology and Immunology of the University of Porto, Porto, Portugal
- Institute of Forensic Sciences Luis Concheiro, University of Santiago de Compostela, Santiago de Compostela, Spain
| | - Christopher Phillips
- Institute of Forensic Sciences Luis Concheiro, University of Santiago de Compostela, Santiago de Compostela, Spain
| | - Nádia Pinto
- IPATIMUP – Institute of Molecular Pathology and Immunology of the University of Porto, Porto, Portugal
- Faculty of Sciences, University of Porto, Porto, Portugal
- Mathematics Research Centre, University of Porto, Porto, Portugal
| | - Carla Santos
- Institute of Forensic Sciences Luis Concheiro, University of Santiago de Compostela, Santiago de Compostela, Spain
| | | | - António Amorim
- IPATIMUP – Institute of Molecular Pathology and Immunology of the University of Porto, Porto, Portugal
- Faculty of Sciences, University of Porto, Porto, Portugal
| | - Ángel Carracedo
- Institute of Forensic Sciences Luis Concheiro, University of Santiago de Compostela, Santiago de Compostela, Spain
- Genomics Medicine Group, CIBERER, University of Santiago de Compostela, Santiago de Compostela, Spain
| | - Leonor Gusmão
- IPATIMUP – Institute of Molecular Pathology and Immunology of the University of Porto, Porto, Portugal
| |
Collapse
|
453
|
Reese ML, Boyle JP. Virulence without catalysis: how can a pseudokinase affect host cell signaling? Trends Parasitol 2012; 28:53-7. [PMID: 22257555 DOI: 10.1016/j.pt.2011.12.004] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2011] [Revised: 12/26/2011] [Accepted: 12/29/2011] [Indexed: 10/14/2022]
Abstract
A hallmark of the pathogenic lifestyle is the secretion of enzymes and other effectors that dysregulate host signaling. Intriguingly, the most potent virulence locus identified in the intracellular parasite Toxoplasma gondii encodes a family of related catalytically inactive protein kinases, or pseudokinases. Toxoplasma has in its kinome among the highest percentage of pseudokinases among all sequenced organisms, and the majority of these appear to be secreted into the host cell. We posit that the pseudokinase fold represents a particularly well-suited domain for functional diversification, discuss the relevance of gene expansion at these loci, and outline potential mechanisms by which a pseudokinase might affect host signaling.
Collapse
Affiliation(s)
- Michael L Reese
- Stanford University, Department of Microbiology and Immunology, 299 Campus Drive, Stanford, CA 94305-5124, USA.
| | | |
Collapse
|
454
|
Combination of Micronutrients for Bone (COMB) Study: bone density after micronutrient intervention. JOURNAL OF ENVIRONMENTAL AND PUBLIC HEALTH 2012; 2012:354151. [PMID: 22291722 PMCID: PMC3265100 DOI: 10.1155/2012/354151] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/10/2011] [Accepted: 10/04/2011] [Indexed: 12/31/2022]
Abstract
Along with other investigations, patients presenting to an environmental health clinic with various chronic conditions were assessed for bone health status. Individuals with compromised bone strength were educated about skeletal health issues and provided with therapeutic options for potential amelioration of their bone health. Patients who declined pharmacotherapy or who previously experienced failure of drug treatment were offered other options including supplemental micronutrients identified in the medical literature as sometimes having a positive impact on bone mineral density (BMD). After 12 months of consecutive supplemental micronutrient therapy with a combination that included vitamin D(3), vitamin K(2), strontium, magnesium and docosahexaenoic acid (DHA), repeat bone densitometry was performed. The results were analyzed in a group of compliant patients and demonstrate improved BMD in patients classified with normal, osteopenic and osteoporotic bone density. According to the results, this combined micronutrient supplementation regimen appears to be at least as effective as bisphosphonates or strontium ranelate in raising BMD levels in hip, spine, and femoral neck sites. No fractures occurred in the group taking the micronutrient protocol. This micronutrient regimen also appears to show efficacy in individuals where bisphosphonate therapy was previously unsuccessful in maintaining or raising BMD. Prospective clinical trials are required to confirm efficacy.
Collapse
|
455
|
Poaty H, Coullin P, Peko JF, Dessen P, Diatta AL, Valent A, Leguern E, Prévot S, Gombé-Mbalawa C, Candelier JJ, Picard JY, Bernheim A. Genome-wide high-resolution aCGH analysis of gestational choriocarcinomas. PLoS One 2012; 7:e29426. [PMID: 22253721 PMCID: PMC3253784 DOI: 10.1371/journal.pone.0029426] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2011] [Accepted: 11/28/2011] [Indexed: 11/28/2022] Open
Abstract
Eleven samples of DNA from choriocarcinomas were studied by high resolution CGH-array 244 K. They were studied after histopathological confirmation of the diagnosis, of the androgenic etiology and after a microsatellite marker analysis confirming the absence of contamination of tumor DNA from maternal DNA. Three cell lines, BeWo, JAR, JEG were also studied by this high resolution pangenomic technique. According to aCGH analysis, the de novo choriocarcinomas exhibited simple chromosomal rearrangements or normal profiles. The cell lines showed various and complex chromosomal aberrations. 23 Minimal Critical Regions were defined that allowed us to list the genes that were potentially implicated. Among them, unusually high numbers of microRNA clusters and imprinted genes were observed.
Collapse
Affiliation(s)
- Henriette Poaty
- INSERM U985, Institut Gustave Roussy, Villejuif, France
- Université Paris XI, Paris Sud, Orsay, France
- INSERM U782, Endocrinologie et génétique de la reproduction et du développement, Clamart, France
| | - Philippe Coullin
- Université Paris XI, Paris Sud, Orsay, France
- INSERM U782, Endocrinologie et génétique de la reproduction et du développement, Clamart, France
| | - Jean Félix Peko
- Service de carcinologie, service d'anatomie et de pathologie, CHU Brazzaville, Congo
| | - Philippe Dessen
- INSERM U985, Institut Gustave Roussy, Villejuif, France
- Université Paris XI, Paris Sud, Orsay, France
| | - Ange Lucien Diatta
- Laboratoire de cytogénétique et de la reproduction, service d'obstétrique, Hôpital A. Le Dantec, Dakar, Sénégal
| | | | - Eric Leguern
- UF de neurogénétique moléculaire et cellulaire, Hôpital de la Salpêtrière, Paris, France
| | - Sophie Prévot
- Université Paris XI, Paris Sud, Orsay, France
- INSERM U782, Endocrinologie et génétique de la reproduction et du développement, Clamart, France
| | - Charles Gombé-Mbalawa
- Service de carcinologie, service d'anatomie et de pathologie, CHU Brazzaville, Congo
| | - Jean-Jacques Candelier
- Université Paris XI, Paris Sud, Orsay, France
- INSERM U782, Endocrinologie et génétique de la reproduction et du développement, Clamart, France
| | - Jean-Yves Picard
- Université Paris XI, Paris Sud, Orsay, France
- INSERM U782, Endocrinologie et génétique de la reproduction et du développement, Clamart, France
| | - Alain Bernheim
- INSERM U985, Institut Gustave Roussy, Villejuif, France
- Université Paris XI, Paris Sud, Orsay, France
- Molecular Pathology, Institut Gustave Roussy, Villejuif, France
- * E-mail:
| |
Collapse
|
456
|
Khalak HG, Wakil SM, Imtiaz F, Ramzan K, Baz B, Almostafa A, Hagos S, Alzahrani F, Abu-Dhaim N, Abu Safieh L, Al-Jbali L, Al-Hamed MS, Monies D, Aldahmesh M, Al-Dosari MS, Kaya N, Shamseldin H, Shaheen R, Al-Rashed M, Hashem M, Al-Tassan N, Meyer B, Alazami AM, Alkuraya FS. Autozygome maps dispensable DNA and reveals potential selective bias against nullizygosity. Genet Med 2012; 14:515-9. [PMID: 22241088 DOI: 10.1038/gim.2011.28] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
PURPOSE Copy number variants are an important source of human genome diversity. The widespread distribution of hemizygous copy number variants in the DNA of healthy humans suggests that haploinsufficiency is largely tolerated. However, little is known about the extent to which corresponding nullizygosity (two-copy deletion) is similarly tolerated. METHODS We analyzed a cohort of first cousin unions to enrich for shared parental hemizygous events and tested their Mendelian inheritance in offspring. RESULTS Analysis of autozygous DNA blocks (autozygome) in the offspring not only proved an efficient method of mapping "dispensable" DNA but also revealed potential selective bias against the occurrence of nullizygous changes. This bias was not restricted to genic copy number variants and was not accounted for by a high rate of miscarriages. CONCLUSIONS The autozygome is an efficient way to map dispensable segments of DNA and may reveal selective bias against nullizygosity in healthy individuals.
Collapse
Affiliation(s)
- Hanif G Khalak
- Department of Genetics, King Faisal Specialist Hospital and Research Center, Riyadh, Saudi Arabia
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
457
|
Besenbacher S, Mailund T, Schierup MH. Association mapping and disease: evolutionary perspectives. Methods Mol Biol 2012; 856:275-91. [PMID: 22399463 DOI: 10.1007/978-1-61779-585-5_11] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
In this chapter, we give a short introduction to the genetics of complex disease with special emphasis on evolutionary models for disease genes and the effect of different models on the genetic architecture, and finally give a survey of the state-of-the-art of genome-wide association studies.
Collapse
|
458
|
Scott SA, Patel M, Martis S, Lubitz SA, van der Zee S, Yoo C, Edelmann L, Halperin JL, Desnick RJ. Copy number variation and warfarin dosing: evaluation of CYP2C9, VKORC1, CYP4F2, GGCX and CALU. Pharmacogenomics 2011; 13:297-307. [PMID: 22188360 DOI: 10.2217/pgs.11.156] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
AIM To determine if copy number variants contribute to warfarin dose requirements, we investigated CYP2C9, VKORC1, CYP4F2, GGCX and CALU for deletions and duplications in a multiethnic patient population treated with therapeutic doses of warfarin. PATIENTS & METHODS DNA samples from 178 patients were subjected to copy number analyses by multiplex ligation-dependent probe amplification or quantitative PCR assays. Additionally, the CYP2C9 exon 8 insertion/deletion polymorphism (rs71668942) was examined among the patient cohort and 1750 additional multiethnic healthy individuals. RESULTS All patients carried two copies of CYP2C9 by multiplex ligation-dependent probe amplification and no exon 8 deletion carriers were detected. Similarly, quantitative PCR assays for VKORC1, CYP4F2, GGCX and CALU identified two copies in all populations. CONCLUSION These data indicate that copy number variants in the principal genes involved in warfarin dose variability (CYP2C9, VKORC1), including genes with lesser effect (CYP4F2, GGCX), and those that may be more relevant among certain racial groups (CALU), are rare in multiethnic populations, including African-Americans.
Collapse
Affiliation(s)
- Stuart A Scott
- Department of Genetics & Genomic Sciences, Box 1497, Mount Sinai School of Medicine, 1428 Madison Avenue, New York, NY 10029, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
459
|
Uddin M, Sturge M, Peddle L, O'Rielly DD, Rahman P. Genome-wide signatures of 'rearrangement hotspots' within segmental duplications in humans. PLoS One 2011; 6:e28853. [PMID: 22194928 PMCID: PMC3237539 DOI: 10.1371/journal.pone.0028853] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2011] [Accepted: 11/16/2011] [Indexed: 11/19/2022] Open
Abstract
The primary objective of this study was to create a genome-wide high resolution map (i.e., >100 bp) of ‘rearrangement hotspots’ which can facilitate the identification of regions capable of mediating de novo deletions or duplications in humans. A hierarchical method was employed to fragment segmental duplications (SDs) into multiple smaller SD units. Combining an end space free pairwise alignment algorithm with a ‘seed and extend’ approach, we have exhaustively searched 409 million alignments to detect complex structural rearrangements within the reference-guided assembly of the NA18507 human genome (18× coverage), including the previously identified novel 4.8 Mb sequence from de novo assembly within this genome. We have identified 1,963 rearrangement hotspots within SDs which encompass 166 genes and display an enrichment of duplicated gene nucleotide variants (DNVs). These regions are correlated with increased non-allelic homologous recombination (NAHR) event frequency which presumably represents the origin of copy number variations (CNVs) and pathogenic duplications/deletions. Analysis revealed that 20% of the detected hotspots are clustered within the proximal and distal SD breakpoints flanked by the pathogenic deletions/duplications that have been mapped for 24 NAHR-mediated genomic disorders. FISH Validation of selected complex regions revealed 94% concordance with in silico localization of the highly homologous derivatives. Other results from this study indicate that intra-chromosomal recombination is enhanced in genic compared with agenic duplicated regions, and that gene desert regions comprising SDs may represent reservoirs for creation of novel genes. The generation of genome-wide signatures of ‘rearrangement hotspots’, which likely serve as templates for NAHR, may provide a powerful approach towards understanding the underlying mutational mechanism(s) for development of constitutional and acquired diseases.
Collapse
Affiliation(s)
- Mohammed Uddin
- Faculty of Medicine, Discipline of Medicine and Genetics, Memorial University, St. John's, Newfoundland, Canada
| | - Mitch Sturge
- Faculty of Medicine, Discipline of Medicine and Genetics, Memorial University, St. John's, Newfoundland, Canada
| | - Lynette Peddle
- Faculty of Medicine, Discipline of Medicine and Genetics, Memorial University, St. John's, Newfoundland, Canada
| | - Darren D. O'Rielly
- Faculty of Medicine, Discipline of Medicine and Genetics, Memorial University, St. John's, Newfoundland, Canada
| | - Proton Rahman
- Faculty of Medicine, Discipline of Medicine and Genetics, Memorial University, St. John's, Newfoundland, Canada
- * E-mail:
| |
Collapse
|
460
|
A commentary on Implication of gene copy number variation in health and diseases. J Hum Genet 2011; 57:79-80. [DOI: 10.1038/jhg.2011.140] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
|
461
|
Lilleoja R, Sarapik A, Reimann E, Reemann P, Jaakma Ü, Vasar E, Kõks S. Sequencing and annotated analysis of an Estonian human genome. Gene 2011; 493:69-76. [PMID: 22138481 DOI: 10.1016/j.gene.2011.11.022] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2011] [Accepted: 11/14/2011] [Indexed: 10/14/2022]
Abstract
In present study we describe the sequencing and annotated analysis of the individual genome of Estonian. Using SOLID technology we generated 2,449,441,916 of 50-bp reads. The Bioscope version 1.3 was used for mapping and pairing of reads to the NCBI human genome reference (build 36, hg18). Bioscope enables also the annotation of the results of variant (tertiary) analysis. The average mapping of reads was 75.5% with total coverage of 107.72 Gb. resulting in mean fold coverage of 34.6. We found 3,482,975 SNPs out of which 352,492 were novel. 21,222 SNPs were in coding region: 10,649 were synonymous SNPs, 10,360 were nonsynonymous missense SNPs, 155 were nonsynonymous nonsense SNPs and 58 were nonsynonymous frameshifts. We identified 219 CNVs with total base pair coverage of 37,326,300 bp and 87,451 large insertion/deletion polymorphisms covering 10,152,256 bp of the genome. In addition, we found 285,864 small size insertion/deletion polymorphisms out of which 133,969 were novel. Finally, we identified 53 inversions, 19 overlapped genes and 2 overlapped exons. Interestingly, we found the region in chromosome 6 to be enriched with the coding SNPs and CNVs. This study confirms previous findings, that our genomes are more complex and variable as thought before. Therefore, sequencing of the personal genomes followed by annotation would improve the analysis of heritability of phenotypes and our understandings on the functions of genome.
Collapse
Affiliation(s)
- Rutt Lilleoja
- Department of Reproductive Biology, Estonian University of Life Sciences, Estonia
| | | | | | | | | | | | | |
Collapse
|
462
|
Mills JD, Janitz M. Alternative splicing of mRNA in the molecular pathology of neurodegenerative diseases. Neurobiol Aging 2011; 33:1012.e11-24. [PMID: 22118946 DOI: 10.1016/j.neurobiolaging.2011.10.030] [Citation(s) in RCA: 83] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2011] [Revised: 10/23/2011] [Accepted: 10/24/2011] [Indexed: 12/15/2022]
Abstract
Alternative splicing (AS) is a post-transcriptional process that occurs in multiexon genes, and errors in this process have been implicated in many human diseases. Until recently, technological limitations prevented AS from being examined at the genome-wide scale. With the advent of new technologies, including exon arrays and next-generation sequencing (NGS) techniques (e.g., RNA-Seq), a higher resolution view of the human transcriptome is now available. This is particularly applicable in the study of neurodegenerative brain diseases (NBDs), such as Alzheimer's disease and Parkinson's disease, because the brain has the greatest amount of alternative splicing of all human tissues. Although many of the AS events associated with these disorders were initially identified using low-throughput methodologies, genome-wide analysis allows for more in-depth studies, marking a new chapter in transcript exploration. In this review, the latest technologies used to study the transcriptome and the AS genes that have been associated with a number of neurodegenerative brain diseases are discussed.
Collapse
Affiliation(s)
- James Dominic Mills
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales 2052, Australia
| | | |
Collapse
|
463
|
Characterising chromosome rearrangements: recent technical advances in molecular cytogenetics. Heredity (Edinb) 2011; 108:75-85. [PMID: 22086080 PMCID: PMC3238113 DOI: 10.1038/hdy.2011.100] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
Genomic rearrangements can result in losses, amplifications, translocations and inversions of DNA fragments thereby modifying genome architecture, and potentially having clinical consequences. Many genomic disorders caused by structural variation have initially been uncovered by early cytogenetic methods. The last decade has seen significant progression in molecular cytogenetic techniques, allowing rapid and precise detection of structural rearrangements on a whole-genome scale. The high resolution attainable with these recently developed techniques has also uncovered the role of structural variants in normal genetic variation alongside single-nucleotide polymorphisms (SNPs). We describe how array-based comparative genomic hybridisation, SNP arrays, array painting and next-generation sequencing analytical methods (read depth, read pair and split read) allow the extensive characterisation of chromosome rearrangements in human genomes.
Collapse
|
464
|
Wang RT, Ahn S, Park CC, Khan AH, Lange K, Smith DJ. Effects of genome-wide copy number variation on expression in mammalian cells. BMC Genomics 2011; 12:562. [PMID: 22085887 PMCID: PMC3287593 DOI: 10.1186/1471-2164-12-562] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2011] [Accepted: 11/16/2011] [Indexed: 11/17/2022] Open
Abstract
Background There is only a limited understanding of the relation between copy number and expression for mammalian genes. We fine mapped cis and trans regulatory loci due to copy number change for essentially all genes using a human-hamster radiation hybrid (RH) panel. These loci are called copy number expression quantitative trait loci (ceQTLs). Results Unexpected findings from a previous study of a mouse-hamster RH panel were replicated. These findings included decreased expression as a result of increased copy number for 30% of genes and an attenuated relationship between expression and copy number on the X chromosome suggesting an Xist independent form of dosage compensation. In a separate glioblastoma dataset, we found conservation of genes in which dosage was negatively correlated with gene expression. These genes were enriched in signaling and receptor activities. The observation of attenuated X-linked gene expression in response to increased gene number was also replicated in the glioblastoma dataset. Of 523 gene deserts of size > 600 kb in the human RH panel, 325 contained trans ceQTLs with -log10 P > 4.1. Recently discovered genes, ultra conserved regions, noncoding RNAs and microRNAs explained only a small fraction of the results, suggesting a substantial portion of gene deserts harbor as yet unidentified functional elements. Conclusion Radiation hybrids are a useful tool for high resolution mapping of cis and trans loci capable of affecting gene expression due to copy number change. Analysis of two independent radiation hybrid panels show agreement in their findings and may serve as a discovery source for novel regulatory loci in noncoding regions of the genome.
Collapse
Affiliation(s)
- Richard T Wang
- Department of Molecular and Medical Pharmacology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | | | | | | | | | | |
Collapse
|
465
|
Yamashita R, Sugano S, Suzuki Y, Nakai K. DBTSS: DataBase of Transcriptional Start Sites progress report in 2012. Nucleic Acids Res 2011; 40:D150-4. [PMID: 22086958 PMCID: PMC3245115 DOI: 10.1093/nar/gkr1005] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
To support transcriptional regulation studies, we have constructed DBTSS (DataBase of Transcriptional Start Sites), which contains exact positions of transcriptional start sites (TSSs), determined with our own technique named TSS-seq, in the genomes of various species. In its latest version, DBTSS covers the data of the majority of human adult and embryonic tissues: it now contains 418 million TSS tag sequences from 28 tissues/cell cultures. Moreover, we integrated a series of our own transcriptomic data, such as the RNA-seq data of subcellular-fractionated RNAs as well as the ChIP-seq data of histone modifications and the binding of RNA polymerase II/several transcription factors in cultured cell lines into our original TSS information. We also included several external epigenomic data, such as the chromatin map of the ENCODE project. We further associated our TSS information with public or original single-nucleotide variation (SNV) data, in order to identify SNVs in the regulatory regions. These data can be browsed in our new viewer, which supports versatile search conditions of users. We believe that our new DBTSS will be an invaluable resource for interpreting the differential uses of TSSs and for identifying human genetic variations that are associated with disordered transcriptional regulation. DBTSS can be accessed at http://dbtss.hgc.jp.
Collapse
Affiliation(s)
- Riu Yamashita
- Frontier Research Initiative, Institute of Medical Science, Human Genome Center, Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo 108-8639, Japan
| | | | | | | |
Collapse
|
466
|
Hindson BJ, Ness KD, Masquelier DA, Belgrader P, Heredia NJ, Makarewicz AJ, Bright IJ, Lucero MY, Hiddessen AL, Legler TC, Kitano TK, Hodel MR, Petersen JF, Wyatt PW, Steenblock ER, Shah PH, Bousse LJ, Troup CB, Mellen JC, Wittmann DK, Erndt NG, Cauley TH, Koehler RT, So AP, Dube S, Rose KA, Montesclaros L, Wang S, Stumbo DP, Hodges SP, Romine S, Milanovich FP, White HE, Regan JF, Karlin-Neumann GA, Hindson CM, Saxonov S, Colston BW. High-throughput droplet digital PCR system for absolute quantitation of DNA copy number. Anal Chem 2011. [PMID: 22035192 DOI: 10.1021/ac202028g+[doi]] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Digital PCR enables the absolute quantitation of nucleic acids in a sample. The lack of scalable and practical technologies for digital PCR implementation has hampered the widespread adoption of this inherently powerful technique. Here we describe a high-throughput droplet digital PCR (ddPCR) system that enables processing of ~2 million PCR reactions using conventional TaqMan assays with a 96-well plate workflow. Three applications demonstrate that the massive partitioning afforded by our ddPCR system provides orders of magnitude more precision and sensitivity than real-time PCR. First, we show the accurate measurement of germline copy number variation. Second, for rare alleles, we show sensitive detection of mutant DNA in a 100,000-fold excess of wildtype background. Third, we demonstrate absolute quantitation of circulating fetal and maternal DNA from cell-free plasma. We anticipate this ddPCR system will allow researchers to explore complex genetic landscapes, discover and validate new disease associations, and define a new era of molecular diagnostics.
Collapse
Affiliation(s)
- Benjamin J Hindson
- Bio-Rad Laboratories, Inc., Pleasanton, California 94566, United States.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
467
|
Hindson BJ, Ness KD, Masquelier DA, Belgrader P, Heredia NJ, Makarewicz AJ, Bright IJ, Lucero MY, Hiddessen AL, Legler TC, Kitano TK, Hodel MR, Petersen JF, Wyatt PW, Steenblock ER, Shah PH, Bousse LJ, Troup CB, Mellen JC, Wittmann DK, Erndt NG, Cauley TH, Koehler RT, So AP, Dube S, Rose KA, Montesclaros L, Wang S, Stumbo DP, Hodges SP, Romine S, Milanovich FP, White HE, Regan JF, Karlin-Neumann GA, Hindson CM, Saxonov S, Colston BW. High-throughput droplet digital PCR system for absolute quantitation of DNA copy number. Anal Chem 2011; 83:8604-10. [PMID: 22035192 PMCID: PMC3216358 DOI: 10.1021/ac202028g] [Citation(s) in RCA: 1857] [Impact Index Per Article: 142.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2011] [Accepted: 10/05/2011] [Indexed: 11/30/2022]
Abstract
Digital PCR enables the absolute quantitation of nucleic acids in a sample. The lack of scalable and practical technologies for digital PCR implementation has hampered the widespread adoption of this inherently powerful technique. Here we describe a high-throughput droplet digital PCR (ddPCR) system that enables processing of ~2 million PCR reactions using conventional TaqMan assays with a 96-well plate workflow. Three applications demonstrate that the massive partitioning afforded by our ddPCR system provides orders of magnitude more precision and sensitivity than real-time PCR. First, we show the accurate measurement of germline copy number variation. Second, for rare alleles, we show sensitive detection of mutant DNA in a 100,000-fold excess of wildtype background. Third, we demonstrate absolute quantitation of circulating fetal and maternal DNA from cell-free plasma. We anticipate this ddPCR system will allow researchers to explore complex genetic landscapes, discover and validate new disease associations, and define a new era of molecular diagnostics.
Collapse
Affiliation(s)
- Benjamin J Hindson
- Bio-Rad Laboratories, Inc., Pleasanton, California 94566, United States.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
468
|
Lou H, Li S, Yang Y, Kang L, Zhang X, Jin W, Wu B, Jin L, Xu S. A map of copy number variations in Chinese populations. PLoS One 2011; 6:e27341. [PMID: 22087296 PMCID: PMC3210162 DOI: 10.1371/journal.pone.0027341] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2011] [Accepted: 10/14/2011] [Indexed: 12/02/2022] Open
Abstract
It has been shown that the human genome contains extensive copy number variations (CNVs). Investigating the medical and evolutionary impacts of CNVs requires the knowledge of locations, sizes and frequency distribution of them within and between populations. However, CNV study of Chinese minorities, which harbor the majority of genetic diversity of Chinese populations, has been underrepresented considering the same efforts in other populations. Here we constructed, to our knowledge, a first CNV map in seven Chinese populations representing the major linguistic groups in China with 1,440 CNV regions identified using Affymetrix SNP 6.0 Array. Considerable differences in distributions of CNV regions between populations and substantial population structures were observed. We showed that ∼35% of CNV regions identified in minority ethnic groups are not shared by Han Chinese population, indicating that the contribution of the minorities to genetic architecture of Chinese population could not be ignored. We further identified highly differentiated CNV regions between populations. For example, a common deletion in Dong and Zhuang (44.4% and 50%), which overlaps two keratin-associated protein genes contributing to the structure of hair fibers, was not observed in Han Chinese. Interestingly, the most differentiated CNV deletion between HapMap CEU and YRI containing CCL3L1 gene reported in previous studies was also the highest differentiated regions between Tibetan and other populations. Besides, by jointly analyzing CNVs and SNPs, we found a CNV region containing gene CTDSPL were in almost perfect linkage disequilibrium between flanking SNPs in Tibetan while not in other populations except HapMap CHD. Furthermore, we found the SNP taggability of CNVs in Chinese populations was much lower than that in European populations. Our results suggest the necessity of a full characterization of CNVs in Chinese populations, and the CNV map we constructed serves as a useful resource in further evolutionary and medical studies.
Collapse
Affiliation(s)
- Haiyi Lou
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences and Max Planck Society (CAS-MPG) Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Shilin Li
- Ministry of Education (MOE) Key Laboratory of Contemporary Anthropology, School of Life Sciences and Institutes of Biomedical Sciences, Fudan University, Shanghai, China
| | - Yajun Yang
- Ministry of Education (MOE) Key Laboratory of Contemporary Anthropology, School of Life Sciences and Institutes of Biomedical Sciences, Fudan University, Shanghai, China
| | - Longli Kang
- Ministry of Education (MOE) Key Laboratory of Contemporary Anthropology, School of Life Sciences and Institutes of Biomedical Sciences, Fudan University, Shanghai, China
| | - Xin Zhang
- Ministry of Education (MOE) Key Laboratory of Contemporary Anthropology, School of Life Sciences and Institutes of Biomedical Sciences, Fudan University, Shanghai, China
| | - Wenfei Jin
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences and Max Planck Society (CAS-MPG) Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Bailin Wu
- Ministry of Education (MOE) Key Laboratory of Contemporary Anthropology, School of Life Sciences and Institutes of Biomedical Sciences, Fudan University, Shanghai, China
- Children's Hospital Boston, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Li Jin
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences and Max Planck Society (CAS-MPG) Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
- Ministry of Education (MOE) Key Laboratory of Contemporary Anthropology, School of Life Sciences and Institutes of Biomedical Sciences, Fudan University, Shanghai, China
| | - Shuhua Xu
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences and Max Planck Society (CAS-MPG) Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
| |
Collapse
|
469
|
Kasaian K, Jones SJ. A new frontier in personalized cancer therapy: mapping molecular changes. Future Oncol 2011; 7:873-94. [PMID: 21732758 DOI: 10.2217/fon.11.63] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Mutations in the genome of a normal cell can affect the function of its many genes and pathways. These alterations could eventually transform the cell from a normal to a malignant state by allowing an uncontrolled proliferation of the cell and formation of a cancer tumor. Each tumor in an individual patient can have hundreds of mutated genes and perturbed pathways. Cancers clinically presenting as the same type or subtype could potentially be very different at the molecular level and thus behave differently in response to therapy. The challenge is to distinguish the key mutations driving the cancer from the background of mutational noise and find ways to effectively target them. The promise is that such a molecular approach to classifying cancer will lead to better diagnostic, prognostic and personalized treatment strategies. This article provides an overview of advances in the molecular characterization of cancers and their applications in therapy.
Collapse
Affiliation(s)
- Katayoon Kasaian
- Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada
| | | |
Collapse
|
470
|
Ames RM, Money D, Ghatge VP, Whelan S, Lovell SC. Determining the evolutionary history of gene families. ACTA ACUST UNITED AC 2011; 28:48-55. [PMID: 22039210 DOI: 10.1093/bioinformatics/btr592] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
MOTIVATION Recent large-scale studies of individuals within a population have demonstrated that there is widespread variation in copy number in many gene families. In addition, there is increasing evidence that the variation in gene copy number can give rise to substantial phenotypic effects. In some cases, these variations have been shown to be adaptive. These observations show that a full understanding of the evolution of biological function requires an understanding of gene gain and gene loss. Accurate, robust evolutionary models of gain and loss events are, therefore, required. RESULTS We have developed weighted parsimony and maximum likelihood methods for inferring gain and loss events. To test these methods, we have used Markov models of gain and loss to simulate data with known properties. We examine three models: a simple birth-death model, a single rate model and a birth-death innovation model with parameters estimated from Drosophila genome data. We find that for all simulations maximum likelihood-based methods are very accurate for reconstructing the number of duplication events on the phylogenetic tree, and that maximum likelihood and weighted parsimony have similar accuracy for reconstructing the ancestral state. Our implementations are robust to different model parameters and provide accurate inferences of ancestral states and the number of gain and loss events. For ancestral reconstruction, we recommend weighted parsimony because it has similar accuracy to maximum likelihood, but is much faster. For inferring the number of individual gene loss or gain events, maximum likelihood is noticeably more accurate, albeit at greater computational cost. AVAILABILITY www.bioinf.manchester.ac.uk/dupliphy CONTACT simon.lovell@manchester.ac.uk; simon.whelan@manchester.ac.uk SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ryan M Ames
- Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester M13 9PL, UK
| | | | | | | | | |
Collapse
|
471
|
Hindson BJ, Ness KD, Masquelier DA, Belgrader P, Heredia NJ, Makarewicz AJ, Bright IJ, Lucero MY, Hiddessen AL, Legler TC, Kitano TK, Hodel MR, Petersen JF, Wyatt PW, Steenblock ER, Shah PH, Bousse LJ, Troup CB, Mellen JC, Wittmann DK, Erndt NG, Cauley TH, Koehler RT, So AP, Dube S, Rose KA, Montesclaros L, Wang S, Stumbo DP, Hodges SP, Romine S, Milanovich FP, White HE, Regan JF, Karlin-Neumann GA, Hindson CM, Saxonov S, Colston BW. High-Throughput Droplet Digital PCR System for Absolute Quantitation of DNA Copy Number. Anal Chem 2011. [DOI: 10.1021/ac202028g [doi]] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Affiliation(s)
- Benjamin J. Hindson
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Kevin D. Ness
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Donald A. Masquelier
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Phillip Belgrader
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Nicholas J. Heredia
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Anthony J. Makarewicz
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Isaac J. Bright
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Michael Y. Lucero
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Amy L. Hiddessen
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Tina C. Legler
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Tyler K. Kitano
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Michael R. Hodel
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Jonathan F. Petersen
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Paul W. Wyatt
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Erin R. Steenblock
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Pallavi H. Shah
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Luc J. Bousse
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Camille B. Troup
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Jeffrey C. Mellen
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Dean K. Wittmann
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Nicholas G. Erndt
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Thomas H. Cauley
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Ryan T. Koehler
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Austin P. So
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Simant Dube
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Klint A. Rose
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Luz Montesclaros
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Shenglong Wang
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - David P. Stumbo
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Shawn P. Hodges
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Steven Romine
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Fred P. Milanovich
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Helen E. White
- National Genetics Reference Laboratory, Wessex Regional Genetics, Salisbury District Hospital, Odstock, Salisbury, Wiltshire, SP2 8BJ, United Kingdom
| | - John F. Regan
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - George A. Karlin-Neumann
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Christopher M. Hindson
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Serge Saxonov
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| | - Bill W. Colston
- Bio-Rad Laboratories, Inc., 7068 Koll Center Parkway, Pleasanton, California 94566, United States
| |
Collapse
|
472
|
Abstract
Copy number variants (CNVs) are widely distributed throughout the human genome, where they contribute to genetic variation and phenotypic diversity. Spontaneous CNVs are also a major cause of genetic and developmental disorders and arise frequently in cancer cells. As with all mutation classes, genetic and environmental factors almost certainly increase the risk for new and deleterious CNVs. However, despite the importance of CNVs, there is limited understanding of these precipitating risk factors and the mechanisms responsible for a large percentage of CNVs. Here we report that low doses of hydroxyurea, an inhibitor of ribonucleotide reductase and an important drug in the treatment of sickle cell disease and other diseases induces a high frequency of de novo CNVs in cultured human cells that resemble pathogenic and aphidicolin-induced CNVs in size and breakpoint structure. These CNVs are distributed throughout the genome, with some hotspots of de novo CNV formation. Sequencing revealed that CNV breakpoint junctions are characterized by short microhomologies, blunt ends, and short insertions. These data provide direct experimental support for models of replication-error origins of CNVs and suggest that any agent or condition that leads to replication stress has the potential to induce deleterious CNVs. In addition, they point to a need for further study of the genomic consequences of the therapeutic use of hydroxyurea.
Collapse
|
473
|
Grant RW, Meigs JB, Florez JC, Park ER, Green RC, Waxler JL, Delahanty LM, O'Brien KE. Design of a randomized trial of diabetes genetic risk testing to motivate behavior change: the Genetic Counseling/lifestyle Change (GC/LC) Study for Diabetes Prevention. Clin Trials 2011; 8:609-15. [PMID: 22013171 PMCID: PMC3763721 DOI: 10.1177/1740774511414159] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
BACKGROUND The efficacy of diabetes genetic risk testing to motivate behavior change for diabetes prevention is currently unknown. PURPOSE This paper presents key issues in the design and implementation of one of the first randomized trials (The Genetic Counseling/Lifestyle Change (GC/LC) Study for Diabetes Prevention) to test whether knowledge of diabetes genetic risk can motivate patients to adopt healthier behaviors. METHODS Because individuals may react differently to receiving 'higher' vs 'lower' genetic risk results, we designed a 3-arm parallel group study to separately test the hypotheses that: (1) patients receiving 'higher' diabetes genetic risk results will increase healthy behaviors compared to untested controls, and (2) patients receiving 'lower' diabetes genetic risk results will decrease healthy behaviors compared to untested controls. In this paper we describe several challenges to implementing this study, including: (1) the application of a novel diabetes risk score derived from genetic epidemiology studies to a clinical population, (2) the use of the principle of Mendelian randomization to efficiently exclude 'average' diabetes genetic risk patients from the intervention, and (3) the development of a diabetes genetic risk counseling intervention that maintained the ethical need to motivate behavior change in both 'higher' and 'lower' diabetes genetic risk result recipients. RESULTS Diabetes genetic risk scores were developed by aggregating the results of 36 diabetes-associated single nucleotide polymorphisms. Relative risk for type 2 diabetes was calculated using Framingham Offspring Study outcomes, grouped by quartiles into 'higher', 'average' (middle two quartiles) and 'lower' genetic risk. From these relative risks, revised absolute risks were estimated using the overall absolute risk for the study group. For study efficiency, we excluded all patients receiving 'average' diabetes risk results from the subsequent intervention. This post-randomization allocation strategy was justified because genotype represents a random allocation of parental alleles ('Mendelian randomization'). Finally, because it would be unethical to discourage participants to participate in diabetes prevention behaviors, we designed our two diabetes genetic risk counseling interventions (for 'higher' and 'lower' result recipients) so that both groups would be motivated despite receiving opposing results. LIMITATIONS For this initial assessment of the clinical implementation of genetic risk testing we assessed intermediate outcomes of attendance at a 12-week diabetes prevention course and changes in self-reported motivation. If effective, longer term studies with larger sample sizes will be needed to assess whether knowledge of diabetes genetic risk can help patients prevent diabetes. CONCLUSIONS We designed a randomized clinical trial designed to explore the motivational impact of disclosing both higher than average and lower than average genetic risk for type 2 diabetes. This design allowed exploration of both increased risk and false reassurance, and has implications for future studies in translational genomics.
Collapse
Affiliation(s)
- Richard W Grant
- Division of General Medicine, Massachusetts General Hospital, Boston, MA, USA.
| | | | | | | | | | | | | | | |
Collapse
|
474
|
Onishi-Seebacher M, Korbel JO. Challenges in studying genomic structural variant formation mechanisms: The short-read dilemma and beyond. Bioessays 2011; 33:840-50. [DOI: 10.1002/bies.201100075] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
|
475
|
Yeo RA, Gangestad SW, Gasparovic C, Liu J, Calhoun VD, Thoma RJ, Mayer AR, Kalyanam R, Hutchison KE. Rare copy number deletions predict individual variation in human brain metabolite concentrations in individuals with alcohol use disorders. Biol Psychiatry 2011; 70:537-44. [PMID: 21684527 PMCID: PMC3162096 DOI: 10.1016/j.biopsych.2011.04.019] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/09/2011] [Revised: 03/30/2011] [Accepted: 04/19/2011] [Indexed: 01/23/2023]
Abstract
BACKGROUND Although variations in neurometabolite concentrations occur in diverse neuropsychiatric and neurodegenerative disorders, little is known about the nature of underlying genetic influences. The current study investigated the importance of a specific type of genetic mutation, copy number variation (CNV), for neurometabolite concentrations in a bilateral anterior cingulate voxel. METHODS These neurometabolic signals were quantified using proton magnetic resonance spectroscopy ((1)H-MRS): N-acetylaspartate (NAA), creatine-phosphocreatine (Cre), glutamate/glutamine (Glx), myoinositol (mI), and phosphorylcholine-glycerol phosphorylcholine (Cho). Genetic data were collected using the Illumina 1MDuoBeadChip Array from a sample adults with alcohol use disorders (n = 146). RESULTS The number of base pairs lost through rare copy number deletions (occurring in less than 5% of our sample) predicted lower NAA, Cre, mI, and Glx. More total rare deletions also predicted lower NAA, Cre, and Glx. Principal components analyses of the five neurometabolites identified two correlated components, the first comprised of NAA, Glx, and Cre, and the second comprised of Cho, mI, and to a lesser extent, Cre. The number and length of rare deletions were correlated with the first component, capturing approximately 10% of phenotypic variance, but not the second component. CONCLUSIONS These results suggest that mutation load affects neurometabolite concentrations, potentially increasing risk for neuropsychiatric disorders. The greater effect of CVNs on NAA, Glx, and Cre may reflect a greater sensitivity to the effects of mutations (i.e., reduced canalization) for neurometabolites related to metabolic activity and cellular energetics, due to extensive recent selection pressure on these phenotypes in the human lineage.
Collapse
Affiliation(s)
- Ronald A. Yeo
- Department of Psychology, University of New Mexico, Albuquerque, NM, USA
| | | | - Charles Gasparovic
- Department of Psychology, University of New Mexico, Albuquerque, NM, USA,The Mind Research Network, Albuquerque, NM, USA
| | - Jingyu Liu
- The Mind Research Network, Albuquerque, NM, USA,Department of Electrical and Computer Engineering, University of New Mexico, Albuquerque, NM, USA
| | - Vince D. Calhoun
- The Mind Research Network, Albuquerque, NM, USA,Department of Electrical and Computer Engineering, University of New Mexico, Albuquerque, NM, USA
| | - Robert J. Thoma
- The Mind Research Network, Albuquerque, NM, USA,Department of Psychiatry, University of New Mexico, Albuquerque, NM, USA
| | | | | | - Kent E. Hutchison
- Department of Psychology, University of New Mexico, Albuquerque, NM, USA,The Mind Research Network, Albuquerque, NM, USA,Department of Psychology and Neuroscience, University of Colorado, Boulder, CO, USA
| |
Collapse
|
476
|
Fieuw A, Kumps C, Schramm A, Pattyn F, Menten B, Antonacci F, Sudmant P, Schulte JH, Van Roy N, Vergult S, Buckley PG, De Paepe A, Noguera R, Versteeg R, Stallings R, Eggert A, Vandesompele J, De Preter K, Speleman F. Identification of a novel recurrent 1q42.2-1qter deletion in high risk MYCN single copy 11q deleted neuroblastomas. Int J Cancer 2011; 130:2599-606. [PMID: 21796619 DOI: 10.1002/ijc.26317] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2011] [Accepted: 07/12/2011] [Indexed: 01/04/2023]
Abstract
Neuroblastoma is an aggressive embryonal tumor that accounts for ∼15% of childhood cancer deaths. Hitherto, despite the availability of comprehensive genomic data on DNA copy number changes in neuroblastoma, relatively little is known about the genes driving neuroblastoma tumorigenesis. In this study, high resolution array comparative genome hybridization (CGH) was performed on 188 primary neuroblastoma tumors and 33 neuroblastoma cell lines to search for previously undetected recurrent DNA copy number gains and losses. A new recurrent distal chromosome 1q deletion (del(1)(q42.2qter)) was detected in seven cases. Further analysis of available array CGH datasets revealed 13 additional similar distal 1q deletions. The majority of all detected 1q deletions was found in high risk 11q deleted tumors without MYCN amplification (Fisher exact test p = 5.61 × 10(-5) ). Using ultra-high resolution (∼115 bp resolution) custom arrays covering the breakpoints on 1q for 11 samples, clustering of nine breakpoints was observed within a 12.5-kb region, of which eight were found in a 7-kb copy number variable region, whereas the remaining two breakpoints were colocated 1.4-Mb proximal. The commonly deleted region contains one miRNA (hsa-mir-1537), four transcribed ultra conserved region elements (uc.43-uc.46) and 130 protein coding genes including at least two bona fide tumor suppressor genes, EGLN1 (or PHD2) and FH. This finding further contributes to the delineation of the genomic profile of aggressive neuroblastoma, offers perspectives for the identification of genes contributing to the disease phenotype and may be relevant in the light of assessment of response to new molecular treatments.
Collapse
Affiliation(s)
- Annelies Fieuw
- Center for Medical Genetics, Ghent University Hospital, Ghent, Belgium
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
477
|
Schlattl A, Anders S, Waszak SM, Huber W, Korbel JO. Relating CNVs to transcriptome data at fine resolution: assessment of the effect of variant size, type, and overlap with functional regions. Genome Res 2011; 21:2004-13. [PMID: 21862627 DOI: 10.1101/gr.122614.111] [Citation(s) in RCA: 81] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Copy-number variants (CNVs) form an abundant class of genetic variation with a presumed widespread impact on individual traits. While recent advances, such as the population-scale sequencing of human genomes, facilitated the fine-scale mapping of CNVs, the phenotypic impact of most of these CNVs remains unclear. By relating copy-number genotypes to transcriptome sequencing data, we have evaluated the impact of CNVs, mapped at fine scale, on gene expression. Based on data from 129 individuals with ancestry from two populations, we identified CNVs associated with the expression of 110 genes, with 13% of the associations involving complex, multiallelic CNVs. Categorization of CNVs according to variant type, size, and gene overlap enabled us to examine the impact of different CNV classes on expression variation. While many small (<4 kb) CNVs were associated with expression variation, overall we observed an enrichment of large duplications and deletions, including large intergenic CNVs, relative to the entire set of expression-associated CNVs. Furthermore, the copy number of genes intersecting with CNVs typically correlated positively with the genes' expression, and also was more strongly correlated with expression than nearby single nucleotide polymorphisms, suggesting a frequent causal role of CNVs in expression quantitative trait loci (eQTLs). We also elucidated unexpected cases of negative correlations between copy number and expression by assessing the CNVs' effects on the structure and regulation of genes. Finally, we examined dosage compensation of transcript levels. Our results suggest that association studies can gain in resolution and power by including fine-scale CNV information, such as those obtained from population-scale sequencing.
Collapse
Affiliation(s)
- Andreas Schlattl
- European Molecular Biology Laboratory (EMBL), Genome Biology Research Unit, 69117 Heidelberg, Germany
| | | | | | | | | |
Collapse
|
478
|
Abstract
Copy number variants (CNVs) play an important role in human disease and population diversity. Advancements in technology have allowed for the analysis of CNVs in thousands of individuals with disease in addition to thousands of controls. These studies have identified rare CNVs associated with neuropsychiatric diseases such as autism, schizophrenia, and intellectual disability. In addition, copy number polymorphisms (CNPs) are present at higher frequencies in the population, show high diversity in copy number, sequence, and structure, and have been associated with multiple phenotypes, primarily related to immune or environmental response. However, the landscape of copy number variation still remains largely unexplored, especially for smaller CNVs and those embedded within complex regions of the human genome. An integrated approach including characterization of single nucleotide variants and CNVs in a large number of individuals with disease and normal genomes holds the promise of thoroughly elucidating the genetic basis of human disease and diversity.
Collapse
Affiliation(s)
- Santhosh Girirajan
- Department of Genome Sciences and Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, USA.
| | | | | |
Collapse
|
479
|
Nedelkova M, Maresca M, Fu J, Rostovskaya M, Chenna R, Thiede C, Anastassiadis K, Sarov M, Stewart AF. Targeted isolation of cloned genomic regions by recombineering for haplotype phasing and isogenic targeting. Nucleic Acids Res 2011; 39:e137. [PMID: 21852329 PMCID: PMC3203589 DOI: 10.1093/nar/gkr668] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Studying genetic variations in the human genome is important for understanding phenotypes and complex traits, including rare personal variations and their associations with disease. The interpretation of polymorphisms requires reliable methods to isolate natural genetic variations, including combinations of variations, in a format suitable for downstream analysis. Here, we describe a strategy for targeted isolation of large regions (∼35 kb) from human genomes that is also applicable to any genome of interest. The method relies on recombineering to fish out target fosmid clones from pools and thereby circumvents the laborious need to plate and screen thousands of individual clones. To optimize the method, a new highly recombineering-efficient bacterial host, including inducible TrfA for fosmid copy number amplification, was developed. Various regions were isolated from human embryonic stem cell lines and a personal genome, including highly repetitive and duplicated ones. The maternal and paternal alleles at the MECP2/IRAK 1 loci were distinguished based on identification of novel allele-specific single-nucleotide polymorphisms in regulatory regions. Additionally, we applied further recombineering to construct isogenic targeting vectors for patient-specific applications. These methods will facilitate work to understand the linkage between personal variations and disease propensity, as well as possibilities for personal genome surgery.
Collapse
Affiliation(s)
- Marta Nedelkova
- Technische Universitaet Dresden, Genomics, BioInnovationsZentrum, Dresden, Germany
| | | | | | | | | | | | | | | | | |
Collapse
|
480
|
Nicholas TJ, Baker C, Eichler EE, Akey JM. A high-resolution integrated map of copy number polymorphisms within and between breeds of the modern domesticated dog. BMC Genomics 2011; 12:414. [PMID: 21846351 PMCID: PMC3166287 DOI: 10.1186/1471-2164-12-414] [Citation(s) in RCA: 67] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2011] [Accepted: 08/16/2011] [Indexed: 01/22/2023] Open
Abstract
Background Structural variation contributes to the rich genetic and phenotypic diversity of the modern domestic dog, Canis lupus familiaris, although compared to other organisms, catalogs of canine copy number variants (CNVs) are poorly defined. To this end, we developed a customized high-density tiling array across the canine genome and used it to discover CNVs in nine genetically diverse dogs and a gray wolf. Results In total, we identified 403 CNVs that overlap 401 genes, which are enriched for defense/immunity, oxidoreductase, protease, receptor, signaling molecule and transporter genes. Furthermore, we performed detailed comparisons between CNVs located within versus outside of segmental duplications (SDs) and find that CNVs in SDs are enriched for gene content and complexity. Finally, we compiled all known dog CNV regions and genotyped them with a custom aCGH chip in 61 dogs from 12 diverse breeds. These data allowed us to perform the first population genetics analysis of canine structural variation and identify CNVs that potentially contribute to breed specific traits. Conclusions Our comprehensive analysis of canine CNVs will be an important resource in genetically dissecting canine phenotypic and behavioral variation.
Collapse
Affiliation(s)
- Thomas J Nicholas
- Department of Genome Sciences, University of Washington, 1705 NE Pacific, Seattle, WA 98195, USA
| | | | | | | |
Collapse
|
481
|
Sathirapongsasuti JF, Lee H, Horst BAJ, Brunner G, Cochran AJ, Binder S, Quackenbush J, Nelson SF. Exome sequencing-based copy-number variation and loss of heterozygosity detection: ExomeCNV. ACTA ACUST UNITED AC 2011; 27:2648-54. [PMID: 21828086 DOI: 10.1093/bioinformatics/btr462] [Citation(s) in RCA: 300] [Impact Index Per Article: 23.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
MOTIVATION The ability to detect copy-number variation (CNV) and loss of heterozygosity (LOH) from exome sequencing data extends the utility of this powerful approach that has mainly been used for point or small insertion/deletion detection. RESULTS We present ExomeCNV, a statistical method to detect CNV and LOH using depth-of-coverage and B-allele frequencies, from mapped short sequence reads, and we assess both the method's power and the effects of confounding variables. We apply our method to a cancer exome resequencing dataset. As expected, accuracy and resolution are dependent on depth-of-coverage and capture probe design. AVAILABILITY CRAN package 'ExomeCNV'. CONTACT fsathira@fas.harvard.edu; snelson@ucla.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
482
|
Copy number variation analysis in the great apes reveals species-specific patterns of structural variation. Genome Res 2011; 21:1626-39. [PMID: 21824994 DOI: 10.1101/gr.117242.110] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Copy number variants (CNVs) are increasingly acknowledged as an important source of evolutionary novelties in the human lineage. However, our understanding of their significance is still hindered by the lack of primate CNV data. We performed intraspecific comparative genomic hybridizations to identify loci harboring copy number variants in each of the four great apes: bonobos, chimpanzees, gorillas, and orangutans. For the first time, we could analyze differences in CNV location and frequency in these four species, and compare them with human CNVs and primate segmental duplication (SD) maps. In addition, for bonobo and gorilla, patterns of CNV and nucleotide diversity were studied in the same individuals. We show that CNVs have been subject to different selective pressures in different lineages. Evidence for purifying selection is stronger in gorilla CNVs overlapping genes, while positive selection appears to have driven the fixation of structural variants in the orangutan lineage. In contrast, chimpanzees and bonobos present high levels of common structural polymorphism, which is indicative of relaxed purifying selection together with the higher mutation rates induced by the known burst of segmental duplication in the ancestor of the African apes. Indeed, the impact of the duplication burst is noticeable by the fact that bonobo and chimpanzee share more CNVs with gorilla than expected. Finally, we identified a number of interesting genomic regions that present high-frequency CNVs in all great apes, while containing only very rare or even pathogenic structural variants in humans.
Collapse
|
483
|
Dassanayake M, Oh DH, Haas JS, Hernandez A, Hong H, Ali S, Yun DJ, Bressan RA, Zhu JK, Bohnert HJ, Cheeseman JM. The genome of the extremophile crucifer Thellungiella parvula. Nat Genet 2011; 43:913-8. [PMID: 21822265 DOI: 10.1038/ng.889] [Citation(s) in RCA: 220] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2011] [Accepted: 06/24/2011] [Indexed: 01/02/2023]
Abstract
Thellungiella parvula is related to Arabidopsis thaliana and is endemic to saline, resource-poor habitats, making it a model for the evolution of plant adaptation to extreme environments. Here we present the draft genome for this extremophile species. Exclusively by next generation sequencing, we obtained the de novo assembled genome in 1,496 gap-free contigs, closely approximating the estimated genome size of 140 Mb. We anchored these contigs to seven pseudo chromosomes without the use of maps. We show that short reads can be assembled to a near-complete chromosome level for a eukaryotic species lacking prior genetic information. The sequence identifies a number of tandem duplications that, by the nature of the duplicated genes, suggest a possible basis for T. parvula's extremophile lifestyle. Our results provide essential background for developing genomically influenced testable hypotheses for the evolution of environmental stress tolerance.
Collapse
Affiliation(s)
- Maheshi Dassanayake
- Department of Plant Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
484
|
Chen W, Hayward C, Wright AF, Hicks AA, Vitart V, Knott S, Wild SH, Pramstaller PP, Wilson JF, Rudan I, Porteous DJ. Copy number variation across European populations. PLoS One 2011; 6:e23087. [PMID: 21829696 PMCID: PMC3150386 DOI: 10.1371/journal.pone.0023087] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2011] [Accepted: 07/12/2011] [Indexed: 12/13/2022] Open
Abstract
Genome analysis provides a powerful approach to test for evidence of genetic variation within and between geographical regions and local populations. Copy number variants which comprise insertions, deletions and duplications of genomic sequence provide one such convenient and informative source. Here, we investigate copy number variants from genome wide scans of single nucleotide polymorphisms in three European population isolates, the island of Vis in Croatia, the islands of Orkney in Scotland and the South Tyrol in Italy. We show that whereas the overall copy number variant frequencies are similar between populations, their distribution is highly specific to the population of origin, a finding which is supported by evidence for increased kinship correlation for specific copy number variants within populations.
Collapse
Affiliation(s)
- Wanting Chen
- Medical Genetics Section, Centre for Molecular Medicine, Institute of Genetics & Molecular Medicine, University of Edinburgh, Western General Hospital, Crewe Road South, Edinburgh, United Kingdom
| | - Caroline Hayward
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, United Kingdom
| | - Alan F. Wright
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, United Kingdom
| | - Andrew A. Hicks
- Institute of Genetic Medicine, European Academy Bozen/Bolzano (EURAC), Bolzano/Bozen, Italy - Affiliated Institute of the University of Lübeck, Lübeck, Germany
| | - Veronique Vitart
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, United Kingdom
| | - Sara Knott
- Institute of Evolutionary Biology, University of Edinburgh, Ashworth Laboratories, King's Buildings, Edinburgh, United Kingdom
| | - Sarah H. Wild
- Centre for Population Health Sciences, The University of Edinburgh Medical School, Edinburgh, United Kingdom
| | - Peter P. Pramstaller
- Institute of Genetic Medicine, European Academy Bozen/Bolzano (EURAC), Bolzano/Bozen, Italy - Affiliated Institute of the University of Lübeck, Lübeck, Germany
- Department of Neurology, General Central Hospital, Bolzano, Italy
- Department of Neurology, University of Lübeck, Lübeck, Germany
| | - James F. Wilson
- Centre for Population Health Sciences, The University of Edinburgh Medical School, Edinburgh, United Kingdom
| | - Igor Rudan
- Centre for Population Health Sciences, The University of Edinburgh Medical School, Edinburgh, United Kingdom
- Croatian Centre for Global Health, University of Split Medical School, Split, Croatia
| | - David J. Porteous
- Medical Genetics Section, Centre for Molecular Medicine, Institute of Genetics & Molecular Medicine, University of Edinburgh, Western General Hospital, Crewe Road South, Edinburgh, United Kingdom
- * E-mail:
| |
Collapse
|
485
|
Jiang C, Mithani A, Gan X, Belfield EJ, Klingler JP, Zhu JK, Ragoussis J, Mott R, Harberd NP. Regenerant Arabidopsis lineages display a distinct genome-wide spectrum of mutations conferring variant phenotypes. Curr Biol 2011; 21:1385-90. [PMID: 21802297 PMCID: PMC3162137 DOI: 10.1016/j.cub.2011.07.002] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2011] [Revised: 05/27/2011] [Accepted: 07/01/2011] [Indexed: 11/30/2022]
Abstract
Multicellular organisms can be regenerated from totipotent differentiated somatic cell or nuclear founders [1–3]. Organisms regenerated from clonally related isogenic founders might a priori have been expected to be phenotypically invariant. However, clonal regenerant animals display variant phenotypes caused by defective epigenetic reprogramming of gene expression [2], and clonal regenerant plants exhibit poorly understood heritable phenotypic (“somaclonal”) variation [4–7]. Here we show that somaclonal variation in regenerant Arabidopsis lineages is associated with genome-wide elevation in DNA sequence mutation rate. We also show that regenerant mutations comprise a distinctive molecular spectrum of base substitutions, insertions, and deletions that probably results from decreased DNA repair fidelity. Finally, we show that while regenerant base substitutions are a likely major genetic cause of the somaclonal variation of regenerant Arabidopsis lineages, transposon movement is unlikely to contribute substantially to that variation. We conclude that the phenotypic variation of regenerant plants, unlike that of regenerant animals, is substantially due to DNA sequence mutation.
Collapse
Affiliation(s)
- Caifu Jiang
- Department of Plant Sciences, University of Oxford, South Parks Road, Oxford OX1 3RB, UK
| | | | | | | | | | | | | | | | | |
Collapse
|
486
|
Johansson ACV, Feuk L. Characterization of copy number-stable regions in the human genome. Hum Mutat 2011; 32:947-55. [PMID: 21542059 DOI: 10.1002/humu.21524] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2011] [Accepted: 04/20/2011] [Indexed: 01/25/2023]
Abstract
In the past few years the number of copy number variants (CNVs) identified in the human genome has increased significantly, but our understanding of the functional impact of CNVs is still limited. Clinically significant variations cannot easily be distinguished from benign, complicating interpretation of patient data. Multiple studies have focused on analysis of regions that vary in copy number in specific disorders. Here we use the opposite strategy and focus our analysis on regions that never seem to vary in the general population, hypothesizing that these are copy number stable because variations within them are deleterious. Our results show that copy number stable regions are characterized by correlation with a number of genomic features, allowing us to define a list of genomic regions that are dosage sensitive in humans. We find that these dosage-sensitive regions show significant overlap with de novo CNVs identified in patients with intellectual disability or autism. There is also a significant association between copy number stable regions and rare inherited variants in autism patients, but not in controls. Based on this predictive power, we propose that copy number stable regions can be used to complement maps of known CNVs to facilitate interpretation of patient data.
Collapse
Affiliation(s)
- Anna C V Johansson
- Department of Immunology, Genetics and Pathology, Rudbeck Laboratory, Uppsala University, Uppsala, Sweden
| | | |
Collapse
|
487
|
Yu P, Wang C, Xu Q, Feng Y, Yuan X, Yu H, Wang Y, Tang S, Wei X. Detection of copy number variations in rice using array-based comparative genomic hybridization. BMC Genomics 2011; 12:372. [PMID: 21771342 PMCID: PMC3156786 DOI: 10.1186/1471-2164-12-372] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2011] [Accepted: 07/20/2011] [Indexed: 01/02/2023] Open
Abstract
Background Copy number variations (CNVs) can create new genes, change gene dosage, reshape gene structures, and modify elements regulating gene expression. As with all types of genetic variation, CNVs may influence phenotypic variation and gene expression. CNVs are thus considered major sources of genetic variation. Little is known, however, about their contribution to genetic variation in rice. Results To detect CNVs, we used a set of NimbleGen whole-genome comparative genomic hybridization arrays containing 718,256 oligonucleotide probes with a median probe spacing of 500 bp. We compiled a high-resolution map of CNVs in the rice genome, showing 641 CNVs between the genomes of the rice cultivars 'Nipponbare' (from O. sativa ssp. japonica) and 'Guang-lu-ai 4' (from O. sativa ssp. indica). The CNVs identified vary in size from 1.1 kb to 180.7 kb, and encompass approximately 7.6 Mb of the rice genome. The largest regions showing copy gain and loss are of 37.4 kb on chromosome 4, and 180.7 kb on chromosome 8. In addition, 85 DNA segments were identified, including some genic sequences. Contracted genes greatly outnumbered duplicated ones. Many of the contracted genes corresponded to either the same genes or genes involved in the same biological processes; this was also the case for genes involved in disease and defense. Conclusion We detected CNVs in rice by array-based comparative genomic hybridization. These CNVs contain known genes. Further discussion of CNVs is important, as they are linked to variation among rice varieties, and are likely to contribute to subspecific characteristics.
Collapse
Affiliation(s)
- Ping Yu
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, China
| | | | | | | | | | | | | | | | | |
Collapse
|
488
|
Church DM, Schneider VA, Graves T, Auger K, Cunningham F, Bouk N, Chen HC, Agarwala R, McLaren WM, Ritchie GRS, Albracht D, Kremitzki M, Rock S, Kotkiewicz H, Kremitzki C, Wollam A, Trani L, Fulton L, Fulton R, Matthews L, Whitehead S, Chow W, Torrance J, Dunn M, Harden G, Threadgold G, Wood J, Collins J, Heath P, Griffiths G, Pelan S, Grafham D, Eichler EE, Weinstock G, Mardis ER, Wilson RK, Howe K, Flicek P, Hubbard T. Modernizing reference genome assemblies. PLoS Biol 2011; 9:e1001091. [PMID: 21750661 PMCID: PMC3130012 DOI: 10.1371/journal.pbio.1001091] [Citation(s) in RCA: 327] [Impact Index Per Article: 25.2] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Affiliation(s)
- Deanna M Church
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
489
|
|
490
|
Jiang Q, Ho YY, Hao L, Nichols Berrios C, Chakravarti A. Copy number variants in candidate genes are genetic modifiers of Hirschsprung disease. PLoS One 2011; 6:e21219. [PMID: 21712996 PMCID: PMC3119685 DOI: 10.1371/journal.pone.0021219] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2011] [Accepted: 05/23/2011] [Indexed: 01/15/2023] Open
Abstract
Hirschsprung disease (HSCR) is a neurocristopathy characterized by absence of intramural ganglion cells along variable lengths of the gastrointestinal tract. The HSCR phenotype is highly variable with respect to gender, length of aganglionosis, familiality and the presence of additional anomalies. By molecular genetic analysis, a minimum of 11 neuro-developmental genes (RET, GDNF, NRTN, SOX10, EDNRB, EDN3, ECE1, ZFHX1B, PHOX2B, KIAA1279, TCF4) are known to harbor rare, high-penetrance mutations that confer a large risk to the bearer. In addition, two other genes (RET, NRG1) harbor common, low-penetrance polymorphisms that contribute only partially to risk and can act as genetic modifiers. To broaden this search, we examined whether a set of 67 proven and candidate HSCR genes harbored additional modifier alleles. In this pilot study, we utilized a custom-designed array CGH with ∼33,000 test probes at an average resolution of ∼185 bp to detect gene-sized or smaller copy number variants (CNVs) within these 67 genes in 18 heterogeneous HSCR patients. Using stringent criteria, we identified CNVs at three loci (MAPK10, ZFHX1B, SOX2) that are novel, involve regulatory and coding sequences of neuro-developmental genes, and show association with HSCR in combination with other congenital anomalies. Additional CNVs are observed under relaxed criteria. Our research suggests a role for CNVs in HSCR and, importantly, emphasizes the role of variation in regulatory sequences. A much larger study will be necessary both for replication and for identifying the full spectrum of small CNV effects.
Collapse
Affiliation(s)
- Qian Jiang
- Center for Complex Disease Genomics, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
| | - Yen-Yi Ho
- Center for Complex Disease Genomics, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
| | - Li Hao
- Center for Complex Disease Genomics, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
| | - Courtney Nichols Berrios
- Center for Complex Disease Genomics, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
| | - Aravinda Chakravarti
- Center for Complex Disease Genomics, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
- * E-mail:
| |
Collapse
|
491
|
Roudnitzky N, Bufe B, Thalmann S, Kuhn C, Gunn HC, Xing C, Crider BP, Behrens M, Meyerhof W, Wooding SP. Genomic, genetic and functional dissection of bitter taste responses to artificial sweeteners. Hum Mol Genet 2011; 20:3437-49. [PMID: 21672920 DOI: 10.1093/hmg/ddr252] [Citation(s) in RCA: 80] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Bitter taste perception is initiated by TAS2R receptors, which respond to agonists by triggering depolarization of taste bud cells. Mutations in TAS2Rs are known to affect taste phenotypes by altering receptor function. Evidence that TAS2Rs overlap in ligand specificity suggests that they may also contribute joint effects. To explore this aspect of gustation, we examined bitter perception of saccharin and acesulfame K, widely used artificial sweeteners with aversive aftertastes. Both substances are agonists of TAS2R31 and -43, which belong to a five-member subfamily (TAS2R30-46) responsive to a diverse constellation of compounds. We analyzed sequence variation and linkage structure in the ∼140 kb genomic region encoding TAS2R30-46, taste responses to the two sweeteners in subjects, and functional characteristics of receptor alleles. Whole-gene sequences from TAS2R30-46 in 60 Caucasian subjects revealed extensive diversity including 34 missense mutations, two nonsense mutations and high-frequency copy-number variants. Thirty markers, including non-synonymous variants in all five genes, were associated (P< 0.001) with responses to saccharin and acesulfame K. However, linkage disequilibrium (LD) in the region was high (D', r(2) > 0.95). Haplotype analyses revealed that most associations were spurious, arising from LD with variants in TAS2R31. In vitro assays confirmed the functional importance of four TAS2R31 mutations, which had independent effects on receptor response. The existence of high LD spanning functionally distinct TAS2R loci predicts that bitter taste responses to many compounds will be strongly correlated even when they are mediated by different genes. Integrative approaches combining phenotypic, genetic and functional analysis will be essential in dissecting these complex relationships.
Collapse
Affiliation(s)
- Natacha Roudnitzky
- Department of Molecular Genetics, German Institute of Human Nutrition Potsdam-Rehbruecke, Arthur-Scheunert- Allee 114-116, 14558 Nuthetal, Germany
| | | | | | | | | | | | | | | | | | | |
Collapse
|
492
|
O’Roak BJ, Deriziotis P, Lee C, Vives L, Schwartz JJ, Girirajan S, Karakoc E, MacKenzie AP, Ng SB, Baker C, Rieder MJ, Nickerson DA, Bernier R, Fisher SE, Shendure J, Eichler EE. Exome sequencing in sporadic autism spectrum disorders identifies severe de novo mutations. Nat Genet 2011; 43:585-9. [PMID: 21572417 PMCID: PMC3115696 DOI: 10.1038/ng.835] [Citation(s) in RCA: 845] [Impact Index Per Article: 65.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2011] [Accepted: 04/21/2011] [Indexed: 12/14/2022]
Abstract
Evidence for the etiology of autism spectrum disorders (ASDs) has consistently pointed to a strong genetic component complicated by substantial locus heterogeneity. We sequenced the exomes of 20 individuals with sporadic ASD (cases) and their parents, reasoning that these families would be enriched for de novo mutations of major effect. We identified 21 de novo mutations, 11 of which were protein altering. Protein-altering mutations were significantly enriched for changes at highly conserved residues. We identified potentially causative de novo events in 4 out of 20 probands, particularly among more severely affected individuals, in FOXP1, GRIN2B, SCN1A and LAMC3. In the FOXP1 mutation carrier, we also observed a rare inherited CNTNAP2 missense variant, and we provide functional support for a multi-hit model for disease risk. Our results show that trio-based exome sequencing is a powerful approach for identifying new candidate genes for ASDs and suggest that de novo mutations may contribute substantially to the genetic etiology of ASDs.
Collapse
Affiliation(s)
- Brian J. O’Roak
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Pelagia Deriziotis
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Choli Lee
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Laura Vives
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Jerrod J. Schwartz
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Santhosh Girirajan
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Emre Karakoc
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Alexandra P. MacKenzie
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Sarah B. Ng
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Carl Baker
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Mark J. Rieder
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Deborah A. Nickerson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Raphael Bernier
- Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, USA
| | - Simon E. Fisher
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
- Language and Genetics Department, Max Planck Institute for Psycholinguistics, Nijmegen, the Netherlands
| | - Jay Shendure
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, Seattle, WA, USA
| |
Collapse
|
493
|
Hormozdiari F, Hach F, Sahinalp SC, Eichler EE, Alkan C. Sensitive and fast mapping of di-base encoded reads. Bioinformatics 2011; 27:1915-21. [PMID: 21586516 DOI: 10.1093/bioinformatics/btr303] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
MOTIVATION Discovering variation among high-throughput sequenced genomes relies on efficient and effective mapping of sequence reads. The speed, sensitivity and accuracy of read mapping are crucial to determining the full spectrum of single nucleotide variants (SNVs) as well as structural variants (SVs) in the donor genomes analyzed. RESULTS We present drFAST, a read mapper designed for di-base encoded 'color-space' sequences generated with the AB SOLiD platform. drFAST is specially designed for better delineation of structural variants, including segmental duplications, and is able to return all possible map locations and underlying sequence variation of short reads within a user-specified distance threshold. We show that drFAST is more sensitive in comparison to all commonly used aligners such as Bowtie, BFAST and SHRiMP. drFAST is also faster than both BFAST and SHRiMP and achieves a mapping speed comparable to Bowtie. AVAILABILITY The source code for drFAST is available at http://drfast.sourceforge.net
Collapse
Affiliation(s)
- Farhad Hormozdiari
- Department of Genome Sciences, Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195-5065, USA
| | | | | | | | | |
Collapse
|
494
|
Abstract
DNA sequence variants are major components of the "causal field" for virtually all medical phenotypes, whether single gene familial disorders or complex traits without a clear familial aggregation. The causal variants in single gene disorders are necessary and sufficient to impart large effects. In contrast, complex traits are attributable to a much more complicated network of contributory components that in aggregate increase the probability of disease. The conventional approach to identification of the causal variants for single gene disorders is genetic linkage. However, it does not offer sufficient resolution to map the causal genes in small families or sporadic cases. The approach to genetic studies of complex traits entails candidate gene or genome-wide association studies. Genome-wide association studies provide an unbiased survey of the effects of common genetic variants (common disease-common variant hypothesis). Genome-wide association studies have led to identification of a large number of alleles for various cardiovascular diseases. However, common alleles account for a relatively small fraction of the total heritability of the traits. Accordingly, the focus has shifted toward identification of rare variants that might impart larger effect sizes (rare variant-common disease hypothesis). This shift is made feasible by recent advances in massively parallel DNA sequencing platforms, which afford the opportunity to identify virtually all common as well as rare alleles in individuals. In this review, we discuss various strategies that are used to delineate the genetic contribution to medically important cardiovascular phenotypes, emphasizing the utility of the new deep sequencing approaches.
Collapse
Affiliation(s)
- A J Marian
- Center for Cardiovascular Genetics, The Brown Foundation Institute of Molecular Medicine, The University of Texas Health Sciences Center, 6770 Bertner Street, Suite C900A, Houston, TX 77030, USA.
| | | |
Collapse
|
495
|
Cooper DN, Kehrer-Sawatzki H. Exploring the potential relevance of human-specific genes to complex disease. Hum Genomics 2011; 5:99-107. [PMID: 21296743 PMCID: PMC3525227 DOI: 10.1186/1479-7364-5-2-99] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Although human disease genes generally tend to be evolutionarily more ancient than non-disease genes, complex disease genes appear to be represented more frequently than Mendelian disease genes among genes of more recent evolutionary origin. It is therefore proposed that the analysis of human-specific genes might provide new insights into the genetics of complex disease. Cross-comparison with the Human Gene Mutation Database (http://www.hgmd.org) revealed a number of examples of disease-causing and disease-associated mutations in putatively human-specific genes. A sizeable proportion of these were missense polymorphisms associated with complex disease. Since both human-specific genes and genes associated with complex disease have often experienced particularly rapid rates of evolutionary change, either due to weaker purifying selection or positive selection, it is proposed that a significant number of human-specific genes may play a role in complex disease.
Collapse
Affiliation(s)
- David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, UK.
| | | |
Collapse
|
496
|
Hardwick RJ, Machado LR, Zuccherato LW, Antolinos S, Xue Y, Shawa N, Gilman RH, Cabrera L, Berg DE, Tyler-Smith C, Kelly P, Tarazona-Santos E, Hollox EJ. A worldwide analysis of beta-defensin copy number variation suggests recent selection of a high-expressing DEFB103 gene copy in East Asia. Hum Mutat 2011; 32:743-50. [PMID: 21387465 PMCID: PMC3263423 DOI: 10.1002/humu.21491] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2010] [Accepted: 02/10/2011] [Indexed: 11/21/2022]
Abstract
Beta-defensins are a family of multifunctional genes with roles in defense against pathogens, reproduction, and pigmentation. In humans, six beta-defensin genes are clustered in a repeated region which is copy-number variable (CNV) as a block, with a diploid copy number between 1 and 12. The role in host defense makes the evolutionary history of this CNV particularly interesting, because morbidity due to infectious disease is likely to have been an important selective force in human evolution, and to have varied between geographical locations. Here, we show CNV of the beta-defensin region in chimpanzees, and identify a beta-defensin block in the human lineage that contains rapidly evolving noncoding regulatory sequences. We also show that variation at one of these rapidly evolving sequences affects expression levels and cytokine responsiveness of DEFB103, a key inhibitor of influenza virus fusion at the cell surface. A worldwide analysis of beta-defensin CNV in 67 populations shows an unusually high frequency of high-DEFB103-expressing copies in East Asia, the geographical origin of historical and modern influenza epidemics, possibly as a result of selection for increased resistance to influenza in this region. Hum Mutat 32:743–750, 2011. © 2011 Wiley-Liss, Inc.
Collapse
Affiliation(s)
- Robert J Hardwick
- Department of Genetics, University of Leicester, University Road, Leicester, United Kingdom
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
497
|
|
498
|
Van Hul W. Identification of genetic modifiers of monogenic (bone) diseases: new tools available, but with limitations. J Bone Miner Res 2011; 26:918-9. [PMID: 21541993 DOI: 10.1002/jbmr.391] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Wim Van Hul
- Department of Medical Genetics, University of Antwerp, Antwerp, Belgium.
| |
Collapse
|
499
|
Smith–Magenis syndrome: haploinsufficiency of RAI1 results in altered gene regulation in neurological and metabolic pathways. Expert Rev Mol Med 2011; 13:e14. [DOI: 10.1017/s1462399411001827] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Smith–Magenis syndrome (SMS) is a complex neurobehavioural disorder characterised by intellectual disability, self-injurious behaviours, sleep disturbance, obesity, and craniofacial and skeletal anomalies. Diagnostic strategies are focused towards identification of a 17p11.2 microdeletion encompassing the gene RAI1 (retinoic acid induced 1) or a mutation of RAI1. Molecular evidence shows that most SMS features are due to RAI1 haploinsufficiency, whereas variability and severity are modified by other genes in the 17p11.2 region for 17p11.2 deletion cases. The functional role of RAI1 is not completely understood, but it is probably a transcription factor acting in several different biological pathways that are dysregulated in SMS. Functional studies based on the hypothesis that RAI1 acts through phenotype-specific pathways involving several downstream genes have shown that RAI1 gene dosage is crucial for normal regulation of circadian rhythm, lipid metabolism and neurotransmitter function. Here, we review the clinical and molecular features of SMS and explore more recent studies supporting possible therapeutic strategies for behavioural management.
Collapse
|
500
|
Campbell C, Sampas N, Tsalenko A, Sudmant P, Kidd J, Malig M, Vu T, Vives L, Tsang P, Bruhn L, Eichler E. Population-genetic properties of differentiated human copy-number polymorphisms. Am J Hum Genet 2011; 88:317-32. [PMID: 21397061 DOI: 10.1016/j.ajhg.2011.02.004] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2010] [Revised: 02/07/2011] [Accepted: 02/15/2011] [Indexed: 01/19/2023] Open
Abstract
Copy-number variants (CNVs) can reach appreciable frequencies in the human population, and recent discoveries have shown that several of these copy-number polymorphisms (CNPs) are associated with human diseases, including lupus, psoriasis, Crohn disease, and obesity. Despite new advances, significant biases remain in terms of CNP discovery and genotyping. We developed a method based on single-channel intensity data and benchmarked against copy numbers determined from sequencing read depth to successfully obtain CNP genotypes for 1495 CNPs from 487 human DNA samples of diverse ethnic backgrounds. This microarray contained CNPs in segmental duplication-rich regions and insertions of sequences not represented in the reference genome assembly or on standard SNP microarray platforms. We observe that CNPs in segmental duplications are more likely to be population differentiated than CNPs in unique regions (p = 0.015) and that biallelic CNPs show greater stratification when compared to frequency-matched SNPs (p = 0.0026). Although biallelic CNPs show a strong correlation of copy number with flanking SNP genotypes, the majority of multicopy CNPs do not (40% with r > 0.8). We selected a subset of CNPs for further characterization in 1876 additional samples from 62 populations; this revealed striking population-differentiated structural variants in genes of clinical significance such as OCLN, a tight junction protein involved in hepatitis C viral entry. Our microarray design allows these variants to be rapidly tested for disease association and our results suggest that CNPs (especially those that cannot be imputed from SNP genotypes) might have contributed disproportionately to human diversity and selection.
Collapse
|