1
|
Birnbaum R, Mahjani B, Loos RJF, Sharp AJ. Clinical Characterization of Copy Number Variants Associated With Neurodevelopmental Disorders in a Large-scale Multiancestry Biobank. JAMA Psychiatry 2022; 79:250-259. [PMID: 35080590 PMCID: PMC8792794 DOI: 10.1001/jamapsychiatry.2021.4080] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Accepted: 11/30/2021] [Indexed: 01/28/2023]
Abstract
IMPORTANCE Past studies identified rare copy number variants (CNVs) as risk factors for neurodevelopmental disorders (NDDs), including autism spectrum disorder and schizophrenia. However, the clinical characterization of NDD CNVs is understudied in population cohorts unselected for neuropsychiatric disorders and in cohorts of diverse ancestry. OBJECTIVE To identify individuals harboring NDD CNVs in a multiancestry biobank and to query their enrichment for select neuropsychiatric disorders as well as association with multiple medical disorders. DESIGN, SETTINGS, AND PARTICIPANTS In a series of phenotypic enrichment and association analyses, NDD CNVs were clinically characterized among 24 877 participants in the BioMe biobank, an electronic health record-linked biobank derived from the Mount Sinai Health System, New York, New York. Participants were recruited into the biobank since September 2007 across diverse ancestry and medical and neuropsychiatric specialties. For the current analyses, electronic health record data were analyzed from May 2004 through May 2019. MAIN OUTCOMES AND MEASURES NDD CNVs were identified using a consensus of 2 CNV calling algorithms, based on whole-exome sequencing and genotype array data, followed by novel in-silico clinical assessments. RESULTS Of 24 877 participants, 14 586 (58.7%) were female; self-reported ancestry categories included 5965 (24.0%) who were of African ancestry, 7892 (31.7%) who were of European ancestry, and 8536 (34.3%) who were of Hispanic ancestry; and the mean (SD) age was 50.5 (17.3) years. Among 24 877 individuals, the prevalence of 64 NDD CNVs was 2.5% (n = 627), with prevalence varying by locus, corroborating the presence of some relatively highly prevalent NDD CNVs (eg, 15q11.2 deletion/duplication). An aggregate set of NDD CNVs were enriched for congenital disorders (odds ratio, 2.0; 95% CI, 1.1-3.5; P = .01) and major depressive disorder (odds ratio, 1.5; 95% CI, 1.1-2.0; P = .01). In a meta-analysis of medical diagnoses (n = 195 hierarchically clustered diagnostic codes), NDD CNVs were significantly associated with several medical outcomes, including essential hypertension (z score = 3.6; P = 2.8 × 10-4), kidney failure (z score = 3.3; P = 1.1 × 10-3), and obstructive sleep apnea (z score = 3.4; P = 8.1 × 10-4) and, in another analysis, morbid obesity (z score = 3.8; P = 1.3 × 10-4). Further, NDD CNVs were associated with increased body mass index in a multiancestry analysis (β = 0.19; 95% CI, 0.10-0.31; P = .003). For 36 common serum tests, there was no association with NDD CNVs. CONCLUSIONS AND RELEVANCE Clinical features of individuals harboring NDD CNVs were elucidated in a large-scale, multiancestry biobank, identifying enrichments for congenital disorders and major depressive disorder as well as associations with several medical outcomes, including hypertension, kidney failure, and obesity and obesity-related phenotypes, specifically obstructive sleep apnea and increased body mass index. The association between NDD CNVs and obesity outcomes indicate further potential pleiotropy of NDD CNVs beyond neurodevelopmental outcomes previously reported. Future clinical genetic investigations may lead to insights of at-risk individuals and therapeutic strategies targeting specific genetic variants. The importance of diverse inclusion within biobanks and considering the effect of rare genetic variants in a multiancestry context is evident.
Collapse
Affiliation(s)
- Rebecca Birnbaum
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, New York
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York
| | - Behrang Mahjani
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, New York
- Seaver Autism Center for Research and Treatment, Icahn School of Medicine at Mount Sinai, New York, New York
| | - Ruth J. F. Loos
- Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York
- NovoNordisk Foundation Center for Basic Metabolic Research, University of Copenhagen, Copenhagen, Denmark
| | - Andrew J. Sharp
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York
- The Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, New York
| |
Collapse
|
2
|
Woo XY, Giordano J, Srivastava A, Zhao ZM, Lloyd MW, de Bruijn R, Suh YS, Patidar R, Chen L, Scherer S, Bailey MH, Yang CH, Cortes-Sanchez E, Xi Y, Wang J, Wickramasinghe J, Kossenkov AV, Rebecca VW, Sun H, Mashl RJ, Davies SR, Jeon R, Frech C, Randjelovic J, Rosains J, Galimi F, Bertotti A, Lafferty A, O'Farrell AC, Modave E, Lambrechts D, Ter Brugge P, Serra V, Marangoni E, El Botty R, Kim H, Kim JI, Yang HK, Lee C, Dean DA, Davis-Dusenbery B, Evrard YA, Doroshow JH, Welm AL, Welm BE, Lewis MT, Fang B, Roth JA, Meric-Bernstam F, Herlyn M, Davies MA, Ding L, Li S, Govindan R, Isella C, Moscow JA, Trusolino L, Byrne AT, Jonkers J, Bult CJ, Medico E, Chuang JH. Conservation of copy number profiles during engraftment and passaging of patient-derived cancer xenografts. Nat Genet 2021; 53:86-99. [PMID: 33414553 PMCID: PMC7808565 DOI: 10.1038/s41588-020-00750-6] [Citation(s) in RCA: 109] [Impact Index Per Article: 36.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2019] [Accepted: 11/18/2020] [Indexed: 02/03/2023]
Abstract
Patient-derived xenografts (PDXs) are resected human tumors engrafted into mice for preclinical studies and therapeutic testing. It has been proposed that the mouse host affects tumor evolution during PDX engraftment and propagation, affecting the accuracy of PDX modeling of human cancer. Here, we exhaustively analyze copy number alterations (CNAs) in 1,451 PDX and matched patient tumor (PT) samples from 509 PDX models. CNA inferences based on DNA sequencing and microarray data displayed substantially higher resolution and dynamic range than gene expression-based inferences, and they also showed strong CNA conservation from PTs through late-passage PDXs. CNA recurrence analysis of 130 colorectal and breast PT/PDX-early/PDX-late trios confirmed high-resolution CNA retention. We observed no significant enrichment of cancer-related genes in PDX-specific CNAs across models. Moreover, CNA differences between patient and PDX tumors were comparable to variations in multiregion samples within patients. Our study demonstrates the lack of systematic copy number evolution driven by the PDX mouse host.
Collapse
Grants
- NC/T001267/1 National Centre for the Replacement, Refinement and Reduction of Animals in Research
- P30 CA016672 NCI NIH HHS
- 29567 Cancer Research UK
- U54 CA233223 NCI NIH HHS
- P30 CA034196 NCI NIH HHS
- P01 CA114046 NCI NIH HHS
- HHSN261201400008C NCI NIH HHS
- P30 CA091842 NCI NIH HHS
- U24 CA224067 NCI NIH HHS
- P50 CA196510 NCI NIH HHS
- U54 CA224070 NCI NIH HHS
- U54 CA224076 NCI NIH HHS
- U54 CA224065 NCI NIH HHS
- U54 CA233306 NCI NIH HHS
- P30 CA010815 NCI NIH HHS
- U24 CA204781 NCI NIH HHS
- U54 CA224083 NCI NIH HHS
- HHSN261201500003C NCI NIH HHS
- HHSN261200800001C NCI NIH HHS
- T32 HG008962 NHGRI NIH HHS
- R50 CA211199 NCI NIH HHS
- P30 CA125123 NCI NIH HHS
- P50 CA070907 NCI NIH HHS
- HHSN261201500003I NCI NIH HHS
- HHSN261200800001E NCI NIH HHS
- P30 CA042014 NCI NIH HHS
- U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
- KWF Kankerbestrijding (Dutch Cancer Society)
- Oncode Institute
- Fondazione AIRC under 5 per Mille 2018 - ID. 21091 EU H2020 Research and Innovation Programme, grant agreement no. 731105 European Research Council Consolidator Grant 724748
- EU H2020 Research and Innovation Programme, grant Agreement No. 754923
- EU H2020 Research and Innovation Programme, grant agreement no. 731105 ISCIII - Miguel Servet program CP14/00228 GHD-Pink/FERO Foundation grant
- Fondazione Piemontese per la Ricerca sul Cancro-ONLUS 5 per mille Ministero della Salute 2015
- Korean Health Industry Development Institute HI13C2148
- Korean Health Industry Development Institute HI13C2148 The First Affiliated Hospital of Xi’an Jiaotong University Ewha Womans University Research Grant
- CPRIT RP170691
- SCU | Ignatian Center for Jesuit Education, Santa Clara University
- Breast Cancer Research Foundation (BCRF)
- Fashion Footwear Charitable Foundation of New York The Foundation for Barnes-Jewish Hospital’s Cancer Frontier Fund
- My First AIRC Grant 19047
- Fondazione AIRC under 5 per Mille 2018 - ID. 21091 AIRC Investigator Grants 18532 and 20697 AIRC/CRUK/FC AECC Accelerator Award 22795 Fondazione Piemontese per la Ricerca sul Cancro-ONLUS 5 per mille Ministero della Salute 2015, 2014, 2016 EU H2020 Research and Innovation Programme, grant Agreement No. 754923 EU H2020 Research and Innovation Programme, grant agreement no. 731105
- Science Foundation Ireland (SFI)
- EU H2020 Research and Innovation Programme, grant agreement no. 731105 EU H2020 Research and Innovation Programme, grant Agreement No. 754923 Irish Health Research Board grant ILP-POR-2019-066
- Nederlandse Organisatie voor Wetenschappelijk Onderzoek (Netherlands Organisation for Scientific Research)
- EU H2020 Research and Innovation Programme, grant agreement no. 731105 European Research Council (ERC) Synergy project CombatCancer Oncode Institute
Collapse
Affiliation(s)
- Xing Yi Woo
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Jessica Giordano
- Department of Oncology, University of Turin, Turin, Italy
- Candiolo Cancer Institute, FPO-IRCCS, Turin, Italy
| | - Anuj Srivastava
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Zi-Ming Zhao
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Michael W Lloyd
- The Jackson Laboratory for Mammalian Genetics, Bar Harbor, ME, USA
| | | | - Yun-Suhk Suh
- College of Medicine, Seoul National University, Seoul, Republic of Korea
| | - Rajesh Patidar
- Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Li Chen
- Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Sandra Scherer
- Department of Oncological Sciences, Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, USA
| | - Matthew H Bailey
- Department of Oncological Sciences, Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, USA
- Department of Human Genetics, University of Utah, Salt Lake City, UT, USA
| | - Chieh-Hsiang Yang
- Department of Oncological Sciences, Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, USA
| | - Emilio Cortes-Sanchez
- Department of Oncological Sciences, Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, USA
| | - Yuanxin Xi
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Jing Wang
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | | | | | | | - Hua Sun
- Department of Medicine, Washington University School of Medicine in St. Louis, St. Louis, MO, USA
| | - R Jay Mashl
- Department of Medicine, Washington University School of Medicine in St. Louis, St. Louis, MO, USA
| | - Sherri R Davies
- Department of Medicine, Washington University School of Medicine in St. Louis, St. Louis, MO, USA
| | - Ryan Jeon
- Seven Bridges Genomics, Charlestown, MA, USA
| | | | | | | | - Francesco Galimi
- Department of Oncology, University of Turin, Turin, Italy
- Candiolo Cancer Institute, FPO-IRCCS, Turin, Italy
| | - Andrea Bertotti
- Department of Oncology, University of Turin, Turin, Italy
- Candiolo Cancer Institute, FPO-IRCCS, Turin, Italy
| | - Adam Lafferty
- Department of Physiology and Medical Physics, Centre for Systems Medicine, Royal College of Surgeons in Ireland, Dublin, Ireland
| | - Alice C O'Farrell
- Department of Physiology and Medical Physics, Centre for Systems Medicine, Royal College of Surgeons in Ireland, Dublin, Ireland
| | - Elodie Modave
- Center for Cancer Biology, VIB, Leuven, Belgium
- Laboratory of Translational Genetics, Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Diether Lambrechts
- Center for Cancer Biology, VIB, Leuven, Belgium
- Laboratory of Translational Genetics, Department of Human Genetics, KU Leuven, Leuven, Belgium
| | | | - Violeta Serra
- Vall d´Hebron Institute of Oncology, Barcelona, Spain
| | - Elisabetta Marangoni
- Department of Translational Research, Institut Curie, PSL Research University, Paris, France
| | - Rania El Botty
- Department of Translational Research, Institut Curie, PSL Research University, Paris, France
| | - Hyunsoo Kim
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Jong-Il Kim
- College of Medicine, Seoul National University, Seoul, Republic of Korea
| | - Han-Kwang Yang
- College of Medicine, Seoul National University, Seoul, Republic of Korea
| | - Charles Lee
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
- Precision Medicine Center, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, People's Republic of China
- Department of Life Sciences, Ewha Womans University, Seoul, Republic of Korea
| | | | | | - Yvonne A Evrard
- Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - James H Doroshow
- Division of Cancer Treatment and Diagnosis, National Cancer Institute, Bethesda, MD, USA
| | - Alana L Welm
- Department of Oncological Sciences, Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, USA
| | - Bryan E Welm
- Department of Oncological Sciences, Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, USA
- Department of Surgery, Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, USA
| | - Michael T Lewis
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, USA
| | - Bingliang Fang
- Department of Thoracic and Cardiovascular Surgery, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Jack A Roth
- Department of Thoracic and Cardiovascular Surgery, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Funda Meric-Bernstam
- Department of Investigational Cancer Therapeutics, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | | | - Michael A Davies
- Department of Melanoma Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Li Ding
- Department of Medicine, Washington University School of Medicine in St. Louis, St. Louis, MO, USA
| | - Shunqiang Li
- Department of Medicine, Washington University School of Medicine in St. Louis, St. Louis, MO, USA
| | - Ramaswamy Govindan
- Department of Medicine, Washington University School of Medicine in St. Louis, St. Louis, MO, USA
| | - Claudio Isella
- Department of Oncology, University of Turin, Turin, Italy
- Candiolo Cancer Institute, FPO-IRCCS, Turin, Italy
| | - Jeffrey A Moscow
- Investigational Drug Branch, National Cancer Institute, Bethesda, MD, USA
| | - Livio Trusolino
- Department of Oncology, University of Turin, Turin, Italy
- Candiolo Cancer Institute, FPO-IRCCS, Turin, Italy
| | - Annette T Byrne
- Department of Physiology and Medical Physics, Centre for Systems Medicine, Royal College of Surgeons in Ireland, Dublin, Ireland
| | - Jos Jonkers
- Netherlands Cancer Institute, Amsterdam, the Netherlands
| | - Carol J Bult
- The Jackson Laboratory for Mammalian Genetics, Bar Harbor, ME, USA
| | - Enzo Medico
- Department of Oncology, University of Turin, Turin, Italy.
- Candiolo Cancer Institute, FPO-IRCCS, Turin, Italy.
| | - Jeffrey H Chuang
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
| |
Collapse
|
3
|
Billingsley KJ, Bandres-Ciga S, Ding J, Hernandez D, Gibbs JR, Blauwendraat C. MIDN locus structural variants and Parkinson's Disease risk. Ann Clin Transl Neurol 2020; 7:602-603. [PMID: 32212230 PMCID: PMC7187709 DOI: 10.1002/acn3.51012] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Accepted: 02/21/2020] [Indexed: 01/24/2023] Open
Affiliation(s)
- Kimberley J Billingsley
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, 35 Convent Drive, Bethesda, 20892, Maryland, United states
| | - Sara Bandres-Ciga
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, 35 Convent Drive, Bethesda, 20892, Maryland, United states
| | - Jinhui Ding
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, 35 Convent Drive, Bethesda, 20892, Maryland, United states
| | - Dena Hernandez
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, 35 Convent Drive, Bethesda, 20892, Maryland, United states
| | - J Raphael Gibbs
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, 35 Convent Drive, Bethesda, 20892, Maryland, United states
| | - Cornelis Blauwendraat
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, 35 Convent Drive, Bethesda, 20892, Maryland, United states
| | | |
Collapse
|
4
|
Uebe S, Ehrlicher M, Ekici AB, Behrens F, Böhm B, Homuth G, Schurmann C, Völker U, Jünger M, Nauck M, Völzke H, Traupe H, Krawczak M, Burkhardt H, Reis A, Hüffmeier U. Genome-wide association and targeted analysis of copy number variants with psoriatic arthritis in German patients. BMC MEDICAL GENETICS 2017; 18:92. [PMID: 28835222 PMCID: PMC5569473 DOI: 10.1186/s12881-017-0447-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/04/2016] [Accepted: 07/31/2017] [Indexed: 01/16/2023]
Abstract
Background Psoriatic Arthritis (PsA) is a chronic inflammatory disease of the joints. PsA is etiologically complex, and 11 susceptibility loci have been identified so far. Most of these overlap with loci associated with psoriasis vulgaris (PsV), the most common psoriatic skin manifestation which is also frequently seen in PsA patients. In addition, two copy number variants (CNVs) are associated with PsV, one of which, located within the LCE3 gene cluster, is also associated with PsA. Finally, an intergenic deletion has been reported as a PsA-specific CNV. Methods We performed a genome-wide association study (GWAS) of CNVs in PsA and assessed the contribution to disease risk by CNVs at known psoriasis susceptibility loci. Results After stringent quality assessment and validation of CNVs of the GWAS with an alternative quantitative method, two significantly associated CNVs remained, one near UXS1, the other one at the TRB locus. However, MLPA analysis did not confirm the CN state in ~1/3 of individuals, and an analysis of an independent case-control-study failed to confirm the initial associations. Furthermore, detailed PCR-based analysis of the sequence at TRB revealed the existence of a more complex genomic sequence most accurately represented by freeze hg18 which accordingly failed to confirm the hg19 sequence. Only rare CNVs were detected at psoriasis susceptibility loci. At three of 12 susceptibility loci with CNVs (CSMD1, IL12B, RYR2), CN variability was confirmed independently by MLPA. Overall, the rate of CNV confirmation by MLPA was strongly dependent upon CNV type, CNV size and the number of array markers involved in a CNV. Conclusion Although we identified PsA associations at several loci and confirmed that the common CNVs at these sites were real, ~1/3 of the common CNV states could not be reproduced. Furthermore, replication analysis failed to confirm the original association. Furthermore, SNP array-based analyses of CNVs were found to be more reliable for deletions than duplications, independent of the respective CNV allele frequency. CNVs are thus good candidate disease variants, while the methods to detect them should be applied cautiously and reproduced by an independent method. Electronic supplementary material The online version of this article (doi:10.1186/s12881-017-0447-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Steffen Uebe
- Institute of Human Genetics, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Schwabachanlage 10, 91054, Erlangen, Germany
| | - Maria Ehrlicher
- Institute of Human Genetics, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Schwabachanlage 10, 91054, Erlangen, Germany
| | - Arif Bülent Ekici
- Institute of Human Genetics, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Schwabachanlage 10, 91054, Erlangen, Germany
| | - Frank Behrens
- Division of Rheumatology and IME Fraunhofer Project Group Translational Medicine & Pharmacology, Goethe University, Frankfurt/Main, Germany
| | - Beate Böhm
- Division of Rheumatology and IME Fraunhofer Project Group Translational Medicine & Pharmacology, Goethe University, Frankfurt/Main, Germany
| | - Georg Homuth
- Interfaculty Institute for Genetics and Functional Genomics, University Medicine and Ernst-Moritz-Arndt University Greifswald, Greifswald, Germany
| | - Claudia Schurmann
- Interfaculty Institute for Genetics and Functional Genomics, University Medicine and Ernst-Moritz-Arndt University Greifswald, Greifswald, Germany
| | - Uwe Völker
- Interfaculty Institute for Genetics and Functional Genomics, University Medicine and Ernst-Moritz-Arndt University Greifswald, Greifswald, Germany
| | - Michael Jünger
- Clinic of Dermatology, University of Greifswald, Greifswald, Germany
| | - Matthias Nauck
- Institute of Clinical Chemistry and Laboratory Medicine, University of Greifswald, Greifswald, Germany
| | - Henry Völzke
- Institute for Community Medicine, University of Greifswald, Greifswald, Germany
| | - Heiko Traupe
- Department of Dermatology, University of Münster, Münster, Germany
| | - Michael Krawczak
- Institute for Medical Informatics and Statistics, Christian-Albrechts University Kiel, Kiel, Germany
| | - Harald Burkhardt
- Division of Rheumatology and IME Fraunhofer Project Group Translational Medicine & Pharmacology, Goethe University, Frankfurt/Main, Germany
| | - André Reis
- Institute of Human Genetics, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Schwabachanlage 10, 91054, Erlangen, Germany
| | - Ulrike Hüffmeier
- Institute of Human Genetics, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Schwabachanlage 10, 91054, Erlangen, Germany.
| |
Collapse
|
5
|
Castellani CA, Melka MG, Wishart AE, Locke MEO, Awamleh Z, O'Reilly RL, Singh SM. Biological relevance of CNV calling methods using familial relatedness including monozygotic twins. BMC Bioinformatics 2014; 15:114. [PMID: 24750645 PMCID: PMC4021055 DOI: 10.1186/1471-2105-15-114] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2014] [Accepted: 04/14/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Studies involving the analysis of structural variation including Copy Number Variation (CNV) have recently exploded in the literature. Furthermore, CNVs have been associated with a number of complex diseases and neurodevelopmental disorders. Common methods for CNV detection use SNP, CNV, or CGH arrays, where the signal intensities of consecutive probes are used to define the number of copies associated with a given genomic region. These practices pose a number of challenges that interfere with the ability of available methods to accurately call CNVs. It has, therefore, become necessary to develop experimental protocols to test the reliability of CNV calling methods from microarray data so that researchers can properly discriminate biologically relevant data from noise. RESULTS We have developed a workflow for the integration of data from multiple CNV calling algorithms using the same array results. It uses four CNV calling programs: PennCNV (PC), Affymetrix® Genotyping Console™ (AGC), Partek® Genomics Suite™ (PGS) and Golden Helix SVS™ (GH) to analyze CEL files from the Affymetrix® Human SNP 6.0 Array™. To assess the relative suitability of each program, we used individuals of known genetic relationships. We found significant differences in CNV calls obtained by different CNV calling programs. CONCLUSIONS Although the programs showed variable patterns of CNVs in the same individuals, their distribution in individuals of different degrees of genetic relatedness has allowed us to offer two suggestions. The first involves the use of multiple algorithms for the detection of the largest possible number of CNVs, and the second suggests the use of PennCNV over all other methods when the use of only one software program is desirable.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Shiva M Singh
- Department of Biology, The University of Western Ontario, London N6A 5B7, ON, Canada.
| |
Collapse
|
6
|
Chen YH, Lu RB, Hung H, Kuo PH. Identifying Potential Regions of Copy Number Variation for Bipolar Disorder. MICROARRAYS 2014; 3:52-71. [PMID: 27605030 PMCID: PMC5003455 DOI: 10.3390/microarrays3010052] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/01/2013] [Revised: 02/10/2014] [Accepted: 02/12/2014] [Indexed: 11/16/2022]
Abstract
Bipolar disorder is a complex psychiatric disorder with high heritability, but its genetic determinants are still largely unknown. Copy number variation (CNV) is one of the sources to explain part of the heritability. However, it is a challenge to estimate discrete values of the copy numbers using continuous signals calling from a set of markers, and to simultaneously perform association testing between CNVs and phenotypic outcomes. The goal of the present study is to perform a series of data filtering and analysis procedures using a DNA pooling strategy to identify potential CNV regions that are related to bipolar disorder. A total of 200 normal controls and 200 clinically diagnosed bipolar patients were recruited in this study, and were randomly divided into eight control and eight case pools. Genome-wide genotyping was employed using Illumina Human Omni1-Quad array with approximately one million markers for CNV calling. We aimed at setting a series of criteria to filter out the signal noise of marker data and to reduce the chance of false-positive findings for CNV regions. We first defined CNV regions for each pool. Potential CNV regions were reported based on the different patterns of CNV status between cases and controls. Genes that were mapped into the potential CNV regions were examined with association testing, Gene Ontology enrichment analysis, and checked with existing literature for their associations with bipolar disorder. We reported several CNV regions that are related to bipolar disorder. Two CNV regions on chromosome 11 and 22 showed significant signal differences between cases and controls (p < 0.05). Another five CNV regions on chromosome 6, 9, and 19 were overlapped with results in previous CNV studies. Experimental validation of two CNV regions lent some support to our reported findings. Further experimental and replication studies could be designed for these selected regions.
Collapse
Affiliation(s)
- Yi-Hsuan Chen
- Department of Public Health & Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei 100, Taiwan.
| | - Ru-Band Lu
- Department of Psychiatry, College of Medicine & Hospital, National Cheng Kung University, Tainan 704, Taiwan.
| | - Hung Hung
- Department of Public Health & Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei 100, Taiwan.
- Research Center for Genes, Environment and Human Health, National Taiwan University, Taipei 100, Taiwan.
| | - Po-Hsiu Kuo
- Department of Public Health & Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei 100, Taiwan.
- Research Center for Genes, Environment and Human Health, National Taiwan University, Taipei 100, Taiwan.
| |
Collapse
|
7
|
Copy number variation distribution in six monozygotic twin pairs discordant for schizophrenia. Twin Res Hum Genet 2014; 17:108-20. [PMID: 24556202 DOI: 10.1017/thg.2014.6] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
We have evaluated copy number variants (CNVs) in six monozygotic twin pairs discordant for schizophrenia. The data from Affymetrix® Human SNP 6.0 arrays™ were analyzed using Affymetrix® Genotyping Console™, Partek® Genomics Suite™, PennCNV, and Golden Helix SVS™. This yielded both program-specific and overlapping results. Only CNVs called by Affymetrix Genotyping Console, Partek Genomics Suite, and PennCNV were used in further analysis. This analysis included an assessment of calls in each of the six twin pairs towards identification of unique CNVs in affected and unaffected co-twins. Real time polymerase chain reaction (PCR) experiments confirmed one CNV loss at 7q11.21 that was found in the affected patient but not in the unaffected twin. The results identified CNVs and genes that were previously implicated in mental abnormalities in four of the six twin pairs. It included PYY (twin pairs 1 and 5), EPHA3 (twin pair 3), KIAA1211L (twin pair 4), and GPR139 (twin pair 5). They represent likely candidate genes and CNVs for the discordance of four of the six monozygotic twin pairs for this heterogeneous neurodevelopmental disorder. An explanation for these differences is ontogenetic de novo events that differentiate in the monozygotic twins during development.
Collapse
|
8
|
Abstract
MOTIVATION Data quality is a critical issue in the analyses of DNA copy number alterations obtained from microarrays. It is commonly assumed that copy number alteration data can be modeled as piecewise constant and the measurement errors of different probes are independent. However, these assumptions do not always hold in practice. In some published datasets, we find that measurement errors are highly correlated between probes that interrogate nearby genomic loci, and the piecewise-constant model does not fit the data well. The correlated errors cause problems in downstream analysis, leading to a large number of DNA segments falsely identified as having copy number gains and losses. METHOD We developed a simple tool, called autocorrelation scanning profile, to assess the dependence of measurement error between neighboring probes. RESULTS Autocorrelation scanning profile can be used to check data quality and refine the analysis of DNA copy number data, which we demonstrate in some typical datasets. CONTACT lzhangli@mdanderson.org. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Liangcai Zhang
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX 77230, USA and Department of Biophysics, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | | |
Collapse
|
9
|
Kim YK, Moon S, Hwang MY, Kim DJ, Oh JH, Kim YJ, Han BG, Lee JY, Kim BJ. Gene-based copy number variation study reveals a microdeletion at 12q24 that influences height in the Korean population. Genomics 2012; 101:134-8. [PMID: 23147675 DOI: 10.1016/j.ygeno.2012.11.002] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2012] [Revised: 10/25/2012] [Accepted: 11/01/2012] [Indexed: 11/18/2022]
Abstract
Height is a classic polygenic trait with high heritability (h(2)=0.8). Recent genome-wide association studies have revealed many independent loci associated with human height. In addition, although many studies have reported an association between copy number variation (CNV) and complex diseases, few have explored the relationship between CNV and height. Recent studies reported that single nucleotide polymorphisms (SNPs) are highly correlated with common CNVs, suggesting that it is warranted to survey CNVs to identify additional genetic factors affecting heritable traits such as height. This study tested the hypothesis that there would be CNV regions (CNVRs) associated with height nearby genes from the GWASs known to affect height. We identified regions containing >1% copy number deletion frequency from 3667 population-based cohort samples using the Illumina HumanOmni1-Quad BeadChip. Among the identified CNVRs, we selected 15 candidate regions that were located within 1Mb of 283 previously reported genes. To assess the effect of these CNVRs on height, statistical analyses were conducted with samples from a case group of 370 taller (upper 10%) individuals and a control group of 1828 individuals (lower 50%). We found that a newly identified 17.7 kb deletion at chromosomal position 12q24.33, approximately 171.6 kb downstream of GPR133, significantly correlated with height; this finding was validated using quantitative PCR. These results suggest that CNVs are potentially important in determining height and may contribute to height variation in human populations.
Collapse
Affiliation(s)
- Yun Kyoung Kim
- Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health, Chungcheongbuk-do, 363-951, Republic of Korea
| | - Sanghoon Moon
- Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health, Chungcheongbuk-do, 363-951, Republic of Korea
| | - Mi Yeong Hwang
- Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health, Chungcheongbuk-do, 363-951, Republic of Korea
| | - Dong-Joon Kim
- Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health, Chungcheongbuk-do, 363-951, Republic of Korea
| | - Ji Hee Oh
- Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health, Chungcheongbuk-do, 363-951, Republic of Korea
| | - Young Jin Kim
- Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health, Chungcheongbuk-do, 363-951, Republic of Korea
| | - Bok-Ghee Han
- Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health, Chungcheongbuk-do, 363-951, Republic of Korea
| | - Jong-Young Lee
- Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health, Chungcheongbuk-do, 363-951, Republic of Korea
| | - Bong-Jo Kim
- Division of Structural and Functional Genomics, Center for Genome Science, National Institute of Health, Chungcheongbuk-do, 363-951, Republic of Korea.
| |
Collapse
|
10
|
Kim SY, Kim JH, Chung YJ. Effect of Combining Multiple CNV Defining Algorithms on the Reliability of CNV Calls from SNP Genotyping Data. Genomics Inform 2012; 10:194-9. [PMID: 23166530 PMCID: PMC3492655 DOI: 10.5808/gi.2012.10.3.194] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2012] [Revised: 08/20/2012] [Accepted: 08/23/2012] [Indexed: 01/11/2023] Open
Abstract
In addition to single-nucleotide polymorphisms (SNP), copy number variation (CNV) is a major component of human genetic diversity. Among many whole-genome analysis platforms, SNP arrays have been commonly used for genomewide CNV discovery. Recently, a number of CNV defining algorithms from SNP genotyping data have been developed; however, due to the fundamental limitation of SNP genotyping data for the measurement of signal intensity, there are still concerns regarding the possibility of false discovery or low sensitivity for detecting CNVs. In this study, we aimed to verify the effect of combining multiple CNV calling algorithms and set up the most reliable pipeline for CNV calling with Affymetrix Genomewide SNP 5.0 data. For this purpose, we selected the 3 most commonly used algorithms for CNV segmentation from SNP genotyping data, PennCNV, QuantiSNP; and BirdSuite. After defining the CNV loci using the 3 different algorithms, we assessed how many of them overlapped with each other, and we also validated the CNVs by genomic quantitative PCR. Through this analysis, we proposed that for reliable CNV-based genomewide association study using SNP array data, CNV calls must be performed with at least 3 different algorithms and that the CNVs consistently called from more than 2 algorithms must be used for association analysis, because they are more reliable than the CNVs called from a single algorithm. Our result will be helpful to set up the CNV analysis protocols for Affymetrix Genomewide SNP 5.0 genotyping data.
Collapse
Affiliation(s)
- Soon-Young Kim
- Integrated Research Center for Genome Polymorphism, The Catholic University of Korea School of Medicine, Seoul 137-701, Korea. ; Department of Microbiology, The Catholic University of Korea School of Medicine, Seoul 137-701, Korea
| | | | | |
Collapse
|
11
|
D'Angelo CS, Koiffmann CP. Copy number variants in obesity-related syndromes: review and perspectives on novel molecular approaches. J Obes 2012; 2012:845480. [PMID: 23316347 PMCID: PMC3534325 DOI: 10.1155/2012/845480] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/30/2012] [Accepted: 10/09/2012] [Indexed: 02/07/2023] Open
Abstract
In recent decades, obesity has reached epidemic proportions worldwide and became a major concern in public health. Despite heritability estimates of 40 to 70% and the long-recognized genetic basis of obesity in a number of rare cases, the list of common obesity susceptibility variants by the currently published genome-wide association studies (GWASs) only explain a small proportion of the individual variation in risk of obesity. It was not until very recently that GWASs of copy number variants (CNVs) in individuals with extreme phenotypes reported a number of large and rare CNVs conferring high risk to obesity, and specifically deletions on chromosome 16p11.2. In this paper, we comment on the recent advances in the field of genetics of obesity with an emphasis on the genes and genomic regions implicated in highly penetrant forms of obesity associated with developmental disorders. Array genomic hybridization in this patient population has afforded discovery opportunities for CNVs that have not previously been detectable. This information can be used to generate new diagnostic arrays and sequencing platforms, which will likely enhance detection of known genetic conditions with the potential to elucidate new disease genes and ultimately help in developing a next-generation sequencing protocol relevant to clinical practice.
Collapse
Affiliation(s)
- Carla Sustek D'Angelo
- Human Genome and Stem Cell Center, Department of Genetics and Evolutionary Biology, Institute of Biosciences, University of Sao Paulo, 277 Rua do Matao, Rooms 204 and 209, 05508-090 Sao Paulo, SP, Brazil.
| | | |
Collapse
|