1
|
Mougeot JLC, Beckman MF, Alexander AS, Hovan AJ, Hasséus B, Legert KG, Johansson JE, von Bültzingslöwen I, Brennan MT, Mougeot FB. Single nucleotide polymorphisms conferring susceptibility to leukemia and oral mucositis: a multi-center pilot study of patients prior to conditioning therapy for hematopoietic cell transplant. Support Care Cancer 2024; 32:220. [PMID: 38467943 DOI: 10.1007/s00520-024-08408-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 02/26/2024] [Indexed: 03/13/2024]
Abstract
PURPOSE Leukemias have been associated with oral manifestations, reflecting susceptibility to cancer therapy-induced oral mucositis. We sought to identify SNPs associated with both leukemia and oral mucositis (OM). METHODS Whole exome sequencing was performed on leukemia and non-cancer blood disorder (ncBD) patients' saliva samples (N = 50) prior to conditioning therapy. WHO OM grading scores were determined: moderate to severe (OM2-4) vs. none to mild (OM0-1). Reads were processed using Trim Galorev0.6.7, Bowtie2v2.4.1, Samtoolsv1.10, Genome Analysis Toolkit (GATK)v4.2.6.1, and DeepVariantv1.4.0. We utilized the following pipelines: P1 analysis with PLINK2v3.7, SNP2GENEv1.4.1 and MAGMAv1.07b, and P2 [leukemia (N = 42) vs. ncBDs (N = 8)] and P3 [leukemia + OM2-4 (N = 18) vs. leukemia + OM0-1 (N = 24)] with Z-tests of genotypes and protein-protein interaction determination. GeneCardsSuitev5.14 was used to identify phenotypes (P1 and P2, leukemia; P3, oral mucositis) and average disease-causing likelihood and DGIdb for drug interactions. P1 and P2 genes were analyzed with CytoScape plugin BiNGOv3.0.3 to retrieve overrepresented Gene Ontology (GO) terms and Ensembl's VEP for SNP outcomes. RESULTS In P1, 457 candidate SNPs (28 genes) were identified and 21,604 SNPs (1016 genes) by MAGMAv1.07b. Eighteen genes were associated with "leukemia" per VarElectv5.14 analysis and predicted to be deleterious. In P2 and P3, 353 and 174 SNPs were significant, respectively. STRINGv12.0 returned 77 and 32 genes (C.L. = 0.7) for P2 and P3, respectively. VarElectv5.14 determined 60 genes from P2 associated with "leukemia" and 11 with "oral mucositis" from P3. Overrepresented GO terms included "cellular process," "signaling," "hemopoiesis," and "regulation of immune response." CONCLUSIONS We identified candidate SNPs possibly conferring susceptibility to develop leukemia and oral mucositis.
Collapse
Affiliation(s)
- Jean-Luc C Mougeot
- Translational Research Laboratories, Department of Oral Medicine/Oral & Maxillofacial Surgery, Atrium Health Carolinas Medical Center, Charlotte, NC, USA.
- Department of Otolaryngology/Head & Neck Surgery, Wake Forest University School of Medicine, Winston-Salem, NC, USA.
| | - Micaela F Beckman
- Translational Research Laboratories, Department of Oral Medicine/Oral & Maxillofacial Surgery, Atrium Health Carolinas Medical Center, Charlotte, NC, USA
- Department of Otolaryngology/Head & Neck Surgery, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Adam S Alexander
- Translational Research Laboratories, Department of Oral Medicine/Oral & Maxillofacial Surgery, Atrium Health Carolinas Medical Center, Charlotte, NC, USA
- Department of Otolaryngology/Head & Neck Surgery, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Allan J Hovan
- BC Cancer, Oral Oncology and Dentistry, Vancouver, BC, Canada
| | - Bengt Hasséus
- Department of Oral Medicine and Pathology, University of Gothenburg, Gothenburg, Sweden
| | - Karin Garming Legert
- Department of Dental Medicine, University Dental Clinic, Karolinska Institutet, Huddinge, Sweden
| | - Jan-Erik Johansson
- Department of Hematology and Coagulation, Sahlgrenska University Hospital, Gothenburg, Sweden
| | | | - Michael T Brennan
- Department of Otolaryngology/Head & Neck Surgery, Wake Forest University School of Medicine, Winston-Salem, NC, USA
- Department of Oral Medicine/Oral & Maxillofacial Surgery, Atrium Health Carolinas Medical Center, Charlotte, NC, USA
| | - Farah Bahrani Mougeot
- Translational Research Laboratories, Department of Oral Medicine/Oral & Maxillofacial Surgery, Atrium Health Carolinas Medical Center, Charlotte, NC, USA.
- Department of Otolaryngology/Head & Neck Surgery, Wake Forest University School of Medicine, Winston-Salem, NC, USA.
| |
Collapse
|
2
|
Song C, Pan S, Li D, Hao B, Lu Z, Lai K, Li N, Geng Q. Comprehensive analysis reveals the potential value of inflammatory response genes in the prognosis, immunity, and drug sensitivity of lung adenocarcinoma. BMC Med Genomics 2022; 15:198. [PMID: 36117156 PMCID: PMC9484176 DOI: 10.1186/s12920-022-01340-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Accepted: 08/16/2022] [Indexed: 11/18/2022] Open
Abstract
Background Although the relationship between inflammatory response and tumor has been gradually recognized, the potential implications of of inflammatory response genes in lung adenocarcinoma (LUAD) remains poorly investigated. Methods RNA sequencing and clinical data were obtained from multiple independent datasets (GSE29013, GSE30219, GSE31210, GSE37745, GSE42127, GSE50081, GSE68465, GSE72094, TCGA and GTEx). Unsupervised clustering analysis was used to identify different tumor subtypes, and LASSO and Cox regression analysis were applied to construct a novel scoring tool. We employed multiple algorithms (ssGSEA, CIBERSORT, MCP counter, and ESTIMATE) to better characterize the LUAD tumor microenvironment (TME) and immune landscapes. GSVA and Metascape analysis were performed to investigate the biological processes and pathway activity. Furthermore, ‘pRRophetic’ R package was used to evaluate the half inhibitory concentration (IC50) of each sample to infer drug sensitivity. Results We identified three distinct tumor subtypes, which were related to different clinical outcomes, biological pathways, and immune characteristics. A scoring tool called inflammatory response gene score (IRGS) was established and well validated in multiple independent cohorts, which could well divide patients into two subgroups with significantly different prognosis. High IRGS patients, characterized by increased genomic variants and mutation burden, presented a worse prognosis, and might show a more favorable response to immunotherapy and chemotherapy. Additionally, based on the cross-talk between TNM stage, IRGS and patients clinical outcomes, we redefined the LUAD stage, which was called ‘IRGS-Stage’. The novel staging system could distinguish patients with different prognosis, with better predictive ability than the conventional TNM staging. Conclusions Inflammatory response genes present important potential value in the prognosis, immunity and drug sensitivity of LUAD. The proposed IRGS and IRGS-Stage may be promising biomarkers for estimating clinical outcomes in LUAD patients. Supplementary Information The online version contains supplementary material available at 10.1186/s12920-022-01340-7.
Collapse
|
3
|
CNVs Associated with Different Clinical Phenotypes of Psoriasis and Anti-TNF-Induced Palmoplantar Pustulosis. J Pers Med 2022; 12:jpm12091452. [PMID: 36143237 PMCID: PMC9506507 DOI: 10.3390/jpm12091452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 09/01/2022] [Accepted: 09/02/2022] [Indexed: 11/16/2022] Open
Abstract
Background: Psoriasis can present different phenotypes and could affect diverse body areas. In contrast to the high effectiveness of biological drugs in the treatment of trunk and extremities plaque psoriasis, in palmoplantar phenotypes and in plaque scalp psoriasis, these same drugs usually have reduced efficacy. Anti-TNF drugs could induce the appearance of palmoplantar pustulosis (PPP) in patients with other inflammatory diseases. The objective of this study is to identify if there are DNA Copy Number Variations (CNVs) associated with these different clinical phenotypes, which could justify the differences found in clinical practice. Moreover, we intend to elucidate if anti-TNF-induced PPP has a similar genetic background to idiopathic PPP. Methods: Skin samples were collected from 39 patients with different patterns of psoriasis and six patients with anti-TNF-induced PPP. The CNVs were obtained from methylation array data (Illumina Infinium Human Methylation) using the conumee R package. Results: No significant CNVs were found between the different phenotypes and the locations of psoriasis compared. Nevertheless, we found two significant bins harboring five different genes associated with anti-TNF-induced PPP in patients with a different background other than psoriasis. Conclusions: Our results may help to predict which patients could develop anti-TNF-induced PPP.
Collapse
|
4
|
Zhu C, Fei W, Wang W, Tang L, Gao J, Zhou F. Copy Number Variation Analysis of IL22 and LCE3C in Different Subtypes of Psoriasis in a Chinese Han Population. Med Sci Monit 2021; 27:e934927. [PMID: 34853291 PMCID: PMC8650389 DOI: 10.12659/msm.934927] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2021] [Accepted: 11/04/2021] [Indexed: 12/19/2022] Open
Abstract
BACKGROUND Psoriasis is a chronic, immune-mediated and hyperproliferative skin disease with both genetic and environmental components. Copy number variations (CNV) of IL22 and LCE3C-LCE3B deletion have been confirmed to be predisposed to psoriasis vulgaris (PsV) in several ethnic groups. However, it remains to be clarified whether CNVs of IL22 and LCE3C are associated with different subtypes of psoriasis (psoriatic arthritis, PsA; erythrodermic psoriasis, EP; and generalized pustular psoriasis, GPP). MATERIAL AND METHODS We enrolled 897 Han Chinese individuals, including 478 patients and 419 healthy controls, and detected CNVs of IL22 and LCE3C using the comparative CT method by real-time PCR, and Pearson's χ² test was used to evaluated the copy number difference among subtypes. RESULTS CNVs of IL22 were significantly higher in PsV than in healthy controls (P<0.001). CNV of LCE3C in PsV, PsA, and GPP groups were significantly lower compared to healthy controls. When linked with clinical parameters, mild psoriasis carried less IL22 copy numbers than that in severe psoriasis (P=0.043). Neither IL22 or LCE3C CNVs were associated with age of onset. CONCLUSIONS CNVs of LCE3C and IL22 might differentially contribute to subtypes of psoriasis. These findings suggest complex and diverse genetic variations in and among different clinical subtypes of psoriasis.
Collapse
Affiliation(s)
- Caihong Zhu
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, Anhui, PR China
- Institute of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, Anhui, PR China
- The Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, Anhui, PR China
- Collaborative Innovation Center for Complex and Severe Dermatosis, Anhui Medical University, Hefei, Anhui, PR China
| | - Wenmin Fei
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, Anhui, PR China
- Collaborative Innovation Center for Complex and Severe Dermatosis, Anhui Medical University, Hefei, Anhui, PR China
| | - Wenjun Wang
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, Anhui, PR China
- Institute of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, Anhui, PR China
- The Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, Anhui, PR China
| | - Lili Tang
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, Anhui, PR China
- Institute of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, Anhui, PR China
- The Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, Anhui, PR China
| | - Jinping Gao
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, Anhui, PR China
- Institute of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, Anhui, PR China
- The Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, Anhui, PR China
- Collaborative Innovation Center for Complex and Severe Dermatosis, Anhui Medical University, Hefei, Anhui, PR China
| | - Fusheng Zhou
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, Anhui, PR China
- Institute of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, Anhui, PR China
- The Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, Anhui, PR China
- Collaborative Innovation Center for Complex and Severe Dermatosis, Anhui Medical University, Hefei, Anhui, PR China
| |
Collapse
|
5
|
Zhou J, Li Y, Guo X. Predicting psoriasis using routine laboratory tests with random forest. PLoS One 2021; 16:e0258768. [PMID: 34665828 PMCID: PMC8525763 DOI: 10.1371/journal.pone.0258768] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2020] [Accepted: 10/05/2021] [Indexed: 11/18/2022] Open
Abstract
Psoriasis is a chronic inflammatory skin disease that affects approximately 125 million people worldwide. It has significant impacts on both physical and emotional health-related quality of life comparable to other major illnesses. Accurately prediction of psoriasis using biomarkers from routine laboratory tests has important practical values. Our goal is to derive a powerful predictive model for psoriasis disease based on only routine hospital tests. We collected a data set including 466 psoriasis patients and 520 healthy controls with 81 variables from only laboratory routine tests, such as age, total cholesterol, HDL cholesterol, blood pressure, albumin, and platelet distribution width. In this study, Boruta feature selection method was applied to select the most relevant features, with which a Random Forest model was constructed. The model was tested with 30 repetitions of 10-fold cross-validation. Our classification model yielded an average accuracy of 86.9%. 26 notable features were selected by Boruta, among which 15 features are confirmed from previous studies, and the rest are worth further investigations. The experimental results demonstrate that the machine learning approach has good potential in predictive modeling for the psoriasis disease given the information only from routine hospital tests.
Collapse
Affiliation(s)
- Jing Zhou
- Department of Dermatology, Second Affiliated Hospital of Harbin Medical University, Harbin, PR China
| | - Yuzhen Li
- Department of Dermatology, Second Affiliated Hospital of Harbin Medical University, Harbin, PR China
- * E-mail: (YL); (XG)
| | - Xuan Guo
- Department of Computer Science and Engineering, University of North Texas, Denton, Texas, United States of America
- * E-mail: (YL); (XG)
| |
Collapse
|
6
|
Bartha Á, Győrffy B. Comprehensive Outline of Whole Exome Sequencing Data Analysis Tools Available in Clinical Oncology. Cancers (Basel) 2019; 11:E1725. [PMID: 31690036 PMCID: PMC6895801 DOI: 10.3390/cancers11111725] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Revised: 10/31/2019] [Accepted: 11/01/2019] [Indexed: 12/17/2022] Open
Abstract
Whole exome sequencing (WES) enables the analysis of all protein coding sequences in the human genome. This technology enables the investigation of cancer-related genetic aberrations that are predominantly located in the exonic regions. WES delivers high-throughput results at a reasonable price. Here, we review analysis tools enabling utilization of WES data in clinical and research settings. Technically, WES initially allows the detection of single nucleotide variants (SNVs) and copy number variations (CNVs), and data obtained through these methods can be combined and further utilized. Variant calling algorithms for SNVs range from standalone tools to machine learning-based combined pipelines. Tools for CNV detection compare the number of reads aligned to a dedicated segment. Both SNVs and CNVs help to identify mutations resulting in pharmacologically druggable alterations. The identification of homologous recombination deficiency enables the use of PARP inhibitors. Determining microsatellite instability and tumor mutation burden helps to select patients eligible for immunotherapy. To pave the way for clinical applications, we have to recognize some limitations of WES, including its restricted ability to detect CNVs, low coverage compared to targeted sequencing, and the missing consensus regarding references and minimal application requirements. Recently, Galaxy became the leading platform in non-command line-based WES data processing. The maturation of next-generation sequencing is reinforced by Food and Drug Administration (FDA)-approved methods for cancer screening, detection, and follow-up. WES is on the verge of becoming an affordable and sufficiently evolved technology for everyday clinical use.
Collapse
Affiliation(s)
- Áron Bartha
- Semmelweis University, Department of Bioinformatics and 2nd Department of Pediatrics, H-1094 Budapest, Hungary.
- TTK Cancer Biomarker Research Group, Institute of Enzymology, Magyar tudósokkörútja 2., H-1117 Budapest, Hungary.
| | - Balázs Győrffy
- Semmelweis University, Department of Bioinformatics and 2nd Department of Pediatrics, H-1094 Budapest, Hungary.
- TTK Cancer Biomarker Research Group, Institute of Enzymology, Magyar tudósokkörútja 2., H-1117 Budapest, Hungary.
| |
Collapse
|
7
|
Zhou F, Shen C, Hsu YH, Gao J, Dou J, Ko R, Zheng X, Sun L, Cui Y, Zhang X. DNA methylation-based subclassification of psoriasis in the Chinese Han population. Front Med 2018; 12:717-725. [PMID: 29623515 DOI: 10.1007/s11684-017-0588-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2017] [Accepted: 08/24/2017] [Indexed: 02/08/2023]
Abstract
Psoriasis (Ps) is an inflammatory skin disease caused by genetic and environmental factors. Previous studies on DNA methylation (DNAm) found genetic markers that are closely associated with Ps, and evidence has shown that DNAm mediates genetic risk in Ps. In this study, Consensus Clustering was used to analyze DNAm data, and 114 Ps patients were divided into three subclassifications. Investigation of the clinical characteristics and copy number variations (CNVs) of DEFB4, IL22, and LCE3C in the three subclassifications revealed no significant differences in gender ratio and in Ps area and severity index (PASI) score. The proportion of late-onset ( ≥ 40 years) Ps patients was significantly higher in type I than in types II and III (P = 0.035). Type III contained the smallest proportion of smokers and the largest proportion of non-smoking Ps patients (P = 0.086). The CNVs of DEFB4 and LCE3C showed no significant differences but the CNV of IL22 significantly differed among the three subclassifications (P = 0.044). This study is the first to profile Ps subclassifications based on DNAm data in the Chinese Han population. These results are useful in the treatment and management of Ps from the molecular and genetic perspectives.
Collapse
Affiliation(s)
- Fusheng Zhou
- Institute of Dermatology, The First Affiliated Hospital, Anhui Medical University, Hefei, 230032, China.
- The Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, 230032, China.
- Collaborative Innovation Center for Complex and Severe Dermatosis, Anhui Medical University, Hefei, 230032, China.
| | - Changbing Shen
- Department of Dermatology, China-Japan Friendship Hospital, Beijing, 100029, China
- Graduate School, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing, 100730, China
- Department of Dermatology, The First Affiliated Hospital, Anhui Medical University, Hefei, 230032, China
- Molecular and Integrative Physiological Sciences, Harvard T.H. CHAN School of Public Health, Boston, MA, 02115, USA
- Hebrew SeniorLife Institute for Aging Research and Harvard Medical School, Boston, MA, 02131, USA
| | - Yi-Hsiang Hsu
- Molecular and Integrative Physiological Sciences, Harvard T.H. CHAN School of Public Health, Boston, MA, 02115, USA
- Hebrew SeniorLife Institute for Aging Research and Harvard Medical School, Boston, MA, 02131, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Jing Gao
- Department of Dermatology, The Second Affiliated Hospital, Anhui Medical University, Hefei, 230601, China
| | - Jinfa Dou
- Institute of Dermatology, The First Affiliated Hospital, Anhui Medical University, Hefei, 230032, China
- The Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, 230032, China
- Collaborative Innovation Center for Complex and Severe Dermatosis, Anhui Medical University, Hefei, 230032, China
- Department of Dermatology, The First Affiliated Hospital, Anhui Medical University, Hefei, 230032, China
| | - Randy Ko
- Department of Biochemistry, University of New Mexico, Albuquerque, NM, 87131, USA
| | - Xiaodong Zheng
- Institute of Dermatology, The First Affiliated Hospital, Anhui Medical University, Hefei, 230032, China
- The Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, 230032, China
- Collaborative Innovation Center for Complex and Severe Dermatosis, Anhui Medical University, Hefei, 230032, China
- Department of Dermatology, The First Affiliated Hospital, Anhui Medical University, Hefei, 230032, China
| | - Liangdan Sun
- Institute of Dermatology, The First Affiliated Hospital, Anhui Medical University, Hefei, 230032, China
- The Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, 230032, China
- Collaborative Innovation Center for Complex and Severe Dermatosis, Anhui Medical University, Hefei, 230032, China
- Department of Dermatology, The First Affiliated Hospital, Anhui Medical University, Hefei, 230032, China
| | - Yong Cui
- Department of Dermatology, China-Japan Friendship Hospital, Beijing, 100029, China
- Graduate School, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing, 100730, China
| | - Xuejun Zhang
- Institute of Dermatology, The First Affiliated Hospital, Anhui Medical University, Hefei, 230032, China.
- The Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, 230032, China.
- Collaborative Innovation Center for Complex and Severe Dermatosis, Anhui Medical University, Hefei, 230032, China.
- Department of Dermatology, The First Affiliated Hospital, Anhui Medical University, Hefei, 230032, China.
- Department of Dermatology, The Second Affiliated Hospital, Anhui Medical University, Hefei, 230601, China.
| |
Collapse
|
8
|
Genomic alterations driving psoriasis pathogenesis. Gene 2018; 683:61-71. [PMID: 30287254 DOI: 10.1016/j.gene.2018.09.042] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2018] [Revised: 08/22/2018] [Accepted: 09/22/2018] [Indexed: 11/23/2022]
Abstract
Psoriasis is an immune mediated inflammatory skin disease with complex etiology involving interplay between environmental and genetic risk factors as disease initiating event. Enhanced understanding on genetic risk factors, differentially expressed genes, deregulated proteins and pathway-targeted therapeutics have established multiple axis of psoriasis pathogenesis. So far, loci in 424 genes are reported to be associated with psoriasis alongside copy number variations and epigenetic alterations. From clinical perspective, presence of specific genetic trigger(s) in individual psoriasis patient could aid in devising a personalized therapeutic strategy. Therefore, the review presents an updates on reported genomic alterations and their subsequent course of cutaneous inflammations that potentially drive to psoriasis.
Collapse
|
9
|
|
10
|
Abstract
Differences between genomes can be due to single nucleotide variants (SNPs), translocations, inversions and copy number variants (CNVs, gain or loss of DNA). The latter can range from sub-microscopic events to complete chromosomal aneuploidies. Small CNVs are often benign but those larger than 250 kb are strongly associated with morbid consequences such as developmental disorders and cancer. Detecting CNVs within and between populations is essential to better understand the plasticity of our genome and to elucidate its possible contribution to disease or phenotypic traits.While the link between SNPs and disease susceptibility has been well studied, to date there are still very few published CNV genome-wide association studies; probably owing to the fact that CNV analysis remains a slightly more complex task than SNP analysis (both in term of bioinformatics workflow and uncertainty in the CNV calling leading to high false positive rates and unknown false negative rates). This chapter aims at explaining computational methods for the analysis of CNVs, ranging from study design, data processing and quality control, up to genome-wide association study with clinical traits.
Collapse
Affiliation(s)
- Aurélien Macé
- Institute of Social and Preventive Medicine, University Hospital of Lausanne, Lausanne, Switzerland.,Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Zoltán Kutalik
- Institute of Social and Preventive Medicine, University Hospital of Lausanne, Lausanne, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | | |
Collapse
|
11
|
Tom JA, Reeder J, Forrest WF, Graham RR, Hunkapiller J, Behrens TW, Bhangale TR. Identifying and mitigating batch effects in whole genome sequencing data. BMC Bioinformatics 2017; 18:351. [PMID: 28738841 PMCID: PMC5525370 DOI: 10.1186/s12859-017-1756-z] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2017] [Accepted: 07/12/2017] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Large sample sets of whole genome sequencing with deep coverage are being generated, however assembling datasets from different sources inevitably introduces batch effects. These batch effects are not well understood and can be due to changes in the sequencing protocol or bioinformatics tools used to process the data. No systematic algorithms or heuristics exist to detect and filter batch effects or remove associations impacted by batch effects in whole genome sequencing data. RESULTS We describe key quality metrics, provide a freely available software package to compute them, and demonstrate that identification of batch effects is aided by principal components analysis of these metrics. To mitigate batch effects, we developed new site-specific filters that identified and removed variants that falsely associated with the phenotype due to batch effect. These include filtering based on: a haplotype based genotype correction, a differential genotype quality test, and removing sites with missing genotype rate greater than 30% after setting genotypes with quality scores less than 20 to missing. This method removed 96.1% of unconfirmed genome-wide significant SNP associations and 97.6% of unconfirmed genome-wide significant indel associations. We performed analyses to demonstrate that: 1) These filters impacted variants known to be disease associated as 2 out of 16 confirmed associations in an AMD candidate SNP analysis were filtered, representing a reduction in power of 12.5%, 2) In the absence of batch effects, these filters removed only a small proportion of variants across the genome (type I error rate of 3%), and 3) in an independent dataset, the method removed 90.2% of unconfirmed genome-wide SNP associations and 89.8% of unconfirmed genome-wide indel associations. CONCLUSIONS Researchers currently do not have effective tools to identify and mitigate batch effects in whole genome sequencing data. We developed and validated methods and filters to address this deficiency.
Collapse
Affiliation(s)
- Jennifer A Tom
- Bioinformatics and Computational Biology Department, Genentech Inc, 1 DNA Way, South San Francisco, CA, 94080, USA.
| | - Jens Reeder
- Bioinformatics and Computational Biology Department, Genentech Inc, 1 DNA Way, South San Francisco, CA, 94080, USA
| | - William F Forrest
- Bioinformatics and Computational Biology Department, Genentech Inc, 1 DNA Way, South San Francisco, CA, 94080, USA
| | - Robert R Graham
- Human Genetics Department, Genentech Inc, 1 DNA Way, South San Francisco, CA, 94080, USA
| | - Julie Hunkapiller
- Human Genetics Department, Genentech Inc, 1 DNA Way, South San Francisco, CA, 94080, USA
| | - Timothy W Behrens
- Human Genetics Department, Genentech Inc, 1 DNA Way, South San Francisco, CA, 94080, USA
| | - Tushar R Bhangale
- Bioinformatics and Computational Biology Department, Genentech Inc, 1 DNA Way, South San Francisco, CA, 94080, USA.,Human Genetics Department, Genentech Inc, 1 DNA Way, South San Francisco, CA, 94080, USA
| |
Collapse
|
12
|
Peng CH, Liao CT, Ng KP, Tai AS, Peng SC, Yeh JP, Chen SJ, Tsao KC, Yen TC, Hsieh WP. Somatic copy number alterations detected by ultra-deep targeted sequencing predict prognosis in oral cavity squamous cell carcinoma. Oncotarget 2016; 6:19891-906. [PMID: 26087196 PMCID: PMC4637328 DOI: 10.18632/oncotarget.4336] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2015] [Accepted: 05/23/2015] [Indexed: 12/20/2022] Open
Abstract
Background Ultra-deep targeted sequencing (UDT-Seq) has advanced our knowledge on the incidence and functional significance of somatic mutations. However, the utility of UDT-Seq in detecting copy number alterations (CNAs) remains unclear. With the goal of improving molecular prognostication and identifying new therapeutic targets, we designed this study to assess whether UDT-Seq may be useful for detecting CNA in oral cavity squamous cell carcinoma (OSCC). Methods We sequenced a panel of clinically actionable cancer mutations in 310 formalin-fixed paraffin-embedded OSCC specimens. A linear model was developed to overcome uneven coverage across target regions and multiple samples. The 5-year rates of secondary primary tumors, local recurrence, neck recurrence, distant metastases, and survival served as the outcome measures. We confirmed the prognostic significance of the CNA signatures in an independent sample of 105 primary OSCC specimens. Results The CNA burden across 10 targeted genes was found to predict prognosis in two independent cohorts. FGFR1 and PIK3CAamplifications were associated with prognosis independent of clinical risk factors. Genes exhibiting CNA were clustered in the proteoglycan metabolism, the FOXO signaling, and the PI3K-AKT signaling pathways, for which targeted drugs are already available or currently under development. Conclusions UDT-Seq is clinically useful to identify CNA, which significantly improve the prognostic information provided by traditional clinicopathological risk factors in OSCC patients.
Collapse
Affiliation(s)
- Chien-Hua Peng
- Departments of Resource Center for Clinical Research, Chang Gung Memorial Hospital, Taoyuan, Taiwan, R.O.C
| | - Chun-Ta Liao
- Otorhinolaryngology, Head and Neck Surgery, Chang Gung Memorial Hospital, Taoyuan, Taiwan, R.O.C.,Head and Neck Oncology Group, Chang Gung Memorial Hospital, Taoyuan, Taiwan, R.O.C
| | - Ka-Pou Ng
- Institute of Statistics, National Tsing Hua University, Hsinchu, Taiwan, R.O.C
| | - An-Shun Tai
- Institute of Statistics, National Tsing Hua University, Hsinchu, Taiwan, R.O.C
| | - Shih-Chi Peng
- Department of Nuclear Medicine and Molecular Imaging Center, Chang Gung Memorial Hospital, Taoyuan, Taiwan, R.O.C
| | - Jen-Pao Yeh
- Institute of Statistics, National Tsing Hua University, Hsinchu, Taiwan, R.O.C
| | - Shu-Jen Chen
- Department of Biomedical Sciences, School of Medicine, Chang Gung University, Taoyuan, Taiwan, R.O.C
| | - Kuo-Chien Tsao
- Medical Biotechnology and Laboratory Science, Research Center for Emerging Viral Infections, Chang Gung Memorial Hospital, Taoyuan, Taiwan, R.O.C.,Laboratory Medicine, Chang Gung Memorial Hospital, Taoyuan, Taiwan, R.O.C
| | - Tzu-Chen Yen
- Department of Nuclear Medicine and Molecular Imaging Center, Chang Gung Memorial Hospital, Taoyuan, Taiwan, R.O.C
| | - Wen-Ping Hsieh
- Otorhinolaryngology, Head and Neck Surgery, Chang Gung Memorial Hospital, Taoyuan, Taiwan, R.O.C
| |
Collapse
|
13
|
Nam JY, Kim NKD, Kim SC, Joung JG, Xi R, Lee S, Park PJ, Park WY. Evaluation of somatic copy number estimation tools for whole-exome sequencing data. Brief Bioinform 2015. [PMID: 26210357 DOI: 10.1093/bib/bbv055] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Whole-exome sequencing (WES) has become a standard method for detecting genetic variants in human diseases. Although the primary use of WES data has been the identification of single nucleotide variations and indels, these data also offer a possibility of detecting copy number variations (CNVs) at high resolution. However, WES data have uneven read coverage along the genome owing to the target capture step, and the development of a robust WES-based CNV tool is challenging. Here, we evaluate six WES somatic CNV detection tools: ADTEx, CONTRA, Control-FREEC, EXCAVATOR, ExomeCNV and Varscan2. Using WES data from 50 kidney chromophobe, 50 bladder urothelial carcinoma, and 50 stomach adenocarcinoma patients from The Cancer Genome Atlas, we compared the CNV calls from the six tools with a reference CNV set that was identified by both single nucleotide polymorphism array 6.0 and whole-genome sequencing data. We found that these algorithms gave highly variable results: visual inspection reveals significant differences between the WES-based segmentation profiles and the reference profile, as well as among the WES-based profiles. Using a 50% overlap criterion, 13-77% of WES CNV calls were covered by CNVs from the reference set, up to 21% of the copy gains were called as losses or vice versa, and dramatic differences in CNV sizes and CNV numbers were observed. Overall, ADTEx and EXCAVATOR had the best performance with relatively high precision and sensitivity. We suggest that the current algorithms for somatic CNV detection from WES data are limited in their performance and that more robust algorithms are needed.
Collapse
|
14
|
Nutsua ME, Fischer A, Nebel A, Hofmann S, Schreiber S, Krawczak M, Nothnagel M. Family-Based Benchmarking of Copy Number Variation Detection Software. PLoS One 2015. [PMID: 26197066 PMCID: PMC4510559 DOI: 10.1371/journal.pone.0133465] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
The analysis of structural variants, in particular of copy-number variations (CNVs), has proven valuable in unraveling the genetic basis of human diseases. Hence, a large number of algorithms have been developed for the detection of CNVs in SNP array signal intensity data. Using the European and African HapMap trio data, we undertook a comparative evaluation of six commonly used CNV detection software tools, namely Affymetrix Power Tools (APT), QuantiSNP, PennCNV, GLAD, R-gada and VEGA, and assessed their level of pair-wise prediction concordance. The tool-specific CNV prediction accuracy was assessed in silico by way of intra-familial validation. Software tools differed greatly in terms of the number and length of the CNVs predicted as well as the number of markers included in a CNV. All software tools predicted substantially more deletions than duplications. Intra-familial validation revealed consistently low levels of prediction accuracy as measured by the proportion of validated CNVs (34-60%). Moreover, up to 20% of apparent family-based validations were found to be due to chance alone. Software using Hidden Markov models (HMM) showed a trend to predict fewer CNVs than segmentation-based algorithms albeit with greater validity. PennCNV yielded the highest prediction accuracy (60.9%). Finally, the pairwise concordance of CNV prediction was found to vary widely with the software tools involved. We recommend HMM-based software, in particular PennCNV, rather than segmentation-based algorithms when validity is the primary concern of CNV detection. QuantiSNP may be used as an additional tool to detect sets of CNVs not detectable by the other tools. Our study also reemphasizes the need for laboratory-based validation, such as qPCR, of CNVs predicted in silico.
Collapse
Affiliation(s)
- Marcel Elie Nutsua
- Institute of Clinical Molecular Biology, Christian-Albrechts University, Kiel, Germany
| | - Annegret Fischer
- Institute of Clinical Molecular Biology, Christian-Albrechts University, Kiel, Germany
| | - Almut Nebel
- Institute of Clinical Molecular Biology, Christian-Albrechts University, Kiel, Germany
| | - Sylvia Hofmann
- Institute of Clinical Molecular Biology, Christian-Albrechts University, Kiel, Germany
| | - Stefan Schreiber
- Institute of Clinical Molecular Biology, Christian-Albrechts University, Kiel, Germany
| | - Michael Krawczak
- Institute of Medical Informatics and Statistics, Christian-Albrechts University, Kiel, Germany
| | - Michael Nothnagel
- Institute of Medical Informatics and Statistics, Christian-Albrechts University, Kiel, Germany; Cologne Center for Genomics, University of Cologne, Cologne, Germany
| |
Collapse
|
15
|
Identifying Human Genome-Wide CNV, LOH and UPD by Targeted Sequencing of Selected Regions. PLoS One 2015; 10:e0123081. [PMID: 25919136 PMCID: PMC4412667 DOI: 10.1371/journal.pone.0123081] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2014] [Accepted: 02/27/2015] [Indexed: 01/03/2023] Open
Abstract
Copy-number variations (CNV), loss of heterozygosity (LOH), and uniparental disomy (UPD) are large genomic aberrations leading to many common inherited diseases, cancers, and other complex diseases. An integrated tool to identify these aberrations is essential in understanding diseases and in designing clinical interventions. Previous discovery methods based on whole-genome sequencing (WGS) require very high depth of coverage on the whole genome scale, and are cost-wise inefficient. Another approach, whole exome genome sequencing (WEGS), is limited to discovering variations within exons. Thus, we are lacking efficient methods to detect genomic aberrations on the whole genome scale using next-generation sequencing technology. Here we present a method to identify genome-wide CNV, LOH and UPD for the human genome via selectively sequencing a small portion of genome termed Selected Target Regions (SeTRs). In our experiments, the SeTRs are covered by 99.73%~99.95% with sufficient depth. Our developed bioinformatics pipeline calls genome-wide CNVs with high confidence, revealing 8 credible events of LOH and 3 UPD events larger than 5M from 15 individual samples. We demonstrate that genome-wide CNV, LOH and UPD can be detected using a cost-effective SeTRs sequencing approach, and that LOH and UPD can be identified using just a sample grouping technique, without using a matched sample or familial information.
Collapse
|
16
|
Wang W, Wang W, Sun W, Crowley JJ, Szatkiewicz JP. Allele-specific copy-number discovery from whole-genome and whole-exome sequencing. Nucleic Acids Res 2015; 43:e90. [PMID: 25883151 PMCID: PMC4538801 DOI: 10.1093/nar/gkv319] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2014] [Accepted: 03/27/2015] [Indexed: 11/14/2022] Open
Abstract
Copy-number variants (CNVs) are a major form of genetic variation and a risk factor for various human diseases, so it is crucial to accurately detect and characterize them. It is conceivable that allele-specific reads from high-throughput sequencing data could be leveraged to both enhance CNV detection and produce allele-specific copy number (ASCN) calls. Although statistical methods have been developed to detect CNVs using whole-genome sequence (WGS) and/or whole-exome sequence (WES) data, information from allele-specific read counts has not yet been adequately exploited. In this paper, we develop an integrated method, called AS-GENSENG, which incorporates allele-specific read counts in CNV detection and estimates ASCN using either WGS or WES data. To evaluate the performance of AS-GENSENG, we conducted extensive simulations, generated empirical data using existing WGS and WES data sets and validated predicted CNVs using an independent methodology. We conclude that AS-GENSENG not only predicts accurate ASCN calls but also improves the accuracy of total copy number calls, owing to its unique ability to exploit information from both total and allele-specific read counts while accounting for various experimental biases in sequence data. Our novel, user-friendly and computationally efficient method and a complete analytic protocol is freely available at https://sourceforge.net/projects/asgenseng/.
Collapse
Affiliation(s)
- WeiBo Wang
- Department of Computer Science, University of North Carolina at Chapel Hill, NC 27599-3175, USA
| | - Wei Wang
- Department of Computer Science, University of California, Los Angeles, CA 90095, USA
| | - Wei Sun
- Department of Biostatistics, University of North Carolina at Chapel Hill, NC 27599-7400, USA
| | - James J Crowley
- Department of Genetics, University of North Carolina at Chapel Hill, NC 27599-7264, USA
| | - Jin P Szatkiewicz
- Department of Genetics, University of North Carolina at Chapel Hill, NC 27599-7264, USA
| |
Collapse
|
17
|
Jiang Y, Oldridge DA, Diskin SJ, Zhang NR. CODEX: a normalization and copy number variation detection method for whole exome sequencing. Nucleic Acids Res 2015; 43:e39. [PMID: 25618849 PMCID: PMC4381046 DOI: 10.1093/nar/gku1363] [Citation(s) in RCA: 100] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2014] [Accepted: 12/19/2014] [Indexed: 01/24/2023] Open
Abstract
High-throughput sequencing of DNA coding regions has become a common way of assaying genomic variation in the study of human diseases. Copy number variation (CNV) is an important type of genomic variation, but detecting and characterizing CNV from exome sequencing is challenging due to the high level of biases and artifacts. We propose CODEX, a normalization and CNV calling procedure for whole exome sequencing data. The Poisson latent factor model in CODEX includes terms that specifically remove biases due to GC content, exon capture and amplification efficiency, and latent systemic artifacts. CODEX also includes a Poisson likelihood-based recursive segmentation procedure that explicitly models the count-based exome sequencing data. CODEX is compared to existing methods on a population analysis of HapMap samples from the 1000 Genomes Project, and shown to be more accurate on three microarray-based validation data sets. We further evaluate performance on 222 neuroblastoma samples with matched normals and focus on a well-studied rare somatic CNV within the ATRX gene. We show that the cross-sample normalization procedure of CODEX removes more noise than normalizing the tumor against the matched normal and that the segmentation procedure performs well in detecting CNVs with nested structures.
Collapse
Affiliation(s)
- Yuchao Jiang
- Genomics and Computational Biology Graduate Program, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Derek A Oldridge
- Medical Scientist Training Program, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA Division of Oncology and Center for Childhood Cancer Research, The Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Sharon J Diskin
- Division of Oncology and Center for Childhood Cancer Research, The Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA Abramson Family Cancer Research Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Nancy R Zhang
- Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
18
|
Bellos E, Coin LJM. cnvOffSeq: detecting intergenic copy number variation using off-target exome sequencing data. ACTA ACUST UNITED AC 2015; 30:i639-45. [PMID: 25161258 PMCID: PMC4147927 DOI: 10.1093/bioinformatics/btu475] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
MOTIVATION Exome sequencing technologies have transformed the field of Mendelian genetics and allowed for efficient detection of genomic variants in protein-coding regions. The target enrichment process that is intrinsic to exome sequencing is inherently imperfect, generating large amounts of unintended off-target sequence. Off-target data are characterized by very low and highly heterogeneous coverage and are usually discarded by exome analysis pipelines. We posit that off-target read depth is a rich, but overlooked, source of information that could be mined to detect intergenic copy number variation (CNV). We propose cnvOffseq, a novel normalization framework for off-target read depth that is based on local adaptive singular value decomposition (SVD). This method is designed to address the heterogeneity of the underlying data and allows for accurate and precise CNV detection and genotyping in off-target regions. RESULTS cnvOffSeq was benchmarked on whole-exome sequencing samples from the 1000 Genomes Project. In a set of 104 gold standard intergenic deletions, our method achieved a sensitivity of 57.5% and a specificity of 99.2%, while maintaining a low FDR of 5%. For gold standard deletions longer than 5 kb, cnvOffSeq achieves a sensitivity of 90.4% without increasing the FDR. cnvOffSeq outperforms both whole-genome and whole-exome CNV detection methods considerably and is shown to offer a substantial improvement over naïve local SVD. AVAILABILITY AND IMPLEMENTATION cnvOffSeq is available at http://sourceforge.net/p/cnvoffseq/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Evangelos Bellos
- Department of Genomics of Common Disease, Imperial College London, London W12 0NN, UK and Institute for Molecular Bioscience, University of Queensland, St Lucia, QLD 4072, Australia
| | - Lachlan J M Coin
- Department of Genomics of Common Disease, Imperial College London, London W12 0NN, UK and Institute for Molecular Bioscience, University of Queensland, St Lucia, QLD 4072, Australia Department of Genomics of Common Disease, Imperial College London, London W12 0NN, UK and Institute for Molecular Bioscience, University of Queensland, St Lucia, QLD 4072, Australia
| |
Collapse
|
19
|
Association of the late cornified envelope-3 genes with psoriasis and psoriatic arthritis: a systematic review. J Genet Genomics 2015; 42:49-56. [PMID: 25697099 DOI: 10.1016/j.jgg.2015.01.001] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2014] [Revised: 12/28/2014] [Accepted: 01/04/2015] [Indexed: 01/10/2023]
Abstract
Psoriasis (Ps) and psoriatic arthritis (PsA) are genetically complex diseases with strong genetic evidence. Recently, susceptibility genes for Ps and PsA have been identified within the late cornified envelop (LCE) gene cluster, especially the cluster 3 (LCE3) genes. It is noteworthy that the deletion of LCE3B and LCE3C (LCE3C_LCE3B-del) is significantly associated with these two diseases. Gene-gene interactions between LCE3 genes and other genes are associated with Ps and PsA. LCE3 genes also have pleiotropic effect on some autoimmune diseases, such as rheumatoid arthritis, atopic dermatitis and systemic lupus erythematosus. Further studies need to focus on the potential function of LCE3 genes in the pathogenesis of Ps and PsA in the future.
Collapse
|
20
|
Kadalayil L, Rafiq S, Rose-Zerilli MJJ, Pengelly RJ, Parker H, Oscier D, Strefford JC, Tapper WJ, Gibson J, Ennis S, Collins A. Exome sequence read depth methods for identifying copy number changes. Brief Bioinform 2014; 16:380-92. [PMID: 25169955 DOI: 10.1093/bib/bbu027] [Citation(s) in RCA: 65] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2014] [Accepted: 07/10/2014] [Indexed: 01/04/2023] Open
Abstract
Copy number variants (CNVs) play important roles in a number of human diseases and in pharmacogenetics. Powerful methods exist for CNV detection in whole genome sequencing (WGS) data, but such data are costly to obtain. Many disease causal CNVs span or are found in genome coding regions (exons), which makes CNV detection using whole exome sequencing (WES) data attractive. If reliably validated against WGS-based CNVs, exome-derived CNVs have potential applications in a clinical setting. Several algorithms have been developed to exploit exome data for CNV detection and comparisons made to find the most suitable methods for particular data samples. The results are not consistent across studies. Here, we review some of the exome CNV detection methods based on depth of coverage profiles and examine their performance to identify problems contributing to discrepancies in published results. We also present a streamlined strategy that uses a single metric, the likelihood ratio, to compare exome methods, and we demonstrated its utility using the VarScan 2 and eXome Hidden Markov Model (XHMM) programs using paired normal and tumour exome data from chronic lymphocytic leukaemia patients. We use array-based somatic CNV (SCNV) calls as a reference standard to compute prevalence-independent statistics, such as sensitivity, specificity and likelihood ratio, for validation of the exome-derived SCNVs. We also account for factors known to influence the performance of exome read depth methods, such as CNV size and frequency, while comparing our findings with published results.
Collapse
|
21
|
Tan R, Wang Y, Kleinstein SE, Liu Y, Zhu X, Guo H, Jiang Q, Allen AS, Zhu M. An evaluation of copy number variation detection tools from whole-exome sequencing data. Hum Mutat 2014; 35:899-907. [PMID: 24599517 DOI: 10.1002/humu.22537] [Citation(s) in RCA: 151] [Impact Index Per Article: 15.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2013] [Accepted: 02/21/2014] [Indexed: 01/11/2023]
Abstract
Copy number variation (CNV) has been found to play an important role in human disease. Next-generation sequencing technology, including whole-genome sequencing (WGS) and whole-exome sequencing (WES), has become a primary strategy for studying the genetic basis of human disease. Several CNV calling tools have recently been developed on the basis of WES data. However, the comparative performance of these tools using real data remains unclear. An objective evaluation study of these tools in practical research situations would be beneficial. Here, we evaluated four well-known WES-based CNV detection tools (XHMM, CoNIFER, ExomeDepth, and CONTRA) using real data generated in house. After evaluation using six metrics, we found that the sensitive and accurate detection of CNVs in WES data remains challenging despite the many algorithms available. Each algorithm has its own strengths and weaknesses. None of the exome-based CNV calling methods performed well in all situations; in particular, compared with CNVs identified from high coverage WGS data from the same samples, all tools suffered from limited power. Our evaluation provides a comprehensive and objective comparison of several well-known detection tools designed for WES data, which will assist researchers in choosing the most suitable tools for their research needs.
Collapse
Affiliation(s)
- Renjie Tan
- Center for Biomedical Informatics, School of Computer Science and Technology, Harbin Institute Technology, Harbin, Heilongjiang, China; Center for Human Genome Variation, Duke University School of Medicine, Durham, North Carolina
| | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Lango Allen H, Caswell R, Xie W, Xu X, Wragg C, Turnpenny PD, Turner CLS, Weedon MN, Ellard S. Next generation sequencing of chromosomal rearrangements in patients with split-hand/split-foot malformation provides evidence for DYNC1I1 exonic enhancers of DLX5/6 expression in humans. J Med Genet 2014; 51:264-7. [PMID: 24459211 PMCID: PMC3963551 DOI: 10.1136/jmedgenet-2013-102142] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
Objective Split-hand/foot malformation type 1 is an autosomal dominant condition with reduced penetrance and variable expression. We report three individuals from two families with split-hand/split-foot malformation (SHFM) in whom next generation sequencing was performed to investigate the cause of their phenotype. Methods and results The first proband has a de novo balanced translocation t(2;7)(p25.1;q22) identified by karyotyping. Whole genome sequencing showed that the chromosome 7 breakpoint is situated within the SHFM1 locus on chromosome 7q21.3. This separates the DYNC1I1 exons recently identified as limb enhancers in mouse studies from their target genes, DLX5 and DLX6. In the second family, X-linked recessive inheritance was suspected and exome sequencing was performed to search for a mutation in the affected proband and his uncle. No coding mutation was found within the SHFM2 locus at Xq26 or elsewhere in the exome, but a 106 kb deletion within the SHFM1 locus was detected through copy number analysis. Genome sequencing of the deletion breakpoints showed that the DLX5 and DLX6 genes are disomic but the putative DYNC1I1 exon 15 and 17 enhancers are deleted. Conclusions Exome sequencing identified a 106 kb deletion that narrows the SHFM1 critical region from 0.9 to 0.1 Mb and confirms a key role of DYNC1I1 exonic enhancers in normal limb formation in humans.
Collapse
Affiliation(s)
- Hana Lango Allen
- Institute of Biomedical and Clinical Science, University of Exeter Medical School, Exeter, UK
| | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Zhao M, Wang Q, Wang Q, Jia P, Zhao Z. Computational tools for copy number variation (CNV) detection using next-generation sequencing data: features and perspectives. BMC Bioinformatics 2013; 14 Suppl 11:S1. [PMID: 24564169 PMCID: PMC3846878 DOI: 10.1186/1471-2105-14-s11-s1] [Citation(s) in RCA: 331] [Impact Index Per Article: 30.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
Copy number variation (CNV) is a prevalent form of critical genetic variation that leads to an abnormal number of copies of large genomic regions in a cell. Microarray-based comparative genome hybridization (arrayCGH) or genotyping arrays have been standard technologies to detect large regions subject to copy number changes in genomes until most recently high-resolution sequence data can be analyzed by next-generation sequencing (NGS). During the last several years, NGS-based analysis has been widely applied to identify CNVs in both healthy and diseased individuals. Correspondingly, the strong demand for NGS-based CNV analyses has fuelled development of numerous computational methods and tools for CNV detection. In this article, we review the recent advances in computational methods pertaining to CNV detection using whole genome and whole exome sequencing data. Additionally, we discuss their strengths and weaknesses and suggest directions for future development.
Collapse
|