1
|
Genome-Wide Association Study of Susceptibility Loci for Radiation-Induced Brain Injury. J Natl Cancer Inst 2019; 111:620-628. [PMID: 30299488 PMCID: PMC6579742 DOI: 10.1093/jnci/djy150] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2018] [Revised: 05/24/2018] [Accepted: 07/29/2018] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Radiation-induced brain injury is a nonnegligible issue in the management of cancer patients treated by partial or whole brain irradiation. In particular, temporal lobe injury (TLI), a deleterious late complication in nasopharyngeal carcinoma, greatly affects the long-term life quality of these patients. Although genome-wide association studies (GWASs) have successfully identified single nucleotide polymorphisms (SNPs) associated with radiation toxicity, genetic variants contributing to the radiation-induced brain injury have not yet been assessed. METHODS We recruited and performed follow-up for a prospective observational cohort, Genetic Architecture of Radiotherapy Toxicity and Prognosis, using magnetic resonance imaging for TLI diagnosis. We conducted genome-wide association analysis in 1082 patients and validated the top associations in two independent cohorts of 1119 and 741 patients, respectively. All statistical tests were two-sided. RESULTS We identified a promoter variant rs17111237 (A > G, minor allele frequency [MAF] = 0.14) in CEP128 associated with TLI risk (hazard ratio = 1.45, 95% confidence interval = 1.26 to 1.66, Pcombined=3.18 × 10-7) which is in moderate linkage disequilibrium (LD) with rs162171 (MAF = 0.18, R2 = 0.69), the top signal in CEP128 (hazard ratio = 1.46, 95% confidence interval = 1.29-1.66, Pcombined= 6.17 × 10-9). Combining the clinical variables with the top SNP, we divided the patients into different subgroups with varying risk with 5-year TLI-free rates ranging from 33.7% to 95.5%. CEP128, a key component of mother centriole, tightly interacts with multiple radiation-resistant genes and plays an important role in maintaining the functional cilia, which otherwise will lead to a malfunction of the neural network. We found that A > G alteration at rs17111237 impaired the promoter activity of CEP128 and knockdown of CEP128 decreased the clonogenic cell survival of U87 cells under radiation. Noteworthy, 12.7% (27/212) of the GWAS-based associated genes (P < .001) were enriched in the neurogenesis pathway. CONCLUSIONS This three-stage study is the first GWAS of radiation-induced brain injury that implicates the genetic susceptibility gene CEP128 involved in TLI development and provides the novel insight into the underlying mechanisms of radiation-induced brain injury.
Collapse
|
2
|
Robust Reference Powered Association Test of Genome-Wide Association Studies. Front Genet 2019; 10:319. [PMID: 31024629 PMCID: PMC6465778 DOI: 10.3389/fgene.2019.00319] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2019] [Accepted: 03/21/2019] [Indexed: 12/28/2022] Open
Abstract
Genome-wide association studies (GWASs) have identified abundant genetic susceptibility loci, GWAS of small sample size are far less from meeting the previous expectations due to low statistical power and false positive results. Effective statistical methods are required to further improve the analyses of massive GWAS data. Here we presented a new statistic (Robust Reference Powered Association Test) to use large public database (gnomad) as reference to reduce concern of potential population stratification. To evaluate the performance of this statistic for various situations, we simulated multiple sets of sample size and frequencies to compute statistical power. Furthermore, we applied our method to several real datasets (psoriasis genome-wide association datasets and schizophrenia genome-wide association dataset) to evaluate the performance. Careful analyses indicated that our newly developed statistic outperformed several previously developed GWAS applications. Importantly, this statistic is more robust than naive merging method in the presence of small control-reference differentiation, therefore likely to detect more association signals.
Collapse
|
3
|
Exploratory genome-wide association analysis of response to ketamine and a polygenic analysis of response to scopolamine in depression. Transl Psychiatry 2018; 8:280. [PMID: 30552317 PMCID: PMC6294748 DOI: 10.1038/s41398-018-0311-7] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/27/2018] [Revised: 07/30/2018] [Accepted: 09/07/2018] [Indexed: 12/13/2022] Open
Abstract
Growing evidence suggests that the glutamatergic modulator ketamine has rapid antidepressant effects in treatment-resistant depressed subjects. The anticholinergic agent scopolamine has also shown promise as a rapid-acting antidepressant. This study applied genome-wide markers to investigate the role of genetic variants in predicting acute antidepressant response to both agents. The ketamine-treated sample included 157 unrelated European subjects with major depressive disorder (MDD) or bipolar disorder (BD). The scopolamine-treated sample comprised 37 unrelated European subjects diagnosed with either MDD or BD who had a current Major Depressive Episode (MDE), and had failed at least two adequate treatment trials for depression. Change in Montgomery-Asberg Depression Rating Scale (MADRS) or the 17-item Hamilton Depression Rating Scale (HAM-D) scale scores at day 1 (24 h post-treatment) was considered the primary outcome. Here, we conduct pilot genome-wide association study (GWAS) analyses to identify potential markers of ketamine response and dissociative side effects. Polygenic risk score analysis of SNPs ranked by the strength of their association with ketamine response was then calculated in order to assess whether common genetic markers from the ketamine study could predict response to scopolamine. Findings require replication in larger samples in light of low power of analyses of these small samples. Neverthless, these data provide a promising illustration of our future potential to identify genetic variants underlying rapid treatment response in mood disorders and may ultimately guide individual patient treatment selection in the future.
Collapse
|
4
|
knnAUC: an open-source R package for detecting nonlinear dependence between one continuous variable and one binary variable. BMC Bioinformatics 2018; 19:448. [PMID: 30466390 PMCID: PMC6249767 DOI: 10.1186/s12859-018-2427-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2018] [Accepted: 10/10/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Testing the dependence of two variables is one of the fundamental tasks in statistics. In this work, we developed an open-source R package (knnAUC) for detecting nonlinear dependence between one continuous variable X and one binary dependent variables Y (0 or 1). RESULTS We addressed this problem by using knnAUC (k-nearest neighbors AUC test, the R package is available at https://sourceforge.net/projects/knnauc/ ). In the knnAUC software framework, we first resampled a dataset to get the training and testing dataset according to the sample ratio (from 0 to 1), and then constructed a k-nearest neighbors algorithm classifier to get the yhat estimator (the probability of y = 1) of testy (the true label of testing dataset). Finally, we calculated the AUC (area under the curve of receiver operating characteristic) estimator and tested whether the AUC estimator is greater than 0.5. To evaluate the advantages of knnAUC compared to seven other popular methods, we performed extensive simulations to explore the relationships between eight different methods and compared the false positive rates and statistical power using both simulated and real datasets (Chronic hepatitis B datasets and kidney cancer RNA-seq datasets). CONCLUSIONS We concluded that knnAUC is an efficient R package to test non-linear dependence between one continuous variable and one binary dependent variable especially in computational biology area.
Collapse
|
5
|
Decreased Gray Matter Volume of Cuneus and Lingual Gyrus in Schizophrenia Patients with Tardive Dyskinesia is Associated with Abnormal Involuntary Movement. Sci Rep 2018; 8:12884. [PMID: 30150749 PMCID: PMC6110787 DOI: 10.1038/s41598-018-31186-y] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2017] [Accepted: 08/06/2018] [Indexed: 12/29/2022] Open
Abstract
Tardive dyskinesia (TD) is a devastating motor disorder associated with the etiological process of schizophrenia or antipsychotic medication treatments. To examine whether cerebral morphological changes may manifest in TD, we used voxel-based morphometry to analyze high-resolution T1-weighted brain structural magnetic resonance images from 32 schizophrenics with TD (TD group), 31 schizophrenics without TD (non-TD group), and 32 healthy controls (HC group). We also assessed psychopathological symptoms with the Positive and Negative Syndrome Scale (PANSS), and TD severity with the Abnormal Involuntary Movement Scale (AIMS). We compared gray matter volumes (GMVs) among groups, and tested for correlations between GMV changes and psychopathological symptoms or TD severity. The results showed significant differences in GMV in the frontal and temporal cortices, insula and cerebellum among the three groups. Brainstem and inferior frontal and precentral gyri GMVs were significantly larger, whereas cuneus and lingual gyrus GMVs were significantly smaller in the TD group as compared to non-TD group. Further, the cuneus and lingual gyrus GMVs were positively correlated with AIMS scores in the TD group. The current results suggest that TD may be associated with the alterations in GMV that are different from that of schizophrenics without TD. Further studies are needed to confirm and to examine the functional significance of these structural findings.
Collapse
|
6
|
Nuclear Norm Clustering: a promising alternative method for clustering tasks. Sci Rep 2018; 8:10873. [PMID: 30022093 PMCID: PMC6052164 DOI: 10.1038/s41598-018-29246-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2017] [Accepted: 07/02/2018] [Indexed: 11/09/2022] Open
Abstract
Clustering techniques are widely used in many applications. The goal of clustering is to identify patterns or groups of similar objects within a dataset of interest. However, many cluster methods are neither robust nor sensitive to noises and outliers in real data. In this paper, we present Nuclear Norm Clustering (NNC, available at https://sourceforge.net/projects/nnc/), an algorithm that can be used in various fields as a promising alternative to the k-means clustering method. The NNC algorithm requires users to provide a data matrix M and a desired number of cluster K. We employed simulated annealing techniques to choose an optimal label vector that minimizes nuclear norm of the pooled within cluster residual matrix. To evaluate the performance of the NNC algorithm, we compared the performance of both 15 public datasets and 2 genome-wide association studies (GWAS) on psoriasis, comparing our method with other classic methods. The results indicate that NNC method has a competitive performance in terms of F-score on 15 benchmarked public datasets and 2 psoriasis GWAS datasets. So NNC is a promising alternative method for clustering tasks.
Collapse
|
7
|
Bagging Nearest-Neighbor Prediction independence Test: an efficient method for nonlinear dependence of two continuous variables. Sci Rep 2017; 7:12736. [PMID: 28986523 PMCID: PMC5630623 DOI: 10.1038/s41598-017-12783-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2017] [Accepted: 09/15/2017] [Indexed: 12/03/2022] Open
Abstract
Testing dependence/correlation of two variables is one of the fundamental tasks in statistics. In this work, we proposed an efficient method for nonlinear dependence of two continuous variables (X and Y). We addressed this research question by using BNNPT (Bagging Nearest-Neighbor Prediction independence Test, software available at https://sourceforge.net/projects/bnnpt/). In the BNNPT framework, we first used the value of X to construct a bagging neighborhood structure. We then obtained the out of bag estimator of Y based on the bagging neighborhood structure. The square error was calculated to measure how well Y is predicted by X. Finally, a permutation test was applied to determine the significance of the observed square error. To evaluate the strength of BNNPT compared to seven other methods, we performed extensive simulations to explore the relationship between various methods and compared the false positive rates and statistical power using both simulated and real datasets (Rugao longevity cohort mitochondrial DNA haplogroups and kidney cancer RNA-seq datasets). We concluded that BNNPT is an efficient computational approach to test nonlinear correlation in real world applications.
Collapse
|
8
|
An Integrative Computational Approach to Evaluate Genetic Markers for Bipolar Disorder. Sci Rep 2017; 7:6745. [PMID: 28751646 PMCID: PMC5532256 DOI: 10.1038/s41598-017-05846-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2016] [Accepted: 06/05/2017] [Indexed: 01/08/2023] Open
Abstract
Studies to date have reported hundreds of genes connected to bipolar disorder (BP). However, many studies identifying candidate genes have lacked replication, and their results have, at times, been inconsistent with one another. This paper, therefore, offers a computational workflow that can curate and evaluate BP-related genetic data. Our method integrated large-scale literature data and gene expression data that were acquired from both postmortem human brain regions (BP case/control: 45/50) and peripheral blood mononuclear cells (BP case/control: 193/593). To assess the pathogenic profiles of candidate genes, we conducted Pathway Enrichment, Sub-Network Enrichment, and Gene-Gene Interaction analyses, with 4 metrics proposed and validated for each gene. Our approach developed a scalable BP genetic database (BP_GD), including BP related genes, drugs, pathways, diseases and supporting references. The 4 metrics successfully identified frequently-studied BP genes (e.g. GRIN2A, DRD1, DRD2, HTR2A, CACNA1C, TH, BDNF, SLC6A3, P2RX7, DRD3, and DRD4) and also highlighted several recently reported BP genes (e.g. GRIK5, GRM1 and CACNA1A). The computational biology approach and the BP database developed in this study could contribute to a better understanding of the current stage of BP genetic research and assist further studies in the field.
Collapse
|
9
|
Whole-genome sequencing of monozygotic twins discordant for schizophrenia indicates multiple genetic risk factors for schizophrenia. J Genet Genomics 2017. [PMID: 28645778 DOI: 10.1016/j.jgg.2017.05.005] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Schizophrenia is a common disorder with a high heritability, but its genetic architecture is still elusive. We implemented whole-genome sequencing (WGS) analysis of 8 families with monozygotic (MZ) twin pairs discordant for schizophrenia to assess potential association of de novo mutations (DNMs) or inherited variants with susceptibility to schizophrenia. Eight non-synonymous DNMs (including one splicing site) were identified and shared by twins, which were either located in previously reported schizophrenia risk genes (p.V24689I mutation in TTN, p.S2506T mutation in GCN1L1, IVS3+1G > T in DOCK1) or had a benign to damaging effect according to in silico prediction analysis. By searching the inherited rare damaging or loss-of-function (LOF) variants and common susceptible alleles from three classes of schizophrenia candidate genes, we were able to distill genetic alterations in several schizophrenia risk genes, including GAD1, PLXNA2, RELN and FEZ1. Four inherited copy number variations (CNVs; including a large deletion at 16p13.11) implicated for schizophrenia were identified in four families, respectively. Most of families carried both missense DNMs and inherited risk variants, which might suggest that DNMs, inherited rare damaging variants and common risk alleles together conferred to schizophrenia susceptibility. Our results support that schizophrenia is caused by a combination of multiple genetic factors, with each DNM/variant showing a relatively small effect size.
Collapse
|
10
|
Genome Wide Association Study (GWAS) between Attention Deficit Hyperactivity Disorder (ADHD) and Obsessive Compulsive Disorder (OCD). Front Mol Neurosci 2017; 10:83. [PMID: 28386217 PMCID: PMC5362635 DOI: 10.3389/fnmol.2017.00083] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2016] [Accepted: 03/09/2017] [Indexed: 11/13/2022] Open
Abstract
Objective: The aim of this study was to identify any potential genetic overlap between attention deficit hyperactivity disorder (ADHD) and obsessive compulsive disorder (OCD). We hypothesized that since these disorders share a sub-phenotype, they may share common risk alleles. In this manuscript, we report the overlap found between these two disorders. Methods: A meta-analysis was conducted between ADHD and OCD, and polygenic risk scores (PRS) were calculated for both disorders. In addition, a protein-protein analysis was completed in order to examine the interactions between proteins; p-values for the protein-protein interaction analysis was calculated using permutation. Conclusion: None of the single nucleotide polymorphisms (SNPs) reached genome wide significance and there was little evidence of genetic overlap between ADHD and OCD.
Collapse
|
11
|
ADAR2 functions as a tumor suppressor via editing IGFBP7 in esophageal squamous cell carcinoma. Int J Oncol 2016; 50:622-630. [PMID: 28035363 DOI: 10.3892/ijo.2016.3823] [Citation(s) in RCA: 58] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2016] [Accepted: 11/28/2016] [Indexed: 01/07/2023] Open
Abstract
Esophageal squamous cell carcinoma (ESCC), one of the most aggressive cancers, is characterized by heterogeneous genetic and epigenetic changes. Recently, A-to-I RNA editing, catalyzed by adenosine deaminases acting on RNA (ADARs), was found to be aberrantly regulated during tumorigenesis. We previously reported that ADAR2 was downregulated in ESCC but its role was unclear. Thus, we report here that overexpression of ADAR2 can induce apoptosis in ESCC cell lines and inhibit tumor growth in vitro and in vivo. ADAR2 knockdown inhibited apoptosis in ADAR2 highly expressing tumor cells. RNA-seq assay showed that ADAR2, not ADAR1 or active-site-mutated ADAR2, could edit insulin-like growth factor binding protein 7 (IGFBP7) mRNA in ESCC. IGFBP7 knockdown or ADAR2 catalytic activity destruction abolished the pro-apoptotic function of ADAR2. Mechanistically, RNA editing may stabilize IGFBP7 protein by changing the protease recognition site of matriptase and this is essential for IGFBP7 to induce apoptosis. Western blotting revealed that ADAR2 overexpression could induce IGFBP7-dependent inhibition of Akt signaling. Thus, our data indicate that ADAR2 suppresses tumor growth and induces apoptosis by editing and stabilizing IGFBP7 in ESCC, and this may represent a novel therapeutic target for treating ESCC.
Collapse
|
12
|
|
13
|
Abstract
BACKGROUND Transcriptional factors (TFs) and microRNAs (miRNAs) have been recognized as 2 classes of principal gene regulators that may be responsible for genome coexpression changes observed in schizophrenia (SZ). METHODS This study aims to (1) identify differentially coexpressed genes (DCGs) in 3 mRNA expression microarray datasets; (2) explore potential interactions among the DCGs, and differentially expressed miRNAs identified in our dataset composed of early-onset SZ patients and healthy controls; (3) validate expression levels of some key transcripts; and (4) explore the druggability of DCGs using the curated database. RESULTS We detected a differential coexpression network associated with SZ and found that 9 out of the 12 regulators were replicated in either of the 2 other datasets. Leveraging the differentially expressed miRNAs identified in our previous dataset, we constructed a miRNA-TF-gene network relevant to SZ, including an EGR1-miR-124-3p-SKIL feed-forward loop. Our real-time quantitative PCR analysis indicated the overexpression of miR-124-3p, the under expression of SKIL and EGR1 in the blood of SZ patients compared with controls, and the direction of change of miR-124-3p and SKIL mRNA levels in SZ cases were reversed after a 12-week treatment cycle. Our druggability analysis revealed that many of these genes have the potential to be drug targets. CONCLUSIONS Together, our results suggest that coexpression network abnormalities driven by combinatorial and interactive action from TFs and miRNAs may contribute to the development of SZ and be relevant to the clinical treatment of the disease.
Collapse
|
14
|
Genomic Characterization of Esophageal Squamous Cell Carcinoma Reveals Critical Genes Underlying Tumorigenesis and Poor Prognosis. Am J Hum Genet 2016; 98:709-27. [PMID: 27058444 DOI: 10.1016/j.ajhg.2016.02.021] [Citation(s) in RCA: 108] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2015] [Accepted: 02/24/2016] [Indexed: 12/17/2022] Open
Abstract
The genetic mechanisms underlying the poor prognosis of esophageal squamous cell carcinoma (ESCC) are not well understood. Here, we report somatic mutations found in ESCC from sequencing 10 whole-genome and 57 whole-exome matched tumor-normal sample pairs. Among the identified genes, we characterized mutations in VANGL1 and showed that they accelerated cell growth in vitro. We also found that five other genes, including three coding genes (SHANK2, MYBL2, FADD) and two non-coding genes (miR-4707-5p, PCAT1), were involved in somatic copy-number alterations (SCNAs) or structural variants (SVs). A survival analysis based on the expression profiles of 321 individuals with ESCC indicated that these genes were significantly associated with poorer survival. Subsequently, we performed functional studies, which showed that miR-4707-5p and MYBL2 promoted proliferation and metastasis. Together, our results shed light on somatic mutations and genomic events that contribute to ESCC tumorigenesis and prognosis and might suggest therapeutic targets.
Collapse
|
15
|
No association between ZNF804A rs1344706 and schizophrenia in a case-control study of Han Chinese. Neurosci Lett 2016; 618:14-18. [DOI: 10.1016/j.neulet.2016.02.048] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2015] [Revised: 02/17/2016] [Accepted: 02/25/2016] [Indexed: 01/21/2023]
|
16
|
Evidence for contribution of common genetic variants within chromosome 8p21.2-8p21.1 to restricted and repetitive behaviors in autism spectrum disorders. BMC Genomics 2016; 17:163. [PMID: 26931105 PMCID: PMC4774106 DOI: 10.1186/s12864-016-2475-y] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2015] [Accepted: 02/15/2016] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND Restricted and Repetitive Behaviors (RRB), one of the core symptom categories for Autism Spectrum Disorders (ASD), comprises heterogeneous groups of behaviors. Previous research indicates that there are two or more factors (subcategories) within the RRB domain. In an effort to identify common variants associated with RRB, we have carried out a genome-wide association study (GWAS) using the Autism Genetic Resource Exchange (AGRE) dataset (n = 1,335, all ASD probands of European ancestry) for each identified RRB subcategory, while allowing for comparisons of associated single nucleotide polymorphisms (SNPs) with associated SNPs in the same set of probands analyzed using all the RRB subcategories as phenotypes in a multivariate linear mixed model. The top ranked SNPs were then explored in an independent dataset. RESULTS Using principal component analysis of item scores obtained from Autism Diagnostic Interview-Revised (ADI-R), two distinct subcategories within Restricted and Repetitive Behaviors were identified: Repetitive Sensory Motor (RSM) and Insistence on Sameness (IS). Quantitative RSM and IS scores were subsequently used as phenotypes in a GWAS using the AGRE ASD cohort. Although no associated SNPs with genome-wide significance (P < 5.0E-08) were detected when RSM or IS were analyzed independently, three SNPs approached genome-wide significance when RSM and IS were considered together using multivariate association analysis. These included the top IS-associated SNP, rs62503729 (P-value = 6.48E-08), which is located within chromosome 8p21.2-8p21.1, a locus previously linked to schizophrenia. Notably, all of the most significantly associated SNPs are located in close proximity to STMN4 and PTK2B, genes previously shown to function in neuron development. In addition, several of the top-ranked SNPs showed correlations with STMN4 mRNA expression in adult CEU (Caucasian and European descent) human prefrontal cortex. However, the association signals within chromosome 8p21.2-8p21.1 failed to replicate in an independent sample of 2,588 ASD probands; the insufficient sample size and between-study heterogeneity are possible explanations for the non-replication. CONCLUSIONS Our analysis indicates that RRB in ASD can be represented by two distinct subcategories: RSM and IS. Subsequent univariate and multivariate genome-wide association studies of these RRB subcategories enabled the detection of associated SNPs at 8p21.2-8p21.1. Although these results did not replicate in an independent ASD dataset, genomic features of this region and pathway analysis suggest that common variants in 8p21.2-8p21.1 may contribute to RRB, particularly IS. Together, these observations warrant future studies to elucidate the possible contributions of common variants in 8p21.2-8p21.1 to the etiology of RSM and IS in ASD.
Collapse
|
17
|
Whole-genome association analysis of treatment response in obsessive-compulsive disorder. Mol Psychiatry 2016; 21:270-6. [PMID: 25824302 PMCID: PMC5027902 DOI: 10.1038/mp.2015.32] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/12/2014] [Revised: 01/16/2015] [Accepted: 02/11/2015] [Indexed: 12/16/2022]
Abstract
Up to 30% of patients with obsessive-compulsive disorder (OCD) exhibit an inadequate response to serotonin reuptake inhibitors (SRIs). To date, genetic predictors of OCD treatment response have not been systematically investigated using genome-wide association study (GWAS). To identify specific genetic variations potentially influencing SRI response, we conducted a GWAS study in 804 OCD patients with information on SRI response. SRI response was classified as 'response' (n=514) or 'non-response' (n=290), based on self-report. We used the more powerful Quasi-Likelihood Score Test (the MQLS test) to conduct a genome-wide association test correcting for relatedness, and then used an adjusted logistic model to evaluate the effect size of the variants in probands. The top single-nucleotide polymorphism (SNP) was rs17162912 (P=1.76 × 10(-8)), which is near the DISP1 gene on 1q41-q42, a microdeletion region implicated in neurological development. The other six SNPs showing suggestive evidence of association (P<10(-5)) were rs9303380, rs12437601, rs16988159, rs7676822, rs1911877 and rs723815. Among them, two SNPs in strong linkage disequilibrium, rs7676822 and rs1911877, located near the PCDH10 gene, gave P-values of 2.86 × 10(-6) and 8.41 × 10(-6), respectively. The other 35 variations with signals of potential significance (P<10(-4)) involve multiple genes expressed in the brain, including GRIN2B, PCDH10 and GPC6. Our enrichment analysis indicated suggestive roles of genes in the glutamatergic neurotransmission system (false discovery rate (FDR)=0.0097) and the serotonergic system (FDR=0.0213). Although the results presented may provide new insights into genetic mechanisms underlying treatment response in OCD, studies with larger sample sizes and detailed information on drug dosage and treatment duration are needed.
Collapse
|
18
|
OBSESSIVE-COMPULSIVE PERSONALITY DISORDER: EVIDENCE FOR TWO DIMENSIONS. Depress Anxiety 2016; 33:128-35. [PMID: 26594839 DOI: 10.1002/da.22452] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/22/2015] [Revised: 09/23/2015] [Accepted: 10/18/2015] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND To determine possible dimensions that underlie obsessive-compulsive personality disorder (OCPD) and to investigate their clinical correlates, familiality, and genetic linkage. METHODS Participants were selected from 844 adults assessed with the Structured Instrument for the Diagnosis of DSM-IV Personality Disorders (SIDP) in the OCD Collaborative Genetics Study (OCGS) that targeted families with obsessive-compulsive disorder (OCD) affected sibling pairs. We conducted an exploratory factor analysis, which included the eight SIDP-derived DSM-IV OCPD traits and the indecision trait from the DSM-III, assessed clinical correlates, and estimated sib-sib correlations to evaluate familiality of the factors. Using MERLIN and MINX, we performed genome-wide quantitative trait locus (QTL) linkage analysis to test for allele sharing among individuals. RESULTS Two factors were identified: Factor 1: order/control (perfectionism, excessive devotion to work, overconscientiousness, reluctance to delegate, and rigidity); and Factor 2: hoarding/indecision (inability to discard and indecisiveness). Factor 1 score was associated with poor insight, whereas Factor 2 score was associated with task incompletion. A significant sib-sib correlation was found for Factor 2 (rICC = .354, P < .0001) but not Factor 1 (rICC = .129, P = .084). The linkage findings were different for the two factors. When Factor 2 was analyzed as a quantitative trait, a strong signal was detected on chromosome 10 at marker d10s1221: KAC LOD = 2.83, P = .0002; and marker d10s1225: KAC LOD = 1.35, P = .006. CONCLUSIONS The results indicate two factors of OCPD, order/control and hoarding/indecision. The hoarding/indecision factor is familial and shows modest linkage to a region on chromosome 10.
Collapse
|
19
|
Common polygenic variation and risk for childhood-onset schizophrenia. Mol Psychiatry 2016; 21:94-6. [PMID: 25510512 DOI: 10.1038/mp.2014.158] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/18/2014] [Revised: 10/02/2014] [Accepted: 10/09/2014] [Indexed: 12/16/2022]
Abstract
Childhood-onset schizophrenia (COS) is a rare and severe form of the disorder, with more striking abnormalities with respect to prepsychotic developmental disorders and abnormities in the brain development compared with later-onset schizophrenia. We previously documented that COS patients, compared with their healthy siblings and with adult-onset patients (AOS), carry significantly more rare chromosomal copy number variations, spanning large genomic regions (>100 kb) (Ahn et al. 2014). Here, we interrogated the contribution of common polygenic variation to the genetic susceptibility for schizophrenia. We examined the association between a direct measure of genetic risk of schizophrenia in 130 COS probands and 103 healthy siblings. Using data from the schizophrenia and autism GWAS of the Psychiatric Genomic Consortia, we selected three risk-related sets of single nucleotide polymorphisms from which we conducted polygenic risk score comparisons for COS probands and their healthy siblings. COS probands had higher genetic risk scores of both schizophrenia and autism than their siblings (P<0.05). Given the small sample size, these findings suggest that COS patients have more salient genetic risk than do AOS.
Collapse
|
20
|
Heterogeneity revealed through meta-analysis might link geographical differences with nasopharyngeal carcinoma incidence in Han Chinese populations. BMC Cancer 2015; 15:598. [PMID: 26307051 PMCID: PMC4549009 DOI: 10.1186/s12885-015-1607-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2015] [Accepted: 08/18/2015] [Indexed: 11/29/2022] Open
Abstract
Background Nasopharyngeal carcinoma (NPC) is an epithelial malignancy highly prevalent in southern China, and incidence rates among Han Chinese people vary according to geographic region. Recently, three independent genome-wide association studies (GWASs) confirmed that HLA-A is the main risk gene for NPC. However, the results of studies conducted in regions with dissimilar incidence rates contradicted the claims that HLA-A is the sole risk gene and that the association of rs29232 is independent of the HLA-A effect in the chromosome 6p21.3 region. Methods We performed a meta-analysis, selecting five single-nucleotide polymorphisms (SNPs) in chromosome 6p21.3 mapped in three published GWASs and four case–control studies. The studies involved 8994 patients with NPC and 11,157 healthy controls, all of whom were Han Chinese. Results The rs2517713 SNP located downstream of HLA-A was significantly associated with NPC (P = 1.08 × 10−91, odds ratio [OR] = 0.58, 95 % confidence interval [CI] = 0.55–0.61). The rs29232 SNP exhibited a moderate level of heterogeneity (I2 = 47 %) that disappeared (I2 = 0 %) after stratification by moderate- and high-incidence NPC regions. Conclusions Our results suggested that the HLA-A gene is strongly associated with NPC risk. In addition, the heterogeneity revealed by the meta-analysis of rs29232 might be associated with regional differences in NPC incidence among Han Chinese people. The higher OR of rs29232 and the fact that rs29232 was independent of the HLA-A effect in the moderate-incidence population suggested that rs29232 might have greater relevance to NPC incidence in a moderate-incidence population than in a high-incidence population. Electronic supplementary material The online version of this article (doi:10.1186/s12885-015-1607-0) contains supplementary material, which is available to authorized users.
Collapse
|
21
|
Efficient test for nonlinear dependence of two continuous variables. BMC Bioinformatics 2015; 16:260. [PMID: 26283601 PMCID: PMC4539721 DOI: 10.1186/s12859-015-0697-7] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2014] [Accepted: 08/06/2015] [Indexed: 01/28/2023] Open
Abstract
Background Testing dependence/correlation of two variables is one of the fundamental tasks in statistics. In this work, we proposed a new way of testing nonlinear dependence between two continuous variables (X and Y). Results We addressed this research question by using CANOVA (continuous analysis of variance, software available at https://sourceforge.net/projects/canova/). In the CANOVA framework, we first defined a neighborhood for each data point related to its X value, and then calculated the variance of the Y value within the neighborhood. Finally, we performed permutations to evaluate the significance of the observed values within the neighborhood variance. To evaluate the strength of CANOVA compared to six other methods, we performed extensive simulations to explore the relationship between methods and compared the false positive rates and statistical power using both simulated and real datasets (kidney cancer RNA-seq dataset). Conclusions We concluded that CANOVA is an efficient method for testing nonlinear correlation with several advantages in real data applications. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0697-7) contains supplementary material, which is available to authorized users.
Collapse
|
22
|
Abstract 4630: Heterogeneity revealed through meta-analysis might link geographical differences with nasopharyngeal carcinoma incidence in Han Chinese. Cancer Res 2015. [DOI: 10.1158/1538-7445.am2015-4630] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Nasopharyngeal carcinoma (NPC) is an epithelial malignancy highly prevalent in Southern China with varying incidence rates among Han Chinese in different geographic regions. Recently, three independent genome-wide association studies (GWAS) confirmed that the HLA-A was the main risk gene of NPC. However, studies from regions with different incidence rates led to contradictory claims regarding whether HLA-A was the sole risk gene and the association of rs29232 was independent from the HLA-A effect in chromosome 6p21.3 region. To explore whether the differences were caused by genetic factors underlying the varying incidence rates by region in NPC incidence, we performed a meta-analysis by selecting 5 single-nucleotide polymorphisms (SNPs) in chromosome 6p21.3 from 3 published GWAS studies and 3 published case-control studies from Southern China (Taiwan, Guangdong and Guangxi) involving 7 828 NPC patients and 8 813 healthy controls from Han Chinese. The rs29232 SNP exhibited a moderate level of heterogeneity (I2 = 43%) that disappeared (I2 = 0% and 4%) after stratifying the studies according to moderate- and high-incidence NPC regions. The estimated odds ratio (OR) of rs29232 (OR: 1.70, 95% confidence interval [CI]: 1.47-1.95) was higher in the moderate-incidence region than in the high-incidence region (OR: 1.42, 95% CI: 1.30-1.56). The heterogeneity revealed by meta-analysis of rs29232 might be associated with differences in NPC incidence in Han Chinese between geographic regions. Therefore, we postulate that moderate heterogeneity in rs29232 might contribute to the varying incidence rates between regions due to a genetic cause in NPC incidence. The higher OR of rs29232 and the fact that it was independent of the HLA-A effect in the moderate-incidence population suggested that rs29232 might have greater relevance to NPC in a moderate-incidence population than to a high-incidence population. To our knowledge, this is the first study to demonstrate that genetic factors might involve in the varying disease incidence rates by region. This further suggests that the disease incidence rate should be considered as a potential confounding factor when conducting large, cross-region meta-analysis in NPC.
Citation Format: Wen-Hui Su, Chi-Cking Chiu, Yin Yao Shugart. Heterogeneity revealed through meta-analysis might link geographical differences with nasopharyngeal carcinoma incidence in Han Chinese. [abstract]. In: Proceedings of the 106th Annual Meeting of the American Association for Cancer Research; 2015 Apr 18-22; Philadelphia, PA. Philadelphia (PA): AACR; Cancer Res 2015;75(15 Suppl):Abstract nr 4630. doi:10.1158/1538-7445.AM2015-4630
Collapse
|
23
|
Genetic variants in five novel loci including CFB and CD40 predispose to chronic hepatitis B. Hepatology 2015; 62:118-28. [PMID: 25802187 DOI: 10.1002/hep.27794] [Citation(s) in RCA: 64] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/30/2014] [Accepted: 03/16/2015] [Indexed: 12/15/2022]
Abstract
UNLABELLED Hepatitis B virus affects more than 2 billion people worldwide, 350 million of which have developed chronic hepatitis B (CHB). The genetic factors that confer CHB risk are still largely unknown. We sought to identify genetic variants for CHB susceptibility in the Chinese population. We undertook a genome-wide association study (GWAS) in 2,514 CHB cases and 1,130 normal controls from eastern China. We replicated 33 of the most promising signals and eight previously reported CHB risk loci through a two-stage validation totaling 6,600 CHB cases and 8,127 controls in four independent populations, of which two populations were recruited from eastern China, one from northern China and one from southern China. The joint analyses of 9,114 CHB cases and 9,257 controls revealed significant association of CHB risk with five novel loci. Four loci are located in the human leukocyte antigen (HLA) region at 6p21.3, including two nonsynonymous variants (rs12614 [R32W] in complement factor B [CFB], Pmeta =1.28 × 10(-34) ; and rs422951 [T320A] in NOTCH4, Pmeta = 5.33 × 10(-16) ); one synonymous variant (rs378352 in HLA-DOA corresponding to HLA-DOA*010101, Pmeta = 1.04 × 10(-23) ); and one noncoding variant (rs2853953 near HLA-C, Pmeta = 5.06 × 10(-20) ). Another locus is located at 20q13.1 (rs1883832 in the Kozak sequence of CD40, Pmeta = 2.95 × 10(-15) ). Additionally, we validated seven of eight previously reported CHB susceptibility loci (rs3130542 at HLA-C, rs1419881 at TCF19, rs652888 at EHMT2, rs2856718 at HLA-DQB1, rs7453920 at HLA-DQB2, rs3077 at HLA-DPA1, and rs9277535 at HLA-DPA2, which are all located in the HLA region, 9.84 × 10(-71) ≤ Pmeta ≤ 9.92 × 10(-7) ). CONCLUSION Our GWAS identified five novel susceptibility loci for CHB. These findings improve the understanding of CHB etiology and may provide new targets for prevention and treatment of this disease.
Collapse
|
24
|
Abstract
BACKGROUND Previous findings are inconsistent; yet, converging evidence suggests an association between schizophrenia (SZ) and the impairment of posttranscriptional regulation of brain development through microRNA (miRNA) systems. METHODS This study aims to (1) compare the overall frequency of 121 rare variants (RVs) in 59 genes associated with the miRNA system in genome-wide association studies (GWAS)-derived data including 768 SZ cases and 1348 healthy controls and validated in an independent GWAS data including 1802 SZ cases and 1447 controls; (2) profile genome-wide miRNA expression in blood collected from 15 early-onset SZ (EOS) cases and 15 healthy controls; and (3) construct a miRNA-messenger RNA (mRNA) regulatory network using our previous genome-wide mRNA expression data generated from a separate sample of 18 EOS cases and 12 healthy controls. RESULTS Our findings indicate that: (1) In genes associated with the control of miRNAs, there are approximately 50% more RVs in SZ cases than in controls (P ≤ 2.62E-10); (2) The observed lower miRNA activity in EOS patients compared with the healthy controls suggests that miRNAs are abnormally downregulated; (3) There exists a predicted regulatory network among some downregulated miRNAs and some upregulated mRNAs. CONCLUSIONS Collectively, results from all 3 lines of evidence, suggest that the genetically based dysregulation of miRNA systems undermines miRNAs' inhibitory effects, resulting in the abnormal upregulation of genome transcription in the development of SZ.
Collapse
|
25
|
Mapsnp: an R package to plot a genomic map for single nucleotide polymorphisms. PLoS One 2015; 10:e0123609. [PMID: 25853637 PMCID: PMC4390227 DOI: 10.1371/journal.pone.0123609] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2014] [Accepted: 03/04/2015] [Indexed: 01/10/2023] Open
Abstract
Single-nucleotide polymorphism (SNP) is one of the most common sources of genetic variations of the genome. Currently, SNPs are a main target for most genetic association studies. Visualizing genomic coordinates of SNPs, including their physical location relative to their host gene, and the structure of the relevant transcripts, may provide intuitive supplements to the understanding of their functions. Nevertheless, to date, no such easy-to-use programming tools exist. Therefore, we developed an R package, "mapsnp", to plot genomic map for a panel of SNPs within a genome region of interest, including the relative chromosome location and the transcripts in the region. mapsnp is a simple and flexible software package which can be used to visualize a genomic map for SNPs, integrating a chromosome ideogram, genomic coordinates, SNP locations and SNP labels.
Collapse
|
26
|
Three dysconnectivity patterns in treatment-resistant schizophrenia patients and their unaffected siblings. NEUROIMAGE-CLINICAL 2015; 8:95-103. [PMID: 26106532 PMCID: PMC4473730 DOI: 10.1016/j.nicl.2015.03.017] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/13/2015] [Revised: 03/17/2015] [Accepted: 03/19/2015] [Indexed: 01/17/2023]
Abstract
Among individuals diagnosed with schizophrenia, approximately 20%–33% are recognized as treatment-resistant schizophrenia (TRS) patients. These TRS patients suffer more severely from the disease but struggle to benefit from existing antipsychotic treatments. A few recent studies suggested that schizophrenia may be caused by impaired synaptic plasticity that manifests as functional dysconnectivity in the brain, however, few of those studies focused on the functional connectivity changes in the brains of TRS groups. In this study, we compared the whole brain connectivity variations in TRS patients, their unaffected siblings, and healthy controls. Connectivity network features between and within the 116 automated anatomical labeling (AAL) brain regions were calculated and compared using maps created with three contrasts: patient vs. control, patient vs. sibling, and sibling vs. control. To evaluate the predictive power of the selected features, we performed a multivariate classification approach. We also evaluated the influence of six important clinical measures (e.g. age, education level) on the connectivity features. This study identified abnormal significant connectivity changes of three patterns in TRS patients and their unaffected siblings: 1) 69 patient-specific connectivity (PCN); 2) 102 shared connectivity (SCN); and 3) 457 unshared connectivity (UCN). While the first two patterns were widely reported by previous non-TRS specific studies, we were among the first to report widespread significant connectivity differences between TRS patient groups and their healthy sibling groups. Observations of this study may provide new insights for the understanding of the neurophysiological mechanisms of TRS. We first compared global functional connectivity in treatment-resistant schizophrenia and their unaffected siblings. Widespread unshared significant functional connectivity in unaffected siblings of treatment-resistant schizophrenia We studied the association of brain connectivity to clinical measures.
Collapse
|
27
|
plot2groups: an R package to plot scatter points for two groups of values. SOURCE CODE FOR BIOLOGY AND MEDICINE 2014. [PMCID: PMC4413984 DOI: 10.1186/1751-0473-9-23] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Background Researchers usually employ bar graphs to show two groups of data, which can be easily manipulated to yield false impressions. To some extent, scatterplot can retain the real data values and the spread of the data. However, for groups of numeric data, scatterplot may cause over-plotting problems. As a result, many values all stack on top of each other. Results We recently implemented an R package, plot2groups, to plot scatter points for two groups values, jittering the adjacent points side by side to avoid overlapping in the plot. The functions simultaneously calculate a P value of two group t- or rank-test and incorporated the P value into the plot. Conclusions plot2groups is a simple and flexible software package which can be used to visualize two groups of values within the statistical programming environment R.
Collapse
|
28
|
Copy number variation in obsessive-compulsive disorder and tourette syndrome: a cross-disorder study. J Am Acad Child Adolesc Psychiatry 2014; 53:910-9. [PMID: 25062598 PMCID: PMC4218748 DOI: 10.1016/j.jaac.2014.04.022] [Citation(s) in RCA: 97] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/11/2013] [Revised: 03/16/2014] [Accepted: 06/18/2014] [Indexed: 12/31/2022]
Abstract
OBJECTIVE Obsessive-compulsive disorder (OCD) and Tourette syndrome (TS) are heritable neurodevelopmental disorders with a partially shared genetic etiology. This study represents the first genome-wide investigation of large (>500 kb), rare (<1%) copy number variants (CNVs) in OCD and the largest genome-wide CNV analysis in TS to date. METHOD The primary analyses used a cross-disorder design for 2,699 case patients (1,613 ascertained for OCD, 1,086 ascertained for TS) and 1,789 controls. Parental data facilitated a de novo analysis in 348 OCD trios. RESULTS Although no global CNV burden was detected in the cross-disorder analysis or in secondary, disease-specific analyses, there was a 3.3-fold increased burden of large deletions previously associated with other neurodevelopmental disorders (p = .09). Half of these neurodevelopmental deletions were located in a single locus, 16p13.11 (5 case patient deletions: 0 control deletions, p = .08 in the current study, p = .025 compared to published controls). Three 16p13.11 deletions were confirmed de novo, providing further support for the etiological significance of this region. The overall OCD de novo rate was 1.4%, which is intermediate between published rates in controls (0.7%) and in individuals with autism or schizophrenia (2-4%). CONCLUSION Several converging lines of evidence implicate 16p13.11 deletions in OCD, with weaker evidence for a role in TS. The trend toward increased overall neurodevelopmental CNV burden in TS and OCD suggests that deletions previously associated with other neurodevelopmental disorders may also contribute to these phenotypes.
Collapse
|
29
|
The power comparison of the haplotype-based collapsing tests and the variant-based collapsing tests for detecting rare variants in pedigrees. BMC Genomics 2014; 15:632. [PMID: 25070353 PMCID: PMC4131059 DOI: 10.1186/1471-2164-15-632] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2013] [Accepted: 07/18/2014] [Indexed: 11/20/2022] Open
Abstract
Background Both common and rare genetic variants have been shown to contribute to the etiology of complex diseases. Recent genome-wide association studies (GWAS) have successfully investigated how common variants contribute to the genetic factors associated with common human diseases. However, understanding the impact of rare variants, which are abundant in the human population (one in every 17 bases), remains challenging. A number of statistical tests have been developed to analyze collapsed rare variants identified by association tests. Here, we propose a haplotype-based approach. This work inspired by an existing statistical framework of the pedigree disequilibrium test (PDT), which uses genetic data to assess the effects of variants in general pedigrees. We aim to compare the performance between the haplotype-based approach and the rare variant-based approach for detecting rare causal variants in pedigrees. Results Extensive simulations in the sequencing setting were carried out to evaluate and compare the haplotype-based approach with the rare variant methods that drew on a more conventional collapsing strategy. As assessed through a variety of scenarios, the haplotype-based pedigree tests had enhanced statistical power compared with the rare variants based pedigree tests when the disease of interest was mainly caused by rare haplotypes (with multiple rare alleles), and vice versa when disease was caused by rare variants acting independently. For most of other situations when disease was caused both by haplotypes with multiple rare alleles and by rare variants with similar effects, these two approaches provided similar power in testing for association. Conclusions The haplotype-based approach was designed to assess the role of rare and potentially causal haplotypes. The proposed rare variants-based pedigree tests were designed to assess the role of rare and potentially causal variants. This study clearly documented the situations under which either method performs better than the other. All tests have been implemented in a software, which was submitted to the Comprehensive R Archive Network (CRAN) for general use as a computer program named rvHPDT.
Collapse
|
30
|
Clinical correlates and genetic linkage of social and communication difficulties in families with obsessive-compulsive disorder: Results from the OCD Collaborative Genetics Study. Am J Med Genet B Neuropsychiatr Genet 2014; 165B:326-36. [PMID: 24798771 DOI: 10.1002/ajmg.b.32235] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/05/2013] [Accepted: 04/11/2014] [Indexed: 11/09/2022]
Abstract
Some individuals with obsessive-compulsive disorder (OCD) have autistic-like traits, including deficits in social and communication behaviors (pragmatics). The objective of this study was to determine if pragmatic impairment aggregates in OCD families and discriminates a clinically and genetically distinct subtype of OCD. We conducted clinical examinations on, and collected DNA samples from, 706 individuals with OCD in 221 multiply affected OCD families. Using the Pragmatic Rating Scale (PRS), we compared the prevalence of pragmatic impairment in OCD-affected relatives of probands with and without pragmatic impairment. We also compared clinical features of OCD-affected individuals in families having at least one, versus no, individual with pragmatic impairment, and assessed for linkage to OCD in the two groups of families. The odds of pragmatic impairment were substantially greater in OCD-affected relatives of probands with pragmatic impairment. Individuals in high-PRS families had greater odds of separation anxiety disorder and social phobia, and a greater number of schizotypal personality traits. In high-PRS families, there was suggestive linkage to OCD on chromosome 12 at marker D12S1064 and on chromosome X at marker DXS7132 whereas, in low-PRS families, there was suggestive linkage to chromosome 3 at marker D3S2398. Pragmatic impairment aggregates in OCD families. Separation anxiety disorder, social phobia, and schizotypal personality traits are part of a clinical spectrum associated with pragmatic impairment in these families. Specific regions of chromosomes 12 and X are linked to OCD in high-PRS families. Thus, pragmatic impairment may distinguish a clinically and genetically homogeneous subtype of OCD.
Collapse
|
31
|
Sparse representation based biomarker selection for schizophrenia with integrated analysis of fMRI and SNPs. Neuroimage 2014; 102 Pt 1:220-8. [PMID: 24530838 DOI: 10.1016/j.neuroimage.2014.01.021] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2013] [Revised: 01/10/2014] [Accepted: 01/13/2014] [Indexed: 11/15/2022] Open
Abstract
Integrative analysis of multiple data types can take advantage of their complementary information and therefore may provide higher power to identify potential biomarkers that would be missed using individual data analysis. Due to different natures of diverse data modality, data integration is challenging. Here we address the data integration problem by developing a generalized sparse model (GSM) using weighting factors to integrate multi-modality data for biomarker selection. As an example, we applied the GSM model to a joint analysis of two types of schizophrenia data sets: 759,075 SNPs and 153,594 functional magnetic resonance imaging (fMRI) voxels in 208 subjects (92 cases/116 controls). To solve this small-sample-large-variable problem, we developed a novel sparse representation based variable selection (SRVS) algorithm, with the primary aim to identify biomarkers associated with schizophrenia. To validate the effectiveness of the selected variables, we performed multivariate classification followed by a ten-fold cross validation. We compared our proposed SRVS algorithm with an earlier sparse model based variable selection algorithm for integrated analysis. In addition, we compared with the traditional statistics method for uni-variant data analysis (Chi-squared test for SNP data and ANOVA for fMRI data). Results showed that our proposed SRVS method can identify novel biomarkers that show stronger capability in distinguishing schizophrenia patients from healthy controls. Moreover, better classification ratios were achieved using biomarkers from both types of data, suggesting the importance of integrative analysis.
Collapse
|
32
|
How genome-wide SNP-SNP interactions relate to nasopharyngeal carcinoma susceptibility. PLoS One 2013; 8:e83034. [PMID: 24376627 PMCID: PMC3871583 DOI: 10.1371/journal.pone.0083034] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2013] [Accepted: 10/29/2013] [Indexed: 11/18/2022] Open
Abstract
This study is the first to use genome-wide association study (GWAS) data to evaluate the multidimensional genetic architecture underlying nasopharyngeal cancer. Since analysis of data from GWAS confirms a close and consistent association between elevated risk for nasopharyngeal carcinoma (NPC) and major histocompatibility complex class 1 genes, our goal here was to explore lesser effects of gene-gene interactions. We conducted an exhaustive genome-wide analysis of GWAS data of NPC, revealing two-locus interactions occurring between single nucleotide polymorphisms (SNPs), and identified a number of suggestive interaction loci which were missed by traditional GWAS analyses. Although none of the interaction pairs we identified passed the genome-wide Bonferroni-adjusted threshold for significance, using independent GWAS data from the same population (Stage 2), we selected 66 SNP pairs in 39 clusters with P<0.01. We identified that in several chromosome regions, multiple suggestive interactions group to form a block-like signal, effectively reducing the rate of false discovery. The strongest cluster of interactions involved the CREB5 gene and a SNP rs1607979 on chromosome 17q22 (P = 9.86×10(-11)) which also show trans-expression quantitative loci (eQTL) association in Chinese population. We then detected a complicated cis-interaction pattern around the NPC-associated HLA-B locus, which is immediately adjacent to copy-number variations implicated in male susceptibility for NPC. While it remains to be seen exactly how and to what degree SNP-SNP interactions such as these affect susceptibility for nasopharyngeal cancer, future research on these questions holds great promise for increasing our understanding of this disease's genetic etiology, and possibly also that of other gene-related cancers.
Collapse
|
33
|
Partitioning the heritability of Tourette syndrome and obsessive compulsive disorder reveals differences in genetic architecture. PLoS Genet 2013; 9:e1003864. [PMID: 24204291 PMCID: PMC3812053 DOI: 10.1371/journal.pgen.1003864] [Citation(s) in RCA: 171] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2013] [Accepted: 08/21/2013] [Indexed: 11/18/2022] Open
Abstract
The direct estimation of heritability from genome-wide common variant data as implemented in the program Genome-wide Complex Trait Analysis (GCTA) has provided a means to quantify heritability attributable to all interrogated variants. We have quantified the variance in liability to disease explained by all SNPs for two phenotypically-related neurobehavioral disorders, obsessive-compulsive disorder (OCD) and Tourette Syndrome (TS), using GCTA. Our analysis yielded a heritability point estimate of 0.58 (se = 0.09, p = 5.64e-12) for TS, and 0.37 (se = 0.07, p = 1.5e-07) for OCD. In addition, we conducted multiple genomic partitioning analyses to identify genomic elements that concentrate this heritability. We examined genomic architectures of TS and OCD by chromosome, MAF bin, and functional annotations. In addition, we assessed heritability for early onset and adult onset OCD. Among other notable results, we found that SNPs with a minor allele frequency of less than 5% accounted for 21% of the TS heritability and 0% of the OCD heritability. Additionally, we identified a significant contribution to TS and OCD heritability by variants significantly associated with gene expression in two regions of the brain (parietal cortex and cerebellum) for which we had available expression quantitative trait loci (eQTLs). Finally we analyzed the genetic correlation between TS and OCD, revealing a genetic correlation of 0.41 (se = 0.15, p = 0.002). These results are very close to previous heritability estimates for TS and OCD based on twin and family studies, suggesting that very little, if any, heritability is truly missing (i.e., unassayed) from TS and OCD GWAS studies of common variation. The results also indicate that there is some genetic overlap between these two phenotypically-related neuropsychiatric disorders, but suggest that the two disorders have distinct genetic architectures.
Collapse
|
34
|
New insights into the genetic mechanism of IQ in autism spectrum disorders. Front Genet 2013; 4:195. [PMID: 24151499 PMCID: PMC3799005 DOI: 10.3389/fgene.2013.00195] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2013] [Accepted: 09/13/2013] [Indexed: 11/13/2022] Open
Abstract
Autism spectrum disorders (ASD) comprise a number of underlying sub-types with various symptoms and presumably different genetic causes. One important difference between these sub-phenotypes is IQ. Some forms of ASD such as Asperger’s have relatively intact intelligence while the majority does not. In this study, we explored the role of genetic factors that might account for this difference. Using a case–control study based on IQ status in 1657 ASD probands, we analyzed both common and rare variants provided by the Autism Genome Project (AGP) consortium via dbGaP (database of Genotypes and Phenotypes). We identified a set of genes, among them HLA-DRB1 and KIAA0319L, which are strongly associated with IQ within a population of ASD patients.
Collapse
|
35
|
Quantitative association of tobacco smoking with the risk of nasopharyngeal carcinoma: a comprehensive meta-analysis of studies conducted between 1979 and 2011. Am J Epidemiol 2013; 178:325-38. [PMID: 23785114 DOI: 10.1093/aje/kws479] [Citation(s) in RCA: 72] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Over the years, many studies have attempted to establish a link between tobacco smoking and an increased risk of nasopharyngeal carcinoma (NPC), but their results have been inconsistent. To clarify this link, we first conducted a comprehensive meta-analysis to integrate the findings of epidemiologic studies from the last half-century. The methodology used for this study followed the checklist proposed by the Meta-analysis of Observational Studies in Epidemiology (MOOSE) Group. Pooled risk estimates were generated using a random-effects model. Twenty-eight case-control studies and 4 cohort studies involving a total of 10,274 NPC cases and 415,266 comparison subjects were included. A substantial effect of smoking on the risk of NPC was identified in this study. The results showed that ever smokers had a 60% greater risk of developing the disease than never smokers (95% confidence interval: 1.38, 1.87); this was a robust dose-dependent association. More importantly, stronger associations were observed in low-risk populations and among persons with the predominant histological type of differentiated NPC than in high-risk populations and persons with an undifferentiated type; the odds ratios were 1.76 and 2.20, respectively, versus 1.29 and 1.27. In this comprehensive meta-analysis, well-established statistical evidence was provided about the role of tobacco smoking in the etiology of NPC.
Collapse
|
36
|
Meta-analysis of association between obsessive-compulsive disorder and the 3' region of neuronal glutamate transporter gene SLC1A1. Am J Med Genet B Neuropsychiatr Genet 2013; 162B:367-79. [PMID: 23606572 DOI: 10.1002/ajmg.b.32137] [Citation(s) in RCA: 74] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/22/2012] [Accepted: 01/15/2013] [Indexed: 12/12/2022]
Abstract
The neuronal glutamate transporter gene SLC1A1 is a candidate gene for obsessive-compulsive disorder (OCD) based on linkage studies and convergent evidence implicating glutamate in OCD etiology. The 3' end of SLC1A1 is the only genomic region with consistently demonstrated OCD association, especially when analyzing male-only probands. However, specific allele associations have not been consistently replicated, and recent OCD genome-wide association and meta-analysis studies have not incorporated all previously associated SLC1A1 SNPs. To clarify the nature of association between SLC1A1 and OCD, pooled analysis was performed on all available relevant raw study data, comprising a final sample of 815 trios, 306 cases and 634 controls. This revealed weak association between OCD and one of nine tested SLC1A1 polymorphisms (rs301443; uncorrected P = 0.046; non-significant corrected P). Secondary analyses of male-affecteds only (N = 358 trios and 133 cases) demonstrated modest association between OCD and a different SNP (rs12682807; uncorrected P = 0.012; non-significant corrected P). Findings of this meta-analysis are consistent with the trend of previous candidate gene studies in psychiatry and do not clarify the putative role of SLC1A1 in OCD pathophysiology. Nonetheless, it may be important to further examine the potential associations demonstrated in this amalgamated sample, especially since the SNPs with modest associations were not included in the more highly powered recent GWAS or in a past meta-analysis including five SLC1A1 polymorphisms. This study underscores the need for much larger sample sizes in future genetic association studies and suggests that next-generation sequencing may be beneficial in examining the potential role of rare variants in OCD.
Collapse
|
37
|
Functional polymorphism in the 5'-UTR of CR2 is associated with susceptibility to nasopharyngeal carcinoma. Oncol Rep 2013; 30:11-6. [PMID: 23612877 PMCID: PMC3729234 DOI: 10.3892/or.2013.2421] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2012] [Accepted: 02/18/2013] [Indexed: 12/27/2022] Open
Abstract
Epstein-Barr virus (EBV)-associated nasopharyngeal carcinoma (NPC) is a squamous cell cancer endemic in Southern China and Southeast Asia. It has been shown that inflammatory and immune responses during EBV infection contribute to the development of NPC. The complement receptor 2 (CR2) gene plays central roles during inflammatory and immune responses and, therefore, is a good candidate susceptibility gene for NPC. We performed PCR-based sequencing to identify multiple single-nucleotide polymorphisms (SNPs) within the exon regions of the CR2 gene in a Cantonese population. Two SNPs were screened in 528 NPC patients and 408 normal individuals to perform a case-control study matched according to age, gender and residence. Furthermore, we cloned the entire 5′-UTR and entire CR2 promoter into a luciferase report system and compared the luciferase activities between the different allelic constructs. A SNP in the 5′-UTR of CR2 (24 T/C, rs3813946) showed a significant association (P<0.01) with NPC in the Cantonese population studied. The subjects were categorized into 2 age groups: group 1, age ≤45 years and group 2, age >45 years. In group 1, the allelic frequencies of 24 T/C in the patients were significantly different from those of the controls (P=0.0034). The odds ratio (OR=1.81) also indicated a higher risk of NPC in individuals who carried the minor allele C. All constructs exerted allelic differences on luciferase activities, but only the susceptible allele +24C construct showed increased activity. Our findings implicate CR2 as a susceptibility gene for NPC and suggest that enhanced CR2 expression may be involved in the oncogenesis and development of NPC.
Collapse
|
38
|
Abstract 4838: Exploration of genome-wide two-locus SNP-SNP interactions relate to nasopharyngeal carcinoma susceptibility. Cancer Res 2013. [DOI: 10.1158/1538-7445.am2013-4838] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Most of the genome-wide association studies (GWAS) are concentrated on the detection main effects and can only explain only a small fraction of the genetic risk. One possible explanation might be the existence of interactions between loci. This effect might be missed in traditional GWAS analysis. Testing for SNP-SNP interaction within a single GWAS study can identify the stronger results that are revealed when genes interact. Here, we have completed the genome-wide two-locus SNP-SNP interaction analysis in nasopharyngeal carcinoma (NPC). We started with the NPC GWAS dataset as our Discovery dataset, then we performed an exhaustive genome-wide two-locus search by pairing each SNP sets according to chromosome positions for PLINK epistasis analysis. The analysises were also performed within each SNP sets to ensure every SNP has performed epistasis analysis with all the other SNPs in the dataset. In total, there were 4,244,943 unique interactions after the genome-wide two-locus epistasis analysis. The top 10,000 interaction pairs from the Discovery dataset were selected for validation test in a subset of independent samples. Finally, 67 SNP pairs remain significant in Validation and Combined analysis and can successfully passed permutation tests for 10,000 times. The 67 significant interaction pairs can be clustered into 39 interaction groups which 12 (32%) of the 39 clusters were with at least two nearby SNPs from one of the interaction chromosome. This study represents one of the first attempts to explore the high-dimensional genetic architecture of NPC on a genome-wide scale.
Citation Format: Wen-Hui Su, Kai-Ping Chang, Yin Yao Shugart, Yu-Sun Chang. Exploration of genome-wide two-locus SNP-SNP interactions relate to nasopharyngeal carcinoma susceptibility. [abstract]. In: Proceedings of the 104th Annual Meeting of the American Association for Cancer Research; 2013 Apr 6-10; Washington, DC. Philadelphia (PA): AACR; Cancer Res 2013;73(8 Suppl):Abstract nr 4838. doi:10.1158/1538-7445.AM2013-4838
Collapse
|
39
|
Replication study confirms link between TSPAN18 mutation and schizophrenia in Han Chinese. PLoS One 2013; 8:e58785. [PMID: 23505562 PMCID: PMC3591373 DOI: 10.1371/journal.pone.0058785] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2012] [Accepted: 02/06/2013] [Indexed: 01/21/2023] Open
Abstract
Schizophrenia (SCZ) is a severe psychiatric disorder associated with many different risk factors, both genetic and environmental. A recent genome-wide association study (GWAS) of Han Chinese identified three single-nucleotide polymorphisms (SNPs rs11038167, rs11038172, and rs835784) in the tetraspanins gene TSPAN18 as possible susceptibility loci for schizophrenia. Hoping to validate these findings, we conducted a case-control study of Han Chinese with 1093 schizophrenia cases and 1022 healthy controls. Using the LDR-PCR method to genotype polymorphisms in TSPAN18, we found no significant differences (P>0.05) between patients and controls in either the allele or genotype frequency of the SNPs rs11038167 and rs11038172. We did find, however, that the frequency of the ‘A’ allele of SNP rs835784 is significantly higher in patients than in controls. We further observed a significant association (OR = 1.197, 95%CI = 1.047–1.369) between risk for SCZ and this ‘A’ allele. These results confirm the significant association, in Han Chinese populations, of increased SCZ risk and the variant of the TSPAN18 gene containing the ‘A’ allele of SNP rs835784.
Collapse
|
40
|
Developing genetic epidemiological models to predict risk for nasopharyngeal carcinoma in high-risk population of China. PLoS One 2013; 8:e56128. [PMID: 23457511 PMCID: PMC3574061 DOI: 10.1371/journal.pone.0056128] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2012] [Accepted: 01/04/2013] [Indexed: 01/12/2023] Open
Abstract
To date, the only established model for assessing risk for nasopharyngeal carcinoma (NPC) relies on the sero-status of the Epstein-Barr virus (EBV). By contrast, the risk assessment models proposed here include environmental risk factors, family history of NPC, and information on genetic variants. The models were developed using epidemiological and genetic data from a large case-control study, which included 1,387 subjects with NPC and 1,459 controls of Cantonese origin. The predictive accuracy of the models were then assessed by calculating the area under the receiver-operating characteristic curves (AUC). To compare the discriminatory improvement of models with and without genetic information, we estimated the net reclassification improvement (NRI) and integrated discrimination index (IDI). Well-established environmental risk factors for NPC include consumption of salted fish and preserved vegetables and cigarette smoking (in pack years). The environmental model alone shows modest discriminatory ability (AUC = 0.68; 95% CI: 0.66, 0.70), which is only slightly increased by the addition of data on family history of NPC (AUC = 0.70; 95% CI: 0.68, 0.72). With the addition of data on genetic variants, however, our model’s discriminatory ability rises to 0.74 (95% CI: 0.72, 0.76). The improvements in NRI and IDI also suggest the potential usefulness of considering genetic variants when screening for NPC in endemic areas. If these findings are confirmed in larger cohort and population-based case-control studies, use of the new models to analyse data from NPC-endemic areas could well lead to earlier detection of NPC.
Collapse
|
41
|
Genome-wide association study meta-analysis of European and Asian-ancestry samples identifies three novel loci associated with bipolar disorder. Mol Psychiatry 2013; 18:195-205. [PMID: 22182935 DOI: 10.1038/mp.2011.157] [Citation(s) in RCA: 162] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Meta-analyses of bipolar disorder (BD) genome-wide association studies (GWAS) have identified several genome-wide significant signals in European-ancestry samples, but so far account for little of the inherited risk. We performed a meta-analysis of ∼750,000 high-quality genetic markers on a combined sample of ∼14,000 subjects of European and Asian-ancestry (phase I). The most significant findings were further tested in an extended sample of ∼17,700 cases and controls (phase II). The results suggest novel association findings near the genes TRANK1 (LBA1), LMAN2L and PTGFR. In phase I, the most significant single nucleotide polymorphism (SNP), rs9834970 near TRANK1, was significant at the P=2.4 × 10(-11) level, with no heterogeneity. Supportive evidence for prior association findings near ANK3 and a locus on chromosome 3p21.1 was also observed. The phase II results were similar, although the heterogeneity test became significant for several SNPs. On the basis of these results and other established risk loci, we used the method developed by Park et al. to estimate the number, and the effect size distribution, of BD risk loci that could still be found by GWAS methods. We estimate that >63,000 case-control samples would be needed to identify the ∼105 BD risk loci discoverable by GWAS, and that these will together explain <6% of the inherited risk. These results support previous GWAS findings and identify three new candidate genes for BD. Further studies are needed to replicate these findings and may potentially lead to identification of functional variants. Sample size will remain a limiting factor in the discovery of common alleles associated with BD.
Collapse
|
42
|
Genetic variants in STAT4 and HLA-DQ genes confer risk of hepatitis B virus-related hepatocellular carcinoma. Nat Genet 2012; 45:72-5. [PMID: 23242368 DOI: 10.1038/ng.2483] [Citation(s) in RCA: 226] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2012] [Accepted: 11/06/2012] [Indexed: 02/08/2023]
Abstract
To identify genetic susceptibility loci for hepatitis B virus (HBV)-related hepatocellular carcinoma (HCC) in the Chinese population, we carried out a genome-wide association study (GWAS) in 2,514 chronic HBV carriers (1,161 HCC cases and 1,353 controls) followed by a 2-stage validation among 6 independent populations of chronic HBV carriers (4,319 cases and 4,966 controls). The joint analyses showed that HCC risk was significantly associated with two independent loci: rs7574865 at STAT4, P(meta) = 2.48 × 10(-10), odds ratio (OR) = 1.21; and rs9275319 at HLA-DQ, P(meta) = 2.72 × 10(-17), OR = 1.49. The risk allele G at rs7574865 was significantly associated with lower mRNA levels of STAT4 in both the HCC tissues and nontumor tissues of 155 individuals with HBV-related HCC (P(trend) = 0.0008 and 0.0002, respectively). We also found significantly lower mRNA expression of STAT4 in HCC tumor tissues compared with paired adjacent nontumor tissues (P = 2.33 × 10(-14)).
Collapse
|
43
|
Weighted pedigree-based statistics for testing the association of rare variants. BMC Genomics 2012; 13:667. [PMID: 23176082 PMCID: PMC3827928 DOI: 10.1186/1471-2164-13-667] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2012] [Accepted: 11/12/2012] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND With the advent of next-generation sequencing (NGS) technologies, researchers are now generating a deluge of data on high dimensional genomic variations, whose analysis is likely to reveal rare variants involved in the complex etiology of disease. Standing in the way of such discoveries, however, is the fact that statistics for rare variants are currently designed for use with population-based data. In this paper, we introduce a pedigree-based statistic specifically designed to test for rare variants in family-based data. The additional power of pedigree-based statistics stems from the fact that while rare variants related to diseases or traits of interest occur only infrequently in populations, in families with multiple affected individuals, such variants are enriched. Note that while the proposed statistic can be applied with and without statistical weighting, our simulations show that its power increases when weighting (WSS and VT) are applied. RESULTS Our working hypothesis was that, since rare variants are concentrated in families with multiple affected individuals, pedigree-based statistics should detect rare variants more powerfully than population-based statistics. To evaluate how well our new pedigree-based statistics perform in association studies, we develop a general framework for sequence-based association studies capable of handling data from pedigrees of various types and also from unrelated individuals. In short, we developed a procedure for transforming population-based statistics into tests for family-based associations. Furthermore, we modify two existing tests, the weighted sum-square test and the variable-threshold test, and apply both to our family-based collapsing methods. We demonstrate that the new family-based tests are more powerful than corresponding population-based test and they generate a reasonable type I error rate.To demonstrate feasibility, we apply the newly developed tests to a pedigree-based GWAS data set from the Framingham Heart Study (FHS). FHS-GWAS data contain approximately 5000 uncommon variants with frequencies less than 0.05. Potential association findings in these data demonstrate the feasibility of the software PB-STAR (note, PB-STAR is now freely available to the public). CONCLUSION Our tests show that when analyzing for rare variants, a pedigree-based design is more powerful than a population-based case-control design. We further demonstrate that a pedigree-based statistic's power to detect rare variants increases in direct relation to the proportion of affected individuals within the pedigree.
Collapse
|
44
|
DNA repair gene variants in relation to overall cancer risk: a population-based study. Carcinogenesis 2012; 34:86-92. [PMID: 23027618 DOI: 10.1093/carcin/bgs304] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
The hypothesis that germ-line polymorphisms in DNA repair genes influence cancer risk has previously been tested primarily on a cancer site-specific basis. The purpose of this study was to test the hypothesis that DNA repair gene allelic variants contribute to globally elevated cancer risk by measuring associations with risk of all cancers that occurred within a population-based cohort. In the CLUE II cohort study established in 1989 in Washington County, MD, this study was comprised of all 3619 cancer cases ascertained through 2007 compared with a sample of 2296 with no cancer. Associations were measured between 759 DNA repair gene single nucleotide polymorphisms (SNPs) and risk of all cancers. A SNP in O(6)-methylguanine-DNA methyltransferase, MGMT, (rs2296675) was significantly associated with overall cancer risk [per minor allele odds ratio (OR) 1.30, 95% confidence interval (CI) 1.19-1.43 and P-value: 4.1 × 10(-8)]. The association between rs2296675 and cancer risk was stronger among those aged ≤54 years old than those who were ≥55 years at baseline (P-for-(interaction) = 0.021). OR were in the direction of increased risk for all 15 categories of malignancies studied (P < 0.0001), ranging from 1.22 (P = 0.42) for ovarian cancer to 2.01 (P = 0.008) for urinary tract cancers; the smallest P-value was for breast cancer (OR 1.45, P = 0.0002). The results indicate that the minor allele of MGMT SNP rs2296675, a common genetic marker with 37% carriers, was significantly associated with increased risk of cancer across multiple tissues. Replication is needed to more definitively determine the scientific and public health significance of this observed association.
Collapse
|
45
|
An epidemiological and molecular study of the relationship between smoking, risk of nasopharyngeal carcinoma, and Epstein-Barr virus activation. J Natl Cancer Inst 2012; 104:1396-410. [PMID: 22972969 DOI: 10.1093/jnci/djs320] [Citation(s) in RCA: 131] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Elevated levels of antibodies against antigens in the Epstein-Barr virus (EBV) lytic phase are important predictive markers for nasopharyngeal carcinoma (NPC) risk. Several lifestyle factors, including smoking, have also been associated with NPC risk. We hypothesized that some specific lifestyle factors induce transformation of EBV from the latent to the lytic stage and contribute to NPC occurrence. METHODS We conducted a case-control study using data from male case patients (n = 1316) and control subjects (n = 1571) living in Guangdong Province, an area in China at high risk for NPC, to study potential NPC risk factors and EBV inducers. Two independent healthy male populations from a second high-risk area (n = 1657) and a low-risk area (n = 1961) were also included in the analysis of potential EBV inducers using logistic regression models. In vitro assays were performed to investigate the effect of cigarette smoke extract on EBV activation in two EBV-positive cell lines. All statistical tests were two-sided. RESULTS Smoking was associated with an increased risk of NPC among the Guangdong participants with 20-40 and 40 or more pack-years vs never smokers (OR = 1.52, 95% CI = 1.22 to 1.88 and OR = 1.76, 95% CI = 1.34 to 2.32, respectively; P (trend) < .001). Smoking was the only factor linked to EBV seropositivity among the expanded control group and the independent low-risk population. In vitro experiments showed that cigarette smoke extract promoted EBV replication, induced the expression of the immediate-early transcriptional activators Zta and Rta, and increased transcriptional expression levels of BFRF3 and gp350 in the lytic phase. CONCLUSION Smoking is not only associated with NPC risk in individuals from China but is also associated with EBV seropositivity in healthy males and is involved in EBV activation.
Collapse
|
46
|
A population-based study of DNA repair gene variants in relation to non-melanoma skin cancer as a marker of a cancer-prone phenotype. Carcinogenesis 2012; 33:1692-8. [PMID: 22581838 PMCID: PMC3514896 DOI: 10.1093/carcin/bgs170] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2011] [Revised: 05/02/2012] [Accepted: 05/08/2012] [Indexed: 11/13/2022] Open
Abstract
For unknown reasons, non-melanoma skin cancer (NMSC) is associated with increased risk of other malignancies. Focusing solely on DNA repair or DNA repair-related genes, this study tested the hypothesis that DNA repair gene variants contribute to the increased cancer risk associated with a personal history of NMSC. From the parent CLUE II cohort study, established in 1989 in Washington County, MD, the study consisted of a cancer-free control group (n 5 2296) compared with three mutually exclusive groups of cancer cases ascertained through 2007: (i) Other (non-NMSC) cancer only (n 5 2349); (ii) NMSC only (n 5 694) and (iii) NMSC plus other cancer (n 5 577). The frequency of minor alleles in 759 DNA repair gene single nucleotide polymorphisms (SNPs) was compared in these four groups. Comparing those with both NMSC and other cancer versus those with no cancer, 10 SNPs had allelic trend P-values <0.01. The two top-ranked SNPs were both within the thymine DNA glycosylase gene (TDG). One was a non-synonymous coding SNP (rs2888805) [per allele odds ratio (OR) 1.40, 95% confidence interval (CI) 1.16-1.70; P-value 5 0.0006] and the other was an intronic SNP in high linkage disequilibrium with rs2888805 (rs4135150). None of the associations had a P-value <6.6310(-5), the threshold for statistical significance after correcting for multiple comparisons. The results pinpoint DNA repair genes most likely to contribute to the NMSC cancer-prone phenotype. A promising lead is genetic variants in TDG, important not only in base excision repair but also in regulating the epigenome and gene expression, which may contribute to the NMSC-associated increase in overall cancer risk.
Collapse
|
47
|
GWAS signals across the HLA regions: revealing a clue for common etiology underlying infectious tumors and other immunity diseases. CHINESE JOURNAL OF CANCER 2012; 30:226-30. [PMID: 21439243 PMCID: PMC3381606 DOI: 10.5732/cjc.011.10075] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Increasing evidence suggests that multiple genes in the human leukocyte antigen (HLA) regions play an important role in development of cancers and immunity disorders. However, the biological mechanisms of the HLA associations are not well understood. We recently conducted a survey of all genome-wide association studies (GWAS) with significant findings in the HLA regions and concluded that diseases such as cancer and immune disorders are more likely to be associated with genetic variants located in the HLA regions than other diseases. This finding is suggestive for testing a hypothesis of a common etiology of infectious tumors and other immunity diseases.
Collapse
|
48
|
Detecting rare variants for quantitative traits using nuclear families. Hum Hered 2012; 73:148-58. [PMID: 22699804 DOI: 10.1159/000338439] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2011] [Accepted: 03/30/2012] [Indexed: 01/17/2023] Open
Abstract
With the advent of sequencing technology opening up a new era of personal genome sequencing, huge amounts of rare variant data have suddenly become available to researchers seeking genetic variants related to human complex disorders. There is an urgent need for the development of novel statistical methods to analyze rare variants in a statistically powerful manner. While a number of statistical tests have already been developed to analyze collapsed rare variants identified by association tests in case-control studies, to date, only two FBAT tests-for-rare (described in the updated FBAT version v2.0.4) have applied collapsing methods analogously in family-based designs. For further research in this area, this study aims to introduce three new beta-determined weight tests for detecting rare variants for quantitative traits in nuclear families. In addition to evaluating the performance of these new methods, it also evaluates that of the two FBAT tests-for-rare, using extensive simulations of situations with and without linkage disequilibrium. Results from these simulations suggest that the four tests using beta-determined weights outperform the two collapsing methods used in FBAT (-v0 and -v1). In addition, both the linear combination method (detailed in the FBAT menu v2.0.4) and the multiple regression method (mixing LASSO and Ridge penalties) performed better than the other two beta-determined weight tests we proposed. Following testing and evaluation, we submitted four new beta-determined weight methods of statistical analysis in a computer program to the Comprehensive R Archive Network (CRAN) for general use.
Collapse
|
49
|
A population-based study of hedgehog pathway gene variants in relation to the dual risk of basal cell carcinoma plus another cancer. Cancer Epidemiol 2012; 36:e288-93. [PMID: 22677152 DOI: 10.1016/j.canep.2012.05.001] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2012] [Revised: 05/01/2012] [Accepted: 05/03/2012] [Indexed: 11/28/2022]
Abstract
INTRODUCTION A personal history of basal cell carcinoma (BCC) is associated with increased risk of other malignancies, but the reason is unknown. The hedgehog pathway is critical to the etiology of BCC, and is also believed to contribute to susceptibility to other cancers. This study tested the hypothesis that hedgehog pathway and pathway-related gene variants contribute to the increased risk of subsequent cancers among those with a history of BCC. METHODS The study was nested within the ongoing CLUE II cohort study, established in 1989 in Washington County, Maryland, USA. The study consisted of a cancer-free control group (n=2296) compared to three different groups of cancer cases ascertained through 2007, those diagnosed with: (1) Other (non-BCC) cancer only (n=2349); (2) BCC only (n=534); and (3) BCC plus other cancer (n=446). The frequencies of variant alleles were compared among these four groups for 20 single nucleotide polymorphisms (SNPs) in 6 hedgehog pathway genes (SHH, IHH, PTCH2, SMO, GLI1, SUFU), and also 22 SNPs in VDR and 8 SNPs in FAS, which have cross-talk with the hedgehog pathway. RESULTS Comparing those with both BCC and other cancer versus those with no cancer, no significant associations were observed for any of the hedgehog pathway SNPs, or for the FAS SNPs. One VDR SNP was nominally significantly associated with the BCC cancer-prone phenotype, rs11574085 [per minor allele odds ratio (OR) 1.38, 95% confidence interval (CI) 1.05-1.82; p-value=0.02]. CONCLUSION The hedgehog pathway gene SNPs studied, along with the VDR and FAS SNPs studied, are not strongly associated with the BCC cancer-prone phenotype.
Collapse
|
50
|
Abstract 2636: Nonmelanoma skin cancer as a marker of a cancer-prone phenotype: Potential role of DNA repair gene variants. Cancer Res 2012. [DOI: 10.1158/1538-7445.am2012-2636] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Purpose: For unknown reasons, nonmelanoma skin cancer (NMSC) is associated with increased risk of other malignancies. This study tested the hypothesis that DNA repair gene variants contribute to the increased cancer risk associated with a personal history of NMSC. Methods: From the parent CLUE II cohort study, established in 1989 in Washington County MD, the study consisted of a cancer-free control group (n=2,296) compared to three mutually exclusive groups of cancer cases ascertained through 2007: 1) Other (non-NMSC) cancer only (n=2,349); 2) NMSC only (n=694); and 3) NMSC plus other cancer (n=577). The frequency of minor alleles in 759 DNA repair single nucleotide polymorphisms (SNPs) was compared in these four groups. Results: Comparing those with both NMSC and other cancer versus those with no cancer, 10 SNPs had additive model p-values<0.01. The two top-ranked SNPs were both within the thymine DNA glycosylase gene (TDG). One was a nonsynonymous coding SNP (rs2888805) (per allele odds ratio (OR) 1.40, 95% confidence interval (CI) 1.16-1.70; p-value=0.0006) and the other was an intronic SNP in high linkage disequilibrium with rs2888805 (rs4135150). Conclusion: The results pinpoint DNA repair genes most likely to contribute to the NMSC cancer-prone phenotype. A promising lead is genetic variants in TDG, important not only in base excision repair but also in regulating the epigenome and gene expression, which may contribute to the NMSC-associated increase in overall cancer risk.
Citation Format: {Authors}. {Abstract title} [abstract]. In: Proceedings of the 103rd Annual Meeting of the American Association for Cancer Research; 2012 Mar 31-Apr 4; Chicago, IL. Philadelphia (PA): AACR; Cancer Res 2012;72(8 Suppl):Abstract nr 2636. doi:1538-7445.AM2012-2636
Collapse
|