1
|
Mitchell J, Camacho N, Shea P, Stopsack KH, Joseph V, Burren O, Dhindsa R, Nag A, Berchuck JE, O'Neill A, Abbasi A, Zoghbi AW, Alegre-Díaz J, Kuri-Morales P, Berumen J, Tapia-Conyer R, Emberson J, Torres JM, Collins R, Wang Q, Goldstein D, Matakidou A, Haefliger C, Anderson-Dring L, March R, Jobanputra V, Dougherty B, Carss K, Petrovski S, Kantoff PW, Offit K, Mucci LA, Pomerantz M, Fabre MA. Characterising the contribution of rare protein-coding germline variants to prostate cancer risk and severity in 37,184 cases. medRxiv 2024:2024.05.10.24307164. [PMID: 38766261 PMCID: PMC11100931 DOI: 10.1101/2024.05.10.24307164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2024]
Abstract
The etiology of prostate cancer, the second most common cancer in men globally, has a strong heritable component. While rare coding germline variants in several genes have been identified as risk factors from candidate gene and linkage studies, the exome-wide spectrum of causal rare variants remains to be fully explored. To more comprehensively address their contribution, we analysed data from 37,184 prostate cancer cases and 331,329 male controls from five cohorts with germline exome/genome sequencing and one cohort with imputed array data from a population enriched in low-frequency deleterious variants. Our gene-level collapsing analysis revealed that rare damaging variants in SAMHD1 as well as genes in the DNA damage response pathway ( BRCA2 , ATM and CHEK2 ) are associated with the risk of overall prostate cancer. We also found that rare damaging variants in AOX1 and BRCA2 were associated with increased severity of prostate cancer in a case-only analysis of aggressive versus non-aggressive prostate cancer. At the single-variant level, we found rare non-synonymous variants in three genes ( HOXB13 , CHEK2 , BIK ) significantly associated with increased risk of overall prostate cancer and in four genes ( ANO7 , SPDL1 , AR , TERT ) with decreased risk. Altogether, this study provides deeper insights into the genetic architecture and biological basis of prostate cancer risk and severity.
Collapse
|
2
|
Hayeck TJ, Stong N, Baugh E, Dhindsa R, Turner TN, Malakar A, Mosbruger TL, Shaw GTW, Duan Y, Ionita-Laza I, Goldstein D, Allen AS. Ancestry adjustment improves genome-wide estimates of regional intolerance. Genetics 2022; 221:iyac050. [PMID: 35385101 PMCID: PMC9157129 DOI: 10.1093/genetics/iyac050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2022] [Accepted: 02/24/2022] [Indexed: 11/12/2022] Open
Abstract
Genomic regions subject to purifying selection are more likely to carry disease-causing mutations than regions not under selection. Cross species conservation is often used to identify such regions but with limited resolution to detect selection on short evolutionary timescales such as that occurring in only one species. In contrast, genetic intolerance looks for depletion of variation relative to expectation within a species, allowing species-specific features to be identified. When estimating the intolerance of noncoding sequence, methods strongly leverage variant frequency distributions. As the expected distributions depend on ancestry, if not properly controlled for, ancestral population source may obfuscate signals of selection. We demonstrate that properly incorporating ancestry in intolerance estimation greatly improved variant classification. We provide a genome-wide intolerance map that is conditional on ancestry and likely to be particularly valuable for variant prioritization.
Collapse
Affiliation(s)
- Tristan J Hayeck
- Department of Pathology and Laboratory Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Nicholas Stong
- Institute for Genomic Medicine, Columbia University Medical Center, New York, NY 10032, USA
| | - Evan Baugh
- Institute for Genomic Medicine, Columbia University Medical Center, New York, NY 10032, USA
| | - Ryan Dhindsa
- Institute for Genomic Medicine, Columbia University Medical Center, New York, NY 10032, USA
| | - Tychele N Turner
- Department of Genetics, Washington University in St. Louis, St. Louis, MO 63110, USA
| | - Ayan Malakar
- Institute for Genomic Medicine, Columbia University Medical Center, New York, NY 10032, USA
| | - Timothy L Mosbruger
- Department of Pathology and Laboratory Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Grace Tzun-Wen Shaw
- Department of Pathology and Laboratory Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Yuncheng Duan
- Department of Biostatistics and Bioinformatics, Duke University, Durham, NC 27710, USA
| | | | - David Goldstein
- Institute for Genomic Medicine, Columbia University Medical Center, New York, NY 10032, USA
| | - Andrew S Allen
- Department of Biostatistics and Bioinformatics, Duke University, Durham, NC 27710, USA
| |
Collapse
|
3
|
Gelfman S, Wang Q, Lu YF, Hall D, Bostick CD, Dhindsa R, Halvorsen M, McSweeney KM, Cotterill E, Edinburgh T, Beaumont MA, Frankel WN, Petrovski S, Allen AS, Boland MJ, Goldstein DB, Eglen SJ. meaRtools: An R package for the analysis of neuronal networks recorded on microelectrode arrays. PLoS Comput Biol 2018; 14:e1006506. [PMID: 30273353 PMCID: PMC6181426 DOI: 10.1371/journal.pcbi.1006506] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2018] [Revised: 10/11/2018] [Accepted: 09/12/2018] [Indexed: 12/22/2022] Open
Abstract
Here we present an open-source R package 'meaRtools' that provides a platform for analyzing neuronal networks recorded on Microelectrode Arrays (MEAs). Cultured neuronal networks monitored with MEAs are now being widely used to characterize in vitro models of neurological disorders and to evaluate pharmaceutical compounds. meaRtools provides core algorithms for MEA spike train analysis, feature extraction, statistical analysis and plotting of multiple MEA recordings with multiple genotypes and treatments. meaRtools functionality covers novel solutions for spike train analysis, including algorithms to assess electrode cross-correlation using the spike train tiling coefficient (STTC), mutual information, synchronized bursts and entropy within cultured wells. Also integrated is a solution to account for bursts variability originating from mixed-cell neuronal cultures. The package provides a statistical platform built specifically for MEA data that can combine multiple MEA recordings and compare extracted features between different genetic models or treatments. We demonstrate the utilization of meaRtools to successfully identify epilepsy-like phenotypes in neuronal networks from Celf4 knockout mice. The package is freely available under the GPL license (GPL> = 3) and is updated frequently on the CRAN web-server repository. The package, along with full documentation can be downloaded from: https://cran.r-project.org/web/packages/meaRtools/.
Collapse
Affiliation(s)
- Sahar Gelfman
- Institute for Genomic Medicine, Columbia University Medical Center, New York, NY, United States of America
- Department of Genetics and Development, Columbia University Medical Center, New York, NY, United States of America
| | - Quanli Wang
- Institute for Genomic Medicine, Columbia University Medical Center, New York, NY, United States of America
- Department of Genetics and Development, Columbia University Medical Center, New York, NY, United States of America
- Simcere Diagnostics Co, Ltd, Nanjing, China
| | - Yi-Fan Lu
- Institute for Genomic Medicine, Columbia University Medical Center, New York, NY, United States of America
- Department of Genetics and Development, Columbia University Medical Center, New York, NY, United States of America
- Department of Biology, Westmont College, Santa Barbara, CA, United States of America
| | - Diana Hall
- Institute for Genomic Medicine, Columbia University Medical Center, New York, NY, United States of America
- Department of Genetics and Development, Columbia University Medical Center, New York, NY, United States of America
| | - Christopher D. Bostick
- Institute for Genomic Medicine, Columbia University Medical Center, New York, NY, United States of America
- Department of Genetics and Development, Columbia University Medical Center, New York, NY, United States of America
| | - Ryan Dhindsa
- Institute for Genomic Medicine, Columbia University Medical Center, New York, NY, United States of America
- Department of Genetics and Development, Columbia University Medical Center, New York, NY, United States of America
| | - Matt Halvorsen
- Institute for Genomic Medicine, Columbia University Medical Center, New York, NY, United States of America
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - K. Melodi McSweeney
- Institute for Genomic Medicine, Columbia University Medical Center, New York, NY, United States of America
- Department of Genetics and Development, Columbia University Medical Center, New York, NY, United States of America
- University Program in Genetics and Genomics, Duke University, Durham, North Carolina, United States of America
| | - Ellese Cotterill
- Cambridge Computational Biology Institute, University of Cambridge, Cambridge, United Kingdom
| | - Tom Edinburgh
- Cambridge Computational Biology Institute, University of Cambridge, Cambridge, United Kingdom
| | - Michael A. Beaumont
- Institute for Genomic Medicine, Columbia University Medical Center, New York, NY, United States of America
- Axion BioSystems, Inc., Atlanta, GA, United States of America
| | - Wayne N. Frankel
- Institute for Genomic Medicine, Columbia University Medical Center, New York, NY, United States of America
- Department of Genetics and Development, Columbia University Medical Center, New York, NY, United States of America
| | - Slavé Petrovski
- Institute for Genomic Medicine, Columbia University Medical Center, New York, NY, United States of America
- Department of Medicine, Austin Health and Royal Melbourne Hospital, University of Melbourne, Melbourne, Australia
| | - Andrew S. Allen
- Institute for Genomic Medicine, Columbia University Medical Center, New York, NY, United States of America
- Department of Biostatistics and Bioinformatics, Duke University, Durham, NC, United States of America
| | - Michael J. Boland
- Institute for Genomic Medicine, Columbia University Medical Center, New York, NY, United States of America
- Department of Neurology, Columbia University, New York, NY, United States of America
| | - David B. Goldstein
- Institute for Genomic Medicine, Columbia University Medical Center, New York, NY, United States of America
- Department of Genetics and Development, Columbia University Medical Center, New York, NY, United States of America
| | - Stephen J. Eglen
- Cambridge Computational Biology Institute, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
4
|
Abstract
Codon preference and asymmetry in usage in the DNA sequences encoding the mature enzyme protein of 24 plant peroxidases from 12 different species were examined. Codon usage in highly conserved/non-conserved areas of the sequences was analysed, as well as possible deficiency/excess in CpG dinucleotides in the pairs of codon positions. Sequence relationships displayed by overall codon usage, dinucleotide frequencies within codons, and amino acid sequences were also studied. The main findings were: (1) Monocots clustered separately from dicots for overall codon usage and dinucleotide frequencies in codon positions 2 and 3, with six and seven clusters respectively discernible among these 24 peroxidase sequences. The monocot/dicot distinction disappeared in the four clusters among the mature protein amino acid sequences. Overall codon usage in sequences from monocotyledon and dicotyledon species differed, the monocots favouring codons with C or G in the third position. (2) Codon usage was biassed in many sequences, asymmetry was particularly noticeable in the monocots. (3) For repeated amino acids within conserved areas, codon preference appeared dependent on the order in which the repeated amino acid occurred, so that its usage of synonymous codons frequently balanced out.
Collapse
Affiliation(s)
- H Tyson
- Biology Department, McGill University, Montreal, Quebec, Canada
| | | |
Collapse
|