1
|
Mews P, Sosnick L, Gurung A, Sidoli S, Nestler EJ. Decoding cocaine-induced proteomic adaptations in the mouse nucleus accumbens. Sci Signal 2024; 17:eadl4738. [PMID: 38626009 PMCID: PMC11170322 DOI: 10.1126/scisignal.adl4738] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2023] [Accepted: 03/28/2024] [Indexed: 04/18/2024]
Abstract
Cocaine use disorder (CUD) is a chronic neuropsychiatric condition that results from enduring cellular and molecular adaptations. Among substance use disorders, CUD is notable for its rising prevalence and the lack of approved pharmacotherapies. The nucleus accumbens (NAc), a region that is integral to the brain's reward circuitry, plays a crucial role in the initiation and continuation of maladaptive behaviors that are intrinsic to CUD. Leveraging advancements in neuroproteomics, we undertook a proteomic analysis that spanned membrane, cytosolic, nuclear, and chromatin compartments of the NAc in a mouse model. The results unveiled immediate and sustained proteomic modifications after cocaine exposure and during prolonged withdrawal. We identified congruent protein regulatory patterns during initial cocaine exposure and reexposure after withdrawal, which contrasted with distinct patterns during withdrawal. Pronounced proteomic shifts within the membrane compartment indicated adaptive and long-lasting molecular responses prompted by cocaine withdrawal. In addition, we identified potential protein translocation events between soluble-nuclear and chromatin-bound compartments, thus providing insight into intracellular protein dynamics after cocaine exposure. Together, our findings illuminate the intricate proteomic landscape that is altered in the NAc by cocaine use and provide a dataset for future research toward potential therapeutics.
Collapse
Affiliation(s)
- Philipp Mews
- Nash Family Department of Neuroscience and Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Lucas Sosnick
- Nash Family Department of Neuroscience and Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Ashik Gurung
- Nash Family Department of Neuroscience and Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Simone Sidoli
- Department of Biochemistry, Albert Einstein College of Medicine, New York, NY 10461, USA
| | - Eric J. Nestler
- Nash Family Department of Neuroscience and Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| |
Collapse
|
2
|
Chen NC, Paulin LF, Sedlazeck FJ, Koren S, Phillippy AM, Langmead B. Improved sequence mapping using a complete reference genome and lift-over. Nat Methods 2024; 21:41-49. [PMID: 38036856 DOI: 10.1038/s41592-023-02069-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Accepted: 10/09/2023] [Indexed: 12/02/2023]
Abstract
Complete, telomere-to-telomere (T2T) genome assemblies promise improved analyses and the discovery of new variants, but many essential genomic resources remain associated with older reference genomes. Thus, there is a need to translate genomic features and read alignments between references. Here we describe a method called levioSAM2 that performs fast and accurate lift-over between assemblies using a whole-genome map. In addition to enabling the use of several references, we demonstrate that aligning reads to a high-quality reference (for example, T2T-CHM13) and lifting to an older reference (for example, Genome reference Consortium (GRC)h38) improves the accuracy of the resulting variant calls on the old reference. By leveraging the quality improvements of T2T-CHM13, levioSAM2 reduces small and structural variant calling errors compared with GRC-based mapping using real short- and long-read datasets. Performance is especially improved for a set of complex medically relevant genes, where the GRC references are lower quality.
Collapse
Affiliation(s)
- Nae-Chyun Chen
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA.
| | - Luis F Paulin
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Fritz J Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Ben Langmead
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA.
| |
Collapse
|
3
|
Tsiamis V, Schwämmle V. VIQoR: a web service for visually supervised protein inference and protein quantification. Bioinformatics 2022; 38:2757-2764. [PMID: 35561162 DOI: 10.1093/bioinformatics/btac182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Revised: 03/07/2022] [Accepted: 03/22/2022] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION In quantitative bottom-up mass spectrometry (MS)-based proteomics, the reliable estimation of protein concentration changes from peptide quantifications between different biological samples is essential. This estimation is not a single task but comprises the two processes of protein inference and protein abundance summarization. Furthermore, due to the high complexity of proteomics data and associated uncertainty about the performance of these processes, there is a demand for comprehensive visualization methods able to integrate protein with peptide quantitative data including their post-translational modifications. Hence, there is a lack of a suitable tool that provides post-identification quantitative analysis of proteins with simultaneous interactive visualization. RESULTS In this article, we present VIQoR, a user-friendly web service that accepts peptide quantitative data of both labeled and label-free experiments and accomplishes the crucial components protein inference and summarization and interactive visualization modules, including the novel VIQoR plot. We implemented two different parsimonious algorithms to solve the protein inference problem, while protein summarization is facilitated by a well-established factor analysis algorithm called fast-FARMS followed by a weighted average summarization function that minimizes the effect of missing values. In addition, summarization is optimized by the so-called Global Correlation Indicator (GCI). We test the tool on three publicly available ground truth datasets and demonstrate the ability of the protein inference algorithms to handle shared peptides. We furthermore show that GCI increases the accuracy of the quantitative analysis in datasets with replicated design. AVAILABILITY AND IMPLEMENTATION VIQoR is accessible at: http://computproteomics.bmb.sdu.dk/Apps/VIQoR/. The source code is available at: https://bitbucket.org/veitveit/viqor/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Vasileios Tsiamis
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense, Denmark
| | - Veit Schwämmle
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense, Denmark
| |
Collapse
|
4
|
Biswas A, Chakraborty S, Baruah VJ. Estimation of the proportion of true null hypotheses under sparse dependence: Adaptive FDR controlling in microarray data. Stat Methods Med Res 2022; 31:917-927. [PMID: 35133933 DOI: 10.1177/09622802221074164] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
The proportion of non-differentially expressed genes is an important quantity in microarray data analysis and an appropriate estimate of the same is used to construct adaptive multiple testing procedures. Most of the estimators for the proportion of true null hypotheses based on the thresholding, maximum likelihood and density estimation approaches assume independence among the gene expressions. Usually, sparse dependence structure is natural in modelling associations in microarray gene expression data and hence it is necessary to develop methods for accommodating the sparse dependence well within the framework of existing estimators. We propose a clustering based method to put genes in the same group that are not coexpressed using the estimated high dimensional correlation structure under sparse assumption as dissimilarity matrix. This novel method is applied to three existing estimators for the proportion of true null hypotheses. Extensive simulation study shows that the proposed method improves an existing estimator by making it less conservative and the corresponding adaptive Benjamini-Hochberg algorithm more powerful. The proposed method is applied to a microarray gene expression dataset of colorectal cancer patients and the results show gain in terms of number of differentially expressed genes. The R code is available at https://github.com/aniketstat/Proportiontion-of-true-null-under-sparse-dependence-2021.
Collapse
Affiliation(s)
- Aniket Biswas
- Department of Statistics, 28675Dibrugarh University, Dibrugarh, Assam, India
| | - Subrata Chakraborty
- Department of Statistics, 28675Dibrugarh University, Dibrugarh, Assam, India
| | - Vishwa Jyoti Baruah
- Center for Biotechnology and Bioinformatics, 28675Dibrugarh University, Dibrugarh, Assam, India
| |
Collapse
|
5
|
A Tutorial for Variance-Sensitive Clustering and the Quantitative Analysis of Protein Complexes. Methods Mol Biol 2021. [PMID: 33950508 DOI: 10.1007/978-1-0716-1024-4_30] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
Abstract
Data clustering facilitates the identification of biologically relevant molecular features in quantitative proteomics experiments with thousands of measurements over multiple conditions. It finds groups of proteins or peptides with similar quantitative behavior across multiple experimental conditions. This co-regulatory behavior suggests that the proteins of such a group share their functional behavior and thus often can be mapped to the same biological processes and molecular subnetworks.While usual clustering approaches dismiss the variance of the measured proteins, VSClust combines statistical testing with pattern recognition into a common algorithm. Here, we show how to use the VSClust web service on a large proteomics data set and present further tools to assess the quantitative behavior of protein complexes.
Collapse
|
6
|
van Belzen IAEM, Schönhuth A, Kemmeren P, Hehir-Kwa JY. Structural variant detection in cancer genomes: computational challenges and perspectives for precision oncology. NPJ Precis Oncol 2021; 5:15. [PMID: 33654267 PMCID: PMC7925608 DOI: 10.1038/s41698-021-00155-6] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Accepted: 01/12/2021] [Indexed: 01/31/2023] Open
Abstract
Cancer is generally characterized by acquired genomic aberrations in a broad spectrum of types and sizes, ranging from single nucleotide variants to structural variants (SVs). At least 30% of cancers have a known pathogenic SV used in diagnosis or treatment stratification. However, research into the role of SVs in cancer has been limited due to difficulties in detection. Biological and computational challenges confound SV detection in cancer samples, including intratumor heterogeneity, polyploidy, and distinguishing tumor-specific SVs from germline and somatic variants present in healthy cells. Classification of tumor-specific SVs is challenging due to inconsistencies in detected breakpoints, derived variant types and biological complexity of some rearrangements. Full-spectrum SV detection with high recall and precision requires integration of multiple algorithms and sequencing technologies to rescue variants that are difficult to resolve through individual methods. Here, we explore current strategies for integrating SV callsets and to enable the use of tumor-specific SVs in precision oncology.
Collapse
Affiliation(s)
| | - Alexander Schönhuth
- Genome Data Science, Faculty of Technology, Bielefeld University, Bielefeld, Germany
| | - Patrick Kemmeren
- Princess Máxima Center for Pediatric Oncology, Utrecht, The Netherlands
| | - Jayne Y Hehir-Kwa
- Princess Máxima Center for Pediatric Oncology, Utrecht, The Netherlands.
| |
Collapse
|
7
|
Mass Spectrometry to Study Chromatin Compaction. BIOLOGY 2020; 9:biology9060140. [PMID: 32604817 PMCID: PMC7345930 DOI: 10.3390/biology9060140] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/01/2020] [Revised: 06/19/2020] [Accepted: 06/23/2020] [Indexed: 12/26/2022]
Abstract
Chromatin accessibility is a major regulator of gene expression. Histone writers/erasers have a critical role in chromatin compaction, as they “flag” chromatin regions by catalyzing/removing covalent post-translational modifications on histone proteins. Anomalous chromatin decondensation is a common phenomenon in cells experiencing aging and viral infection. Moreover, about 50% of cancers have mutations in enzymes regulating chromatin state. Numerous genomics methods have evolved to characterize chromatin state, but the analysis of (in)accessible chromatin from the protein perspective is not yet in the spotlight. We present an overview of the most used approaches to generate data on chromatin accessibility and then focus on emerging methods that utilize mass spectrometry to quantify the accessibility of histones and the rest of the chromatin bound proteome. Mass spectrometry is currently the method of choice to quantify entire proteomes in an unbiased large-scale manner; accessibility on chromatin of proteins and protein modifications adds an extra quantitative layer to proteomics dataset that assist more informed data-driven hypotheses in chromatin biology. We speculate that this emerging new set of methods will enhance predictive strength on which proteins and histone modifications are critical in gene regulation, and which proteins occupy different chromatin states in health and disease.
Collapse
|
8
|
Michalak W, Tsiamis V, Schwämmle V, Rogowska-Wrzesińska A. ComplexBrowser: A Tool for Identification and Quantification of Protein Complexes in Large-scale Proteomics Datasets. Mol Cell Proteomics 2019; 18:2324-2334. [PMID: 31447428 PMCID: PMC6823858 DOI: 10.1074/mcp.tir119.001434] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2019] [Revised: 07/27/2019] [Indexed: 12/25/2022] Open
Abstract
We have developed ComplexBrowser, an open source, online platform for supervised analysis of quantitative proteomic data (label free and isobaric mass tag based) that focuses on protein complexes. The software uses manually curated information from CORUM and Complex Portal databases to identify protein complex components. For the first time, we provide a Complex Fold Change (CFC) factor that identifies up- and downregulated complexes based on the level of complex subunits coregulation. The software provides interactive visualization of protein complexes' composition and expression for exploratory analysis and incorporates a quality control step that includes normalization and statistical analysis based on the limma package. ComplexBrowser was tested on two published studies identifying changes in protein expression within either human adenocarcinoma tissue or activated mouse T-cells. The analysis revealed 1519 and 332 protein complexes, of which 233 and 41 were found coordinately regulated in the respective studies. The adopted approach provided evidence for a shift to glucose-based metabolism and high proliferation in adenocarcinoma tissues, and the identification of chromatin remodeling complexes involved in mouse T-cell activation. The results correlate with the original interpretation of the experiments and provide novel biological details about the protein complexes affected. ComplexBrowser is, to our knowledge, the first tool to automate quantitative protein complex analysis for high-throughput studies, providing insights into protein complex regulation within minutes of analysis.
Collapse
Affiliation(s)
- Wojciech Michalak
- Department of Biochemistry & Molecular Biology and VILLUM Center for Bioanalytical Sciences, University of Southern Denmark, Campusvej 55, DK-5230, Odense M, Denmark
| | - Vasileios Tsiamis
- Department of Biochemistry & Molecular Biology and VILLUM Center for Bioanalytical Sciences, University of Southern Denmark, Campusvej 55, DK-5230, Odense M, Denmark
| | - Veit Schwämmle
- Department of Biochemistry & Molecular Biology and VILLUM Center for Bioanalytical Sciences, University of Southern Denmark, Campusvej 55, DK-5230, Odense M, Denmark
| | - Adelina Rogowska-Wrzesińska
- Department of Biochemistry & Molecular Biology and VILLUM Center for Bioanalytical Sciences, University of Southern Denmark, Campusvej 55, DK-5230, Odense M, Denmark.
| |
Collapse
|