1
|
van Wijk KJ, Leppert T, Sun Z, Guzchenko I, Debley E, Sauermann G, Routray P, Mendoza L, Sun Q, Deutsch EW. The Zea mays PeptideAtlas: A New Maize Community Resource. J Proteome Res 2024. [PMID: 39101213 DOI: 10.1021/acs.jproteome.4c00320] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/06/2024]
Abstract
This study presents the Maize PeptideAtlas resource (www.peptideatlas.org/builds/maize) to help solve questions about the maize proteome. Publicly available raw tandem mass spectrometry (MS/MS) data for maize collected from ProteomeXchange were reanalyzed through a uniform processing and metadata annotation pipeline. These data are from a wide range of genetic backgrounds and many sample types and experimental conditions. The protein search space included different maize genome annotations for the B73 inbred line from MaizeGDB, UniProtKB, NCBI RefSeq, and for the W22 inbred line. 445 million MS/MS spectra were searched, of which 120 million were matched to 0.37 million distinct peptides. Peptides were matched to 66.2% of proteins in the most recent B73 nuclear genome annotation. Furthermore, most conserved plastid- and mitochondrial-encoded proteins (NCBI RefSeq annotations) were identified. Peptides and proteins identified in the other B73 genome annotations will improve maize genome annotation. We also illustrate the high-confidence detection of unique W22 proteins. N-terminal acetylation, phosphorylation, ubiquitination, and three lysine acylations (K-acetyl, K-malonyl, and K-hydroxyisobutyryl) were identified and can be inspected through a PTM viewer in PeptideAtlas. All matched MS/MS-derived peptide data are linked to spectral, technical, and biological metadata. This new PeptideAtlas is integrated in MaizeGDB with a peptide track in JBrowse.
Collapse
Affiliation(s)
- Klaas J van Wijk
- Section of Plant Biology, School of Integrative Plant Sciences (SIPS), Cornell University, Ithaca, New York 14853, United States
| | - Tami Leppert
- Institute for Systems Biology (ISB), Seattle, Washington 98109, United States
| | - Zhi Sun
- Institute for Systems Biology (ISB), Seattle, Washington 98109, United States
| | - Isabell Guzchenko
- Section of Plant Biology, School of Integrative Plant Sciences (SIPS), Cornell University, Ithaca, New York 14853, United States
| | - Erica Debley
- Section of Plant Biology, School of Integrative Plant Sciences (SIPS), Cornell University, Ithaca, New York 14853, United States
| | - Georgia Sauermann
- Section of Plant Biology, School of Integrative Plant Sciences (SIPS), Cornell University, Ithaca, New York 14853, United States
| | - Pratyush Routray
- Section of Plant Biology, School of Integrative Plant Sciences (SIPS), Cornell University, Ithaca, New York 14853, United States
| | - Luis Mendoza
- Institute for Systems Biology (ISB), Seattle, Washington 98109, United States
| | - Qi Sun
- Computational Biology Service Unit, Cornell University, Ithaca, New York 14853, United States
| | - Eric W Deutsch
- Institute for Systems Biology (ISB), Seattle, Washington 98109, United States
| |
Collapse
|
2
|
Pandi B, Brenman S, Black A, Ng DCM, Lau E, Lam MPY. Tissue Usage Preference and Intrinsically Disordered Region Remodeling of Alternative Splicing Derived Proteoforms in the Heart. J Proteome Res 2024; 23:3161-3173. [PMID: 38456420 PMCID: PMC11296937 DOI: 10.1021/acs.jproteome.3c00789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 02/08/2024] [Accepted: 02/27/2024] [Indexed: 03/09/2024]
Abstract
A computational analysis of mass spectrometry data was performed to uncover alternative splicing derived protein variants across chambers of the human heart. Evidence for 216 non-canonical isoforms was apparent in the atrium and the ventricle, including 52 isoforms not documented on SwissProt and recovered using an RNA sequencing derived database. Among non-canonical isoforms, 29 show signs of regulation based on statistically significant preferences in tissue usage, including a ventricular enriched protein isoform of tensin-1 (TNS1) and an atrium-enriched PDZ and LIM Domain 3 (PDLIM3) isoform 2 (PDLIM3-2/ALP-H). Examined variant regions that differ between alternative and canonical isoforms are highly enriched with intrinsically disordered regions. Moreover, over two-thirds of such regions are predicted to function in protein binding and RNA binding. The analysis here lends further credence to the notion that alternative splicing diversifies the proteome by rewiring intrinsically disordered regions, which are increasingly recognized to play important roles in the generation of biological function from protein sequences.
Collapse
Affiliation(s)
- Boomathi Pandi
- Department
of Medicine/Division of Cardiology, Department of Biochemistry &
Molecular Genetics, and Consortium for Fibrosis Research and Translation (CFReT), University of Colorado School of Medicine, Aurora, Colorado 80045, United States
| | - Stella Brenman
- Department
of Medicine/Division of Cardiology, Department of Biochemistry &
Molecular Genetics, and Consortium for Fibrosis Research and Translation (CFReT), University of Colorado School of Medicine, Aurora, Colorado 80045, United States
| | - Alexander Black
- Department
of Medicine/Division of Cardiology, Department of Biochemistry &
Molecular Genetics, and Consortium for Fibrosis Research and Translation (CFReT), University of Colorado School of Medicine, Aurora, Colorado 80045, United States
| | - Dominic C. M. Ng
- Department
of Medicine/Division of Cardiology, Department of Biochemistry &
Molecular Genetics, and Consortium for Fibrosis Research and Translation (CFReT), University of Colorado School of Medicine, Aurora, Colorado 80045, United States
| | - Edward Lau
- Department
of Medicine/Division of Cardiology, Department of Biochemistry &
Molecular Genetics, and Consortium for Fibrosis Research and Translation (CFReT), University of Colorado School of Medicine, Aurora, Colorado 80045, United States
| | - Maggie P. Y. Lam
- Department
of Medicine/Division of Cardiology, Department of Biochemistry &
Molecular Genetics, and Consortium for Fibrosis Research and Translation (CFReT), University of Colorado School of Medicine, Aurora, Colorado 80045, United States
| |
Collapse
|
3
|
Korchak JA, Jeffery ED, Bandyopadhyay S, Jordan BT, Lehe MD, Watts EF, Fenix A, Wilhelm M, Sheynkman GM. IS-PRM-Based Peptide Targeting Informed by Long-Read Sequencing for Alternative Proteome Detection. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024. [PMID: 39012054 DOI: 10.1021/jasms.4c00119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/17/2024]
Abstract
Alternative splicing is a major contributor of transcriptomic complexity, but the extent to which transcript isoforms are translated into stable, functional protein isoforms is unclear. Furthermore, detection of relatively scarce isoform-specific peptides is challenging, with many protein isoforms remaining uncharted due to technical limitations. Recently, a family of advanced targeted MS strategies, termed internal standard parallel reaction monitoring (IS-PRM), have demonstrated multiplexed, sensitive detection of predefined peptides of interest. Such approaches have not yet been used to confirm existence of novel peptides. Here, we present a targeted proteogenomic approach that leverages sample-matched long-read RNA sequencing (lrRNA-seq) data to predict potential protein isoforms with prior transcript evidence. Predicted tryptic isoform-specific peptides, which are specific to individual gene product isoforms, serve as "triggers" and "targets" in the IS-PRM method, Tomahto. Using the model human stem cell line WTC11, LR RNaseq data were generated and used to inform the generation of synthetic standards for 192 isoform-specific peptides (114 isoforms from 55 genes). These synthetic "trigger" peptides were labeled with super heavy tandem mass tags (TMT) and spiked into TMT-labeled WTC11 tryptic digest, predicted to contain corresponding endogenous "target" peptides. Compared to DDA mode, Tomahto increased detectability of isoforms by 3.6-fold, resulting in the identification of five previously unannotated isoforms. Our method detected protein isoform expression for 43 out of 55 genes corresponding to 54 resolved isoforms. This lrRNA-seq-informed Tomahto targeted approach is a new modality for generating protein-level evidence of alternative isoforms─a critical first step in designing functional studies and eventually clinical assays.
Collapse
Affiliation(s)
- Jennifer A Korchak
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia 22903, United States
| | - Erin D Jeffery
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia 22903, United States
| | - Saikat Bandyopadhyay
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia 22903, United States
- Center for Public Health Genomics, University of Virginia, Charlottesville, Virginia 22903, United States
| | - Ben T Jordan
- Cancer Genomics Research Laboratory, Frederick National Laboratory for Cancer Research, Frederick, Maryland 21701, United States
| | - Micah D Lehe
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia 22903, United States
| | - Emily F Watts
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia 22903, United States
| | - Aidan Fenix
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, Washington 98195, United States
| | - Mathias Wilhelm
- Computational Mass Spectrometry, Technical University of Munich (TUM), D-85354 Freising, Germany
| | - Gloria M Sheynkman
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia 22903, United States
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, Virginia 22903, United States
- UVA Comprehensive Cancer Center, University of Virginia, Charlottesville, Virginia 22903, United States
| |
Collapse
|
4
|
Gallo CM, Kistler SA, Natrakul A, Labadorf AT, Beffert U, Ho A. APOER2 splicing repertoire in Alzheimer's disease: Insights from long-read RNA sequencing. PLoS Genet 2024; 20:e1011348. [PMID: 39038048 PMCID: PMC11293713 DOI: 10.1371/journal.pgen.1011348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Revised: 08/01/2024] [Accepted: 06/21/2024] [Indexed: 07/24/2024] Open
Abstract
Disrupted alternative splicing plays a determinative role in neurological diseases, either as a direct cause or as a driver in disease susceptibility. Transcriptomic profiling of aged human postmortem brain samples has uncovered hundreds of aberrant mRNA splicing events in Alzheimer's disease (AD) brains, associating dysregulated RNA splicing with disease. We previously identified a complex array of alternative splicing combinations across apolipoprotein E receptor 2 (APOER2), a transmembrane receptor that interacts with both the neuroprotective ligand Reelin and the AD-associated risk factor, APOE. Many of the human APOER2 isoforms, predominantly featuring cassette splicing events within functionally important domains, are critical for the receptor's function and ligand interaction. However, a comprehensive repertoire and the functional implications of APOER2 isoforms under both physiological and AD conditions are not fully understood. Here, we present an in-depth analysis of the splicing landscape of human APOER2 isoforms in normal and AD states. Using single-molecule, long-read sequencing, we profiled the entire APOER2 transcript from the parietal cortex and hippocampus of Braak stage IV AD brain tissues along with age-matched controls and investigated several functional properties of APOER2 isoforms. Our findings reveal diverse patterns of cassette exon skipping for APOER2 isoforms, with some showing region-specific expression and others unique to AD-affected brains. Notably, exon 15 of APOER2, which encodes the glycosylation domain, showed less inclusion in AD compared to control in the parietal cortex of females with an APOE ɛ3/ɛ3 genotype. Also, some of these APOER2 isoforms demonstrated changes in cell surface expression, APOE-mediated receptor processing, and synaptic number. These variations are likely critical in inducing synaptic alterations and may contribute to the neuronal dysfunction underlying AD pathogenesis.
Collapse
Affiliation(s)
- Christina M. Gallo
- Department of Biology, Boston University, Boston, Massachusetts, United States of America
- Department of Pharmacology, Physiology & Biophysics, Boston University Chobanian & Avedisian School of Medicine, Boston, Massachusetts, United States of America
| | - Sabrina A. Kistler
- Department of Biology, Boston University, Boston, Massachusetts, United States of America
- Department of Pharmacology, Physiology & Biophysics, Boston University Chobanian & Avedisian School of Medicine, Boston, Massachusetts, United States of America
| | - Anna Natrakul
- Department of Biology, Boston University, Boston, Massachusetts, United States of America
| | - Adam T. Labadorf
- Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America
- Department of Neurology, Boston University Chobanian & Avedisian School of Medicine, Boston, Massachusetts, United States of America
| | - Uwe Beffert
- Department of Biology, Boston University, Boston, Massachusetts, United States of America
| | - Angela Ho
- Department of Biology, Boston University, Boston, Massachusetts, United States of America
- Department of Pharmacology, Physiology & Biophysics, Boston University Chobanian & Avedisian School of Medicine, Boston, Massachusetts, United States of America
| |
Collapse
|
5
|
Crowl S, Coleman MB, Chaphiv A, Naegle KM. A systematic analysis of the effects of splicing on the diversity of post-translational modifications in protein isoforms. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.10.575062. [PMID: 38260432 PMCID: PMC10802621 DOI: 10.1101/2024.01.10.575062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
Abstract
Post-translational modifications (PTMs) and splicing are known to be important regulatory processes for controlling protein function and activity. However, there have been limitations in analyzing the interplay of alternative splicing and PTMs, which stems from the deep differences in genomic and proteomic databases. In this work, we bridged the protein- and genome-centric world views to map PTMs to genomic locations for subsequent projection of PTMs onto alternative isoforms. We then performed a systematic analysis of the diversification of PTMs by alternative splicing, including exploration of the modification-specific rates of inclusion across isoforms and how often the regulatory sequences directly flanking a PTM are impacted by splicing, which might indicate altered regulatory or binding interactions in the alternatively spliced isoform. We found that 6-51% of PTMs are excluded from at least one isoform, depending on the modification type. Further, approximately 2% of prospective PTM sites exhibited altered regulatory sequences surrounding the modification site, suggesting that regulatory or binding interactions might be diversified in these proteoforms. Lastly, we applied this PTM-to-isoform mapping approach to explore the impacts of disease-related splicing in prostate cancer, identifying possible new hypotheses explaining the variable consequences of ESRP1 expression in different cancers. As a part of this work, we have provided an easily implementable tool for annotating splice events identified from RNA-sequencing with PTMs and their functional consequences, called PTM-POSE.
Collapse
|
6
|
Li N, Jia W, Wang J, Shao Q, Feng X, Li Z, Sun W, Kang M, Hu D, Xing L, Zhan X. Clinically relevant immune subtypes based on alternative splicing landscape of immune-related genes for lung cancer advanced PPPM approach. EPMA J 2024; 15:345-373. [PMID: 38841624 PMCID: PMC11147996 DOI: 10.1007/s13167-024-00366-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Accepted: 05/03/2024] [Indexed: 06/07/2024]
Abstract
Background Alternative splicing (AS) occurs in the process of gene post-transcriptional process, which is very important for the correct synthesis and function of protein. The change of AS pattern may lead to the change of expression level or function of lung cancer-related genes, and then affect the occurrence and development of lung cancers. The specific AS pattern might be used as a biomarker for early warning and prognostic assessment of a cancer in the framework of predictive, preventive, and personalized medicine (PPPM; 3PM). AS events of immune-related genes (IRGs) were closely associated with tumor progression and immunotherapy. We hypothesize that IRG-AS events are significantly different in lung adenocarcinomas (LUADs) vs. controls or in lung squamous cell carcinomas (LUSCs) vs. controls. IRG-AS alteration profiling was identified to construct IRG-differentially expressed AS (IRG-DEAS) signature models. Study on the selective AS events of specific IRGs in lung cancer patients might be of great significance for further exploring the pathogenesis of lung cancer, realizing early detection and effective monitoring of lung cancer, finding new therapeutic targets, overcoming drug resistance, and developing more effective therapeutic strategies, and better used for the prediction, diagnosis, prevention, and personalized medicine of lung cancer. Methods The transcriptomic, clinical, and AS data of LUADs and LUSCs were downloaded from TCGA and its SpliceSeq databases. IRG-DEAS events were identified in LUAD and LUSC, followed by their functional characteristics, and overall survival (OS) analyses. OS-related IRG-DEAS prognostic models were constructed for LUAD and LUSC with Lasso regression, which were used to classify LUADs and LUSCs into low- and high-risk score groups. Furthermore, the immune cell distribution, immune-related scores, drug sensitivity, mutation status, and GSEA/GSVA status were analyzed between low- and high-risk score groups. Also, low- and high-immunity clusters and AS factor (SF)-OS-related-AS co-expression network and verification of cell function of CELF6 were analyzed in LUAD and LUSC. Results Comprehensive analysis of transcriptomic, clinical, and AS data of LUADs and LUSCs identified IRG-AS events in LUAD (n = 1607) and LUSC (n = 1656), including OS-related IRG-AS events in LUAD (n = 127) and LUSC (n = 105). A total of 66 IRG-DEAS events in LUAD and 89 IRG-DEAS events in LUSC were identified compared to controls. The overlapping analysis between IRG-DEASs and OS-related IRG-AS events revealed 14 OS-related IRG-DEAS events for LUAD and 16 OS-related IRG-DEAS events for LUSC, which were used to identify and optimize a 12-OS-related-IRG-DEAS signature prognostic model for LUAD and an 11-OS-related-IRG-DEAS signature prognostic model for LUSC. These two prognostic models effectively divided LUAD or LUSC samples into low- and high-risk score groups that were closely associated with OS, clinical characteristics, and tumor immune microenvironment, with significant gene sets and pathways enriched in the two groups. Moreover, weighted gene co-expression network (WGCNA) and nonnegative matrix factorization method (NMF) analyses identified four OS-relevant subtypes of LUAD and six OS-relevant subtypes of LUSC, and ssGSEA identified five immunity-relevant subtypes of LUAD and five immunity-relevant subtypes of LUSC. Interestingly, splicing factors-OS-related-AS network revealed hub molecule CELF6 was significantly related to the malignant phenotype in lung cancer cells. Conclusions This study established two reliable IRG-DEAS signature prognostic models and constructed interesting splicing factor-splicing event networks in LUAD and LUSC, which can be used to construct clinically relevant immune subtypes, patient stratification, prognostic prediction, and personalized medical services in the PPPM practice. Supplementary Information The online version contains supplementary material available at 10.1007/s13167-024-00366-4.
Collapse
Affiliation(s)
- Na Li
- Shandong Key Laboratory of Radiation Oncology, Shandong Cancer Hospital and Institute, Shandong First Medical University & Shandong Academy of Medical Sciences, 440 Jiyan Road, Jinan, Shandong 250117 People’s Republic of China
| | - Wenshuang Jia
- Shandong Key Laboratory of Radiation Oncology, Shandong Cancer Hospital and Institute, Shandong First Medical University & Shandong Academy of Medical Sciences, 440 Jiyan Road, Jinan, Shandong 250117 People’s Republic of China
| | - Jiahong Wang
- Cancer Research Institute, School of Basic Medical Sciences, Southern Medical University, Baiyun Road 1083, Guangzhou, Guangdong 510515 People’s Republic of China
| | - Qianwen Shao
- Shandong Key Laboratory of Radiation Oncology, Shandong Cancer Hospital and Institute, Shandong First Medical University & Shandong Academy of Medical Sciences, 440 Jiyan Road, Jinan, Shandong 250117 People’s Republic of China
| | - Xiaoxia Feng
- Shandong Key Laboratory of Radiation Oncology, Shandong Cancer Hospital and Institute, Shandong First Medical University & Shandong Academy of Medical Sciences, 440 Jiyan Road, Jinan, Shandong 250117 People’s Republic of China
| | - Zhijun Li
- Shandong Key Laboratory of Radiation Oncology, Shandong Cancer Hospital and Institute, Shandong First Medical University & Shandong Academy of Medical Sciences, 440 Jiyan Road, Jinan, Shandong 250117 People’s Republic of China
| | - Wenhao Sun
- Shandong Key Laboratory of Radiation Oncology, Shandong Cancer Hospital and Institute, Shandong First Medical University & Shandong Academy of Medical Sciences, 440 Jiyan Road, Jinan, Shandong 250117 People’s Republic of China
| | - Ming Kang
- Shandong Key Laboratory of Radiation Oncology, Shandong Cancer Hospital and Institute, Shandong First Medical University & Shandong Academy of Medical Sciences, 440 Jiyan Road, Jinan, Shandong 250117 People’s Republic of China
| | - Dongming Hu
- Shandong Key Laboratory of Radiation Oncology, Shandong Cancer Hospital and Institute, Shandong First Medical University & Shandong Academy of Medical Sciences, 440 Jiyan Road, Jinan, Shandong 250117 People’s Republic of China
| | - Ligang Xing
- Shandong Key Laboratory of Radiation Oncology, Shandong Cancer Hospital and Institute, Shandong First Medical University & Shandong Academy of Medical Sciences, 440 Jiyan Road, Jinan, Shandong 250117 People’s Republic of China
| | - Xianquan Zhan
- Shandong Key Laboratory of Radiation Oncology, Shandong Cancer Hospital and Institute, Shandong First Medical University & Shandong Academy of Medical Sciences, 440 Jiyan Road, Jinan, Shandong 250117 People’s Republic of China
| |
Collapse
|
7
|
Kurgan N, Kjærgaard Larsen J, Deshmukh AS. Harnessing the power of proteomics in precision diabetes medicine. Diabetologia 2024; 67:783-797. [PMID: 38345659 DOI: 10.1007/s00125-024-06097-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Accepted: 12/20/2023] [Indexed: 03/21/2024]
Abstract
Precision diabetes medicine (PDM) aims to reduce errors in prevention programmes, diagnosis thresholds, prognosis prediction and treatment strategies. However, its advancement and implementation are difficult due to the heterogeneity of complex molecular processes and environmental exposures that influence an individual's disease trajectory. To address this challenge, it is imperative to develop robust screening methods for all areas of PDM. Innovative proteomic technologies, alongside genomics, have proven effective in precision cancer medicine and are showing promise in diabetes research for potential translation. This narrative review highlights how proteomics is well-positioned to help improve PDM. Specifically, a critical assessment of widely adopted affinity-based proteomic technologies in large-scale clinical studies and evidence of the benefits and feasibility of using MS-based plasma proteomics is presented. We also present a case for the use of proteomics to identify predictive protein panels for type 2 diabetes subtyping and the development of clinical prediction models for prevention, diagnosis, prognosis and treatment strategies. Lastly, we discuss the importance of plasma and tissue proteomics and its integration with genomics (proteogenomics) for identifying unique type 2 diabetes intra- and inter-subtype aetiology. We conclude with a call for action formed on advancing proteomics technologies, benchmarking their performance and standardisation across sites, with an emphasis on data sharing and the inclusion of diverse ancestries in large cohort studies. These efforts should foster collaboration with key stakeholders and align with ongoing academic programmes such as the Precision Medicine in Diabetes Initiative consortium.
Collapse
Affiliation(s)
- Nigel Kurgan
- Novo Nordisk Foundation Center for Basic Metabolic Research, University of Copenhagen, Copenhagen, Denmark
| | - Jeppe Kjærgaard Larsen
- Novo Nordisk Foundation Center for Basic Metabolic Research, University of Copenhagen, Copenhagen, Denmark
| | - Atul S Deshmukh
- Novo Nordisk Foundation Center for Basic Metabolic Research, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
8
|
Han Y, Wennersten SA, Pandi BP, Ng DCM, Lau E, Lam MPY. A Ratiometric Catalog of Protein Isoform Shifts in the Cardiac Fetal Gene Program. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.09.588716. [PMID: 38645170 PMCID: PMC11030362 DOI: 10.1101/2024.04.09.588716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/23/2024]
Abstract
The fetal genetic program orchestrates cardiac development and the re-expression of fetal genes is thought to underlie cardiac disease and adaptation. Here, a proteomics ratio test using mass spectrometry is applied to find protein isoforms with statistically significant usage differences in the fetal vs. postnatal mouse heart. Changes in isoform usage ratios are pervasive at the protein level, with 104 significant events observed, including 88 paralog-derived isoform switching events and 16 splicing-derived isoform switching events between fetal and postnatal hearts. The ratiometric proteomic comparisons rediscovered hallmark fetal gene signatures including a postnatal switch from fetal β (MYH7) toward ɑ (MYH6) myosin heavy chains and from slow skeletal muscle (TNNI1) toward cardiac (TNNI3) troponin I. Altered usages in metabolic proteins are prominent, including a platelet to muscle phosphofructokinase (PFKP - PFKM), enolase 1 to 3 (ENO1 - ENO3), and alternative splicing of pyruvate kinase M2 toward M1 (PKM2 - PKM1) isoforms in glycolysis. The data also revealed a parallel change in mitochondrial proteins in cardiac development, suggesting the shift toward aerobic respiration involves also a remodeling of the mitochondrial protein isoform proportion. Finally, a number of glycolytic protein isoforms revert toward their fetal forms in adult hearts under pathological cardiac hypertrophy, suggesting their functional roles in adaptive or maladaptive response, but this reversal is partial. In summary, this work presents a catalog of ratiometric protein markers of the fetal genetic program of the mouse heart, including previously unreported splice isoform markers.
Collapse
Affiliation(s)
- Yu Han
- Department of Medicine, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Sara A Wennersten
- Department of Medicine, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Boomathi P Pandi
- Department of Medicine, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Dominic C M Ng
- Department of Medicine, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Edward Lau
- Department of Medicine, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Maggie P Y Lam
- Department of Medicine, University of Colorado School of Medicine, Aurora, CO 80045, USA
- Department of Biochemistry and Molecular Genetics, University of Colorado School of Medicine, Aurora, CO 80045, USA
- Consortium for Fibrosis Research and Translation, University of Colorado School of Medicine, Aurora, CO 80045, USA
| |
Collapse
|
9
|
Korchak JA, Jeffery ED, Bandyopadhyay S, Jordan BT, Lehe M, Watts EF, Fenix A, Wilhelm M, Sheynkman GM. IS-PRM-based peptide targeting informed by long-read sequencing for alternative proteome detection. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.01.587549. [PMID: 38617311 PMCID: PMC11014528 DOI: 10.1101/2024.04.01.587549] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/16/2024]
Abstract
Alternative splicing is a major contributor of transcriptomic complexity, but the extent to which transcript isoforms are translated into stable, functional protein isoforms is unclear. Furthermore, detection of relatively scarce isoform-specific peptides is challenging, with many protein isoforms remaining uncharted due to technical limitations. Recently, a family of advanced targeted MS strategies, termed internal standard parallel reaction monitoring (IS-PRM), have demonstrated multiplexed, sensitive detection of pre-defined peptides of interest. Such approaches have not yet been used to confirm existence of novel peptides. Here, we present a targeted proteogenomic approach that leverages sample-matched long-read RNA sequencing (LR RNAseq) data to predict potential protein isoforms with prior transcript evidence. Predicted tryptic isoform-specific peptides, which are specific to individual gene product isoforms, serve as "triggers" and "targets" in the IS-PRM method, Tomahto. Using the model human stem cell line WTC11, LR RNAseq data were generated and used to inform the generation of synthetic standards for 192 isoform-specific peptides (114 isoforms from 55 genes). These synthetic "trigger" peptides were labeled with super heavy tandem mass tags (TMT) and spiked into TMT-labeled WTC11 tryptic digest, predicted to contain corresponding endogenous "target" peptides. Compared to DDA mode, Tomahto increased detectability of isoforms by 3.6-fold, resulting in the identification of five previously unannotated isoforms. Our method detected protein isoform expression for 43 out of 55 genes corresponding to 54 resolved isoforms. This LR RNA seq-informed Tomahto targeted approach, called LRP-IS-PRM, is a new modality for generating protein-level evidence of alternative isoforms - a critical first step in designing functional studies and eventually clinical assays.
Collapse
Affiliation(s)
- Jennifer A. Korchak
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
| | - Erin D. Jeffery
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
| | - Saikat Bandyopadhyay
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
| | - Ben T. Jordan
- Cancer Genomics Research Laboratory, Frederick National Laboratory for Cancer Research, Frederick, MD USA
| | - Micah Lehe
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
| | - Emily F. Watts
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
| | - Aidan Fenix
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA, USA
| | - Mathias Wilhelm
- Computational Mass Spectrometry, Technical University of Munich (TUM), D-85354 Freising, Germany
| | - Gloria M. Sheynkman
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, USA
- UVA Comprehensive Cancer Center, University of Virginia, Charlottesville, VA, USA
| |
Collapse
|
10
|
Solovyeva EM, Utzinger S, Vissières A, Mitchelmore J, Ahrné E, Hermes E, Poetsch T, Ronco M, Bidinosti M, Merkl C, Serluca FC, Fessenden J, Naumann U, Voshol H, Meyer AS, Hoersch S. Integrative Proteogenomics for Differential Expression and Splicing Variation in a DM1 Mouse Model. Mol Cell Proteomics 2024; 23:100683. [PMID: 37993104 PMCID: PMC10770608 DOI: 10.1016/j.mcpro.2023.100683] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2022] [Revised: 09/02/2023] [Accepted: 11/17/2023] [Indexed: 11/24/2023] Open
Abstract
Dysregulated mRNA splicing is involved in the pathogenesis of many diseases including cancer, neurodegenerative diseases, and muscular dystrophies such as myotonic dystrophy type 1 (DM1). Comprehensive assessment of dysregulated splicing on the transcriptome and proteome level has been methodologically challenging, and thus investigations have often been targeting only few genes. Here, we performed a large-scale coordinated transcriptomic and proteomic analysis to characterize a DM1 mouse model (HSALR) in comparison to wild type. Our integrative proteogenomics approach comprised gene- and splicing-level assessments for mRNAs and proteins. It recapitulated many known instances of aberrant mRNA splicing in DM1 and identified new ones. It enabled the design and targeting of splicing-specific peptides and confirmed the translation of known instances of aberrantly spliced disease-related genes (e.g., Atp2a1, Bin1, Ryr1), complemented by novel findings (Flnc and Ywhae). Comparative analysis of large-scale mRNA and protein expression data showed quantitative agreement of differentially expressed genes and splicing patterns between disease and wild type. We hence propose this work as a suitable blueprint for a robust and scalable integrative proteogenomic strategy geared toward advancing our understanding of splicing-based disorders. With such a strategy, splicing-based biomarker candidates emerge as an attractive and accessible option, as they can be efficiently asserted on the mRNA and protein level in coordinated fashion.
Collapse
Affiliation(s)
- Elizaveta M Solovyeva
- Research Informatics, Biomedical Research at Novartis, Basel, Switzerland; V.L. Talrose Institute for Energy Problems of Chemical Physics, N.N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, Moscow, Russia.
| | - Stephan Utzinger
- Diseases of Aging and Regenerative Medicine, Biomedical Research at Novartis, Basel, Switzerland
| | | | - Joanna Mitchelmore
- Diseases of Aging and Regenerative Medicine, Biomedical Research at Novartis, Basel, Switzerland
| | - Erik Ahrné
- Discovery Sciences, Biomedical Research at Novartis, Basel, Switzerland
| | - Erwin Hermes
- Discovery Sciences, Biomedical Research at Novartis, Basel, Switzerland
| | - Tania Poetsch
- Discovery Sciences, Biomedical Research at Novartis, Basel, Switzerland
| | - Marie Ronco
- Diseases of Aging and Regenerative Medicine, Biomedical Research at Novartis, Basel, Switzerland
| | - Michael Bidinosti
- Diseases of Aging and Regenerative Medicine, Biomedical Research at Novartis, Basel, Switzerland
| | - Claudia Merkl
- Diseases of Aging and Regenerative Medicine, Biomedical Research at Novartis, Basel, Switzerland
| | - Fabrizio C Serluca
- Research Informatics, Biomedical Research at Novartis, Cambridge, Massachusetts, USA
| | - James Fessenden
- Neurodegenerative Diseases, Biomedical Research at Novartis, Cambridge, Massachusetts, USA
| | - Ulrike Naumann
- Discovery Sciences, Biomedical Research at Novartis, Basel, Switzerland
| | - Hans Voshol
- Discovery Sciences, Biomedical Research at Novartis, Basel, Switzerland
| | - Angelika S Meyer
- Diseases of Aging and Regenerative Medicine, Biomedical Research at Novartis, Basel, Switzerland
| | - Sebastian Hoersch
- Research Informatics, Biomedical Research at Novartis, Basel, Switzerland.
| |
Collapse
|
11
|
Humphrey J, Brophy E, Kosoy R, Zeng B, Coccia E, Mattei D, Ravi A, Efthymiou AG, Navarro E, Muller BZ, Snijders GJLJ, Allan A, Münch A, Kitata RB, Kleopoulos SP, Argyriou S, Shao Z, Francoeur N, Tsai CF, Gritsenko MA, Monroe ME, Paurus VL, Weitz KK, Shi T, Sebra R, Liu T, de Witte LD, Goate AM, Bennett DA, Haroutunian V, Hoffman GE, Fullard JF, Roussos P, Raj T. Long-read RNA-seq atlas of novel microglia isoforms elucidates disease-associated genetic regulation of splicing. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.12.01.23299073. [PMID: 38076956 PMCID: PMC10705658 DOI: 10.1101/2023.12.01.23299073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/21/2023]
Abstract
Microglia, the innate immune cells of the central nervous system, have been genetically implicated in multiple neurodegenerative diseases. We previously mapped the genetic regulation of gene expression and mRNA splicing in human microglia, identifying several loci where common genetic variants in microglia-specific regulatory elements explain disease risk loci identified by GWAS. However, identifying genetic effects on splicing has been challenging due to the use of short sequencing reads to identify causal isoforms. Here we present the isoform-centric microglia genomic atlas (isoMiGA) which leverages the power of long-read RNA-seq to identify 35,879 novel microglia isoforms. We show that the novel microglia isoforms are involved in stimulation response and brain region specificity. We then quantified the expression of both known and novel isoforms in a multi-ethnic meta-analysis of 555 human microglia short-read RNA-seq samples from 391 donors, the largest to date, and found associations with genetic risk loci in Alzheimer's disease and Parkinson's disease. We nominate several loci that may act through complex changes in isoform and splice site usage.
Collapse
Affiliation(s)
- Jack Humphrey
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Estelle and Daniel Maggin Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Erica Brophy
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Estelle and Daniel Maggin Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Roman Kosoy
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, USA
| | - Biao Zeng
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, USA
| | - Elena Coccia
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Estelle and Daniel Maggin Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Daniele Mattei
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Estelle and Daniel Maggin Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ashvin Ravi
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Estelle and Daniel Maggin Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Anastasia G. Efthymiou
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Elisa Navarro
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Estelle and Daniel Maggin Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Biochemistry and Molecular Biology, Faculty of Medicine (Universidad Complutense de Madrid), Madrid, Spain
- Centro de Investigación Biomédica en Red sobre Enfermedades Neurodegenerativas (CIBERNED), Madrid, Spain
- Instituto Ramon y Cajal de Investigacion Sanitaria (IRYCIS), Madrid, Spain
| | - Benjamin Z. Muller
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Estelle and Daniel Maggin Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Gijsje JLJ Snijders
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, USA
| | - Amanda Allan
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Estelle and Daniel Maggin Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Alexandra Münch
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Reta Birhanu Kitata
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Steven P Kleopoulos
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, USA
| | - Stathis Argyriou
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, USA
| | - Zhiping Shao
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, USA
| | - Nancy Francoeur
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Chia-Feng Tsai
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Marina A Gritsenko
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Matthew E Monroe
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Vanessa L Paurus
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Karl K Weitz
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Tujin Shi
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Robert Sebra
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Black Family Stem Cell Institute, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Global Health and Emerging Pathogens Institute, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Tao Liu
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Lot D. de Witte
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, USA
| | - Alison M. Goate
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Estelle and Daniel Maggin Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - David A. Bennett
- Rush Alzheimer’s Disease Center, Rush University Medical Center, Chicago, Illinois, USA
| | - Vahram Haroutunian
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Mental Illness Research Education, and Clinical Center (VISN 2 South), James J. Peters VA Medical Center, Bronx, NY, USA
| | - Gabriel E. Hoffman
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, USA
| | - John F. Fullard
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, USA
| | - Panos Roussos
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, USA
- Mental Illness Research Education, and Clinical Center (VISN 2 South), James J. Peters VA Medical Center, Bronx, NY, USA
| | - Towfique Raj
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Icahn Genomics Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Estelle and Daniel Maggin Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| |
Collapse
|
12
|
Sinitcyn P, Richards AL, Weatheritt RJ, Brademan DR, Marx H, Shishkova E, Meyer JG, Hebert AS, Westphall MS, Blencowe BJ, Cox J, Coon JJ. Global detection of human variants and isoforms by deep proteome sequencing. Nat Biotechnol 2023; 41:1776-1786. [PMID: 36959352 PMCID: PMC10713452 DOI: 10.1038/s41587-023-01714-x] [Citation(s) in RCA: 43] [Impact Index Per Article: 43.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2022] [Accepted: 02/15/2023] [Indexed: 03/25/2023]
Abstract
An average shotgun proteomics experiment detects approximately 10,000 human proteins from a single sample. However, individual proteins are typically identified by peptide sequences representing a small fraction of their total amino acids. Hence, an average shotgun experiment fails to distinguish different protein variants and isoforms. Deeper proteome sequencing is therefore required for the global discovery of protein isoforms. Using six different human cell lines, six proteases, deep fractionation and three tandem mass spectrometry fragmentation methods, we identify a million unique peptides from 17,717 protein groups, with a median sequence coverage of approximately 80%. Direct comparison with RNA expression data provides evidence for the translation of most nonsynonymous variants. We have also hypothesized that undetected variants likely arise from mutation-induced protein instability. We further observe comparable detection rates for exon-exon junction peptides representing constitutive and alternative splicing events. Our dataset represents a resource for proteoform discovery and provides direct evidence that most frame-preserving alternatively spliced isoforms are translated.
Collapse
Affiliation(s)
- Pavel Sinitcyn
- Computational Systems Biochemistry Research Group, Max Planck Institute of Biochemistry, Martinsried, Germany
- Morgridge Institute for Research, Madison, WI, USA
| | - Alicia L Richards
- National Center for Quantitative Biology of Complex Systems, University of Wisconsin-Madison, Madison, WI, USA
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Robert J Weatheritt
- EMBL Australia and Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Dain R Brademan
- Morgridge Institute for Research, Madison, WI, USA
- Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Harald Marx
- National Center for Quantitative Biology of Complex Systems, University of Wisconsin-Madison, Madison, WI, USA
- Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI, USA
- Department of Microbiology and Ecosystem Science, University of Vienna, Vienna, Austria
| | - Evgenia Shishkova
- National Center for Quantitative Biology of Complex Systems, University of Wisconsin-Madison, Madison, WI, USA
- Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Jesse G Meyer
- National Center for Quantitative Biology of Complex Systems, University of Wisconsin-Madison, Madison, WI, USA
- Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Alexander S Hebert
- National Center for Quantitative Biology of Complex Systems, University of Wisconsin-Madison, Madison, WI, USA
| | - Michael S Westphall
- National Center for Quantitative Biology of Complex Systems, University of Wisconsin-Madison, Madison, WI, USA
- Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Benjamin J Blencowe
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Jürgen Cox
- Computational Systems Biochemistry Research Group, Max Planck Institute of Biochemistry, Martinsried, Germany.
| | - Joshua J Coon
- Morgridge Institute for Research, Madison, WI, USA.
- National Center for Quantitative Biology of Complex Systems, University of Wisconsin-Madison, Madison, WI, USA.
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA.
- Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI, USA.
| |
Collapse
|
13
|
Dam SH, Olsen LR, Vitting-Seerup K. Expression and splicing mediate distinct biological signals. BMC Biol 2023; 21:220. [PMID: 37858135 PMCID: PMC10588054 DOI: 10.1186/s12915-023-01724-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Accepted: 10/04/2023] [Indexed: 10/21/2023] Open
Abstract
BACKGROUND Through alternative splicing, most human genes produce multiple isoforms in a cell-, tissue-, and disease-specific manner. Numerous studies show that alternative splicing is essential for development, diseases, and their treatments. Despite these important examples, the extent and biological relevance of splicing are currently unknown. RESULTS To solve this problem, we developed pairedGSEA and used it to profile transcriptional changes in 100 representative RNA-seq datasets. Our systematic analysis demonstrates that changes in splicing, on average, contribute to 48.1% of the biological signal in expression analyses. Gene-set enrichment analysis furthermore indicates that expression and splicing both convey shared and distinct biological signals. CONCLUSIONS These findings establish alternative splicing as a major regulator of the human condition and suggest that most contemporary RNA-seq studies likely miss out on critical biological insights. We anticipate our results will contribute to the transition from a gene-centric to an isoform-centric research paradigm.
Collapse
Affiliation(s)
- Søren Helweg Dam
- Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Lars Rønn Olsen
- Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Kristoffer Vitting-Seerup
- Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, Kgs. Lyngby, Denmark.
| |
Collapse
|
14
|
Jin L, Wang F, Wang X, Harvey BP, Bi Y, Hu C, Cui B, Darcy AT, Maull JW, Phillips BR, Kim Y, Jenkins GJ, Sornasse TR, Tian Y. Identification of Plasma Biomarkers from Rheumatoid Arthritis Patients Using an Optimized Sequential Window Acquisition of All THeoretical Mass Spectra (SWATH) Proteomics Workflow. Proteomes 2023; 11:32. [PMID: 37873874 PMCID: PMC10594463 DOI: 10.3390/proteomes11040032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Revised: 09/28/2023] [Accepted: 10/02/2023] [Indexed: 10/25/2023] Open
Abstract
Rheumatoid arthritis (RA) is a systemic autoimmune and inflammatory disease. Plasma biomarkers are critical for understanding disease mechanisms, treatment effects, and diagnosis. Mass spectrometry-based proteomics is a powerful tool for unbiased biomarker discovery. However, plasma proteomics is significantly hampered by signal interference from high-abundance proteins, low overall protein coverage, and high levels of missing data from data-dependent acquisition (DDA). To achieve quantitative proteomics analysis for plasma samples with a balance of throughput, performance, and cost, we developed a workflow incorporating plate-based high abundance protein depletion and sample preparation, comprehensive peptide spectral library building, and data-independent acquisition (DIA) SWATH mass spectrometry-based methodology. In this study, we analyzed plasma samples from both RA patients and healthy donors. The results showed that the new workflow performance exceeded that of the current state-of-the-art depletion-based plasma proteomic platforms in terms of both data quality and proteome coverage. Proteins from biological processes related to the activation of systemic inflammation, suppression of platelet function, and loss of muscle mass were enriched and differentially expressed in RA. Some plasma proteins, particularly acute-phase reactant proteins, showed great power to distinguish between RA patients and healthy donors. Moreover, protein isoforms in the plasma were also analyzed, providing even deeper proteome coverage. This workflow can serve as a basis for further application in discovering plasma biomarkers of other diseases.
Collapse
Affiliation(s)
- Liang Jin
- Research & Development, AbbVie, North Chicago, IL 60064, USA; (L.J.); (B.P.H.); (B.C.); (A.T.D.); (J.W.M.); (T.R.S.)
| | - Fei Wang
- Research & Development, AbbVie, North Chicago, IL 60064, USA; (L.J.); (B.P.H.); (B.C.); (A.T.D.); (J.W.M.); (T.R.S.)
| | - Xue Wang
- Research & Development, AbbVie, North Chicago, IL 60064, USA; (L.J.); (B.P.H.); (B.C.); (A.T.D.); (J.W.M.); (T.R.S.)
| | - Bohdan P. Harvey
- Research & Development, AbbVie, North Chicago, IL 60064, USA; (L.J.); (B.P.H.); (B.C.); (A.T.D.); (J.W.M.); (T.R.S.)
| | - Yingtao Bi
- Research & Development, AbbVie, North Chicago, IL 60064, USA; (L.J.); (B.P.H.); (B.C.); (A.T.D.); (J.W.M.); (T.R.S.)
| | - Chenqi Hu
- DMPK, Takeda Development Center Americas Inc., Cambridge, MA 02142, USA; (C.H.)
| | - Baoliang Cui
- Research & Development, AbbVie, North Chicago, IL 60064, USA; (L.J.); (B.P.H.); (B.C.); (A.T.D.); (J.W.M.); (T.R.S.)
| | - Anhdao T. Darcy
- Research & Development, AbbVie, North Chicago, IL 60064, USA; (L.J.); (B.P.H.); (B.C.); (A.T.D.); (J.W.M.); (T.R.S.)
| | - John W. Maull
- Research & Development, AbbVie, North Chicago, IL 60064, USA; (L.J.); (B.P.H.); (B.C.); (A.T.D.); (J.W.M.); (T.R.S.)
| | - Ben R. Phillips
- Research & Development, AbbVie, North Chicago, IL 60064, USA; (L.J.); (B.P.H.); (B.C.); (A.T.D.); (J.W.M.); (T.R.S.)
| | - Youngjae Kim
- DMPK, Takeda Development Center Americas Inc., Cambridge, MA 02142, USA; (C.H.)
| | - Gary J. Jenkins
- Research & Development, AbbVie, North Chicago, IL 60064, USA; (L.J.); (B.P.H.); (B.C.); (A.T.D.); (J.W.M.); (T.R.S.)
| | - Thierry R. Sornasse
- Research & Development, AbbVie, North Chicago, IL 60064, USA; (L.J.); (B.P.H.); (B.C.); (A.T.D.); (J.W.M.); (T.R.S.)
| | - Yu Tian
- Research & Development, AbbVie, North Chicago, IL 60064, USA; (L.J.); (B.P.H.); (B.C.); (A.T.D.); (J.W.M.); (T.R.S.)
| |
Collapse
|
15
|
Pandi B, Brenman S, Black A, Ng DCM, Lau E, Lam MPY. Tissue Usage Preference and Intrinsically Disordered Region Remodeling of Alternative Splicing Derived Proteoforms in the Heart. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.08.561375. [PMID: 37873130 PMCID: PMC10592692 DOI: 10.1101/2023.10.08.561375] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
A computational analysis of mass spectrometry data was performed to uncover alternative splicing derived protein variants across chambers of the human heart. Evidence for 216 non-canonical isoforms was apparent in the atrium and the ventricle, including 52 isoforms not documented on SwissProt and recovered using an RNA sequencing derived database. Among non-canonical isoforms, 29 show signs of regulation based on statistically significant preferences in tissue usage, including a ventricular enriched protein isoform of tensin-1 (TNS1) and an atrium-enriched PDZ and LIM Domain 3 (PDLIM3) isoform 2 (PDLIM3-2/ALP-H). Examined variant regions that differ between alternative and canonical isoforms are highly enriched in intrinsically disordered regions, and over two-thirds of such regions are predicted to function in protein binding and/or RNA binding. The analysis here lends further credence to the notion that alternative splicing diversifies the proteome by rewiring intrinsically disordered regions, which are increasingly recognized to play important roles in the generation of biological function from protein sequences.
Collapse
Affiliation(s)
- Boomathi Pandi
- Department of Medicine/Division of Cardiology, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Stella Brenman
- Department of Medicine/Division of Cardiology, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Alexander Black
- Department of Medicine/Division of Cardiology, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Dominic C. M. Ng
- Department of Medicine/Division of Cardiology, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Edward Lau
- Department of Medicine/Division of Cardiology, University of Colorado School of Medicine, Aurora, CO 80045, USA
- Consortium for Fibrosis Research and Translation (CFReT), University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Maggie P. Y. Lam
- Department of Medicine/Division of Cardiology, University of Colorado School of Medicine, Aurora, CO 80045, USA
- Department of Biochemistry & Molecular Genetics, University of Colorado School of Medicine, Aurora, CO 80045, USA
- Consortium for Fibrosis Research and Translation (CFReT), University of Colorado School of Medicine, Aurora, CO 80045, USA
| |
Collapse
|
16
|
Puliasis SS, Lewandowska D, Hemsley PA, Zhang R. ProtView: A Versatile Tool for In Silico Protease Evaluation and Selection in a Proteomic and Proteogenomic Context. J Proteome Res 2023. [PMID: 37248202 DOI: 10.1021/acs.jproteome.3c00135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Many tools have been created to generate in silico proteome digests with different protease enzymes and provide useful information for selecting optimal digest schemes for specific needs. This can save on time and resources and generate insights on the observable proteome. However, there remains a need for a tool that evaluates digest schemes beyond protein and amino acid coverages in the proteomic domain. Here, we present ProtView, a versatile in silico protease combination digest evaluation workflow that maps in silico-digested peptides to both protein and genome references, so that the potential observable portions of the proteome, transcriptome, and genome can be identified. The proteomic identification and quantification of evidence for transcriptional, co-transcriptional, post-transcriptional, translational, and post-translational regulation can all be examined in silico with ProtView prior to an experiment. Benchmarking against biological data comparing multiple proteases shows that ProtView can correctly estimate performances among the digest schemes. ProtView provides this information in a way that is easy to interpret, allowing for digest schemes to be evaluated before carrying out an experiment, in context that can optimize both proteomic and proteogenomic experiments. ProtView is available at https://github.com/SSPuliasis/ProtView.
Collapse
Affiliation(s)
- Sophia S Puliasis
- Division of Plant Sciences, School of Life Sciences, University of Dundee, Dow Street, Dundee DD1 5EH, Scotland, UK
- Information and Computational Sciences, The James Hutton Institute, Invergowrie, Dundee DD2 5DA, Scotland, UK
| | - Dominika Lewandowska
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee DD2 5DA, Scotland, UK
| | - Piers A Hemsley
- Division of Plant Sciences, School of Life Sciences, University of Dundee, Dow Street, Dundee DD1 5EH, Scotland, UK
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee DD2 5DA, Scotland, UK
| | - Runxuan Zhang
- Information and Computational Sciences, The James Hutton Institute, Invergowrie, Dundee DD2 5DA, Scotland, UK
| |
Collapse
|
17
|
Oreper D, Klaeger S, Jhunjhunwala S, Delamarre L. The peptide woods are lovely, dark and deep: Hunting for novel cancer antigens. Semin Immunol 2023; 67:101758. [PMID: 37027981 DOI: 10.1016/j.smim.2023.101758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Revised: 03/22/2023] [Accepted: 03/22/2023] [Indexed: 04/08/2023]
Abstract
Harnessing the patient's immune system to control a tumor is a proven avenue for cancer therapy. T cell therapies as well as therapeutic vaccines, which target specific antigens of interest, are being explored as treatments in conjunction with immune checkpoint blockade. For these therapies, selecting the best suited antigens is crucial. Most of the focus has thus far been on neoantigens that arise from tumor-specific somatic mutations. Although there is clear evidence that T-cell responses against mutated neoantigens are protective, the large majority of these mutations are not immunogenic. In addition, most somatic mutations are unique to each individual patient and their targeting requires the development of individualized approaches. Therefore, novel antigen types are needed to broaden the scope of such treatments. We review high throughput approaches for discovering novel tumor antigens and some of the key challenges associated with their detection, and discuss considerations when selecting tumor antigens to target in the clinic.
Collapse
Affiliation(s)
- Daniel Oreper
- Genentech, 1 DNA way, South San Francisco, 94080 CA, USA.
| | - Susan Klaeger
- Genentech, 1 DNA way, South San Francisco, 94080 CA, USA.
| | | | | |
Collapse
|
18
|
Yin L, Zander M, Huang SSC, Xie M, Song L, Saldierna Guzmán JP, Hann E, Shanbhag BK, Ng S, Jain S, Janssen BJ, Clark NM, Walley JW, Beddoe T, Bar-Joseph Z, Lewsey MG, Ecker JR. Transcription Factor Dynamics in Cross-Regulation of Plant Hormone Signaling Pathways. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.07.531630. [PMID: 36945593 PMCID: PMC10028877 DOI: 10.1101/2023.03.07.531630] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/11/2023]
Abstract
Cross-regulation between hormone signaling pathways is indispensable for plant growth and development. However, the molecular mechanisms by which multiple hormones interact and co-ordinate activity need to be understood. Here, we generated a cross-regulation network explaining how hormone signals are integrated from multiple pathways in etiolated Arabidopsis (Arabidopsis thaliana) seedlings. To do so we comprehensively characterized transcription factor activity during plant hormone responses and reconstructed dynamic transcriptional regulatory models for six hormones; abscisic acid, brassinosteroid, ethylene, jasmonic acid, salicylic acid and strigolactone/karrikin. These models incorporated target data for hundreds of transcription factors and thousands of protein-protein interactions. Each hormone recruited different combinations of transcription factors, a subset of which were shared between hormones. Hub target genes existed within hormone transcriptional networks, exhibiting transcription factor activity themselves. In addition, a group of MITOGEN-ACTIVATED PROTEIN KINASES (MPKs) were identified as potential key points of cross-regulation between multiple hormones. Accordingly, the loss of function of one of these (MPK6) disrupted the global proteome, phosphoproteome and transcriptome during hormone responses. Lastly, we determined that all hormones drive substantial alternative splicing that has distinct effects on the transcriptome compared with differential gene expression, acting in early hormone responses. These results provide a comprehensive understanding of the common features of plant transcriptional regulatory pathways and how cross-regulation between hormones acts upon gene expression.
Collapse
Affiliation(s)
- Lingling Yin
- La Trobe Institute for Agriculture and Food, Department of Animal, Plant and Soil Sciences, School of Agriculture Biomedicine and Environment, AgriBio Building, La Trobe University, Melbourne, VIC 3086, Australia
- Australian Research Council Industrial Transformation Research Hub for Medicinal Agriculture, AgriBio Building, La Trobe University, Bundoora, VIC 3086, Australia
| | - Mark Zander
- Plant Biology Laboratory, Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Genomic Analysis Laboratory, Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Howard Hughes Medical Institute, Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Present address: Waksman Institute of Microbiology, Department of Plant Biology, Rutgers, The State University of New Jersey, NJ 08854, USA
| | - Shao-shan Carol Huang
- Plant Biology Laboratory, Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Genomic Analysis Laboratory, Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Howard Hughes Medical Institute, Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Present address: Department of Biology, New York University, New York, NY 10003, USA
| | - Mingtang Xie
- Plant Biology Laboratory, Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Genomic Analysis Laboratory, Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Howard Hughes Medical Institute, Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Present address: Cibus, San Diego, CA 92121, USA
| | - Liang Song
- Plant Biology Laboratory, Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Genomic Analysis Laboratory, Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Howard Hughes Medical Institute, Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Present address: Department of Botany, The University of British Columbia, Vancouver, British Columbia, Canada
| | - J. Paola Saldierna Guzmán
- Plant Biology Laboratory, Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Howard Hughes Medical Institute, Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Present address: Department of Soil and Crop Sciences, Colorado State University, Fort Collins, Colorado, USA
| | - Elizabeth Hann
- Plant Biology Laboratory, Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Present address: Department of Chemical and Environmental Engineering, Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA
| | - Bhuvana K. Shanbhag
- La Trobe Institute for Agriculture and Food, Department of Animal, Plant and Soil Sciences, School of Agriculture Biomedicine and Environment, AgriBio Building, La Trobe University, Melbourne, VIC 3086, Australia
- Australian Research Council Industrial Transformation Research Hub for Medicinal Agriculture, AgriBio Building, La Trobe University, Bundoora, VIC 3086, Australia
| | - Sophia Ng
- La Trobe Institute for Agriculture and Food, Department of Animal, Plant and Soil Sciences, School of Agriculture Biomedicine and Environment, AgriBio Building, La Trobe University, Melbourne, VIC 3086, Australia
- Australian Research Council Industrial Transformation Research Hub for Medicinal Agriculture, AgriBio Building, La Trobe University, Bundoora, VIC 3086, Australia
| | - Siddhartha Jain
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Bart J. Janssen
- The New Zealand Institute for Plant & Food Research Limited, Auckland, New Zealand
| | - Natalie M. Clark
- Proteomics Platform, Broad Institute of MIT and Harvard, Cambridge, MA, 02142 USA
- Department of Plant Pathology, Entomology, and Microbiology, Iowa State University, Ames, IA, 50011 USA
| | - Justin W. Walley
- Department of Plant Pathology, Entomology, and Microbiology, Iowa State University, Ames, IA, 50011 USA
| | - Travis Beddoe
- La Trobe Institute for Agriculture and Food, Department of Animal, Plant and Soil Sciences, School of Agriculture Biomedicine and Environment, AgriBio Building, La Trobe University, Melbourne, VIC 3086, Australia
- Australian Research Council Industrial Transformation Research Hub for Medicinal Agriculture, AgriBio Building, La Trobe University, Bundoora, VIC 3086, Australia
| | - Ziv Bar-Joseph
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Mathew G. Lewsey
- La Trobe Institute for Agriculture and Food, Department of Animal, Plant and Soil Sciences, School of Agriculture Biomedicine and Environment, AgriBio Building, La Trobe University, Melbourne, VIC 3086, Australia
- Australian Research Council Industrial Transformation Research Hub for Medicinal Agriculture, AgriBio Building, La Trobe University, Bundoora, VIC 3086, Australia
- Australian Research Council Centre of Excellence in Plants For Space, AgriBio Building, La Trobe University, Bundoora, VIC 3086, Australia
| | - Joseph R. Ecker
- Plant Biology Laboratory, Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Genomic Analysis Laboratory, Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Howard Hughes Medical Institute, Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| |
Collapse
|
19
|
Manuel JM, Guilloy N, Khatir I, Roucou X, Laurent B. Re-evaluating the impact of alternative RNA splicing on proteomic diversity. Front Genet 2023; 14:1089053. [PMID: 36845399 PMCID: PMC9947481 DOI: 10.3389/fgene.2023.1089053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Accepted: 01/23/2023] [Indexed: 02/11/2023] Open
Abstract
Alternative splicing (AS) constitutes a mechanism by which protein-coding genes and long non-coding RNA (lncRNA) genes produce more than a single mature transcript. From plants to humans, AS is a powerful process that increases transcriptome complexity. Importantly, splice variants produced from AS can potentially encode for distinct protein isoforms which can lose or gain specific domains and, hence, differ in their functional properties. Advances in proteomics have shown that the proteome is indeed diverse due to the presence of numerous protein isoforms. For the past decades, with the help of advanced high-throughput technologies, numerous alternatively spliced transcripts have been identified. However, the low detection rate of protein isoforms in proteomic studies raised debatable questions on whether AS contributes to proteomic diversity and on how many AS events are really functional. We propose here to assess and discuss the impact of AS on proteomic complexity in the light of the technological progress, updated genome annotation, and current scientific knowledge.
Collapse
Affiliation(s)
- Jeru Manoj Manuel
- Research Center on Aging, Centre Intégré Universitaire de Santé et Services Sociaux de l’Estrie-Centre Hospitalier Universitaire de Sherbrooke, Sherbrooke, QC, Canada,Department of Biochemistry and Functional Genomics, Faculty of Medicine and Health Sciences, Université de Sherbrooke, Sherbrooke, QC, Canada
| | - Noé Guilloy
- Department of Biochemistry and Functional Genomics, Faculty of Medicine and Health Sciences, Université de Sherbrooke, Sherbrooke, QC, Canada
| | - Inès Khatir
- Research Center on Aging, Centre Intégré Universitaire de Santé et Services Sociaux de l’Estrie-Centre Hospitalier Universitaire de Sherbrooke, Sherbrooke, QC, Canada,Department of Biochemistry and Functional Genomics, Faculty of Medicine and Health Sciences, Université de Sherbrooke, Sherbrooke, QC, Canada
| | - Xavier Roucou
- Department of Biochemistry and Functional Genomics, Faculty of Medicine and Health Sciences, Université de Sherbrooke, Sherbrooke, QC, Canada,Centre de Recherche du Centre Hospitalier Universitaire de Sherbrooke (CRCHUS), Sherbrooke, QC, Canada,Quebec Network for Research on Protein Function Structure and Engineering, PROTEO, Québec, QC, Canada
| | - Benoit Laurent
- Research Center on Aging, Centre Intégré Universitaire de Santé et Services Sociaux de l’Estrie-Centre Hospitalier Universitaire de Sherbrooke, Sherbrooke, QC, Canada,Department of Biochemistry and Functional Genomics, Faculty of Medicine and Health Sciences, Université de Sherbrooke, Sherbrooke, QC, Canada,*Correspondence: Benoit Laurent,
| |
Collapse
|
20
|
Identification of Alternative Splicing in Proteomes of Human Melanoma Cell Lines without RNA Sequencing Data. Int J Mol Sci 2023; 24:ijms24032466. [PMID: 36768787 PMCID: PMC9916885 DOI: 10.3390/ijms24032466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Revised: 01/06/2023] [Accepted: 01/13/2023] [Indexed: 01/31/2023] Open
Abstract
Alternative splicing is one of the main regulation pathways in living cells beyond simple changes in the level of protein expression. Most of the approaches proposed in proteomics for the identification of specific splicing isoforms require a preliminary deep transcriptomic analysis of the sample under study, which is not always available, especially in the case of the re-analysis of previously acquired data. Herein, we developed new algorithms for the identification and validation of protein splice isoforms in proteomic data in the absence of RNA sequencing of the samples under study. The bioinformatic approaches were tested on the results of proteome analysis of human melanoma cell lines, obtained earlier by high-resolution liquid chromatography and mass spectrometry (LC-MS). A search for alternative splicing events for each of the cell lines studied was performed against the database generated from all known transcripts (RefSeq) and the one composed of peptide sequences, which included all biologically possible combinations of exons. The identifications were filtered using the prediction of both retention times and relative intensities of fragment ions in the corresponding mass spectra. The fragmentation mass spectra corresponding to the discovered alternative splicing events were additionally examined for artifacts. Selected splicing events were further validated at the mRNA level by quantitative PCR.
Collapse
|
21
|
Shaw TI, Zhao B, Li Y, Wang H, Wang L, Manley B, Stewart PA, Karolak A. Multi-omics approach to identifying isoform variants as therapeutic targets in cancer patients. Front Oncol 2022; 12:1051487. [PMID: 36505834 PMCID: PMC9730332 DOI: 10.3389/fonc.2022.1051487] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Accepted: 11/07/2022] [Indexed: 11/25/2022] Open
Abstract
Cancer-specific alternatively spliced events (ASE) play a role in cancer pathogenesis and can be targeted by immunotherapy, oligonucleotide therapy, and small molecule inhibition. However, identifying actionable ASE targets remains challenging due to the uncertainty of its protein product, structure impact, and proteoform (protein isoform) function. Here we argue that an integrated multi-omics profiling strategy can overcome these challenges, allowing us to mine this untapped source of targets for therapeutic development. In this review, we will provide an overview of current multi-omics strategies in characterizing ASEs by utilizing the transcriptome, proteome, and state-of-art algorithms for protein structure prediction. We will discuss limitations and knowledge gaps associated with each technology and informatics analytics. Finally, we will discuss future directions that will enable the full integration of multi-omics data for ASE target discovery.
Collapse
Affiliation(s)
- Timothy I. Shaw
- Department of Biostatistics and Bioinformatics, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, United States,*Correspondence: Timothy I. Shaw,
| | - Bi Zhao
- Department of Machine Learning, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, United States
| | - Yuxin Li
- Center for Proteomics and Metabolomics, St. Jude Children’s Research Hospital, Memphis, TN, United States
| | - Hong Wang
- Center for Proteomics and Metabolomics, St. Jude Children’s Research Hospital, Memphis, TN, United States
| | - Liang Wang
- Department of Tumor Biology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, United States
| | - Brandon Manley
- Department of Genitourinary Oncology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, United States
| | - Paul A. Stewart
- Department of Biostatistics and Bioinformatics, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, United States
| | - Aleksandra Karolak
- Department of Machine Learning, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, United States
| |
Collapse
|
22
|
Martinez-Gomez L, Cerdán-Vélez D, Abascal F, Tress ML. Origins and Evolution of Human Tandem Duplicated Exon Substitution Events. Genome Biol Evol 2022; 14:6809199. [PMID: 36346145 PMCID: PMC9741552 DOI: 10.1093/gbe/evac162] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Revised: 10/25/2022] [Accepted: 10/29/2022] [Indexed: 11/10/2022] Open
Abstract
The mutually exclusive splicing of tandem duplicated exons produces protein isoforms that are identical save for a homologous region that allows for the fine tuning of protein function. Tandem duplicated exon substitution events are rare, yet highly important alternative splicing events. Most events are ancient, their isoforms are highly expressed, and they have significantly more pathogenic mutations than other splice events. Here, we analyzed the physicochemical properties and functional roles of the homologous polypeptide regions produced by the 236 tandem duplicated exon substitutions annotated in the human gene set. We find that the most important structural and functional residues in these homologous regions are maintained, and that most changes are conservative rather than drastic. Three quarters of the isoforms produced from tandem duplicated exon substitution events are tissue-specific, particularly in nervous and cardiac tissues, and tandem duplicated exon substitution events are enriched in functional terms related to structures in the brain and skeletal muscle. We find considerable evidence for the convergent evolution of tandem duplicated exon substitution events in vertebrates, arthropods, and nematodes. Twelve human gene families have orthologues with tandem duplicated exon substitution events in both Drosophila melanogaster and Caenorhabditis elegans. Six of these gene families are ion transporters, suggesting that tandem exon duplication in genes that control the flow of ions into the cell has an adaptive benefit. The ancient origins, the strong indications of tissue-specific functions, and the evidence of convergent evolution suggest that these events may have played important roles in the evolution of animal tissues and organs.
Collapse
Affiliation(s)
- Laura Martinez-Gomez
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), C. Melchor Fernandez Almagro, 3, 28029 Madrid, Spain
| | - Daniel Cerdán-Vélez
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), C. Melchor Fernandez Almagro, 3, 28029 Madrid, Spain
| | - Federico Abascal
- Somatic Evolution Group, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SA, United Kingdom
| | | |
Collapse
|
23
|
Pozo F, Rodriguez JM, Martínez Gómez L, Vázquez J, Tress ML. APPRIS principal isoforms and MANE Select transcripts define reference splice variants. Bioinformatics 2022; 38:ii89-ii94. [PMID: 36124785 PMCID: PMC9486585 DOI: 10.1093/bioinformatics/btac473] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
MOTIVATION Selecting the splice variant that best represents a coding gene is a crucial first step in many experimental analyses, and vital for mapping clinically relevant variants. This study compares the longest isoforms, MANE Select transcripts, APPRIS principal isoforms, and expression data, and aims to determine which method is best for selecting biological important reference splice variants for large-scale analyses. RESULTS Proteomics analyses and human genetic variation data suggest that most coding genes have a single main protein isoform. We show that APPRIS principal isoforms and MANE Select transcripts best describe these main cellular isoforms, and find that using the longest splice variant as the representative is a poor strategy. Exons unique to the longest splice isoforms are not under selective pressure, and so are unlikely to be functionally relevant. Expression data are also a poor means of selecting the main splice variant. APPRIS principal and MANE Select exons are under purifying selection, while exons specific to alternative transcripts are not. There are MANE and APPRIS representatives for almost 95% of genes, and where they agree they are particularly effective, coinciding with the main proteomics isoform for over 98.2% of genes. AVAILABILITY AND IMPLEMENTATION APPRIS principal isoforms for human, mouse and other model species can be downloaded from the APPRIS database (https://appris.bioinfo.cnio.es), GENCODE genes (https://www.gencodegenes.org/) and the Ensembl website (https://www.ensembl.org). MANE Select transcripts for the human reference set are available from the Ensembl, GENCODE and RefSeq databases (https://www.ncbi.nlm.nih.gov/refseq/). Lists of splice variants where MANE and APPRIS coincide are available from the APPRIS database. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Fernando Pozo
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), 28029 Madrid, Spain
| | - José Manuel Rodriguez
- Cardiovascular Proteomics Laboratory, Centro Nacional de Investigaciones Cardiovasculares Carlos III (CNIC), 28029 Madrid, Spain
| | - Laura Martínez Gómez
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), 28029 Madrid, Spain
| | - Jesús Vázquez
- Cardiovascular Proteomics Laboratory, Centro Nacional de Investigaciones Cardiovasculares Carlos III (CNIC), 28029 Madrid, Spain,CIBER de Investigaciones Cardiovasculares (CIBERCV), 28029 Madrid, Spain
| | | |
Collapse
|
24
|
Han Y, Wennersten SA, Wright JM, Ludwig RW, Lau E, Lam MPY. Proteogenomics reveals sex-biased aging genes and coordinated splicing in cardiac aging. Am J Physiol Heart Circ Physiol 2022; 323:H538-H558. [PMID: 35930447 PMCID: PMC9448281 DOI: 10.1152/ajpheart.00244.2022] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Revised: 07/20/2022] [Accepted: 07/31/2022] [Indexed: 01/24/2023]
Abstract
The risks of heart diseases are significantly modulated by age and sex, but how these factors influence baseline cardiac gene expression remains incompletely understood. Here, we used RNA sequencing and mass spectrometry to compare gene expression in female and male young adult (4 mo) and early aging (20 mo) mouse hearts, identifying thousands of age- and sex-dependent gene expression signatures. Sexually dimorphic cardiac genes are broadly distributed, functioning in mitochondrial metabolism, translation, and other processes. In parallel, we found over 800 genes with differential aging response between male and female, including genes in cAMP and PKA signaling. Analysis of the sex-adjusted aging cardiac transcriptome revealed a widespread remodeling of exon usage patterns that is largely independent from differential gene expression, concomitant with upstream changes in RNA-binding protein and splice factor transcripts. To evaluate the impact of the splicing events on cardiac proteoform composition, we applied an RNA-guided proteomics computational pipeline to analyze the mass spectrometry data and detected hundreds of putative splice variant proteins that have the potential to rewire the cardiac proteome. Taken together, the results here suggest that cardiac aging is associated with 1) widespread sex-biased aging genes and 2) a rewiring of RNA splicing programs, including sex- and age-dependent changes in exon usages and splice patterns that have the potential to influence cardiac protein structure and function. These changes contribute to the emerging evidence for considerable sexual dimorphism in the cardiac aging process that should be considered in the search for disease mechanisms.NEW & NOTEWORTHY Han et al. used proteogenomics to compare male and female mouse hearts at 4 and 20 mo. Sex-biased cardiac genes function in mitochondrial metabolism, translation, autophagy, and other processes. Hundreds of cardiac genes show sex-by-age interactions, that is, sex-biased aging genes. Cardiac aging is accompanied with a remodeling of exon usage in functionally coordinated genes, concomitant with differential expression of RNA-binding proteins and splice factors. These features represent an underinvestigated aspect of cardiac aging that may be relevant to the search for disease mechanisms.
Collapse
Grants
- R21-HL150456 HHS | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- R00-HL144829 HHS | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- R00 HL127302 NHLBI NIH HHS
- R03-OD032666 HHS | NIH | NIH Office of the Director (OD)
- R01 HL141278 NHLBI NIH HHS
- F32 HL149191 NHLBI NIH HHS
- F32-HL149191 HHS | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- R00-HL127302 HHS | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- R21 HL150456 NHLBI NIH HHS
- R03 OD032666 NIH HHS
- R00 HL144829 NHLBI NIH HHS
- R01-HL141278 HHS | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- University of Colorado
- University of Colorado School of Medicine, Anschutz Medical Campus
Collapse
Affiliation(s)
- Yu Han
- Department of Medicine, Anschutz Medical Campus, University of Colorado School of Medicine, Aurora, Colorado
| | - Sara A Wennersten
- Department of Medicine, Anschutz Medical Campus, University of Colorado School of Medicine, Aurora, Colorado
| | - Julianna M Wright
- Department of Medicine, Anschutz Medical Campus, University of Colorado School of Medicine, Aurora, Colorado
| | - R W Ludwig
- Department of Medicine, Anschutz Medical Campus, University of Colorado School of Medicine, Aurora, Colorado
| | | | - Maggie P Y Lam
- Department of Medicine, Anschutz Medical Campus, University of Colorado School of Medicine, Aurora, Colorado
- Department of Biochemistry and Molecular Genetics, Anschutz Medical Campus, University of Colorado School of Medicine, Aurora, Colorado
| |
Collapse
|
25
|
Fan X, Yang Y, Chen C, Wang Z. Pervasive translation of circular RNAs driven by short IRES-like elements. Nat Commun 2022; 13:3751. [PMID: 35768398 PMCID: PMC9242994 DOI: 10.1038/s41467-022-31327-y] [Citation(s) in RCA: 56] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2021] [Accepted: 06/09/2022] [Indexed: 12/30/2022] Open
Abstract
Some circular RNAs (circRNAs) were found to be translated through IRES-driven mechanism, however the scope and functions of circRNA translation are unclear because endogenous IRESs are rare. To determine the prevalence and mechanism of circRNA translation, we develop a cell-based system to screen random sequences and identify 97 overrepresented hexamers that drive cap-independent circRNA translation. These IRES-like short elements are significantly enriched in endogenous circRNAs and sufficient to drive circRNA translation. We further identify multiple trans-acting factors that bind these IRES-like elements to initiate translation. Using mass-spectrometry data, hundreds of circRNA-coded peptides are identified, most of which have low abundance due to rapid degradation. As judged by mass-spectrometry, 50% of translatable endogenous circRNAs undergo rolling circle translation, several of which are experimentally validated. Consistently, mutations of the IRES-like element in one circRNA reduce its translation. Collectively, our findings suggest a pervasive translation of circRNAs, providing profound implications in translation control. Unbiased screen of random sequences identified many short IRES-like elements to drive circular RNA translation and hundreds of rolling circle translation events, suggesting a pervasive cap-independent translation in human transcriptome.
Collapse
Affiliation(s)
- Xiaojuan Fan
- Bio-med Big Data Center, CAS Key Laboratory of Computational Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Nutrition and Health, Shanghai, China.,University of Chinese Academy of Sciences, Chinese Academy of Sciences, Beijing, China
| | - Yun Yang
- Bio-med Big Data Center, CAS Key Laboratory of Computational Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Nutrition and Health, Shanghai, China.,CirCode BioMedicine, Pudong, Shanghai, China
| | - Chuyun Chen
- Bio-med Big Data Center, CAS Key Laboratory of Computational Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Nutrition and Health, Shanghai, China
| | - Zefeng Wang
- Bio-med Big Data Center, CAS Key Laboratory of Computational Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Nutrition and Health, Shanghai, China. .,University of Chinese Academy of Sciences, Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
26
|
Chen S, Downs ML. Proteomic Analysis of Oil-Roasted Cashews Using a Customized Allergen-Focused Protein Database. J Proteome Res 2022; 21:1694-1706. [PMID: 35658452 DOI: 10.1021/acs.jproteome.2c00095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Cashews are one of the most prevalent causes of tree nut allergies. However, the cashew proteome is far from complete, which limits the quality of peptide identification in mass spectrometric analyses. In this study, bioinformatics tools were utilized to construct a customized cashew protein database and improve sequence quality for proteins of interest, based on a publicly available cashew genome database. As a result, two additional isoforms for cashew 2S albumins and five other isoforms for cashew 11S proteins were identified, along with several other potential allergens. Using the optimized protein database, the protein profiles of cashew nuts subjected to different oil-roasting conditions (139 and 166 °C for 2-10 min) were analyzed using discovery LC-MS/MS analysis. The results showed that the cashew 2S protein is most heat-stable, followed by 11S and 7S proteins, though protein isoforms might be affected differently. Preliminary target peptide selection indicated that out of the 29 potential targets, 18 peptides were derived from the newly developed database. In the evaluation of thermal processing effects on cashew proteins, several Maillard reaction adducts were also identified. The cashew protein database developed in this study allows for comprehensive analyses of cashew proteins and development of high-quality allergen detection methods.
Collapse
Affiliation(s)
- Shimin Chen
- Food Allergy Research and Resource Program, Department of Food Science and Technology, University of Nebraska-Lincoln, Lincoln, Nebraska 68588, United States
| | - Melanie L Downs
- Food Allergy Research and Resource Program, Department of Food Science and Technology, University of Nebraska-Lincoln, Lincoln, Nebraska 68588, United States
| |
Collapse
|
27
|
Levitsky LI, Kuznetsova KG, Kliuchnikova AA, Ilina IY, Goncharov AO, Lobas AA, Ivanov MV, Lazarev VN, Ziganshin RH, Gorshkov MV, Moshkovskii SA. Validating Amino Acid Variants in Proteogenomics Using Sequence Coverage by Multiple Reads. J Proteome Res 2022; 21:1438-1448. [PMID: 35536917 DOI: 10.1021/acs.jproteome.2c00033] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Mass spectrometry-based proteome analysis implies matching the mass spectra of proteolytic peptides to amino acid sequences predicted from genomic sequences. Reliability of peptide variant identification in proteogenomic studies is often lacking. We propose a way to interpret shotgun proteomics results, specifically in the data-dependent acquisition mode, as protein sequence coverage by multiple reads as it is done in nucleic acid sequencing for calling of single nucleotide variants. Multiple reads for each sequence position could be provided by overlapping distinct peptides, thus confirming the presence of certain amino acid residues in the overlapping stretch with a lower false discovery rate. Overlapping distinct peptides originate from miscleaved tryptic peptides in combination with their properly cleaved counterparts and from peptides generated by multiple proteases after the same specimen is subject to parallel digestion and analyzed separately. We illustrate this approach using publicly available multiprotease data sets and our own data generated for the HEK-293 cell line digests obtained using trypsin, LysC, and GluC proteases. Totally, up to 30% of the whole proteome was covered by tryptic peptides with up to 7% covered twofold and more. The proteogenomic analysis of the HEK-293 cell line revealed 36 single amino acid variants, seven of which were supported by multiple reads.
Collapse
Affiliation(s)
- Lev I Levitsky
- V.L. Talrose Institute for Energy Problems of Chemical Physics, N.N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 38, bld. 2, Leninsky Prospect, Moscow 119334, Russia
| | - Ksenia G Kuznetsova
- Federal Research and Clinical Center of Physical-Chemical Medicine, 1a, Malaya Pirogovskaya, Moscow 119435, Russia
| | - Anna A Kliuchnikova
- Federal Research and Clinical Center of Physical-Chemical Medicine, 1a, Malaya Pirogovskaya, Moscow 119435, Russia.,Pirogov Russian National Research Medical University, 1, Ostrovityanova, Moscow 117997, Russia
| | - Irina Y Ilina
- Federal Research and Clinical Center of Physical-Chemical Medicine, 1a, Malaya Pirogovskaya, Moscow 119435, Russia
| | - Anton O Goncharov
- Federal Research and Clinical Center of Physical-Chemical Medicine, 1a, Malaya Pirogovskaya, Moscow 119435, Russia.,Pirogov Russian National Research Medical University, 1, Ostrovityanova, Moscow 117997, Russia
| | - Anna A Lobas
- V.L. Talrose Institute for Energy Problems of Chemical Physics, N.N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 38, bld. 2, Leninsky Prospect, Moscow 119334, Russia
| | - Mark V Ivanov
- V.L. Talrose Institute for Energy Problems of Chemical Physics, N.N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 38, bld. 2, Leninsky Prospect, Moscow 119334, Russia
| | - Vassili N Lazarev
- Federal Research and Clinical Center of Physical-Chemical Medicine, 1a, Malaya Pirogovskaya, Moscow 119435, Russia.,Moscow Institute of Physics and Technology (State University), 9, Institutskiy per., Dolgoprudny, Moscow Region 141701, Russia
| | - Rustam H Ziganshin
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 16/10, Miklukho-Maklaya, Moscow 117997, Russia
| | - Mikhail V Gorshkov
- V.L. Talrose Institute for Energy Problems of Chemical Physics, N.N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 38, bld. 2, Leninsky Prospect, Moscow 119334, Russia
| | - Sergei A Moshkovskii
- Federal Research and Clinical Center of Physical-Chemical Medicine, 1a, Malaya Pirogovskaya, Moscow 119435, Russia.,Pirogov Russian National Research Medical University, 1, Ostrovityanova, Moscow 117997, Russia
| |
Collapse
|
28
|
Miller RM, Jordan BT, Mehlferber MM, Jeffery ED, Chatzipantsiou C, Kaur S, Millikin RJ, Dai Y, Tiberi S, Castaldi PJ, Shortreed MR, Luckey CJ, Conesa A, Smith LM, Deslattes Mays A, Sheynkman GM. Enhanced protein isoform characterization through long-read proteogenomics. Genome Biol 2022; 23:69. [PMID: 35241129 PMCID: PMC8892804 DOI: 10.1186/s13059-022-02624-y] [Citation(s) in RCA: 33] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Accepted: 02/02/2022] [Indexed: 02/04/2023] Open
Abstract
BACKGROUND The detection of physiologically relevant protein isoforms encoded by the human genome is critical to biomedicine. Mass spectrometry (MS)-based proteomics is the preeminent method for protein detection, but isoform-resolved proteomic analysis relies on accurate reference databases that match the sample; neither a subset nor a superset database is ideal. Long-read RNA sequencing (e.g., PacBio or Oxford Nanopore) provides full-length transcripts which can be used to predict full-length protein isoforms. RESULTS We describe here a long-read proteogenomics approach for integrating sample-matched long-read RNA-seq and MS-based proteomics data to enhance isoform characterization. We introduce a classification scheme for protein isoforms, discover novel protein isoforms, and present the first protein inference algorithm for the direct incorporation of long-read transcriptome data to enable detection of protein isoforms previously intractable to MS-based detection. We have released an open-source Nextflow pipeline that integrates long-read sequencing in a proteomic workflow for isoform-resolved analysis. CONCLUSIONS Our work suggests that the incorporation of long-read sequencing and proteomic data can facilitate improved characterization of human protein isoform diversity. Our first-generation pipeline provides a strong foundation for future development of long-read proteogenomics and its adoption for both basic and translational research.
Collapse
Affiliation(s)
- Rachel M. Miller
- grid.14003.360000 0001 2167 3675Department of Chemistry, University of Wisconsin-Madison, Madison, WI USA
| | - Ben T. Jordan
- grid.27755.320000 0000 9136 933XDepartment of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA USA
| | - Madison M. Mehlferber
- grid.27755.320000 0000 9136 933XDepartment of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA USA ,grid.27755.320000 0000 9136 933XDepartment of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA USA
| | - Erin D. Jeffery
- grid.27755.320000 0000 9136 933XDepartment of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA USA
| | | | - Simi Kaur
- grid.14003.360000 0001 2167 3675Department of Chemistry, University of Wisconsin-Madison, Madison, WI USA
| | - Robert J. Millikin
- grid.14003.360000 0001 2167 3675Department of Chemistry, University of Wisconsin-Madison, Madison, WI USA
| | - Yunxiang Dai
- grid.14003.360000 0001 2167 3675Department of Chemistry, University of Wisconsin-Madison, Madison, WI USA
| | - Simone Tiberi
- grid.7400.30000 0004 1937 0650Department of Molecular Life Sciences, University of Zurich, Zurich, Switzerland ,grid.7400.30000 0004 1937 0650Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland
| | - Peter J. Castaldi
- grid.62560.370000 0004 0378 8294Channing Division of Network Medicine, Brigham and Women’s Hospital, Boston, MA USA ,grid.62560.370000 0004 0378 8294Division of General Medicine and Primary Care, Brigham and Women’s Hospital, Boston, MA USA
| | - Michael R. Shortreed
- grid.14003.360000 0001 2167 3675Department of Chemistry, University of Wisconsin-Madison, Madison, WI USA
| | - Chance John Luckey
- grid.27755.320000 0000 9136 933XDepartment of Pathology, University of Virginia, Charlottesville, VA USA
| | - Ana Conesa
- grid.4711.30000 0001 2183 4846Institute for Integrative Systems Biology, Spanish National Research Council (CSIC), Paterna, Spain ,grid.15276.370000 0004 1936 8091Microbiology and Cell Science Department, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL USA
| | - Lloyd M. Smith
- grid.14003.360000 0001 2167 3675Department of Chemistry, University of Wisconsin-Madison, Madison, WI USA
| | - Anne Deslattes Mays
- grid.420089.70000 0000 9635 8082 Office of Data Science and Sharing, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Rockville, MD USA
| | - Gloria M. Sheynkman
- grid.27755.320000 0000 9136 933XDepartment of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA USA ,grid.27755.320000 0000 9136 933XCenter for Public Health Genomics, University of Virginia, Charlottesville, VA USA ,grid.27755.320000 0000 9136 933XUVA Cancer Center, University of Virginia, Charlottesville, VA USA
| |
Collapse
|
29
|
Reixachs-Solé M, Eyras E. Uncovering the impacts of alternative splicing on the proteome with current omics techniques. WILEY INTERDISCIPLINARY REVIEWS. RNA 2022; 13:e1707. [PMID: 34979593 PMCID: PMC9542554 DOI: 10.1002/wrna.1707] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/26/2021] [Revised: 11/27/2021] [Accepted: 11/29/2021] [Indexed: 12/15/2022]
Abstract
The high‐throughput sequencing of cellular RNAs has underscored a broad effect of isoform diversification through alternative splicing on the transcriptome. Moreover, the differential production of transcript isoforms from gene loci has been recognized as a critical mechanism in cell differentiation, organismal development, and disease. Yet, the extent of the impact of alternative splicing on protein production and cellular function remains a matter of debate. Multiple experimental and computational approaches have been developed in recent years to address this question. These studies have unveiled how molecular changes at different steps in the RNA processing pathway can lead to differences in protein production and have functional effects. New and emerging experimental technologies open exciting new opportunities to develop new methods to fully establish the connection between messenger RNA expression and protein production and to further investigate how RNA variation impacts the proteome and cell function. This article is categorized under:RNA Processing > Splicing Regulation/Alternative Splicing Translation > Regulation RNA Evolution and Genomics > Computational Analyses of RNA
Collapse
Affiliation(s)
- Marina Reixachs-Solé
- The John Curtin School of Medical Research, Australian National University, Canberra, Australian Capital Territory, Australia.,EMBL Australia Partner Laboratory Network and the Australian National University, Canberra, Australian Capital Territory, Australia
| | - Eduardo Eyras
- The John Curtin School of Medical Research, Australian National University, Canberra, Australian Capital Territory, Australia.,EMBL Australia Partner Laboratory Network and the Australian National University, Canberra, Australian Capital Territory, Australia.,Catalan Institution for Research and Advanced Studies, Barcelona, Spain.,Hospital del Mar Medical Research Institute (IMIM), Barcelona, Spain
| |
Collapse
|
30
|
Mehlferber MM, Jeffery ED, Saquing J, Jordan BT, Sheynkman L, Murali M, Genet G, Acharya BR, Hirschi KK, Sheynkman GM. Characterization of protein isoform diversity in human umbilical vein endothelial cells via long-read proteogenomics. RNA Biol 2022; 19:1228-1243. [PMID: 36457147 PMCID: PMC9721438 DOI: 10.1080/15476286.2022.2141938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
Endothelial cells (ECs) comprise the lumenal lining of all blood vessels and are critical for the functioning of the cardiovascular system. Their phenotypes can be modulated by alternative splicing of RNA to produce distinct protein isoforms. To characterize the RNA and protein isoform landscape within ECs, we applied a long read proteogenomics approach to analyse human umbilical vein endothelial cells (HUVECs). Transcripts delineated from PacBio sequencing serve as the basis for a sample-specific protein database used for downstream mass-spectrometry (MS) analysis to infer protein isoform expression. We detected 53,863 transcript isoforms from 10,426 genes, with 22,195 of those transcripts being novel. Furthermore, the predominant isoform in HUVECs does not correspond with the accepted "reference isoform" 25% of the time, with vascular pathway-related genes among this group. We found 2,597 protein isoforms supported through unique peptides, with an additional 2,280 isoforms nominated upon incorporation of long-read transcript evidence. We characterized a novel alternative acceptor for endothelial-related gene CDH5, suggesting potential changes in its associated signalling pathways. Finally, we identified novel protein isoforms arising from a diversity of RNA splicing mechanisms supported by uniquely mapped novel peptides. Our results represent a high-resolution atlas of known and novel isoforms of potential relevance to endothelial phenotypes and function.[Figure: see text].
Collapse
Affiliation(s)
- Madison M. Mehlferber
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, USA,Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
| | - Erin D. Jeffery
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
| | - Jamie Saquing
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
| | - Ben T. Jordan
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
| | - Leon Sheynkman
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
| | - Mayank Murali
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
| | - Gael Genet
- Department of Cell Biology, University of Virginia School of Medicine, Charlottesville, VA, USA
| | - Bipul R. Acharya
- Department of Cell Biology, University of Virginia School of Medicine, Charlottesville, VA, USA,Cardiovascular Research Center, University of Virginia, Charlottesville, VA, USA,Wellcome Centre for Cell-Matrix Research, Faculty of Biology, Medicine and Health, the University of Manchester, UK
| | - Karen K. Hirschi
- Department of Cell Biology, University of Virginia School of Medicine, Charlottesville, VA, USA,Cardiovascular Research Center, University of Virginia, Charlottesville, VA, USA
| | - Gloria M. Sheynkman
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, USA,Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA,Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA,UVA Comprehensive Cancer Center, University of Virginia, Charlottesville, Virginia, USA,CONTACT Gloria M. Sheynkman The Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA
| |
Collapse
|
31
|
Hümmer S, Borao S, Guerra-Moreno A, Cozzuto L, Hidalgo E, Ayté J. Cross talk between the upstream exon-intron junction and Prp2 facilitates splicing of non-consensus introns. Cell Rep 2021; 37:109893. [PMID: 34706246 DOI: 10.1016/j.celrep.2021.109893] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Revised: 04/27/2021] [Accepted: 10/06/2021] [Indexed: 10/20/2022] Open
Abstract
Splicing of mRNA precursors is essential in the regulation of gene expression. U2AF65 recognizes the poly-pyrimidine tract and helps in the recognition of the branch point. Inactivation of fission yeast U2AF65 (Prp2) blocks splicing of most, but not all, pre-mRNAs, for reasons that are not understood. Here, we have determined genome-wide the splicing efficiency of fission yeast cells as they progress into synchronous meiosis in the presence or absence of functional Prp2. Our data indicate that in addition to the splicing elements at the 3' end of any intron, the nucleotides immediately upstream the intron will determine whether Prp2 is required or dispensable for splicing. By changing those nucleotides in any given intron, we regulate its Prp2 dependency. Our results suggest a model in which Prp2 is required for the coordinated recognition of both intronic ends, placing Prp2 as a key regulatory element in the determination of the exon-intron boundaries.
Collapse
Affiliation(s)
- Stefan Hümmer
- Oxidative Stress and Cell Cycle Group, Universitat Pompeu Fabra, 08003 Barcelona, Spain.
| | - Sonia Borao
- Oxidative Stress and Cell Cycle Group, Universitat Pompeu Fabra, 08003 Barcelona, Spain
| | - Angel Guerra-Moreno
- Oxidative Stress and Cell Cycle Group, Universitat Pompeu Fabra, 08003 Barcelona, Spain
| | - Luca Cozzuto
- CRG Bioinformatics Core, Centre de Regulació Genòmica (CRG), 08003 Barcelona, Spain
| | - Elena Hidalgo
- Oxidative Stress and Cell Cycle Group, Universitat Pompeu Fabra, 08003 Barcelona, Spain
| | - José Ayté
- Oxidative Stress and Cell Cycle Group, Universitat Pompeu Fabra, 08003 Barcelona, Spain.
| |
Collapse
|
32
|
Ran X, Hu F, Mao N, Ruan Y, Yi F, Niu X, Huang S, Li S, You L, Zhang F, Tang L, Wang J, Liu J. Differences in gene expression and variable splicing events of ovaries between large and small litter size in Chinese Xiang pigs. Porcine Health Manag 2021; 7:52. [PMID: 34470660 PMCID: PMC8411529 DOI: 10.1186/s40813-021-00226-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 07/20/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Although lots of quantitative trait loci (QTLs) and genes present roles in litter size of some breeds, the information might not make it clear for the huge diversity of reproductive capability in pig breeds. To elucidate the inherent mechanisms of heterogeneity of reproductive capability in litter size of Xiang pig, we performed transcriptome analysis for the expression profile in ovaries using RNA-seq method. RESULTS We identified 1,419 up-regulated and 1,376 down-regulated genes in Xiang pigs with large litter size. Among them, 1,010 differentially expressed genes (DEGs) were differently spliced between two groups with large or small litter sizes. Based on GO and KEGG analysis, numerous members of genes were gathered in ovarian steroidogenesis, steroid biosynthesis, oocyte maturation and reproduction processes. CONCLUSIONS Combined with gene biological function, twelve genes were found out that might be related with the reproductive capability of Xiang pig, of which, eleven genes were recognized as hub genes. These genes may play a role in promoting litter size by elevating steroid and peptide hormones supply through the ovary and facilitating the processes of ovulation and in vivo fertilization.
Collapse
Affiliation(s)
- Xueqin Ran
- College of Animal Science, Institute of Agro-Bioengineering and Key Laboratory of Plant Resource Conservative and Germplam Innovation in Mountainous Region (Ministry of Education), Guizhou University, 550025, Guiyang, China
| | - Fengbin Hu
- College of Animal Science, Institute of Agro-Bioengineering and Key Laboratory of Plant Resource Conservative and Germplam Innovation in Mountainous Region (Ministry of Education), Guizhou University, 550025, Guiyang, China
| | - Ning Mao
- College of Animal Science, Institute of Agro-Bioengineering and Key Laboratory of Plant Resource Conservative and Germplam Innovation in Mountainous Region (Ministry of Education), Guizhou University, 550025, Guiyang, China
| | - Yiqi Ruan
- College of Animal Science, Institute of Agro-Bioengineering and Key Laboratory of Plant Resource Conservative and Germplam Innovation in Mountainous Region (Ministry of Education), Guizhou University, 550025, Guiyang, China
| | - Fanli Yi
- College of Animal Science, Institute of Agro-Bioengineering and Key Laboratory of Plant Resource Conservative and Germplam Innovation in Mountainous Region (Ministry of Education), Guizhou University, 550025, Guiyang, China
| | - Xi Niu
- College of Animal Science, Institute of Agro-Bioengineering and Key Laboratory of Plant Resource Conservative and Germplam Innovation in Mountainous Region (Ministry of Education), Guizhou University, 550025, Guiyang, China
| | - Shihui Huang
- College of Animal Science, Institute of Agro-Bioengineering and Key Laboratory of Plant Resource Conservative and Germplam Innovation in Mountainous Region (Ministry of Education), Guizhou University, 550025, Guiyang, China
| | - Sheng Li
- College of Animal Science, Institute of Agro-Bioengineering and Key Laboratory of Plant Resource Conservative and Germplam Innovation in Mountainous Region (Ministry of Education), Guizhou University, 550025, Guiyang, China
| | - Longjiang You
- College of Animal Science, Institute of Agro-Bioengineering and Key Laboratory of Plant Resource Conservative and Germplam Innovation in Mountainous Region (Ministry of Education), Guizhou University, 550025, Guiyang, China
| | - Fuping Zhang
- College of Animal Science, Institute of Agro-Bioengineering and Key Laboratory of Plant Resource Conservative and Germplam Innovation in Mountainous Region (Ministry of Education), Guizhou University, 550025, Guiyang, China
| | - Liangting Tang
- College of Animal Science, Institute of Agro-Bioengineering and Key Laboratory of Plant Resource Conservative and Germplam Innovation in Mountainous Region (Ministry of Education), Guizhou University, 550025, Guiyang, China
| | - Jiafu Wang
- College of Animal Science, Institute of Agro-Bioengineering and Key Laboratory of Plant Resource Conservative and Germplam Innovation in Mountainous Region (Ministry of Education), Guizhou University, 550025, Guiyang, China.
| | - Jianfeng Liu
- National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, 100193, Beijing, China
| |
Collapse
|
33
|
Zea DJ, Laskina S, Baudin A, Richard H, Laine E. Assessing conservation of alternative splicing with evolutionary splicing graphs. Genome Res 2021; 31:1462-1473. [PMID: 34266979 PMCID: PMC8327911 DOI: 10.1101/gr.274696.120] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Accepted: 06/11/2021] [Indexed: 12/29/2022]
Abstract
Understanding how protein function has evolved and diversified is of great importance for human genetics and medicine. Here, we tackle the problem of describing the whole transcript variability observed in several species by generalizing the definition of splicing graph. We provide a practical solution to construct parsimonious evolutionary splicing graphs where each node is a minimal transcript building block defined across species. We show a clear link between the functional relevance, tissue regulation, and conservation of alternative transcripts on a set of 50 genes. By scaling up to the whole human protein-coding genome, we identify a few thousand genes where alternative splicing modulates the number and composition of pseudorepeats. We have implemented our approach in ThorAxe, an efficient, versatile, robust, and freely available computational tool.
Collapse
Affiliation(s)
- Diego Javier Zea
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 75005 Paris, France
| | - Sofya Laskina
- Bioinformatics Unit (MF1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, 13353 Berlin, Germany
| | - Alexis Baudin
- Sorbonne Université, CNRS, LIP6, F-75005 Paris, France
| | - Hugues Richard
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 75005 Paris, France
- Bioinformatics Unit (MF1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, 13353 Berlin, Germany
| | - Elodie Laine
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 75005 Paris, France
| |
Collapse
|
34
|
Pozo F, Martinez-Gomez L, Walsh TA, Rodriguez JM, Di Domenico T, Abascal F, Vazquez J, Tress ML. Assessing the functional relevance of splice isoforms. NAR Genom Bioinform 2021; 3:lqab044. [PMID: 34046593 PMCID: PMC8140736 DOI: 10.1093/nargab/lqab044] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Revised: 04/22/2021] [Accepted: 05/17/2021] [Indexed: 12/20/2022] Open
Abstract
Alternative splicing of messenger RNA can generate an array of mature transcripts, but it is not clear how many go on to produce functionally relevant protein isoforms. There is only limited evidence for alternative proteins in proteomics analyses and data from population genetic variation studies indicate that most alternative exons are evolving neutrally. Determining which transcripts produce biologically important isoforms is key to understanding isoform function and to interpreting the real impact of somatic mutations and germline variations. Here we have developed a method, TRIFID, to classify the functional importance of splice isoforms. TRIFID was trained on isoforms detected in large-scale proteomics analyses and distinguishes these biologically important splice isoforms with high confidence. Isoforms predicted as functionally important by the algorithm had measurable cross species conservation and significantly fewer broken functional domains. Additionally, exons that code for these functionally important protein isoforms are under purifying selection, while exons from low scoring transcripts largely appear to be evolving neutrally. TRIFID has been developed for the human genome, but it could in principle be applied to other well-annotated species. We believe that this method will generate valuable insights into the cellular importance of alternative splicing.
Collapse
Affiliation(s)
- Fernando Pozo
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
| | - Laura Martinez-Gomez
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
| | - Thomas A Walsh
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
| | - José Manuel Rodriguez
- Cardiovascular Proteomics Laboratory, Centro Nacional de Investigaciones Cardiovasculares (CNIC), Madrid, Spain
| | - Tomas Di Domenico
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
| | - Federico Abascal
- Somatic Evolution Group, Wellcome Sanger Institute, Hinxton CB10 1SA, UK
| | - Jesús Vazquez
- Cardiovascular Proteomics Laboratory, Centro Nacional de Investigaciones Cardiovasculares (CNIC), Madrid, Spain
| | - Michael L Tress
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
| |
Collapse
|
35
|
An extended catalogue of tandem alternative splice sites in human tissue transcriptomes. PLoS Comput Biol 2021; 17:e1008329. [PMID: 33826604 PMCID: PMC8055015 DOI: 10.1371/journal.pcbi.1008329] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2020] [Revised: 04/19/2021] [Accepted: 03/22/2021] [Indexed: 12/18/2022] Open
Abstract
Tandem alternative splice sites (TASS) is a special class of alternative splicing events that are characterized by a close tandem arrangement of splice sites. Most TASS lack functional characterization and are believed to arise from splicing noise. Based on the RNA-seq data from the Genotype Tissue Expression project, we present an extended catalogue of TASS in healthy human tissues and analyze their tissue-specific expression. The expression of TASS is usually dominated by one major splice site (maSS), while the expression of minor splice sites (miSS) is at least an order of magnitude lower. Among 46k miSS with sufficient read support, 9k (20%) are significantly expressed above the expected noise level, and among them 2.5k are expressed tissue-specifically. We found significant correlations between tissue-specific expression of RNA-binding proteins (RBP), tissue-specific expression of miSS, and miSS response to RBP inactivation by shRNA. In combination with RBP profiling by eCLIP, this allowed prediction of novel cases of tissue-specific splicing regulation including a miSS in QKI mRNA that is likely regulated by PTBP1. The analysis of human primary cell transcriptomes suggested that both tissue-specific and cell-type-specific factors contribute to the regulation of miSS expression. More than 20% of tissue-specific miSS affect structured protein regions and may adjust protein-protein interactions or modify the stability of the protein core. The significantly expressed miSS evolve under the same selection pressure as maSS, while other miSS lack signatures of evolutionary selection and conservation. Using mixture models, we estimated that not more than 15% of maSS and not more than 54% of tissue-specific miSS are noisy, while the proportion of noisy splice sites among non-significantly expressed miSS is above 63%. Pre-mRNA splicing is an important step in the processing of the genomic information during gene expression. During splicing, introns are excised from a gene transcript, and the remaining exons are ligated. Our work concerns one its particular subtype, which involves the so-called tandem alternative splice sites, a group of closely located exon borders that are used alternatively. We analyzed RNA-seq measurements of gene expression provided by the Genotype-Tissue Expression (GTEx) project, the largest to-date collection of such measurements in healthy human tissues, and constructed a detailed catalogue of tandem alternative splice sites. Within this catalogue, we characterized patterns of tissue-specific expression, regulation, impact on protein structure, and evolutionary selection acting on tandem alternative splice sites. In a number of genes, we predicted regulatory mechanisms that could be responsible for choosing one of many tandem alternative splice sites. The results of this study provide an invaluable resource for molecular biologists studying alternative splicing.
Collapse
|
36
|
Moaddel R, Ubaida‐Mohien C, Tanaka T, Lyashkov A, Basisty N, Schilling B, Semba RD, Franceschi C, Gorospe M, Ferrucci L. Proteomics in aging research: A roadmap to clinical, translational research. Aging Cell 2021; 20:e13325. [PMID: 33730416 PMCID: PMC8045948 DOI: 10.1111/acel.13325] [Citation(s) in RCA: 62] [Impact Index Per Article: 20.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2020] [Revised: 12/31/2020] [Accepted: 01/18/2021] [Indexed: 02/06/2023] Open
Abstract
The identification of plasma proteins that systematically change with age and, independent of chronological age, predict accelerated decline of health is an expanding area of research. Circulating proteins are ideal translational "omics" since they are final effectors of physiological pathways and because physicians are accustomed to use information of plasma proteins as biomarkers for diagnosis, prognosis, and tracking the effectiveness of treatments. Recent technological advancements, including mass spectrometry (MS)-based proteomics, multiplexed proteomic assay using modified aptamers (SOMAscan), and Proximity Extension Assay (PEA, O-Link), have allowed for the assessment of thousands of proteins in plasma or other biological matrices, which are potentially translatable into new clinical biomarkers and provide new clues about the mechanisms by which aging is associated with health deterioration and functional decline. We carried out a detailed literature search for proteomic studies performed in different matrices (plasma, serum, urine, saliva, tissues) and species using multiple platforms. Herein, we identified 232 proteins that were age-associated across studies. Enrichment analysis of the 232 age-associated proteins revealed metabolic pathways previously connected with biological aging both in animal models and in humans, most remarkably insulin-like growth factor (IGF) signaling, mitogen-activated protein kinases (MAPK), hypoxia-inducible factor 1 (HIF1), cytokine signaling, Forkhead Box O (FOXO) metabolic pathways, folate metabolism, advance glycation end products (AGE), and receptor AGE (RAGE) metabolic pathway. Information on these age-relevant proteins, likely expanded and validated in longitudinal studies and examined in mechanistic studies, will be essential for patient stratification and the development of new treatments aimed at improving health expectancy.
Collapse
Affiliation(s)
- Ruin Moaddel
- Biomedical Research Centre National Institute on Aging, NIH Baltimore MD USA
| | | | - Toshiko Tanaka
- Biomedical Research Centre National Institute on Aging, NIH Baltimore MD USA
| | - Alexey Lyashkov
- Biomedical Research Centre National Institute on Aging, NIH Baltimore MD USA
| | | | | | - Richard D Semba
- Wilmer Eye Institute Johns Hopkins University School of Medicine Baltimore MD USA
| | - Claudio Franceschi
- University of Bologna and IRCCS Institute of Neurological Sciences Bologna Italy
| | - Myriam Gorospe
- Biomedical Research Centre National Institute on Aging, NIH Baltimore MD USA
| | - Luigi Ferrucci
- Biomedical Research Centre National Institute on Aging, NIH Baltimore MD USA
| |
Collapse
|
37
|
Salovska B, Zhu H, Gandhi T, Frank M, Li W, Rosenberger G, Wu C, Germain PL, Zhou H, Hodny Z, Reiter L, Liu Y. Isoform-resolved correlation analysis between mRNA abundance regulation and protein level degradation. Mol Syst Biol 2021; 16:e9170. [PMID: 32175694 PMCID: PMC7073818 DOI: 10.15252/msb.20199170] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2019] [Revised: 02/06/2020] [Accepted: 02/12/2020] [Indexed: 12/15/2022] Open
Abstract
Profiling of biological relationships between different molecular layers dissects regulatory mechanisms that ultimately determine cellular function. To thoroughly assess the role of protein post‐translational turnover, we devised a strategy combining pulse stable isotope‐labeled amino acids in cells (pSILAC), data‐independent acquisition mass spectrometry (DIA‐MS), and a novel data analysis framework that resolves protein degradation rate on the level of mRNA alternative splicing isoforms and isoform groups. We demonstrated our approach by the genome‐wide correlation analysis between mRNA amounts and protein degradation across different strains of HeLa cells that harbor a high grade of gene dosage variation. The dataset revealed that specific biological processes, cellular organelles, spatial compartments of organelles, and individual protein isoforms of the same genes could have distinctive degradation rate. The protein degradation diversity thus dissects the corresponding buffering or concerting protein turnover control across cancer cell lines. The data further indicate that specific mRNA splicing events such as intron retention significantly impact the protein abundance levels. Our findings support the tight association between transcriptome variability and proteostasis and provide a methodological foundation for studying functional protein degradation.
Collapse
Affiliation(s)
- Barbora Salovska
- Yale Cancer Biology Institute, Yale University, West Haven, CT, USA.,Department of Genome Integrity, Institute of Molecular Genetics of the Czech Academy of Sciences, Prague, Czech Republic
| | - Hongwen Zhu
- Department of Analytical Chemistry and CAS Key Laboratory of Receptor Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
| | | | - Max Frank
- European Molecular Biology Laboratory, Heidelberg, Germany
| | - Wenxue Li
- Yale Cancer Biology Institute, Yale University, West Haven, CT, USA
| | | | - Chongde Wu
- Yale Cancer Biology Institute, Yale University, West Haven, CT, USA
| | - Pierre-Luc Germain
- Institute for Neuroscience, D-HEST, ETH Zurich, Zurich, Switzerland.,Statistical Bioinformatics Lab, DMLS, University of Zürich, Zurich, Switzerland
| | - Hu Zhou
- Department of Analytical Chemistry and CAS Key Laboratory of Receptor Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
| | - Zdenek Hodny
- Department of Genome Integrity, Institute of Molecular Genetics of the Czech Academy of Sciences, Prague, Czech Republic
| | | | - Yansheng Liu
- Yale Cancer Biology Institute, Yale University, West Haven, CT, USA.,Department of Pharmacology, Yale University School of Medicine, New Haven, CT, USA
| |
Collapse
|
38
|
Wang L, Shamardani K, Babikir H, Catalan F, Nejo T, Chang S, Phillips JJ, Okada H, Diaz AA. The evolution of alternative splicing in glioblastoma under therapy. Genome Biol 2021; 22:48. [PMID: 33499924 PMCID: PMC7835670 DOI: 10.1186/s13059-021-02259-5] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2020] [Accepted: 01/04/2021] [Indexed: 12/03/2022] Open
Abstract
Background Alternative splicing is a rich source of tumor-specific neoantigen targets for immunotherapy. This holds promise for glioblastomas (GBMs), the most common primary tumors of the adult brain, which are resistant to standard-of-care therapy. Although most clinical trials enroll patients at recurrence, most preclinical studies have been done with specimens from primary disease. There are limited expression data from GBMs at recurrence and surprisingly little is known about the evolution of splicing patterns under therapy. Result We profile 37 primary-recurrent paired human GBM specimens via RNA sequencing. We describe the landscape of alternative splicing in GBM at recurrence and contrast that to primary and non-malignant brain-tissue specimens. By screening single-cell atlases, we identify cell-type-specific splicing patterns and novel splicing events in cell-surface proteins that are suitable targets for engineered T cell therapies. We identify recurrent-specific isoforms of mitogen-activated kinase pathway genes that enhance invasiveness and are preferentially expressed by stem-like cells. Conclusion These studies shed light on gene expression in recurrent GBM and identify novel targets for therapeutic development. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-021-02259-5.
Collapse
Affiliation(s)
- Lin Wang
- Department of Neurological Surgery, University of California, San Francisco, 1450 3rd Street, San Francisco, CA, 94158, USA.,Helen Diller Family Comprehensive Cancer Center, 1450, 3rd Street, San Francisco, CA, 94158, USA
| | - Karin Shamardani
- Department of Neurological Surgery, University of California, San Francisco, 1450 3rd Street, San Francisco, CA, 94158, USA.,Helen Diller Family Comprehensive Cancer Center, 1450, 3rd Street, San Francisco, CA, 94158, USA
| | - Husam Babikir
- Department of Neurological Surgery, University of California, San Francisco, 1450 3rd Street, San Francisco, CA, 94158, USA.,Helen Diller Family Comprehensive Cancer Center, 1450, 3rd Street, San Francisco, CA, 94158, USA
| | - Francisca Catalan
- Department of Neurological Surgery, University of California, San Francisco, 1450 3rd Street, San Francisco, CA, 94158, USA.,Helen Diller Family Comprehensive Cancer Center, 1450, 3rd Street, San Francisco, CA, 94158, USA
| | - Takahide Nejo
- Department of Neurological Surgery, University of California, San Francisco, 1450 3rd Street, San Francisco, CA, 94158, USA.,Helen Diller Family Comprehensive Cancer Center, 1450, 3rd Street, San Francisco, CA, 94158, USA
| | - Susan Chang
- Department of Neurological Surgery, University of California, San Francisco, 1450 3rd Street, San Francisco, CA, 94158, USA.,Helen Diller Family Comprehensive Cancer Center, 1450, 3rd Street, San Francisco, CA, 94158, USA
| | - Joanna J Phillips
- Department of Neurological Surgery, University of California, San Francisco, 1450 3rd Street, San Francisco, CA, 94158, USA.,Helen Diller Family Comprehensive Cancer Center, 1450, 3rd Street, San Francisco, CA, 94158, USA
| | - Hideho Okada
- Department of Neurological Surgery, University of California, San Francisco, 1450 3rd Street, San Francisco, CA, 94158, USA.,Helen Diller Family Comprehensive Cancer Center, 1450, 3rd Street, San Francisco, CA, 94158, USA.,Parker Institute for Cancer Immunotherapy , 1 Letterman Dr Suite D3500, Building D, San Francisco, CA, 94129, USA
| | - Aaron A Diaz
- Department of Neurological Surgery, University of California, San Francisco, 1450 3rd Street, San Francisco, CA, 94158, USA. .,Helen Diller Family Comprehensive Cancer Center, 1450, 3rd Street, San Francisco, CA, 94158, USA.
| |
Collapse
|
39
|
Hekman RM, Hume AJ, Goel RK, Abo KM, Huang J, Blum BC, Werder RB, Suder EL, Paul I, Phanse S, Youssef A, Alysandratos KD, Padhorny D, Ojha S, Mora-Martin A, Kretov D, Ash PEA, Verma M, Zhao J, Patten JJ, Villacorta-Martin C, Bolzan D, Perea-Resa C, Bullitt E, Hinds A, Tilston-Lunel A, Varelas X, Farhangmehr S, Braunschweig U, Kwan JH, McComb M, Basu A, Saeed M, Perissi V, Burks EJ, Layne MD, Connor JH, Davey R, Cheng JX, Wolozin BL, Blencowe BJ, Wuchty S, Lyons SM, Kozakov D, Cifuentes D, Blower M, Kotton DN, Wilson AA, Mühlberger E, Emili A. Actionable Cytopathogenic Host Responses of Human Alveolar Type 2 Cells to SARS-CoV-2. Mol Cell 2020; 80:1104-1122.e9. [PMID: 33259812 PMCID: PMC7674017 DOI: 10.1016/j.molcel.2020.11.028] [Citation(s) in RCA: 63] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 10/16/2020] [Accepted: 11/11/2020] [Indexed: 12/11/2022]
Abstract
Human transmission of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), causative pathogen of the COVID-19 pandemic, exerts a massive health and socioeconomic crisis. The virus infects alveolar epithelial type 2 cells (AT2s), leading to lung injury and impaired gas exchange, but the mechanisms driving infection and pathology are unclear. We performed a quantitative phosphoproteomic survey of induced pluripotent stem cell-derived AT2s (iAT2s) infected with SARS-CoV-2 at air-liquid interface (ALI). Time course analysis revealed rapid remodeling of diverse host systems, including signaling, RNA processing, translation, metabolism, nuclear integrity, protein trafficking, and cytoskeletal-microtubule organization, leading to cell cycle arrest, genotoxic stress, and innate immunity. Comparison to analogous data from transformed cell lines revealed respiratory-specific processes hijacked by SARS-CoV-2, highlighting potential novel therapeutic avenues that were validated by a high hit rate in a targeted small molecule screen in our iAT2 ALI system.
Collapse
Affiliation(s)
- Ryan M Hekman
- Center for Network Systems Biology, Boston University, Boston, MA, USA; Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Adam J Hume
- Department of Microbiology, Boston University School of Medicine, Boston, MA, USA; National Emerging Infectious Diseases Laboratories, Boston University, Boston, MA, USA
| | - Raghuveera Kumar Goel
- Center for Network Systems Biology, Boston University, Boston, MA, USA; Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Kristine M Abo
- Center for Regenerative Medicine of Boston University and Boston Medical Center, Boston, MA, USA; The Pulmonary Center, Department of Medicine, Boston University School of Medicine, Boston, MA, USA
| | - Jessie Huang
- Center for Regenerative Medicine of Boston University and Boston Medical Center, Boston, MA, USA; The Pulmonary Center, Department of Medicine, Boston University School of Medicine, Boston, MA, USA
| | - Benjamin C Blum
- Center for Network Systems Biology, Boston University, Boston, MA, USA; Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Rhiannon B Werder
- Center for Regenerative Medicine of Boston University and Boston Medical Center, Boston, MA, USA; The Pulmonary Center, Department of Medicine, Boston University School of Medicine, Boston, MA, USA
| | - Ellen L Suder
- Department of Microbiology, Boston University School of Medicine, Boston, MA, USA; National Emerging Infectious Diseases Laboratories, Boston University, Boston, MA, USA
| | - Indranil Paul
- Center for Network Systems Biology, Boston University, Boston, MA, USA; Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Sadhna Phanse
- Center for Network Systems Biology, Boston University, Boston, MA, USA
| | - Ahmed Youssef
- Center for Network Systems Biology, Boston University, Boston, MA, USA; Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA; Bioinformatics Program, Boston University, Boston, MA, USA
| | - Konstantinos D Alysandratos
- Center for Regenerative Medicine of Boston University and Boston Medical Center, Boston, MA, USA; The Pulmonary Center, Department of Medicine, Boston University School of Medicine, Boston, MA, USA
| | - Dzmitry Padhorny
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY, USA; Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, USA
| | - Sandeep Ojha
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | | | - Dmitry Kretov
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Peter E A Ash
- Department of Pharmacology, Boston University School of Medicine, Boston, MA, USA
| | - Mamta Verma
- Department of Pharmacology, Boston University School of Medicine, Boston, MA, USA
| | - Jian Zhao
- Department of Electrical and Computer Engineering, Boston University, Boston, MA, USA
| | - J J Patten
- Department of Microbiology, Boston University School of Medicine, Boston, MA, USA; National Emerging Infectious Diseases Laboratories, Boston University, Boston, MA, USA
| | - Carlos Villacorta-Martin
- Center for Regenerative Medicine of Boston University and Boston Medical Center, Boston, MA, USA
| | - Dante Bolzan
- Department of Computer Science, University of Miami, Miami, FL, USA
| | - Carlos Perea-Resa
- Department of Molecular Biology, Harvard Medical School, Boston, MA, USA
| | - Esther Bullitt
- Department of Physiology and Biophysics, Boston University, Boston, MA, USA
| | - Anne Hinds
- The Pulmonary Center, Department of Medicine, Boston University School of Medicine, Boston, MA, USA
| | - Andrew Tilston-Lunel
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Xaralabos Varelas
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Shaghayegh Farhangmehr
- Donnelly Centre, University of Toronto, Toronto, ON, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | | | - Julian H Kwan
- Center for Network Systems Biology, Boston University, Boston, MA, USA; Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Mark McComb
- Center for Network Systems Biology, Boston University, Boston, MA, USA; Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA; Center for Biomedical Mass Spectrometry, Boston University School of Medicine, Boston, MA, USA
| | - Avik Basu
- Center for Network Systems Biology, Boston University, Boston, MA, USA; Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Mohsan Saeed
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA; National Emerging Infectious Diseases Laboratories, Boston University, Boston, MA, USA
| | - Valentina Perissi
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Eric J Burks
- Department of Pathology and Laboratory Medicine, Boston University School of Medicine, Boston, MA, USA
| | - Matthew D Layne
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - John H Connor
- Department of Microbiology, Boston University School of Medicine, Boston, MA, USA; National Emerging Infectious Diseases Laboratories, Boston University, Boston, MA, USA
| | - Robert Davey
- Department of Microbiology, Boston University School of Medicine, Boston, MA, USA; National Emerging Infectious Diseases Laboratories, Boston University, Boston, MA, USA
| | - Ji-Xin Cheng
- Department of Biomedical Engineering, Boston University, Boston, MA, USA
| | - Benjamin L Wolozin
- Department of Pharmacology, Boston University School of Medicine, Boston, MA, USA
| | - Benjamin J Blencowe
- Donnelly Centre, University of Toronto, Toronto, ON, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Miami, FL, USA; Department of Biology, University of Miami, Miami, FL, USA; Miami Institute of Data Science and Computing, Miami, FL, USA
| | - Shawn M Lyons
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Dima Kozakov
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY, USA; Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, USA
| | - Daniel Cifuentes
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Michael Blower
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA; Department of Molecular Biology, Harvard Medical School, Boston, MA, USA
| | - Darrell N Kotton
- Center for Regenerative Medicine of Boston University and Boston Medical Center, Boston, MA, USA; The Pulmonary Center, Department of Medicine, Boston University School of Medicine, Boston, MA, USA.
| | - Andrew A Wilson
- Center for Regenerative Medicine of Boston University and Boston Medical Center, Boston, MA, USA; The Pulmonary Center, Department of Medicine, Boston University School of Medicine, Boston, MA, USA.
| | - Elke Mühlberger
- Department of Microbiology, Boston University School of Medicine, Boston, MA, USA; National Emerging Infectious Diseases Laboratories, Boston University, Boston, MA, USA.
| | - Andrew Emili
- Center for Network Systems Biology, Boston University, Boston, MA, USA; Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA; Department of Biology, Boston University, Boston, MA, USA.
| |
Collapse
|
40
|
Lau E, Han Y, Williams DR, Thomas CT, Shrestha R, Wu JC, Lam MPY. Splice-Junction-Based Mapping of Alternative Isoforms in the Human Proteome. Cell Rep 2020; 29:3751-3765.e5. [PMID: 31825849 PMCID: PMC6961840 DOI: 10.1016/j.celrep.2019.11.026] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2019] [Revised: 09/24/2019] [Accepted: 11/06/2019] [Indexed: 12/18/2022] Open
Abstract
The protein-level translational status and function of many alternative splicing events remain poorly understood. We use an RNA sequencing (RNA-seq)-guided proteomics method to identify protein alternative splicing isoforms in the human proteome by constructing tissue-specific protein databases that prioritize transcript splice junction pairs with high translational potential. Using the custom databases to reanalyze ~80 million mass spectra in public proteomics datasets, we identify more than 1,500 noncanonical protein isoforms across 12 human tissues, including ~400 sequences undocumented on TrEMBL and RefSeq databases. We apply the method to original quantitative mass spectrometry experiments and observe widespread isoform regulation during human induced pluripotent stem cell cardiomyocyte differentiation. On a proteome scale, alternative isoform regions overlap frequently with disordered sequences and post-translational modification sites, suggesting that alternative splicing may regulate protein function through modulating intrinsically disordered regions. The described approach may help elucidate functional consequences of alternative splicing and expand the scope of proteomics investigations in various systems. The translation and function of many alternative splicing events await confirmation at the protein level. Lau et al. use an integrated proteotranscriptomics approach to identify non-canonical and undocumented isoforms from 12 organs in the human proteome. Alternative isoforms interfere with functional sequence features and are differentially regulated during iPSC cardiomyocyte differentiation.
Collapse
Affiliation(s)
- Edward Lau
- Stanford Cardiovascular Institute, Department of Medicine, Stanford University, Palo Alto, CA, USA
| | - Yu Han
- Consortium for Fibrosis Research and Translation, Anschutz Medical Campus, University of Colorado, Aurora, CO, USA; Departments of Medicine-Cardiology and Biochemistry and Molecular Genetics, Anschutz Medical Campus, University of Colorado, Aurora, CO, USA
| | - Damon R Williams
- Stanford Cardiovascular Institute, Department of Medicine, Stanford University, Palo Alto, CA, USA
| | - Cody T Thomas
- Departments of Medicine-Cardiology and Biochemistry and Molecular Genetics, Anschutz Medical Campus, University of Colorado, Aurora, CO, USA
| | - Rajani Shrestha
- Stanford Cardiovascular Institute, Department of Medicine, Stanford University, Palo Alto, CA, USA
| | - Joseph C Wu
- Stanford Cardiovascular Institute, Department of Medicine, Stanford University, Palo Alto, CA, USA; Department of Radiology, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - Maggie P Y Lam
- Consortium for Fibrosis Research and Translation, Anschutz Medical Campus, University of Colorado, Aurora, CO, USA; Departments of Medicine-Cardiology and Biochemistry and Molecular Genetics, Anschutz Medical Campus, University of Colorado, Aurora, CO, USA.
| |
Collapse
|
41
|
Gaddi GM, Gisonno RA, Rosú SA, Cortez MF, Finarelli GS, Ramella NA, Tricerri MA. Data regarding the sensibility to proteolysis of a natural apolipoprotein A-I mutant. Data Brief 2020; 31:105960. [PMID: 32676531 PMCID: PMC7352074 DOI: 10.1016/j.dib.2020.105960] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Revised: 06/22/2020] [Accepted: 06/29/2020] [Indexed: 11/18/2022] Open
Abstract
The article shows dataset of the proteolysis of a natural variant of apolipoprotein A-I (apoA-I) with a substitution of a leucine by and arginine in position 60 (L60R), in comparison with the protein with the native sequence (Wt). This information demonstrates the potential of in vitro partial proteolysis experiments as it may be applicable to different approaches in the biophysical field. We have analyzed by different electrophoresis techniques apoA-I variants, quantified the degree of proteolysis after staining and compared the proteolysis efficiency with the computed cleavage patterns. The data shown here clearly strengthen the usefulness of this approach to test protein flexibility, as it may be attained with enzymes which are not expected to modify in vivo this protein but have a well-known digestion pattern. In addition it is appropriate for evaluating protein catabolism, as it is exemplified here by the evidence with metalloproteinase 12 (MMP-12), which is a physiological protease that may elicit the pro-inflammatory processing of this variant within the lesions. We support the work “Structural analysis of a natural apolipoprotein A-I variant (L60R) associated with amyloidosis” (Gaddi, et al., 2020), gaining insights on protein folding from a characterization by proteolysis analysis [1].
Collapse
Affiliation(s)
- Gisela M. Gaddi
- Instituto de Investigaciones Bioquímicas de La Plata (INIBIOLP), CONICET, La Plata, Buenos Aires, Argentina
- Facultad de Ciencias Médicas, Universidad Nacional de La Plata, Calle 60 y 120. La Plata, Buenos Aires, Argentina
| | - Romina A. Gisonno
- Instituto de Investigaciones Bioquímicas de La Plata (INIBIOLP), CONICET, La Plata, Buenos Aires, Argentina
- Facultad de Ciencias Médicas, Universidad Nacional de La Plata, Calle 60 y 120. La Plata, Buenos Aires, Argentina
| | - Silvana A. Rosú
- Instituto de Investigaciones Bioquímicas de La Plata (INIBIOLP), CONICET, La Plata, Buenos Aires, Argentina
- Facultad de Ciencias Médicas, Universidad Nacional de La Plata, Calle 60 y 120. La Plata, Buenos Aires, Argentina
| | - M. Fernanda Cortez
- Instituto de Investigaciones Bioquímicas de La Plata (INIBIOLP), CONICET, La Plata, Buenos Aires, Argentina
| | - Gabriela S. Finarelli
- Instituto de Investigaciones Bioquímicas de La Plata (INIBIOLP), CONICET, La Plata, Buenos Aires, Argentina
| | - Nahuel A. Ramella
- Instituto de Investigaciones Bioquímicas de La Plata (INIBIOLP), CONICET, La Plata, Buenos Aires, Argentina
- Facultad de Ciencias Médicas, Universidad Nacional de La Plata, Calle 60 y 120. La Plata, Buenos Aires, Argentina
- Corresponding authors at: Instituto de Investigaciones Bioquímicas de La Plata (INIBIOLP), CONICET, La Plata, Buenos Aires, Argentina
| | - M. Alejandra Tricerri
- Instituto de Investigaciones Bioquímicas de La Plata (INIBIOLP), CONICET, La Plata, Buenos Aires, Argentina
- Facultad de Ciencias Médicas, Universidad Nacional de La Plata, Calle 60 y 120. La Plata, Buenos Aires, Argentina
- Corresponding authors at: Instituto de Investigaciones Bioquímicas de La Plata (INIBIOLP), CONICET, La Plata, Buenos Aires, Argentina
| |
Collapse
|
42
|
Characterization of the protein and peptide of excipient zein by the multi-enzyme digestion coupled with nano-LC-MS/MS. Food Chem 2020; 321:126712. [DOI: 10.1016/j.foodchem.2020.126712] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2019] [Revised: 03/11/2020] [Accepted: 03/29/2020] [Indexed: 12/24/2022]
|
43
|
Dau T, Bartolomucci G, Rappsilber J. Proteomics Using Protease Alternatives to Trypsin Benefits from Sequential Digestion with Trypsin. Anal Chem 2020; 92:9523-9527. [PMID: 32628831 PMCID: PMC7377536 DOI: 10.1021/acs.analchem.0c00478] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2020] [Accepted: 06/16/2020] [Indexed: 12/15/2022]
Abstract
Trypsin is the most used enzyme in proteomics. Nevertheless, proteases with complementary cleavage specificity have been applied in special circumstances. In this work, we analyzed the characteristics of five protease alternatives to trypsin for protein identification and sequence coverage when applied to S. pombe whole cell lysates. The specificity of the protease heavily impacted the number of proteins identified. Proteases with higher specificity led to the identification of more proteins than proteases with lower specificity. However, AspN, GluC, chymotrypsin, and proteinase K largely benefited from being paired with trypsin in sequential digestion, as had been shown by us for elastase before. In the most extreme case, predigesting with trypsin improves the number of identified proteins for proteinase K by 731%. Trypsin predigestion also improved the protein identifications of other proteases, AspN (+62%), GluC (+80%), and chymotrypsin (+21%). Interestingly, the sequential digest with trypsin and AspN yielded even a higher number of protein identifications than digesting with trypsin alone.
Collapse
Affiliation(s)
- Therese Dau
- Bioanalytics,
Institute of Biotechnology, Technische Universität Berlin, 13355, Berlin, Germany
- Wellcome
Centre for Cell Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, EH9 3BF, Scotland U.K.
| | - Giulia Bartolomucci
- Wellcome
Centre for Cell Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, EH9 3BF, Scotland U.K.
| | - Juri Rappsilber
- Bioanalytics,
Institute of Biotechnology, Technische Universität Berlin, 13355, Berlin, Germany
- Wellcome
Centre for Cell Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, EH9 3BF, Scotland U.K.
| |
Collapse
|
44
|
Reixachs-Solé M, Ruiz-Orera J, Albà MM, Eyras E. Ribosome profiling at isoform level reveals evolutionary conserved impacts of differential splicing on the proteome. Nat Commun 2020; 11:1768. [PMID: 32286305 PMCID: PMC7156646 DOI: 10.1038/s41467-020-15634-w] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Accepted: 03/11/2020] [Indexed: 12/14/2022] Open
Abstract
The differential production of transcript isoforms from gene loci is a key cellular mechanism. Yet, its impact in protein production remains an open question. Here, we describe ORQAS (ORF quantification pipeline for alternative splicing), a pipeline for the translation quantification of individual transcript isoforms using ribosome-protected mRNA fragments (ribosome profiling). We find evidence of translation for 40-50% of the expressed isoforms in human and mouse, with 53% of the expressed genes having more than one translated isoform in human, and 33% in mouse. Differential splicing analysis revealed that about 40% of the splicing changes at RNA level are concordant with changes in translation. Furthermore, orthologous cassette exons between human and mouse preserve the directionality of the change, and are enriched in microexons in a comparison between glia and glioma. ORQAS leverages ribosome profiling to uncover a widespread and evolutionarily conserved impact of differential splicing on translation, particularly of microexon-containing isoforms.
Collapse
Affiliation(s)
- Marina Reixachs-Solé
- The John Curtin School of Medical Research, Australian National University, Canberra, ACT, 2601, Australia
- EMBL Australia Partner Laboratory Network at the Australian National University, Canberra, ACT, 2601, Australia
| | - Jorge Ruiz-Orera
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, 13125, Germany
| | - M Mar Albà
- IMIM - Hospital del Mar Medical Research Institute, E08003, Barcelona, Spain
- Catalan Institution for Research and Advanced Studies, E08010, Barcelona, Spain
- Pompeu Fabra University, E08003, Barcelona, Spain
| | - Eduardo Eyras
- The John Curtin School of Medical Research, Australian National University, Canberra, ACT, 2601, Australia.
- EMBL Australia Partner Laboratory Network at the Australian National University, Canberra, ACT, 2601, Australia.
- IMIM - Hospital del Mar Medical Research Institute, E08003, Barcelona, Spain.
- Catalan Institution for Research and Advanced Studies, E08010, Barcelona, Spain.
| |
Collapse
|
45
|
Abstract
High-throughput sequencing-based methods and their applications in the study of transcriptomes have revolutionized our understanding of alternative splicing. Networks of functionally coordinated and biologically important alternative splicing events continue to be discovered in an ever-increasing diversity of cell types in the context of physiologically normal and disease states. These studies have been complemented by efforts directed at defining sequence codes governing splicing and their cognate trans-acting factors, which have illuminated important combinatorial principles of regulation. Additional studies have revealed critical roles of position-dependent, multivalent protein-RNA interactions that direct splicing outcomes. Investigations of evolutionary changes in RNA binding proteins, splice variants, and associated cis elements have further shed light on the emergence, mechanisms, and functions of splicing networks. Progress in these areas has emphasized the need for a coordinated, community-based effort to systematically address the functions of individual splice variants associated with normal and disease biology.
Collapse
|
46
|
Li K, Vaudel M, Zhang B, Ren Y, Wen B. PDV: an integrative proteomics data viewer. Bioinformatics 2020; 35:1249-1251. [PMID: 30169737 DOI: 10.1093/bioinformatics/bty770] [Citation(s) in RCA: 57] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2018] [Revised: 08/22/2018] [Accepted: 08/29/2018] [Indexed: 12/20/2022] Open
Abstract
SUMMARY Data visualization plays critical roles in proteomics studies, ranging from quality control of MS/MS data to validation of peptide identification results. Herein, we present PDV, an integrative proteomics data viewer that can be used to visualize a wide range of proteomics data, including database search results, de novo sequencing results, proteogenomics files, MS/MS data in mzML/mzXML format and data from public proteomics repositories. PDV is a lightweight visualization tool that enables intuitive and fast exploration of diverse, large-scale proteomics datasets on standard desktop computers in both graphical user interface and command line modes. AVAILABILITY AND IMPLEMENTATION PDV software and the user manual are freely available at http://pdv.zhang-lab.org. The source code is available at https://github.com/wenbostar/PDV and is released under the GPL-3 license. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Kai Li
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Marc Vaudel
- Department of Clinical Science, KG Jebsen Center for Diabetes Research, University of Bergen, Norway.,Center for Medical Genetics and Molecular Medicine, Haukeland University Hospital, Bergen, Norway
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Yan Ren
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| |
Collapse
|
47
|
Nikonova E, Kao SY, Spletter ML. Contributions of alternative splicing to muscle type development and function. Semin Cell Dev Biol 2020; 104:65-80. [PMID: 32070639 DOI: 10.1016/j.semcdb.2020.02.003] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Revised: 02/05/2020] [Accepted: 02/07/2020] [Indexed: 12/30/2022]
Abstract
Animals possess a wide variety of muscle types that support different kinds of movements. Different muscles have distinct locations, morphologies and contractile properties, raising the question of how muscle diversity is generated during development. Normal aging processes and muscle disorders differentially affect particular muscle types, thus understanding how muscles normally develop and are maintained provides insight into alterations in disease and senescence. As muscle structure and basic developmental mechanisms are highly conserved, many important insights into disease mechanisms in humans as well as into basic principles of muscle development have come from model organisms such as Drosophila, zebrafish and mouse. While transcriptional regulation has been characterized to play an important role in myogenesis, there is a growing recognition of the contributions of alternative splicing to myogenesis and the refinement of muscle function. Here we review our current understanding of muscle type specific alternative splicing, using examples of isoforms with distinct functions from both vertebrates and Drosophila. Future exploration of the vast potential of alternative splicing to fine-tune muscle development and function will likely uncover novel mechanisms of isoform-specific regulation and a more holistic understanding of muscle development, disease and aging.
Collapse
Affiliation(s)
- Elena Nikonova
- Biomedical Center, Department of Physiological Chemistry, Ludwig-Maximilians-Universität München, Großhaderner Str. 9, 82152 Martinsried-Planegg, Germany
| | - Shao-Yen Kao
- Biomedical Center, Department of Physiological Chemistry, Ludwig-Maximilians-Universität München, Großhaderner Str. 9, 82152 Martinsried-Planegg, Germany
| | - Maria L Spletter
- Biomedical Center, Department of Physiological Chemistry, Ludwig-Maximilians-Universität München, Großhaderner Str. 9, 82152 Martinsried-Planegg, Germany; Center for Integrated Protein Science Munich (CIPSM) at the Department of Chemistry, Ludwig-Maximilians-Universität München, Munich, Germany.
| |
Collapse
|
48
|
Ait-Hamlat A, Zea DJ, Labeeuw A, Polit L, Richard H, Laine E. Transcripts' Evolutionary History and Structural Dynamics Give Mechanistic Insights into the Functional Diversity of the JNK Family. J Mol Biol 2020; 432:2121-2140. [PMID: 32067951 DOI: 10.1016/j.jmb.2020.01.032] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Revised: 01/03/2020] [Accepted: 01/28/2020] [Indexed: 12/14/2022]
Abstract
Alternative splicing and alternative initiation/termination transcription sites have the potential to greatly expand the proteome in eukaryotes by producing several transcript isoforms from the same gene. Although these mechanisms are well described at the genomic level, little is known about their contribution to protein evolution and their impact at the protein structure level. Here, we address both issues by reconstructing the evolutionary history of transcripts and by modeling the tertiary structures of the corresponding protein isoforms. We reconstruct phylogenetic forests relating 60 protein-coding transcripts from the c-Jun N-terminal kinase (JNK) family observed in seven species. We identify two alternative splicing events of ancient origin and show that they induce subtle changes in the protein's structural dynamics. We highlight a previously uncharacterized transcript whose predicted structure seems stable in solution. We further demonstrate that orphan transcripts, for which no phylogeny could be reconstructed, display peculiar sequence and structural properties. Our approach is implemented in PhyloSofS (Phylogenies of Splicing Isoforms Structures), a fully automated computational tool freely available at https://github.com/PhyloSofS-Team/PhyloSofS.
Collapse
Affiliation(s)
- Adel Ait-Hamlat
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), Paris, 75005, France
| | - Diego Javier Zea
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), Paris, 75005, France
| | - Antoine Labeeuw
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), Paris, 75005, France
| | - Lélia Polit
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), Paris, 75005, France
| | - Hugues Richard
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), Paris, 75005, France.
| | - Elodie Laine
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), Paris, 75005, France.
| |
Collapse
|
49
|
Søreide K, Roalsø M, Aunan JR. Is There a Trojan Horse to Aggressive Pancreatic Cancer Biology? A Review of the Trypsin-PAR2 Axis to Proliferation, Early Invasion, and Metastasis. J Pancreat Cancer 2020; 6:12-20. [PMID: 32064449 PMCID: PMC7014313 DOI: 10.1089/pancan.2019.0014] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
Purpose: Pancreatic cancer is one of the most lethal of solid tumors and is associated with aggressive cancer biology. The purpose is to review the role of trypsin and effect on molecular and cellular processes potentially explaining the aggressive biology in pancreatic cancer. Methods: A narrative literature review of studies investigating trypsin and its effect on protease systems in cancer, with special reference to pancreatic cancer biology. Results: Proteases, such as trypsin, provides a significant advantage to developing tumors through the ability to remodel the extracellular matrix, promote cell invasion and migration, and facilitate angiogenesis. Trypsin is a digestive enzyme produced by the exocrine pancreas that is also related to mechanisms of proliferation, invasion and metastasis. Several of these mechanisms may be co-regulated or influenced by activation of proteinase-activated receptor 2 (PAR-2). The current role in pancreatic cancer is not clear but emerging data suggest several potential mechanisms. Trypsin may act as a Trojan horse in the pancreatic gland, facilitating several molecular pathways from the onset, which leads to rapid progression of the disease. Pancreatic cancer cell lines containing PAR-2 proliferate upon exposure to trypsin, whereas cancer cell lines not containing PAR-2 fail to proliferate upon trypsin expression. Several mechanisms of action include a proinflammatory environment, signals inducing proliferation and migration, and direct and indirect evidence for mechanisms promoting invasion and metastasis. Novel techniques (such as organoid models) and increased understanding of mechanisms (such as the microbiome) may yield improved understanding into the role of trypsin in pancreatic carcinogenesis. Conclusion: Trypsin is naturally present in the pancreatic gland and may experience pathological activation intracellularly and in the neoplastic environment, which speeds up molecular mechanisms of proliferation, invasion, and metastasis. Further investigation of these processes will provide important insights into how pancreatic cancer evolves, and suggest new ways for treatment.
Collapse
Affiliation(s)
- Kjetil Søreide
- Gastrointestinal Translational Research Unit, Laboratory for Molecular Medicine, Stavanger University Hospital, Stavanger, Norway.,Department of Gastrointestinal Surgery, HPB Unit, Stavanger University Hospital, Stavanger, Norway.,Department of Clinical Medicine, University of Bergen, Bergen, Norway
| | - Marcus Roalsø
- Gastrointestinal Translational Research Unit, Laboratory for Molecular Medicine, Stavanger University Hospital, Stavanger, Norway.,Department of Gastrointestinal Surgery, HPB Unit, Stavanger University Hospital, Stavanger, Norway.,Faculty of Health and Medicine, University of Stavanger, Stavanger, Norway
| | - Jan Rune Aunan
- Gastrointestinal Translational Research Unit, Laboratory for Molecular Medicine, Stavanger University Hospital, Stavanger, Norway.,Department of Gastrointestinal Surgery, HPB Unit, Stavanger University Hospital, Stavanger, Norway
| |
Collapse
|
50
|
Hornburg D, Kruse T, Anderl F, Daschkin C, Semper RP, Klar K, Guenther A, Mejías-Luque R, Schneiderhan-Marra N, Mann M, Meissner F, Gerhard M. A mass spectrometry guided approach for the identification of novel vaccine candidates in gram-negative pathogens. Sci Rep 2019; 9:17401. [PMID: 31758014 PMCID: PMC6874673 DOI: 10.1038/s41598-019-53493-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2019] [Accepted: 10/27/2019] [Indexed: 12/20/2022] Open
Abstract
Vaccination is the most effective method to prevent infectious diseases. However, approaches to identify novel vaccine candidates are commonly laborious and protracted. While surface proteins are suitable vaccine candidates and can elicit antibacterial antibody responses, systematic approaches to define surfomes from gram-negatives have rarely been successful. Here we developed a combined discovery-driven mass spectrometry and computational strategy to identify bacterial vaccine candidates and validate their immunogenicity using a highly prevalent gram-negative pathogen, Helicobacter pylori, as a model organism. We efficiently isolated surface antigens by enzymatic cleavage, with a design of experiment based strategy to experimentally dissect cell surface-exposed from cytosolic proteins. From a total of 1,153 quantified bacterial proteins, we thereby identified 72 surface exposed antigens and further prioritized candidates by computational homology inference within and across species. We next tested candidate-specific immune responses. All candidates were recognized in sera from infected patients, and readily induced antibody responses after vaccination of mice. The candidate jhp_0775 induced specific B and T cell responses and significantly reduced colonization levels in mouse therapeutic vaccination studies. In infected humans, we further show that jhp_0775 is immunogenic and activates IFNγ secretion from peripheral CD4+ and CD8+ T cells. Our strategy provides a generic preclinical screening, selection and validation process for novel vaccine candidates against gram-negative bacteria, which could be employed to other gram-negative pathogens.
Collapse
Affiliation(s)
- Daniel Hornburg
- Max-Planck-Institute for Biochemistry, Martinsried, Germany
- Stanford University, School of Medicine, San Francisco, USA
| | - Tobias Kruse
- Institut für Medizinische Mikrobiologie, Immunologie und Hygiene, Technische Universität München, Munich, Germany
- ImevaX GmbH, Munich, Germany
| | - Florian Anderl
- Institut für Medizinische Mikrobiologie, Immunologie und Hygiene, Technische Universität München, Munich, Germany
- ImevaX GmbH, Munich, Germany
| | - Christina Daschkin
- Institut für Medizinische Mikrobiologie, Immunologie und Hygiene, Technische Universität München, Munich, Germany
| | - Raphaela P Semper
- Institut für Medizinische Mikrobiologie, Immunologie und Hygiene, Technische Universität München, Munich, Germany
- German Center for infection research, partner site Munich, Munich, Germany
| | | | - Anna Guenther
- NMI Natural and Medical Sciences Institute, University of Tübingen, Reutlingen, Germany
| | - Raquel Mejías-Luque
- Institut für Medizinische Mikrobiologie, Immunologie und Hygiene, Technische Universität München, Munich, Germany
- German Center for infection research, partner site Munich, Munich, Germany
| | | | - Matthias Mann
- Max-Planck-Institute for Biochemistry, Martinsried, Germany
| | - Felix Meissner
- Max-Planck-Institute for Biochemistry, Martinsried, Germany.
| | - Markus Gerhard
- Institut für Medizinische Mikrobiologie, Immunologie und Hygiene, Technische Universität München, Munich, Germany.
- ImevaX GmbH, Munich, Germany.
- German Center for infection research, partner site Munich, Munich, Germany.
| |
Collapse
|