1
|
Chen G, Chen J, Qi L, Yin Y, Lin Z, Wen H, Zhang S, Xiao C, Bello SF, Zhang X, Nie Q, Luo W. Bulk and single-cell alternative splicing analyses reveal roles of TRA2B in myogenic differentiation. Cell Prolif 2024; 57:e13545. [PMID: 37705195 PMCID: PMC10849790 DOI: 10.1111/cpr.13545] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Revised: 08/08/2023] [Accepted: 08/28/2023] [Indexed: 09/15/2023] Open
Abstract
Alternative splicing (AS) disruption has been linked to disorders of muscle development, as well as muscular atrophy. However, the precise changes in AS patterns that occur during myogenesis are not well understood. Here, we employed isoform long-reads RNA-seq (Iso-seq) and single-cell RNA-seq (scRNA-seq) to investigate the AS landscape during myogenesis. Our Iso-seq data identified 61,146 full-length isoforms representing 11,682 expressed genes, of which over 52% were novel. We identified 38,022 AS events, with most of these events altering coding sequences and exhibiting stage-specific splicing patterns. We identified AS dynamics in different types of muscle cells through scRNA-seq analysis, revealing genes essential for the contractile muscle system and cytoskeleton that undergo differential splicing across cell types. Gene-splicing analysis demonstrated that AS acts as a regulator, independent of changes in overall gene expression. Two isoforms of splicing factor TRA2B play distinct roles in myogenic differentiation by triggering AS of TGFBR2 to regulate canonical TGF-β signalling cascades differently. Our study provides a valuable transcriptome resource for myogenesis and reveals the complexity of AS and its regulation during myogenesis.
Collapse
Affiliation(s)
- Genghua Chen
- College of Animal ScienceSouth China Agricultural UniversityGuangzhouGuangdongChina
- Guangdong Provincial Key Lab of Agro‐Animal Genomics and Molecular Breeding, Lingnan Guangdong Laboratory of Modern Agriculture & State Key Laboratory for Conservation and Utilization of Subtropical Agro‐Bioresources, Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of AgricultureGuangzhouGuangdongChina
- State Key Laboratory of Livestock and Poultry Breeding, and Lingnan Guangdong Laboratory of AgricultureSouth China Agricultural UniversityGuangzhouChina
| | - Jiahui Chen
- College of Animal ScienceSouth China Agricultural UniversityGuangzhouGuangdongChina
- Guangdong Provincial Key Lab of Agro‐Animal Genomics and Molecular Breeding, Lingnan Guangdong Laboratory of Modern Agriculture & State Key Laboratory for Conservation and Utilization of Subtropical Agro‐Bioresources, Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of AgricultureGuangzhouGuangdongChina
- State Key Laboratory of Livestock and Poultry Breeding, and Lingnan Guangdong Laboratory of AgricultureSouth China Agricultural UniversityGuangzhouChina
| | - Lin Qi
- College of Animal ScienceSouth China Agricultural UniversityGuangzhouGuangdongChina
- Guangdong Provincial Key Lab of Agro‐Animal Genomics and Molecular Breeding, Lingnan Guangdong Laboratory of Modern Agriculture & State Key Laboratory for Conservation and Utilization of Subtropical Agro‐Bioresources, Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of AgricultureGuangzhouGuangdongChina
- State Key Laboratory of Livestock and Poultry Breeding, and Lingnan Guangdong Laboratory of AgricultureSouth China Agricultural UniversityGuangzhouChina
| | - Yunqian Yin
- College of Animal ScienceSouth China Agricultural UniversityGuangzhouGuangdongChina
- Guangdong Provincial Key Lab of Agro‐Animal Genomics and Molecular Breeding, Lingnan Guangdong Laboratory of Modern Agriculture & State Key Laboratory for Conservation and Utilization of Subtropical Agro‐Bioresources, Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of AgricultureGuangzhouGuangdongChina
- State Key Laboratory of Livestock and Poultry Breeding, and Lingnan Guangdong Laboratory of AgricultureSouth China Agricultural UniversityGuangzhouChina
| | - Zetong Lin
- College of Animal ScienceSouth China Agricultural UniversityGuangzhouGuangdongChina
- Guangdong Provincial Key Lab of Agro‐Animal Genomics and Molecular Breeding, Lingnan Guangdong Laboratory of Modern Agriculture & State Key Laboratory for Conservation and Utilization of Subtropical Agro‐Bioresources, Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of AgricultureGuangzhouGuangdongChina
- State Key Laboratory of Livestock and Poultry Breeding, and Lingnan Guangdong Laboratory of AgricultureSouth China Agricultural UniversityGuangzhouChina
| | - Huaqiang Wen
- College of Animal ScienceSouth China Agricultural UniversityGuangzhouGuangdongChina
- Guangdong Provincial Key Lab of Agro‐Animal Genomics and Molecular Breeding, Lingnan Guangdong Laboratory of Modern Agriculture & State Key Laboratory for Conservation and Utilization of Subtropical Agro‐Bioresources, Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of AgricultureGuangzhouGuangdongChina
- State Key Laboratory of Livestock and Poultry Breeding, and Lingnan Guangdong Laboratory of AgricultureSouth China Agricultural UniversityGuangzhouChina
| | - Shuai Zhang
- College of Animal ScienceSouth China Agricultural UniversityGuangzhouGuangdongChina
- Guangdong Provincial Key Lab of Agro‐Animal Genomics and Molecular Breeding, Lingnan Guangdong Laboratory of Modern Agriculture & State Key Laboratory for Conservation and Utilization of Subtropical Agro‐Bioresources, Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of AgricultureGuangzhouGuangdongChina
- State Key Laboratory of Livestock and Poultry Breeding, and Lingnan Guangdong Laboratory of AgricultureSouth China Agricultural UniversityGuangzhouChina
| | - Chuanyun Xiao
- Human and Animal PhysiologyWageningen UniversityWageningenThe Netherlands
| | - Semiu Folaniyi Bello
- College of Animal ScienceSouth China Agricultural UniversityGuangzhouGuangdongChina
- Guangdong Provincial Key Lab of Agro‐Animal Genomics and Molecular Breeding, Lingnan Guangdong Laboratory of Modern Agriculture & State Key Laboratory for Conservation and Utilization of Subtropical Agro‐Bioresources, Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of AgricultureGuangzhouGuangdongChina
- State Key Laboratory of Livestock and Poultry Breeding, and Lingnan Guangdong Laboratory of AgricultureSouth China Agricultural UniversityGuangzhouChina
| | - Xiquan Zhang
- College of Animal ScienceSouth China Agricultural UniversityGuangzhouGuangdongChina
- Guangdong Provincial Key Lab of Agro‐Animal Genomics and Molecular Breeding, Lingnan Guangdong Laboratory of Modern Agriculture & State Key Laboratory for Conservation and Utilization of Subtropical Agro‐Bioresources, Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of AgricultureGuangzhouGuangdongChina
- State Key Laboratory of Livestock and Poultry Breeding, and Lingnan Guangdong Laboratory of AgricultureSouth China Agricultural UniversityGuangzhouChina
| | - Qinghua Nie
- College of Animal ScienceSouth China Agricultural UniversityGuangzhouGuangdongChina
- Guangdong Provincial Key Lab of Agro‐Animal Genomics and Molecular Breeding, Lingnan Guangdong Laboratory of Modern Agriculture & State Key Laboratory for Conservation and Utilization of Subtropical Agro‐Bioresources, Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of AgricultureGuangzhouGuangdongChina
- State Key Laboratory of Livestock and Poultry Breeding, and Lingnan Guangdong Laboratory of AgricultureSouth China Agricultural UniversityGuangzhouChina
| | - Wen Luo
- College of Animal ScienceSouth China Agricultural UniversityGuangzhouGuangdongChina
- Guangdong Provincial Key Lab of Agro‐Animal Genomics and Molecular Breeding, Lingnan Guangdong Laboratory of Modern Agriculture & State Key Laboratory for Conservation and Utilization of Subtropical Agro‐Bioresources, Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of AgricultureGuangzhouGuangdongChina
- State Key Laboratory of Livestock and Poultry Breeding, and Lingnan Guangdong Laboratory of AgricultureSouth China Agricultural UniversityGuangzhouChina
| |
Collapse
|
2
|
Ilgisonis E, Lisitsa A, Kudryavtseva V, Ponomarenko E. Creation of Individual Scientific Concept-Centered Semantic Maps Based on Automated Text-Mining Analysis of PubMed. Adv Bioinformatics 2018; 2018:4625394. [PMID: 30147721 PMCID: PMC6083525 DOI: 10.1155/2018/4625394] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2018] [Accepted: 07/05/2018] [Indexed: 01/22/2023] Open
Abstract
Concept-centered semantic maps were created based on a text-mining analysis of PubMed using the BiblioEngine_v2018 software. The objects ("concepts") of a semantic map can be MeSH-terms or other terms (names of proteins, diseases, chemical compounds, etc.) structured in the form of controlled vocabularies. The edges between the two objects were automatically calculated based on the index of semantic similarity, which is proportional to the number of publications related to both objects simultaneously. On the one hand, an individual semantic map created based on the already published papers allows us to trace scientific inquiry. On the other hand, a prospective analysis based on the study of PubMed search history enables us to determine the possible directions for future research.
Collapse
|
3
|
Peffers MJ, Goljanek-Whysall K, Collins J, Fang Y, Rushton M, Loughlin J, Proctor C, Clegg PD. Decoding the Regulatory Landscape of Ageing in Musculoskeletal Engineered Tissues Using Genome-Wide DNA Methylation and RNASeq. PLoS One 2016; 11:e0160517. [PMID: 27533049 PMCID: PMC4988628 DOI: 10.1371/journal.pone.0160517] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2016] [Accepted: 07/20/2016] [Indexed: 12/11/2022] Open
Abstract
Mesenchymal stem cells (MSC) are capable of multipotent differentiation into connective tissues and as such are an attractive source for autologous cell-based regenerative medicine and tissue engineering. Epigenetic mechanisms, like DNA methylation, contribute to the changes in gene expression in ageing. However there was a lack of sufficient knowledge of the role that differential methylation plays during chondrogenic, osteogenic and tenogenic differentiation from ageing MSCs. This study undertook genome level determination of the effects of DNA methylation on expression in engineered tissues from chronologically aged MSCs. We compiled unique DNA methylation signatures from chondrogenic, osteogenic, and tenogenic engineered tissues derived from young; n = 4 (21.8 years ± 2.4 SD) and old; n = 4 (65.5 years±8.3SD) human MSCs donors using the Illumina HumanMethylation 450 Beadchip arrays and compared these to gene expression by RNA sequencing. Unique and common signatures of global DNA methylation were identified. There were 201, 67 and 32 chondrogenic, osteogenic and tenogenic age-related DE protein-coding genes respectively. Findings inferred the nature of the transcript networks was predominantly for 'cell death and survival', 'cell morphology', and 'cell growth and proliferation'. Further studies are required to validate if this gene expression effect translates to cell events. Alternative splicing (AS) was dysregulated in ageing with 119, 21 and 9 differential splicing events identified in chondrogenic, osteogenic and tenogenic respectively, and enrichment in genes associated principally with metabolic processes. Gene ontology analysis of differentially methylated loci indicated age-related enrichment for all engineered tissue types in 'skeletal system morphogenesis', 'regulation of cell proliferation' and 'regulation of transcription' suggesting that dynamic epigenetic modifications may occur in genes associated with shared and distinct pathways dependent upon engineered tissue type. An altered phenotype in engineered tissues was observed with ageing at numerous levels. These changes represent novel insights into the ageing process, with implications for stem cell therapies in older patients. In addition we have identified a number of tissue-dependant pathways, which warrant further studies.
Collapse
Affiliation(s)
- Mandy Jayne Peffers
- Institute of Ageing and Chronic Disease, University of Liverpool, Leahurst, Chester High Road, Neston, Wirral, UK, CH64 7TE
| | - Katarzyna Goljanek-Whysall
- Institute of Ageing and Chronic Disease, University of Liverpool, Leahurst, Chester High Road, Neston, Wirral, UK, CH64 7TE
| | - John Collins
- Thurston Arthritis Research Centre, School Of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA, 27599
| | - Yongxiang Fang
- Centre for Genomic Research, Institute of Integrative Biology, Biosciences Building, Crown Street, University of Liverpool, Liverpool, UK, L69 7ZB
| | - Michael Rushton
- Musculoskeletal Research Group, Institute of Cellular Medicine, Newcastle University, Newcastle upon Tyne, UK, NE2 4HH
| | - John Loughlin
- Musculoskeletal Research Group, Institute of Cellular Medicine, Newcastle University, Newcastle upon Tyne, UK, NE2 4HH
| | - Carole Proctor
- Musculoskeletal Research Group, Institute of Cellular Medicine, Newcastle University, Newcastle upon Tyne, UK, NE2 4HH
- Newcastle University Institute for Ageing, Newcastle University, Campus for Ageing and Vitality, Newcastle upon Tyne, UK, NE4 5PL
| | - Peter David Clegg
- Institute of Ageing and Chronic Disease, University of Liverpool, Leahurst, Chester High Road, Neston, Wirral, UK, CH64 7TE
| |
Collapse
|
4
|
Li HD, Omenn GS, Guan Y. A proteogenomic approach to understand splice isoform functions through sequence and expression-based computational modeling. Brief Bioinform 2016; 17:1024-1031. [PMID: 26740460 DOI: 10.1093/bib/bbv109] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2015] [Revised: 11/03/2015] [Indexed: 01/23/2023] Open
Abstract
The products of multi-exon genes are a mixture of alternatively spliced isoforms, from which the translated proteins can have similar, different or even opposing functions. It is therefore essential to differentiate and annotate functions for individual isoforms. Computational approaches provide an efficient complement to expensive and time-consuming experimental studies. The input data of these methods range from DNA sequence, to RNA selection pressure, to expressed sequence tags, to full-length complementary DNA, to exon array, to RNA-seq expression, to proteomic data. Notably, RNA-seq technology generates quantitative profiling of transcript expression at the genome scale, with an unprecedented amount of expression data available for developing isoform function prediction methods. Integrative analysis of these data at different molecular levels enables a proteogenomic approach to systematically interrogate isoform functions. Here, we briefly review the state-of-the-art methods according to their input data sources, discuss their advantages and limitations and point out potential ways to improve prediction accuracies.
Collapse
|
5
|
Study of the activated macrophage transcriptome. Exp Mol Pathol 2015; 99:575-80. [PMID: 26439118 DOI: 10.1016/j.yexmp.2015.09.014] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2015] [Accepted: 09/30/2015] [Indexed: 11/22/2022]
Abstract
Transcriptome analysis is a powerful modern tool to study possible alterations of gene expression associated with human diseases. It turns out to be especially promising for evaluation of gene expression changes in immunopathology, as immune cells have flexible gene expression patterns that can be switched in response to infection, inflammatory stimuli and exposure to various cytokines. In particular, macrophage polarization towards pro-inflammatory (M1) and anti-inflammatory (M2) phenotypes can be successfully studied using the modern transcriptome analysis approaches. The two mostly used techniques for transcriptome analysis are microarray and next generation sequencing. In this review we will provide an overview of known gene expression changes associated with immunopathology and discuss the advantage and limitations of different methods of transcriptome analysis.
Collapse
|
6
|
Li HD, Menon R, Govindarajoo B, Panwar B, Zhang Y, Omenn GS, Guan Y. Functional Networks of Highest-Connected Splice Isoforms: From The Chromosome 17 Human Proteome Project. J Proteome Res 2015. [PMID: 26216192 DOI: 10.1021/acs.jproteome.5b00494] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Alternative splicing allows a single gene to produce multiple transcript-level splice isoforms from which the translated proteins may show differences in their expression and function. Identifying the major functional or canonical isoform is important for understanding gene and protein functions. Identification and characterization of splice isoforms is a stated goal of the HUPO Human Proteome Project and of neXtProt. Multiple efforts have catalogued splice isoforms as "dominant", "principal", or "major" isoforms based on expression or evolutionary traits. In contrast, we recently proposed highest connected isoforms (HCIs) as a new class of canonical isoforms that have the strongest interactions in a functional network and revealed their significantly higher (differential) transcript-level expression compared to nonhighest connected isoforms (NCIs) regardless of tissues/cell lines in the mouse. HCIs and their expression behavior in the human remain unexplored. Here we identified HCIs for 6157 multi-isoform genes using a human isoform network that we constructed by integrating a large compendium of heterogeneous genomic data. We present examples for pairs of transcript isoforms of ABCC3, RBM34, ERBB2, and ANXA7. We found that functional networks of isoforms of the same gene can show large differences. Interestingly, differential expression between HCIs and NCIs was also observed in the human on an independent set of 940 RNA-seq samples across multiple tissues, including heart, kidney, and liver. Using proteomic data from normal human retina and placenta, we showed that HCIs are a promising indicator of expressed protein isoforms exemplified by NUDFB6 and M6PR. Furthermore, we found that a significant percentage (20%, p = 0.0003) of human and mouse HCIs are homologues, suggesting their conservation between species. Our identified HCIs expand the repertoire of canonical isoforms and are expected to facilitate studying main protein products, understanding gene regulation, and possibly evolution. The network is available through our web server as a rich resource for investigating isoform functional relationships (http://guanlab.ccmb.med.umich.edu/hisonet). All MS/MS data were available at ProteomeXchange Web site (http://www.proteomexchange.org) through their identifiers (retina: PXD001242, placenta: PXD000754).
Collapse
Affiliation(s)
- Hong-Dong Li
- Department of Computational Medicine and Bioinformatics, ‡Department of Internal Medicine, §Department of Human Genetics and School of Public Health, ∥Department of Electrical Engineering and Computer Science University of Michigan , Ann Arbor, Michigan 48109, United States
| | - Rajasree Menon
- Department of Computational Medicine and Bioinformatics, ‡Department of Internal Medicine, §Department of Human Genetics and School of Public Health, ∥Department of Electrical Engineering and Computer Science University of Michigan , Ann Arbor, Michigan 48109, United States
| | - Brandon Govindarajoo
- Department of Computational Medicine and Bioinformatics, ‡Department of Internal Medicine, §Department of Human Genetics and School of Public Health, ∥Department of Electrical Engineering and Computer Science University of Michigan , Ann Arbor, Michigan 48109, United States
| | - Bharat Panwar
- Department of Computational Medicine and Bioinformatics, ‡Department of Internal Medicine, §Department of Human Genetics and School of Public Health, ∥Department of Electrical Engineering and Computer Science University of Michigan , Ann Arbor, Michigan 48109, United States
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, ‡Department of Internal Medicine, §Department of Human Genetics and School of Public Health, ∥Department of Electrical Engineering and Computer Science University of Michigan , Ann Arbor, Michigan 48109, United States
| | - Gilbert S Omenn
- Department of Computational Medicine and Bioinformatics, ‡Department of Internal Medicine, §Department of Human Genetics and School of Public Health, ∥Department of Electrical Engineering and Computer Science University of Michigan , Ann Arbor, Michigan 48109, United States
| | - Yuanfang Guan
- Department of Computational Medicine and Bioinformatics, ‡Department of Internal Medicine, §Department of Human Genetics and School of Public Health, ∥Department of Electrical Engineering and Computer Science University of Michigan , Ann Arbor, Michigan 48109, United States
| |
Collapse
|
7
|
Krasnov GS, Dmitriev AA, Kudryavtseva AV, Shargunov AV, Karpov DS, Uroshlev LA, Melnikova NV, Blinov VM, Poverennaya EV, Archakov AI, Lisitsa AV, Ponomarenko EA. PPLine: An Automated Pipeline for SNP, SAP, and Splice Variant Detection in the Context of Proteogenomics. J Proteome Res 2015; 14:3729-37. [DOI: 10.1021/acs.jproteome.5b00490] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- George Sergeevich Krasnov
- Engelhardt
Institute of Molecular Biology, Russian Academy of Sciences, Moscow, 111991 Russia
- Orekhovich
Institute of Biomedical Chemistry, Russian Academy of Medical Sciences, Moscow, 119121 Russia
- Mechnikov Research Institute of Vaccines and Sera, Moscow, 105064 Russia
| | | | - Anna Viktorovna Kudryavtseva
- Engelhardt
Institute of Molecular Biology, Russian Academy of Sciences, Moscow, 111991 Russia
- Herzen
Moscow Cancer Research Institute, Ministry of Healthcare of the Russian Federation, Moscow, 125284 Russia
| | - Alexander Valerievich Shargunov
- Orekhovich
Institute of Biomedical Chemistry, Russian Academy of Medical Sciences, Moscow, 119121 Russia
- Mechnikov Research Institute of Vaccines and Sera, Moscow, 105064 Russia
| | - Dmitry Sergeevich Karpov
- Engelhardt
Institute of Molecular Biology, Russian Academy of Sciences, Moscow, 111991 Russia
- Orekhovich
Institute of Biomedical Chemistry, Russian Academy of Medical Sciences, Moscow, 119121 Russia
| | | | | | - Vladimir Mikhailovich Blinov
- Orekhovich
Institute of Biomedical Chemistry, Russian Academy of Medical Sciences, Moscow, 119121 Russia
- Mechnikov Research Institute of Vaccines and Sera, Moscow, 105064 Russia
| | | | | | - Andrey Valerievich Lisitsa
- Orekhovich
Institute of Biomedical Chemistry, Russian Academy of Medical Sciences, Moscow, 119121 Russia
| | | |
Collapse
|
8
|
Horvatovich P, Lundberg EK, Chen YJ, Sung TY, He F, Nice EC, Goode RJ, Yu S, Ranganathan S, Baker MS, Domont GB, Velasquez E, Li D, Liu S, Wang Q, He QY, Menon R, Guan Y, Corrales FJ, Segura V, Casal JI, Pascual-Montano A, Albar JP, Fuentes M, Gonzalez-Gonzalez M, Diez P, Ibarrola N, Degano RM, Mohammed Y, Borchers CH, Urbani A, Soggiu A, Yamamoto T, Salekdeh GH, Archakov A, Ponomarenko E, Lisitsa A, Lichti CF, Mostovenko E, Kroes RA, Rezeli M, Végvári Á, Fehniger TE, Bischoff R, Vizcaíno JA, Deutsch EW, Lane L, Nilsson CL, Marko-Varga G, Omenn GS, Jeong SK, Lim JS, Paik YK, Hancock WS. Quest for Missing Proteins: Update 2015 on Chromosome-Centric Human Proteome Project. J Proteome Res 2015; 14:3415-31. [DOI: 10.1021/pr5013009] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- Péter Horvatovich
- Analytical
Biochemistry, Department of Pharmacy, University of Groningen, A. Deusinglaan
1, 9713 AV Groningen, The Netherlands
| | - Emma K. Lundberg
- Science
for Life Laboratory, KTH - Royal Institute of Technology, SE-171 21 Stockholm, Sweden
| | - Yu-Ju Chen
- Institute
of Chemistry, Academia Sinica, 128 Academia Road Sec. 2, Taipei 115, Taiwan
| | - Ting-Yi Sung
- Institute
of Information Science, Academia Sinica, 128 Academia Road Sec. 2, Taipei 115, Taiwan
| | - Fuchu He
- The State Key Laboratory of Proteomics, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine, No. 27 Taiping Road, Haidian District, Beijing 100850, China
| | - Edouard C. Nice
- Department
of Biochemistry and Molecular Biology, Monash University, Clayton, Victoria 3800, Australia
| | - Robert J. Goode
- Department
of Biochemistry and Molecular Biology, Monash University, Clayton, Victoria 3800, Australia
| | - Simon Yu
- Department
of Biochemistry and Molecular Biology, Monash University, Clayton, Victoria 3800, Australia
| | - Shoba Ranganathan
- Department
of Chemistry and Biomolecular Sciences and ARC Centre of Excellence
in Bioinformatics, Macquarie University, Sydney, New South Wales 2109, Australia
| | - Mark S. Baker
- Australian
School of Advanced Medicine, Macquarie University, Sydney, NSW 2109, Australia
| | - Gilberto B. Domont
- Proteomics Unit, Institute of Chemistry, Federal University of Rio de Janeiro, Cidade Universitária, Av Athos da Silveira Ramos 149, CT-A542, 21941-909 Rio de Janeriro, Rj, Brazil
| | - Erika Velasquez
- Proteomics Unit, Institute of Chemistry, Federal University of Rio de Janeiro, Cidade Universitária, Av Athos da Silveira Ramos 149, CT-A542, 21941-909 Rio de Janeriro, Rj, Brazil
| | - Dong Li
- The State Key Laboratory of Proteomics, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine, No. 27 Taiping Road, Haidian District, Beijing 100850, China
| | - Siqi Liu
- Beijing Institute of Genomics and BGI Shenzhen, No. 1 Beichen West Road, Chaoyang District, Beijing 100101, China
- BGI Shenzhen, Beishan Road, Yantian District, Shenzhen, 518083, China
| | - Quanhui Wang
- Beijing Institute of Genomics and BGI Shenzhen, No. 1 Beichen West Road, Chaoyang District, Beijing 100101, China
| | - Qing-Yu He
- Key Laboratory of Functional Protein
Research of Guangdong
Higher Education Institutes, College of Life Science and Technology, Jinan University, Guangzhou 510632, China
| | - Rajasree Menon
- Department of Computational Medicine & Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, Michigan 48109-2218, United States
| | - Yuanfang Guan
- Departments of Computational Medicine & Bioinformatics and Computer Sciences, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, Michigan 48109-2218, United States
| | - Fernando J. Corrales
- ProteoRed-ISCIII,
Biomolecular and Bioinformatics Resources Platform (PRB2), Spanish
Consortium of C-HPP (Chr-16), CIMA, University of Navarra, 31008 Pamplona, Spain
- Chr16 SpHPP Consortium, CIMA, University of Navarra, 31008 Pamplona, Spain
| | - Victor Segura
- ProteoRed-ISCIII,
Biomolecular and Bioinformatics Resources Platform (PRB2), Spanish
Consortium of C-HPP (Chr-16), CIMA, University of Navarra, 31008 Pamplona, Spain
- Chr16 SpHPP Consortium, CIMA, University of Navarra, 31008 Pamplona, Spain
| | - J. Ignacio Casal
- Department
of Cellular and Molecular Medicine, Centro de Investigaciones Biológicas (CIB-CSIC), 28040 Madrid, Spain
| | | | - Juan P. Albar
- Centro Nacional de Biotecnologia (CNB-CSIC), Cantoblanco, 28049 Madrid, Spain
| | - Manuel Fuentes
- Cancer
Research Center. Proteomics Unit and General Service of Cytometry,
Department of Medicine, University of Salmanca-CSIC, IBSAL, Campus Miguel de Unamuno
s/n, 37007 Salamanca, Spain
| | - Maria Gonzalez-Gonzalez
- Cancer
Research Center. Proteomics Unit and General Service of Cytometry,
Department of Medicine, University of Salmanca-CSIC, IBSAL, Campus Miguel de Unamuno
s/n, 37007 Salamanca, Spain
| | - Paula Diez
- Cancer
Research Center. Proteomics Unit and General Service of Cytometry,
Department of Medicine, University of Salmanca-CSIC, IBSAL, Campus Miguel de Unamuno
s/n, 37007 Salamanca, Spain
| | - Nieves Ibarrola
- Cancer
Research Center. Proteomics Unit and General Service of Cytometry,
Department of Medicine, University of Salmanca-CSIC, IBSAL, Campus Miguel de Unamuno
s/n, 37007 Salamanca, Spain
| | - Rosa M. Degano
- Cancer
Research Center. Proteomics Unit and General Service of Cytometry,
Department of Medicine, University of Salmanca-CSIC, IBSAL, Campus Miguel de Unamuno
s/n, 37007 Salamanca, Spain
| | - Yassene Mohammed
- University of Victoria-Genome British Columbia Proteomics
Centre, Vancouver Island
Technology Park, #3101−4464 Markham Street, Victoria, British Columbia V8Z 7X8, Canada
- Center
for Proteomics and Metabolomics, Leiden University Medical Center, 2333 ZA Leiden, The Netherlands
| | - Christoph H. Borchers
- University of Victoria-Genome British Columbia Proteomics
Centre, Vancouver Island
Technology Park, #3101−4464 Markham Street, Victoria, British Columbia V8Z 7X8, Canada
| | - Andrea Urbani
- Proteomics
and Metabonomic, Laboratory, Fondazione Santa Lucia, Rome, Italy
- Department
of Experimental Medicine and Surgery, University of Rome “Tor Vergata”, Rome, Italy
| | - Alessio Soggiu
- Department
of Veterinary Science and Public Health (DIVET), University of Milano, via Celoria 10, 20133 Milano, Italy
| | - Tadashi Yamamoto
- Institute
of Nephrology, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan
| | - Ghasem Hosseini Salekdeh
- Department of Molecular Systems Biology at Cell Science Research Center, Royan Institute for Stem Cell Biology and Technology, ACECR, Tehran, Iran
- Department of Systems Biology, Agricultural Biotechnology Research Institute of Iran, Karaj, Iran
| | | | | | - Andrey Lisitsa
- Orechovich Institute of Biomedical Chemistry, Moscow, Russia
| | - Cheryl F. Lichti
- Department
of Pharmacology and Toxicology, The University of Texas Medical Branch, Galveston, Texas 77555-0617, United States
| | - Ekaterina Mostovenko
- Department
of Pharmacology and Toxicology, The University of Texas Medical Branch, Galveston, Texas 77555-0617, United States
| | - Roger A. Kroes
- Falk Center for Molecular Therapeutics, Department of Biomedical Engineering, Northwestern University, 1801 Maple Ave., Suite 4300, Evanston, Illinois 60201, United States
| | - Melinda Rezeli
- Clinical Protein Science & Imaging, Department of Biomedical Engineering, Lund University, BMC D13, 221 84 Lund, Sweden
| | - Ákos Végvári
- Clinical Protein Science & Imaging, Department of Biomedical Engineering, Lund University, BMC D13, 221 84 Lund, Sweden
| | - Thomas E. Fehniger
- Clinical Protein Science & Imaging, Department of Biomedical Engineering, Lund University, BMC D13, 221 84 Lund, Sweden
| | - Rainer Bischoff
- Analytical
Biochemistry, Department of Pharmacy, University of Groningen, A. Deusinglaan
1, 9713 AV Groningen, The Netherlands
| | - Juan Antonio Vizcaíno
- European Molecular
Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, CB10 1SD, Hinxton, Cambridge, United Kingdom
| | - Eric W. Deutsch
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, Washington 98109, United States
| | - Lydie Lane
- SIB Swiss Institute of Bioinformatics, Geneva, Switzerland
- Department
of Human Protein Science, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Carol L. Nilsson
- Department
of Pharmacology and Toxicology, The University of Texas Medical Branch, Galveston, Texas 77555-0617, United States
| | - György Marko-Varga
- Clinical Protein Science & Imaging, Department of Biomedical Engineering, Lund University, BMC D13, 221 84 Lund, Sweden
| | - Gilbert S. Omenn
- Departments of Computational Medicine & Bioinformatics, Internal Medicine, Human Genetics and School of Public Health, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, Michigan 48109-2218, United States
| | - Seul-Ki Jeong
- Departments of Integrated Omics for Biomedical Science & Biochemistry, College of Life Science and Technology, Yonsei Proteome Research Center, Yonsei University, Seoul, 120-749, Korea
| | - Jong-Sun Lim
- Departments of Integrated Omics for Biomedical Science & Biochemistry, College of Life Science and Technology, Yonsei Proteome Research Center, Yonsei University, Seoul, 120-749, Korea
| | - Young-Ki Paik
- Departments of Integrated Omics for Biomedical Science & Biochemistry, College of Life Science and Technology, Yonsei Proteome Research Center, Yonsei University, Seoul, 120-749, Korea
| | - William S. Hancock
- The
Barnett Institute of Chemical and Biological Analysis, Northeastern University, 140 The Fenway, Boston, Massachusetts 02115, United States
| |
Collapse
|
9
|
Tay AP, Pang CNI, Twine NA, Hart-Smith G, Harkness L, Kassem M, Wilkins MR. Proteomic Validation of Transcript Isoforms, Including Those Assembled from RNA-Seq Data. J Proteome Res 2015; 14:3541-54. [PMID: 25961807 DOI: 10.1021/pr5011394] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Human proteome analysis now requires an understanding of protein isoforms. We recently published the PG Nexus pipeline, which facilitates high confidence validation of exons and splice junctions by integrating genomics and proteomics data. Here we comprehensively explore how RNA-seq transcriptomics data, and proteomic analysis of the same sample, can identify protein isoforms. RNA-seq data from human mesenchymal (hMSC) stem cells were analyzed with our new TranscriptCoder tool to generate a database of protein isoform sequences. MS/MS data from matching hMSC samples were then matched against the TranscriptCoder-derived database, along with Ensembl and the neXtProt database. Querying the TranscriptCoder-derived or Ensembl database could unambiguously identify ∼450 protein isoforms, with isoform-specific proteotypic peptides, including candidate hMSC-specific isoforms for the genes DPYSL2 and FXR1. Where isoform-specific peptides did not exist, groups of nonisoform-specific proteotypic peptides could specifically identify many isoforms. In both the above cases, isoforms will be detectable with targeted MS/MS assays. Unfortunately, our analysis also revealed that some isoforms will be difficult to identify unambiguously as they do not have peptides that are sufficiently distinguishing. We covisualize mRNA isoforms and peptides in a genome browser to illustrate the above situations. Mass spectrometry data is available via ProteomeXchange (PXD001449).
Collapse
Affiliation(s)
- Aidan P Tay
- Systems Biology Initiative, The University of New South Wales , Sydney, New South Wales 2052, Australia.,School of Biotechnology and Biomolecular Sciences, The University of New South Wales , Sydney, New South Wales 2052, Australia
| | - Chi Nam Ignatius Pang
- Systems Biology Initiative, The University of New South Wales , Sydney, New South Wales 2052, Australia.,School of Biotechnology and Biomolecular Sciences, The University of New South Wales , Sydney, New South Wales 2052, Australia
| | - Natalie A Twine
- Systems Biology Initiative, The University of New South Wales , Sydney, New South Wales 2052, Australia.,School of Biotechnology and Biomolecular Sciences, The University of New South Wales , Sydney, New South Wales 2052, Australia
| | - Gene Hart-Smith
- Systems Biology Initiative, The University of New South Wales , Sydney, New South Wales 2052, Australia.,School of Biotechnology and Biomolecular Sciences, The University of New South Wales , Sydney, New South Wales 2052, Australia
| | - Linda Harkness
- Endocrine Research Laboratory (KMEB), Department of Endocrinology and Metabolism, Odense University Hospital & University of Southern Denmark , Odense 5230, Denmark
| | - Moustapha Kassem
- Endocrine Research Laboratory (KMEB), Department of Endocrinology and Metabolism, Odense University Hospital & University of Southern Denmark , Odense 5230, Denmark
| | - Marc R Wilkins
- Systems Biology Initiative, The University of New South Wales , Sydney, New South Wales 2052, Australia.,School of Biotechnology and Biomolecular Sciences, The University of New South Wales , Sydney, New South Wales 2052, Australia
| |
Collapse
|
10
|
Quintáns B, Ordóñez-Ugalde A, Cacheiro P, Carracedo A, Sobrido MJ. Medical genomics: The intricate path from genetic variant identification to clinical interpretation. Appl Transl Genom 2014; 3:60-7. [PMID: 27284505 PMCID: PMC4887840 DOI: 10.1016/j.atg.2014.06.001] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2014] [Accepted: 06/02/2014] [Indexed: 01/23/2023]
Abstract
The field of medical genomics involves translating high throughput genetic methods to the clinic, in order to improve diagnostic efficiency and treatment decision making. Technical questions related to sample enrichment, sequencing methodologies and variant identification and calling algorithms, still need careful investigation in order to validate the analytical step of next generation sequencing techniques for clinical applications. However, the main foreseeable challenge will be interpreting the clinical significance of the variants observed in a given patient, as well as their significance for family members and for other patients. Every step in the variant interpretation process has limitations and difficulties, and its quote of contribution to false positive and false negative results. There is no single piece of evidence enough on its own to make firm conclusions on the pathogenicity and disease causality of a given variant. A plethora of automated analysis software tools is being developed that will enhance efficiency and accuracy. However a risk of misinterpretation could derive from biased biorepository content, facilitated by annotation of variant functional consequences using previous datasets stored in the same or linked repositories. In order to improve variant interpretation and avoid an exponential accumulation of confounding noise in the medical literature, the use of terms in a standard way should be sought and requested when reporting genetic variants and their consequences. Generally, stepwise and linear interpretation processes are likely to overrate some pieces of evidence while underscoring others. Algorithms are needed that allow a multidimensional, parallel analysis of diverse lines of evidence to be carried out by expert teams for specific genes, cellular pathways or disorders.
Collapse
Affiliation(s)
- B Quintáns
- Fundación Pública Galega de Medicina Xenómica and Instituto de Investigación Sanitaria, SERGAS, Santiago de Compostela, Spain; Centro para Investigación Biomédica en red de Enfermedades Raras (CIBERER), Instituto de Salud Carlos III, Spain
| | - A Ordóñez-Ugalde
- Fundación Pública Galega de Medicina Xenómica and Instituto de Investigación Sanitaria, SERGAS, Santiago de Compostela, Spain; Universidade de Santiago de Compostela, Spain
| | - P Cacheiro
- Fundación Pública Galega de Medicina Xenómica and Instituto de Investigación Sanitaria, SERGAS, Santiago de Compostela, Spain; Universidade de Santiago de Compostela, Spain
| | - A Carracedo
- Fundación Pública Galega de Medicina Xenómica and Instituto de Investigación Sanitaria, SERGAS, Santiago de Compostela, Spain; Centro para Investigación Biomédica en red de Enfermedades Raras (CIBERER), Instituto de Salud Carlos III, Spain; Universidade de Santiago de Compostela, Spain
| | - M J Sobrido
- Fundación Pública Galega de Medicina Xenómica and Instituto de Investigación Sanitaria, SERGAS, Santiago de Compostela, Spain; Centro para Investigación Biomédica en red de Enfermedades Raras (CIBERER), Instituto de Salud Carlos III, Spain
| |
Collapse
|