1
|
da Silva EMG, Rebello KM, Choi YJ, Gregorio V, Paschoal AR, Mitreva M, McKerrow JH, Neves-Ferreira AGDC, Passetti F. Identification of Novel Genes and Proteoforms in Angiostrongylus costaricensis through a Proteogenomic Approach. Pathogens 2022; 11:1273. [PMID: 36365024 PMCID: PMC9694666 DOI: 10.3390/pathogens11111273] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Revised: 10/15/2022] [Accepted: 10/20/2022] [Indexed: 07/22/2023] Open
Abstract
RNA sequencing (RNA-Seq) and mass-spectrometry-based proteomics data are often integrated in proteogenomic studies to assist in the prediction of eukaryote genome features, such as genes, splicing, single-nucleotide (SNVs), and single-amino-acid variants (SAAVs). Most genomes of parasite nematodes are draft versions that lack transcript- and protein-level information and whose gene annotations rely only on computational predictions. Angiostrongylus costaricensis is a roundworm species that causes an intestinal inflammatory disease, known as abdominal angiostrongyliasis (AA). Currently, there is no drug available that acts directly on this parasite, mostly due to the sparse understanding of its molecular characteristics. The available genome of A. costaricensis, specific to the Costa Rica strain, is a draft version that is not supported by transcript- or protein-level evidence. This study used RNA-Seq and MS/MS data to perform an in-depth annotation of the A. costaricensis genome. Our prediction improved the reference annotation with (a) novel coding and non-coding genes; (b) pieces of evidence of alternative splicing generating new proteoforms; and (c) a list of SNVs between the Brazilian (Crissiumal) and the Costa Rica strain. To the best of our knowledge, this is the first time that a multi-omics approach has been used to improve the genome annotation of A. costaricensis. We hope this improved genome annotation can assist in the future development of drugs, kits, and vaccines to treat, diagnose, and prevent AA caused by either the Brazil strain (Crissiumal) or the Costa Rica strain.
Collapse
Affiliation(s)
- Esdras Matheus Gomes da Silva
- Instituto Carlos Chagas, Fiocruz, Curitiba 81350-010, PR, Brazil
- Laboratory of Toxinology, Oswaldo Cruz Institute, Fiocruz, Rio de Janeiro 21040-900, RJ, Brazil
| | - Karina Mastropasqua Rebello
- Laboratory of Toxinology, Oswaldo Cruz Institute, Fiocruz, Rio de Janeiro 21040-900, RJ, Brazil
- Laboratory of Integrated Studies in Protozoology, Oswaldo Cruz Institute, Fiocruz, Rio de Janeiro 21040-360, RJ, Brazil
| | - Young-Jun Choi
- Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Vitor Gregorio
- Bioinformatics and Pattern Recognition Group (Bioinfo-CP), Department of Computer Science (DACOM), Federal University of Technology-Parana (UTFPR), Cornélio Procópio 86300-000, PR, Brazil
| | - Alexandre Rossi Paschoal
- Bioinformatics and Pattern Recognition Group (Bioinfo-CP), Department of Computer Science (DACOM), Federal University of Technology-Parana (UTFPR), Cornélio Procópio 86300-000, PR, Brazil
| | - Makedonka Mitreva
- Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - James H. McKerrow
- Center for Discovery and Innovation in Parasitic Diseases, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, CA 92093, USA
| | | | - Fabio Passetti
- Instituto Carlos Chagas, Fiocruz, Curitiba 81350-010, PR, Brazil
| |
Collapse
|
2
|
Hamid Z, Zimmerman KD, Guillen-Ahlers H, Li C, Nathanielsz P, Cox LA, Olivier M. Assessment of label-free quantification and missing value imputation for proteomics in non-human primates. BMC Genomics 2022; 23:496. [PMID: 35804317 PMCID: PMC9264528 DOI: 10.1186/s12864-022-08723-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Accepted: 06/23/2022] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND Reliable and effective label-free quantification (LFQ) analyses are dependent not only on the method of data acquisition in the mass spectrometer, but also on the downstream data processing, including software tools, query database, data normalization and imputation. In non-human primates (NHP), LFQ is challenging because the query databases for NHP are limited since the genomes of these species are not comprehensively annotated. This invariably results in limited discovery of proteins and associated Post Translational Modifications (PTMs) and a higher fraction of missing data points. While identification of fewer proteins and PTMs due to database limitations can negatively impact uncovering important and meaningful biological information, missing data also limits downstream analyses (e.g., multivariate analyses), decreases statistical power, biases statistical inference, and makes biological interpretation of the data more challenging. In this study we attempted to address both issues: first, we used the MetaMorphues proteomics search engine to counter the limits of NHP query databases and maximize the discovery of proteins and associated PTMs, and second, we evaluated different imputation methods for accurate data inference. We used a generic approach for missing data imputation analysis without distinguising the potential source of missing data (either non-assigned m/z or missing values across runs). RESULTS Using the MetaMorpheus proteomics search engine we obtained quantitative data for 1622 proteins and 10,634 peptides including 58 different PTMs (biological, metal and artifacts) across a diverse age range of NHP brain frontal cortex. However, among the 1622 proteins identified, only 293 proteins were quantified across all samples with no missing values, emphasizing the importance of implementing an accurate and statiscaly valid imputation method to fill in missing data. In our imputation analysis we demonstrate that Single Imputation methods that borrow information from correlated proteins such as Generalized Ridge Regression (GRR), Random Forest (RF), local least squares (LLS), and a Bayesian Principal Component Analysis methods (BPCA), are able to estimate missing protein abundance values with great accuracy. CONCLUSIONS Overall, this study offers a detailed comparative analysis of LFQ data generated in NHP and proposes strategies for improved LFQ in NHP proteomics data.
Collapse
Affiliation(s)
- Zeeshan Hamid
- Center for Precision Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Kip D Zimmerman
- Center for Precision Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Hector Guillen-Ahlers
- Center for Precision Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Cun Li
- Southwest National Primate Research Center, San Antonio, TX, USA
- Department of Animal Science, University of Wyoming, Laramie, WY, USA
| | - Peter Nathanielsz
- Southwest National Primate Research Center, San Antonio, TX, USA
- Department of Animal Science, University of Wyoming, Laramie, WY, USA
| | - Laura A Cox
- Center for Precision Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA
- Southwest National Primate Research Center, San Antonio, TX, USA
| | - Michael Olivier
- Center for Precision Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA.
| |
Collapse
|
3
|
Cox LA, Chan J, Rao P, Hamid Z, Glenn JP, Jadhav A, Das V, Karere GM, Quillen E, Kavanagh K, Olivier M. Integrated omics analysis reveals sirtuin signaling is central to hepatic response to a high fructose diet. BMC Genomics 2021; 22:870. [PMID: 34861817 PMCID: PMC8641221 DOI: 10.1186/s12864-021-08166-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Accepted: 11/08/2021] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Dietary high fructose (HFr) is a known metabolic disruptor contributing to development of obesity and diabetes in Western societies. Initial molecular changes from exposure to HFr on liver metabolism may be essential to understand the perturbations leading to insulin resistance and abnormalities in lipid and carbohydrate metabolism. We studied vervet monkeys (Clorocebus aethiops sabaeus) fed a HFr (n=5) or chow diet (n=5) for 6 weeks, and obtained clinical measures of liver function, blood insulin, cholesterol and triglycerides. In addition, we performed untargeted global transcriptomics, proteomics, and metabolomics analyses on liver biopsies to determine the molecular impact of a HFr diet on coordinated pathways and networks that differed by diet. RESULTS We show that integration of omics data sets improved statistical significance for some pathways and networks, and decreased significance for others, suggesting that multiple omics datasets enhance confidence in relevant pathway and network identification. Specifically, we found that sirtuin signaling and a peroxisome proliferator activated receptor alpha (PPARA) regulatory network were significantly altered in hepatic response to HFr. Integration of metabolomics and miRNAs data further strengthened our findings. CONCLUSIONS Our integrated analysis of three types of omics data with pathway and regulatory network analysis demonstrates the usefulness of this approach for discovery of molecular networks central to a biological response. In addition, metabolites aspartic acid and docosahexaenoic acid (DHA), protein ATG3, and genes ATG7, and HMGCS2 link sirtuin signaling and the PPARA network suggesting molecular mechanisms for altered hepatic gluconeogenesis from consumption of a HFr diet.
Collapse
Affiliation(s)
- Laura A Cox
- Center for Precision Medicine, Department of Internal Medicine, Section on Molecular Medicine, Wake Forest School of Medicine, Medical Center Boulevard, NRC, G-floor, NC, 27157, Winston-Salem, USA.
- Department of Genetics, Texas Biomedical Research Institute, 78245, San Antonio, TX, USA.
- Southwest National Primate Research Center, Texas Biomedical Research Institute, 78245, San Antonio, TX, USA.
- Department of Pathology, Section on Comparative Medicine, Wake Forest School of Medicine, 27157, Winston-Salem, NC, USA.
| | - Jeannie Chan
- Center for Precision Medicine, Department of Internal Medicine, Section on Molecular Medicine, Wake Forest School of Medicine, Medical Center Boulevard, NRC, G-floor, NC, 27157, Winston-Salem, USA
- Department of Genetics, Texas Biomedical Research Institute, 78245, San Antonio, TX, USA
| | - Prahlad Rao
- University of Tennessee Health Science Center, TN, Memphis, USA
| | - Zeeshan Hamid
- Center for Precision Medicine, Department of Internal Medicine, Section on Molecular Medicine, Wake Forest School of Medicine, Medical Center Boulevard, NRC, G-floor, NC, 27157, Winston-Salem, USA
| | - Jeremy P Glenn
- Department of Genetics, Texas Biomedical Research Institute, 78245, San Antonio, TX, USA
- Southwest National Primate Research Center, Texas Biomedical Research Institute, 78245, San Antonio, TX, USA
| | - Avinash Jadhav
- Center for Precision Medicine, Department of Internal Medicine, Section on Molecular Medicine, Wake Forest School of Medicine, Medical Center Boulevard, NRC, G-floor, NC, 27157, Winston-Salem, USA
- Department of Genetics, Texas Biomedical Research Institute, 78245, San Antonio, TX, USA
| | - Vivek Das
- Novo Nordisk Research Center, Seattle, WA, USA
| | - Genesio M Karere
- Center for Precision Medicine, Department of Internal Medicine, Section on Molecular Medicine, Wake Forest School of Medicine, Medical Center Boulevard, NRC, G-floor, NC, 27157, Winston-Salem, USA
- Department of Genetics, Texas Biomedical Research Institute, 78245, San Antonio, TX, USA
| | - Ellen Quillen
- Center for Precision Medicine, Department of Internal Medicine, Section on Molecular Medicine, Wake Forest School of Medicine, Medical Center Boulevard, NRC, G-floor, NC, 27157, Winston-Salem, USA
- Department of Genetics, Texas Biomedical Research Institute, 78245, San Antonio, TX, USA
| | - Kylie Kavanagh
- Center for Precision Medicine, Department of Internal Medicine, Section on Molecular Medicine, Wake Forest School of Medicine, Medical Center Boulevard, NRC, G-floor, NC, 27157, Winston-Salem, USA
- Department of Pathology, Section on Comparative Medicine, Wake Forest School of Medicine, 27157, Winston-Salem, NC, USA
| | - Michael Olivier
- Center for Precision Medicine, Department of Internal Medicine, Section on Molecular Medicine, Wake Forest School of Medicine, Medical Center Boulevard, NRC, G-floor, NC, 27157, Winston-Salem, USA
- Department of Genetics, Texas Biomedical Research Institute, 78245, San Antonio, TX, USA
| |
Collapse
|
4
|
Ramesh P, Nagarajan V, Khanchandani V, Desai VK, Niranjan V. Proteomic variations of esophageal squamous cell carcinoma revealed by combining RNA-seq proteogenomics and G-PTM search strategy. Heliyon 2020; 6:e04813. [PMID: 32913912 PMCID: PMC7472856 DOI: 10.1016/j.heliyon.2020.e04813] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2020] [Revised: 07/10/2020] [Accepted: 08/25/2020] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Cancer that arises from epithelial cells of the esophagus is called esophagus squamous cell carcinoma (ESCC) and is mostly observed in developing nations. Evaluation of cancer genomes and its regulation into proteins plays a predominant role in understanding the cancer progressions. Mass-spectrometry-based proteomics is a consequential tool to estimate proteomic variation and posttranslational modifications (PTMs) from standard protein databases. Post-translational modifications play a crucial role in protein folding and PTMs can be accounted for as a biological signal to interpret the structural changes and transition order of proteins. Functional validation of cancer-related mutations can explain the effects of mutations on genes and the identification of Oncogenes and tumor suppressor genes. Therefore, we present a study on protein variations to interpret the structural changes and transition order of proteins in ESCC carcinogenesis. METHODOLOGY We are using a bottom-up proteomics approach with Galaxy-P framework and RNA sequence data analysis to generate the sample-specific databases containing details of RNA splicing and variant peptides. Once the database generated with information on variable modification, only the curated PTMs at specific positions are considered to perform spectral matching. Proteogenomics mapping was performed to identify protein variations in ESCC. RESULTS RNA-sequence proteogenomics with G-PTM (Global Post-Translational Modification) searching strategy has revealed proteomic events including several peptides that contain single amino acid variations, novel splice junction peptides and posttranslationally modified peptides. Proteogenomic mapping exhibited the splice junction peptides mapped predominantly for Malic enzyme exon type (ME-3) and MCM7 protein-coding genes that promote cancer progression, found to be exhibited in ESCC samples. Approximately 25 ± types of PTM modifications were recorded, and Protein Phosphorylation was largely noted. CONCLUSION ESCC cancer prognosis at the molecular level enables a better understanding of cancer carcinogenesis and protein modifications can be used as potential biomarkers.
Collapse
Affiliation(s)
- Pooja Ramesh
- Department of Biotechnology, RV College of Engineering, Bangalore, Karnataka, India
| | | | - Vartika Khanchandani
- Department of Biotechnology, RV College of Engineering, Bangalore, Karnataka, India
| | - Vasanth Kumar Desai
- Department of Biotechnology, RV College of Engineering, Bangalore, Karnataka, India
| | - Vidya Niranjan
- Department of Biotechnology, RV College of Engineering, Bangalore, Karnataka, India
| |
Collapse
|
5
|
Housman G, Gilad Y. Prime time for primate functional genomics. Curr Opin Genet Dev 2020; 62:1-7. [PMID: 32544775 DOI: 10.1016/j.gde.2020.04.007] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Revised: 04/21/2020] [Accepted: 04/24/2020] [Indexed: 12/14/2022]
Abstract
Functional genomics research is continually improving our understanding of genotype-phenotype relationships in humans, and comparative genomics perspectives can provide additional insight into the evolutionary histories of such relationships. To specifically identify conservation or species-specific divergence in humans, we must look to our closest extant evolutionary relatives. Primate functional genomics research has been steadily advancing and expanding, in spite of several limitations and challenges that this field faces. New technologies and cheaper sequencing provide a unique opportunity to enhance and expand primate comparative studies, and we outline possible paths going forward. The potential human-specific insights that can be gained from primate functional genomics research are substantial, and we propose that now is a prime time to expand such endeavors.
Collapse
Affiliation(s)
- Genevieve Housman
- Section of Genetic Medicine, Department of Medicine, University of Chicago, 5841 S. Maryland Ave., N417, MC6091, Chicago, IL 60637 USA.
| | - Yoav Gilad
- Section of Genetic Medicine, Department of Medicine, University of Chicago, 5841 S. Maryland Ave., N417, MC6091, Chicago, IL 60637 USA; Department of Human Genetics, University of Chicago, Cummings Life Science Center, 928 E. 58th St., Chicago, IL 60637 USA
| |
Collapse
|
6
|
Li H, Zhou R, Xu S, Chen X, Hong Y, Lu Q, Liu H, Zhou B, Liang X. Improving Gene Annotation of the Peanut Genome by Integrated Proteogenomics Workflow. J Proteome Res 2020; 19:2226-2235. [DOI: 10.1021/acs.jproteome.9b00723] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Affiliation(s)
- Haifen Li
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| | - Ruo Zhou
- Deepxomics Co., Ltd., Shenzhen 518000, China
| | - Shaohang Xu
- Deepxomics Co., Ltd., Shenzhen 518000, China
| | - Xiaoping Chen
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| | - Yanbin Hong
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| | - Qing Lu
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| | - Hao Liu
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| | - Baojin Zhou
- Deepxomics Co., Ltd., Shenzhen 518000, China
| | - Xuanqiang Liang
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| |
Collapse
|
7
|
Pfeuffer J, Sachsenberg T, Dijkstra TMH, Serang O, Reinert K, Kohlbacher O. EPIFANY: A Method for Efficient High-Confidence Protein Inference. J Proteome Res 2020; 19:1060-1072. [PMID: 31975601 DOI: 10.1021/acs.jproteome.9b00566] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Accurate protein inference in the presence of shared peptides is still one of the key problems in bottom-up proteomics. Most protein inference tools employing simple heuristic inference strategies are efficient but exhibit reduced accuracy. More advanced probabilistic methods often exhibit better inference quality but tend to be too slow for large data sets. Here, we present a novel protein inference method, EPIFANY, combining a loopy belief propagation algorithm with convolution trees for efficient processing of Bayesian networks. We demonstrate that EPIFANY combines the reliable protein inference of Bayesian methods with significantly shorter runtimes. On the 2016 iPRG protein inference benchmark data, EPIFANY is the only tested method that finds all true-positive proteins at a 5% protein false discovery rate (FDR) without strict prefiltering on the peptide-spectrum match (PSM) level, yielding an increase in identification performance (+10% in the number of true positives and +14% in partial AUC) compared to previous approaches. Even very large data sets with hundreds of thousands of spectra (which are intractable with other Bayesian and some non-Bayesian tools) can be processed with EPIFANY within minutes. The increased inference quality including shared peptides results in better protein inference results and thus increased robustness of the biological hypotheses generated. EPIFANY is available as open-source software for all major platforms at https://OpenMS.de/epifany.
Collapse
Affiliation(s)
- Julianus Pfeuffer
- Applied Bioinformatics, Department of Computer Science, University of Tübingen, 72076 Tübingen, Germany.,Institute for Bioinformatics and Medical Informatics, University of Tübingen, 72076 Tübingen, Germany.,Algorithmic Bioinformatics, Department of Bioinformatics, Freie Universität Berlin, 14195 Berlin, Germany
| | - Timo Sachsenberg
- Applied Bioinformatics, Department of Computer Science, University of Tübingen, 72076 Tübingen, Germany.,Institute for Bioinformatics and Medical Informatics, University of Tübingen, 72076 Tübingen, Germany
| | - Tjeerd M H Dijkstra
- Biomolecular Interactions, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Oliver Serang
- Department of Computer Science, University of Montana, Missoula, Montana 59812, United States
| | - Knut Reinert
- Algorithmic Bioinformatics, Department of Bioinformatics, Freie Universität Berlin, 14195 Berlin, Germany
| | - Oliver Kohlbacher
- Applied Bioinformatics, Department of Computer Science, University of Tübingen, 72076 Tübingen, Germany.,Institute for Bioinformatics and Medical Informatics, University of Tübingen, 72076 Tübingen, Germany.,Biomolecular Interactions, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany.,Institute for Translational Bioinformatics, University Hospital Tübingen, 72076 Tübingen, Germany.,Quantitative Biology Center, University of Tübingen, 72076 Tübingen, Germany
| |
Collapse
|
8
|
Ricci CA, Kamal AHM, Chakrabarty JK, Fuess LE, Mann WT, Jinks LR, Brinkhuis V, Chowdhury SM, Mydlarz LD. Proteomic Investigation of a Diseased Gorgonian Coral Indicates Disruption of Essential Cell Function and Investment in Inflammatory and Other Immune Processes. Integr Comp Biol 2019; 59:830-844. [DOI: 10.1093/icb/icz107] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Abstract
As scleractinian coral cover declines in the face of increased frequency in disease outbreaks, future reefs may become dominated by octocorals. Understanding octocoral disease responses and consequences is therefore necessary if we are to gain insight into the future of ecosystem services provided by coral reefs. In Florida, populations of the octocoral Eunicea calyculata infected with Eunicea black disease (EBD) were observed in the field in the fall of 2011. This disease was recognized by a stark, black pigmentation caused by heavy melanization. Histological preparations of E. calyculata infected with EBD demonstrated granular amoebocyte (GA) mobilization, melanin granules in much of the GA population, and the presence of fungal hyphae penetrating coral tissue. Previous transcriptomic analysis also identified immune trade-offs evidenced by increased immune investment at the expense of growth. Our investigation utilized proteogenomic techniques to reveal decreased investment in general cell signaling while increasing energy production for immune responses. Inflammation was also prominent in diseased E. calyculata and sheds light on factors driving the extreme phenotype observed with EBD. With disease outbreaks continuing to increase in frequency, our results highlight new targets within the cnidarian immune system and provide a framework for understanding transcriptomics in the context of an organismal disease phenotype and its protein expression.
Collapse
Affiliation(s)
- Contessa A Ricci
- Department of Biology, University of Texas at Arlington, Arlington, 501 S Nedderman Dr., TX 76010, USA
| | - Abu Hena Mostafa Kamal
- Department of Chemistry and Biochemistry, University of Texas at Arlington, 700 Planetarium Pl, Arlington, TX 76010, USA
| | - Jayanta Kishor Chakrabarty
- Department of Chemistry and Biochemistry, University of Texas at Arlington, 700 Planetarium Pl, Arlington, TX 76010, USA
| | - Lauren E Fuess
- Department of Ecology and Evolutionary Biology University of Connecticut, Storrs, CT 06269, USA
| | - Whitney T Mann
- Department of Biology, University of Texas at Arlington, Arlington, 501 S Nedderman Dr., TX 76010, USA
| | - Lea R Jinks
- Department of Biology, University of Texas at Arlington, Arlington, 501 S Nedderman Dr., TX 76010, USA
| | - Vanessa Brinkhuis
- Washington State Department of Ecology—Central Regional Office, 1250 Alder Street, Union Gap, WA 98903, USA
| | - Saiful M Chowdhury
- Department of Chemistry and Biochemistry, University of Texas at Arlington, 700 Planetarium Pl, Arlington, TX 76010, USA
| | - Laura D Mydlarz
- Department of Biology, University of Texas at Arlington, Arlington, 501 S Nedderman Dr., TX 76010, USA
| |
Collapse
|
9
|
Rendleman J, Choi H, Vogel C. Integration of large-scale multi-omic datasets: a protein-centric view. ACTA ACUST UNITED AC 2018; 11:74-81. [PMID: 30906903 DOI: 10.1016/j.coisb.2018.09.001] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Innovative mass spectrometry-based proteomics has enabled routine measurements of protein abundance, localization, interactions, and modifications, covering unique aspects of gene expression regulation and function. It is now time to move from isolated analyses of these datasets toward true integration of proteomics with other data types to gain insights from the interactions and interdependencies of biomolecules. When combined with genomic or transcriptomic data, proteomics expands genome annotation to identify variant or missing genes. Dynamic proteomic measurements can move analysis from predominantly concentration-based framework to that of synthesis and degradation of proteins. Proteomic data from thousands of cancer patients can foster identification of novel pathogenic mutations via detection of protein sequence changes that lead to dysregulated pathways in various tumors. Such comprehensive efforts can exploit the synergy arising from large and complex datasets to advance virtually every field of biology.
Collapse
Affiliation(s)
- Justin Rendleman
- Center for Genomics and Systems Biology, New York University, Department of Biology, New York, USA
| | - Hyungwon Choi
- Department of Medicine, Yong Loo Lin School of Medicine, National University Singapore, Singapore.,Institute of Molecular and Cell Biology, Agency for Science, Technology, and Research, Singapore
| | - Christine Vogel
- Center for Genomics and Systems Biology, New York University, Department of Biology, New York, USA
| |
Collapse
|
10
|
Misra BB, Langefeld CD, Olivier M, Cox LA. Integrated Omics: Tools, Advances, and Future Approaches. J Mol Endocrinol 2018; 62:JME-18-0055. [PMID: 30006342 DOI: 10.1530/jme-18-0055] [Citation(s) in RCA: 219] [Impact Index Per Article: 36.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/24/2018] [Revised: 07/02/2018] [Accepted: 07/12/2018] [Indexed: 12/13/2022]
Abstract
With the rapid adoption of high-throughput omic approaches to analyze biological samples such as genomics, transcriptomics, proteomics, and metabolomics, each analysis can generate tera- to peta-byte sized data files on a daily basis. These data file sizes, together with differences in nomenclature among these data types, make the integration of these multi-dimensional omics data into biologically meaningful context challenging. Variously named as integrated omics, multi-omics, poly-omics, trans-omics, pan-omics, or shortened to just 'omics', the challenges include differences in data cleaning, normalization, biomolecule identification, data dimensionality reduction, biological contextualization, statistical validation, data storage and handling, sharing, and data archiving. The ultimate goal is towards the holistic realization of a 'systems biology' understanding of the biological question in hand. Commonly used approaches in these efforts are currently limited by the 3 i's - integration, interpretation, and insights. Post integration, these very large datasets aim to yield unprecedented views of cellular systems at exquisite resolution for transformative insights into processes, events, and diseases through various computational and informatics frameworks. With the continued reduction in costs and processing time for sample analyses, and increasing types of omics datasets generated such as glycomics, lipidomics, microbiomics, and phenomics, an increasing number of scientists in this interdisciplinary domain of bioinformatics face these challenges. We discuss recent approaches, existing tools, and potential caveats in the integration of omics datasets for development of standardized analytical pipelines that could be adopted by the global omics research community.
Collapse
Affiliation(s)
- Biswapriya B Misra
- B Misra, Internal Medicine, Wake Forest University School of Medicine, Winston-Salem, United States
| | - Carl D Langefeld
- C Langefeld, Biostatistical Sciences, Wake Forest University School of Medicine, Winston-Salem, United States
| | - Michael Olivier
- M Olivier, Internal Medicine, Wake Forest University School of Medicine, Winston-Salem, United States
| | - Laura A Cox
- L Cox, Internal Medicine, Wake Forest University School of Medicine, Winston-Salem, United States
| |
Collapse
|