Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Nie L, Wu G, Brockman FJ, Zhang W. Integrated analysis of transcriptomic and proteomic data of Desulfovibrio vulgaris: zero-inflated Poisson regression models to predict abundance of undetected proteins. Bioinformatics 2006;22:1641-7. [PMID: 16675466 DOI: 10.1093/bioinformatics/btl134] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

For:	Nie L, Wu G, Brockman FJ, Zhang W. Integrated analysis of transcriptomic and proteomic data of Desulfovibrio vulgaris: zero-inflated Poisson regression models to predict abundance of undetected proteins. Bioinformatics 2006;22:1641-7. [PMID: 16675466 DOI: 10.1093/bioinformatics/btl134] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Number

Cited by Other Article(s)

Fadhil SH, Saheb EJ. Relationship between the serum level, polymorphism and gene expression of IL-33 in samples of recurrent miscarriage Iraqi women infected with toxoplasmosis. Exp Parasitol 2024;263-264:108799. [PMID: 39025462 DOI: 10.1016/j.exppara.2024.108799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2023] [Revised: 05/20/2024] [Accepted: 07/11/2024] [Indexed: 07/20/2024]

Abstract

One of the many warm-blooded hosts that toxoplasmosis-causing intracellular protozoan parasite Toxoplasma gondii can infect is humans. Cytokines are crucial to stimulate an effective immune response against T. gondii. Interleukin-33 (IL-33) is a unique anti-inflammatory cytokine that suppresses the immune response. The levels of cytokine gene expression are regulated by genetics, and the genetic polymorphisms of these cytokines play a functional role in this process. Single nucleotide polymorphisms (SNPs) are prognostic indicators of illnesses. This study aimed to determine whether toxoplasmosis interacts with serum levels of IL-33 and its SNP in miscarriage women as well as whether serum levels and IL-33 gene expression are related in toxoplasmosis-positive miscarriage women. Two hundred blood samples from patients and controls were collected from AL-Alawiya Maternity Teaching Hospital and AL-Yarmouk Teaching Hospital in Baghdad, Iraq from 2021 to 2022 in order to evaluate the serum level of IL-33 using ELISA test. For the SNP of IL-33, the allelic high-resolution approach was utilized, and real time-PCR was performed to assess gene expression. The results showed that compared to healthy and pregnant women, recurrent miscarriage with toxoplasmosis and recurrent miscarriage women had lower IL-33 concentrations. Additionally, there were significant differences among healthy women, pregnant women, and women with repeated miscarriage who experienced toxoplasmosis. Furthermore, no differences between patients and controls were revealed by gene expression data. The results revealed that recurrent miscarriage, pregnancy, and healthy women all had a slightly higher amount of the IL-33 gene fold. Additionally, the SNP of IL-33 data demonstrated that there was no significant genetic relationship between patients and controls. Recurrent miscarriage women with toxoplasmosis have showed significant differences from pregnant women in the genotypes GG and AA as well as the alleles A and G. There were notable variations between recurrent miscarriage with and without toxoplasmosis in terms of the genotypes AA and AC. The genotypes GG, AA, and allele A in recurrent miscarriage women with toxoplasmosis and recurrent miscarriage women is a protective factor. Taking together, there was a statistically significant negative correlation between toxoplasmosis and IL-33 gene expression, which calls for more quantitative investigation in order to fully comprehend the interaction of mRNA and protein.

Collapse

Ferreira MADM, Silveira WBD, Nikoloski Z. Protein constraints in genome-scale metabolic models: Data integration, parameter estimation, and prediction of metabolic phenotypes. Biotechnol Bioeng 2024;121:915-930. [PMID: 38178617 DOI: 10.1002/bit.28650] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Revised: 10/24/2023] [Accepted: 12/18/2023] [Indexed: 01/06/2024]

Monteiro GA, Duarte SOD. The Effect of Recombinant Protein Production in Lactococcus lactis Transcriptome and Proteome. Microorganisms 2022;10:microorganisms10020267. [PMID: 35208722 PMCID: PMC8877491 DOI: 10.3390/microorganisms10020267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Revised: 01/18/2022] [Accepted: 01/20/2022] [Indexed: 11/18/2022] Open

Wang M, Gong C, Amakye W, Ren J. Exploring the Mechanisms of Anti-Aβ42 Aggregation Activity of Walnut-derived Peptides using Transcriptomics and Proteomics in vitro. EFOOD 2022. [DOI: 10.53365/efood.k/144885] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open

Zhu X, Wang J, Sun B, Ren C, Yang T, Ding J. An efficient ensemble method for missing value imputation in microarray gene expression data. BMC Bioinformatics 2021;22:188. [PMID: 33849444 PMCID: PMC8045198 DOI: 10.1186/s12859-021-04109-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2020] [Accepted: 03/29/2021] [Indexed: 11/10/2022] Open

Bai B, van der Horst N, Cordewener JH, America AHP, Nijveen H, Bentsink L. Delayed Protein Changes During Seed Germination. FRONTIERS IN PLANT SCIENCE 2021;12:735719. [PMID: 34603360 PMCID: PMC8480309 DOI: 10.3389/fpls.2021.735719] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/03/2021] [Accepted: 08/05/2021] [Indexed: 05/12/2023]

Jiang D, Armour CR, Hu C, Mei M, Tian C, Sharpton TJ, Jiang Y. Microbiome Multi-Omics Network Analysis: Statistical Considerations, Limitations, and Opportunities. Front Genet 2019;10:995. [PMID: 31781153 PMCID: PMC6857202 DOI: 10.3389/fgene.2019.00995] [Citation(s) in RCA: 86] [Impact Index Per Article: 17.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Accepted: 09/18/2019] [Indexed: 12/21/2022] Open

Li Y, Fan TWM, Lane AN, Kang WY, Arnold SM, Stromberg AJ, Wang C, Chen L. SDA: a semi-parametric differential abundance analysis method for metabolomics and proteomics data. BMC Bioinformatics 2019;20:501. [PMID: 31623550 PMCID: PMC6798423 DOI: 10.1186/s12859-019-3067-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2019] [Accepted: 09/03/2019] [Indexed: 12/21/2022] Open

Du Y, Clair GC, Al Alam D, Danopoulos S, Schnell D, Kitzmiller JA, Misra RS, Bhattacharya S, Warburton D, Mariani TJ, Pryhuber GS, Whitsett JA, Ansong C, Xu Y. Integration of transcriptomic and proteomic data identifies biological functions in cell populations from human infant lung. Am J Physiol Lung Cell Mol Physiol 2019;317:L347-L360. [PMID: 31268347 DOI: 10.1152/ajplung.00475.2018] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open

Affiliation(s)

Yina Du The Perinatal Institute and Section of Neonatology, Perinatal and Pulmonary Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio
Geremy C Clair Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington
Denise Al Alam Developmental Biology and Regenerative Medicine Program, Department of Pediatric Surgery, The Saban Research Institute, Children's Hospital Los Angeles, Los Angeles, California.,Keck School of Medicine, University of Southern California, Los Angeles, California
Soula Danopoulos Developmental Biology and Regenerative Medicine Program, Department of Pediatric Surgery, The Saban Research Institute, Children's Hospital Los Angeles, Los Angeles, California.,Keck School of Medicine, University of Southern California, Los Angeles, California
Daniel Schnell Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio.,Heart Institute and Center for Translational Fibrosis Research, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio
Joseph A Kitzmiller The Perinatal Institute and Section of Neonatology, Perinatal and Pulmonary Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio
Ravi S Misra Department of Pediatrics, University of Rochester Medical Center, Rochester, New York
Soumyaroop Bhattacharya Department of Pediatrics, University of Rochester Medical Center, Rochester, New York.,Division of Neonatology and Program in Pediatric Molecular and Personalized Medicine, University of Rochester Medical Center, Rochester, New York
David Warburton Developmental Biology and Regenerative Medicine Program, Department of Pediatric Surgery, The Saban Research Institute, Children's Hospital Los Angeles, Los Angeles, California.,Keck School of Medicine, University of Southern California, Los Angeles, California
Thomas J Mariani Department of Pediatrics, University of Rochester Medical Center, Rochester, New York.,Division of Neonatology and Program in Pediatric Molecular and Personalized Medicine, University of Rochester Medical Center, Rochester, New York
Gloria S Pryhuber Department of Pediatrics, University of Rochester Medical Center, Rochester, New York
Jeffrey A Whitsett The Perinatal Institute and Section of Neonatology, Perinatal and Pulmonary Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio
Charles Ansong Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington
Yan Xu The Perinatal Institute and Section of Neonatology, Perinatal and Pulmonary Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio.,Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio

Collapse

Lin D, Zhang J, Li J, Xu C, Deng HW, Wang YP. An integrative imputation method based on multi-omics datasets. BMC Bioinformatics 2016;17:247. [PMID: 27329642 PMCID: PMC4915152 DOI: 10.1186/s12859-016-1122-6] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2015] [Accepted: 06/05/2016] [Indexed: 12/26/2022] Open

Bundy JL, Inouye BD, Mercer RS, Nowakowski RS. Fractionation-dependent improvements in proteome resolution in the mouse hippocampus by IEF LC-MS/MS. Electrophoresis 2016;37:2054-62. [DOI: 10.1002/elps.201600076] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2015] [Revised: 04/03/2016] [Accepted: 04/20/2016] [Indexed: 01/19/2023]

Qi F, Zhao X, Kitahara Y, Li T, Ou X, Du W, Liu D, Huang J. Integrative transcriptomic and proteomic analysis of the mutant lignocellulosic hydrolyzate-tolerant Rhodosporidium toruloides. Eng Life Sci 2016;17:249-261. [PMID: 32624772 DOI: 10.1002/elsc.201500143] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2015] [Revised: 11/15/2015] [Accepted: 01/14/2016] [Indexed: 12/15/2022] Open

Lazar C, Gatto L, Ferro M, Bruley C, Burger T. Accounting for the Multiple Natures of Missing Values in Label-Free Quantitative Proteomics Data Sets to Compare Imputation Strategies. J Proteome Res 2016;15:1116-25. [DOI: 10.1021/acs.jproteome.5b00981] [Citation(s) in RCA: 232] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]

Wang J, Wu G, Chen L, Zhang W. Integrated Analysis of Transcriptomic and Proteomic Datasets Reveals Information on Protein Expressivity and Factors Affecting Translational Efficiency. Methods Mol Biol 2016;1375:123-136. [PMID: 25762301 DOI: 10.1007/7651_2015_242] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]

Gao L, Pei G, Chen L, Zhang W. A global network-based protocol for functional inference of hypothetical proteins in Synechocystis sp. PCC 6803. J Microbiol Methods 2015;116:44-52. [DOI: 10.1016/j.mimet.2015.06.013] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2015] [Revised: 06/24/2015] [Accepted: 06/25/2015] [Indexed: 01/15/2023]

A Post-Genomic View of the Ecophysiology, Catabolism and Biotechnological Relevance of Sulphate-Reducing Prokaryotes. Adv Microb Physiol 2015. [PMID: 26210106 DOI: 10.1016/bs.ampbs.2015.05.002] [Citation(s) in RCA: 174] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]

Guindani M, Sepúlveda N, Paulino CD, Müller P. A Bayesian Semi-parametric Approach for the Differential Analysis of Sequence Counts Data. J R Stat Soc Ser C Appl Stat 2014;63:385-404. [PMID: 24833809 PMCID: PMC4017673 DOI: 10.1111/rssc.12041] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Tomescu OA, Mattanovich D, Thallinger GG. Integrative omics analysis. A study based on Plasmodium falciparum mRNA and protein data. BMC SYSTEMS BIOLOGY 2014;8 Suppl 2:S4. [PMID: 25033389 PMCID: PMC4101701 DOI: 10.1186/1752-0509-8-s2-s4] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Abstract

Background

Technological improvements have shifted the focus from data generation to data analysis. The availability of large amounts of data from transcriptomics, protemics and metabolomics experiments raise new questions concerning suitable integrative analysis methods. We compare three integrative analysis techniques (co-inertia analysis, generalized singular value decomposition and integrative biclustering) by applying them to gene and protein abundance data from the six life cycle stages of Plasmodium falciparum. Co-inertia analysis is an analysis method used to visualize and explore gene and protein data. The generalized singular value decomposition has shown its potential in the analysis of two transcriptome data sets. Integrative Biclustering applies biclustering to gene and protein data.

Results

Using CIA, we visualize the six life cycle stages of Plasmodium falciparum, as well as GO terms in a 2D plane and interpret the spatial configuration. With GSVD, we decompose the transcriptomic and proteomic data sets into matrices with biologically meaningful interpretations and explore the processes captured by the data sets. IBC identifies groups of genes, proteins, GO Terms and life cycle stages of Plasmodium falciparum. We show method-specific results as well as a network view of the life cycle stages based on the results common to all three methods. Additionally, by combining the results of the three methods, we create a three-fold validated network of life cycle stage specific GO terms: Sporozoites are associated with transcription and transport; merozoites with entry into host cell as well as biosynthetic and metabolic processes; rings with oxidation-reduction processes; trophozoites with glycolysis and energy production; schizonts with antigenic variation and immune response; gametocyctes with DNA packaging and mitochondrial transport. Furthermore, the network connectivity underlines the separation of the intraerythrocytic cycle from the gametocyte and sporozoite stages.

Conclusion

Using integrative analysis techniques, we can integrate knowledge from different levels and obtain a wider view of the system under study. The overlap between method-specific and common results is considerable, even if the basic mathematical assumptions are very different. The three-fold validated network of life cycle stage characteristics of Plasmodium falciparum could identify a large amount of the known associations from literature in only one study.

Collapse

Proteomic and transcriptomic analyses of "Candidatus Pelagibacter ubique" describe the first PII-independent response to nitrogen limitation in a free-living Alphaproteobacterium. mBio 2013;4:e00133-12. [PMID: 24281717 PMCID: PMC3870248 DOI: 10.1128/mbio.00133-12] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Abstract

UNLABELLED

Nitrogen is one of the major nutrients limiting microbial productivity in the ocean, and as a result, most marine microorganisms have evolved systems for responding to nitrogen stress. The highly abundant alphaproteobacterium "Candidatus Pelagibacter ubique," a cultured member of the order Pelagibacterales (SAR11), lacks the canonical GlnB, GlnD, GlnK, and NtrB/NtrC genes for regulating nitrogen assimilation, raising questions about how these organisms respond to nitrogen limitation. A survey of 266 Alphaproteobacteria genomes found these five regulatory genes nearly universally conserved, absent only in intracellular parasites and members of the order Pelagibacterales, including "Ca. Pelagibacter ubique." Global differences in mRNA and protein expression between nitrogen-limited and nitrogen-replete cultures were measured to identify nitrogen stress responses in "Ca. Pelagibacter ubique" strain HTCC1062. Transporters for ammonium (AmtB), taurine (TauA), amino acids (YhdW), and opines (OccT) were all elevated in nitrogen-limited cells, indicating that they devote increased resources to the assimilation of nitrogenous organic compounds. Enzymes for assimilating amine into glutamine (GlnA), glutamate (GltBD), and glycine (AspC) were similarly upregulated. Differential regulation of the transcriptional regulator NtrX in the two-component signaling system NtrY/NtrX was also observed, implicating it in control of the nitrogen starvation response. Comparisons of the transcriptome and proteome supported previous observations of uncoupling between transcription and translation in nutrient-deprived "Ca. Pelagibacter ubique" cells. Overall, these data reveal a streamlined, PII-independent response to nitrogen stress in "Ca. Pelagibacter ubique," and likely other Pelagibacterales, and show that they respond to nitrogen stress by allocating more resources to the assimilation of nitrogen-rich organic compounds.

IMPORTANCE

Pelagibacterales are extraordinarily abundant and play a pivotal role in marine geochemical cycles, as one of the major recyclers of labile dissolved organic matter. They are also models for understanding how streamlining selection can reshape chemoheterotroph metabolism. Streamlining and its broad importance to environmental microbiology are emerging slowly from studies that reveal the complete genomes of uncultured organisms. Here, we report another remarkable example of streamlined metabolism in Pelagibacterales, this time in systems that control nitrogen assimilation. Pelagibacterales are major contributors to metatranscriptomes and metaproteomes from ocean systems, where patterns of gene expression are used to gain insight into ocean conditions and geochemical cycles. The data presented here supply background that is essential to interpreting data from field studies.

Collapse

Haider S, Pal R. Integrated analysis of transcriptomic and proteomic data. Curr Genomics 2013;14:91-110. [PMID: 24082820 PMCID: PMC3637682 DOI: 10.2174/1389202911314020003] [Citation(s) in RCA: 258] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2012] [Revised: 01/09/2013] [Accepted: 01/22/2013] [Indexed: 12/14/2022] Open

A practical data processing workflow for multi-OMICS projects. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2013;1844:52-62. [PMID: 23501674 DOI: 10.1016/j.bbapap.2013.02.029] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/03/2012] [Revised: 02/15/2013] [Accepted: 02/20/2013] [Indexed: 12/11/2022]

Abstract

Multi-OMICS approaches aim on the integration of quantitative data obtained for different biological molecules in order to understand their interrelation and the functioning of larger systems. This paper deals with several data integration and data processing issues that frequently occur within this context. To this end, the data processing workflow within the PROFILE project is presented, a multi-OMICS project that aims on identification of novel biomarkers and the development of new therapeutic targets for seven important liver diseases. Furthermore, a software called CrossPlatformCommander is sketched, which facilitates several steps of the proposed workflow in a semi-automatic manner. Application of the software is presented for the detection of novel biomarkers, their ranking and annotation with existing knowledge using the example of corresponding Transcriptomics and Proteomics data sets obtained from patients suffering from hepatocellular carcinoma. Additionally, a linear regression analysis of Transcriptomics vs. Proteomics data is presented and its performance assessed. It was shown, that for capturing profound relations between Transcriptomics and Proteomics data, a simple linear regression analysis is not sufficient and implementation and evaluation of alternative statistical approaches are needed. Additionally, the integration of multivariate variable selection and classification approaches is intended for further development of the software. Although this paper focuses only on the combination of data obtained from quantitative Proteomics and Transcriptomics experiments, several approaches and data integration steps are also applicable for other OMICS technologies. Keeping specific restrictions in mind the suggested workflow (or at least parts of it) may be used as a template for similar projects that make use of different high throughput techniques. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan.

Collapse

Wang J, Chen L, Huang S, Liu J, Ren X, Tian X, Qiao J, Zhang W. RNA-seq based identification and mutant validation of gene targets related to ethanol resistance in cyanobacterial Synechocystis sp. PCC 6803. BIOTECHNOLOGY FOR BIOFUELS 2012;5:89. [PMID: 23259593 PMCID: PMC3564720 DOI: 10.1186/1754-6834-5-89] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2012] [Accepted: 12/04/2012] [Indexed: 05/03/2023]

Prediction and Characterization of Missing Proteomic Data in Desulfovibrio vulgaris. Comp Funct Genomics 2011;2011:780973. [PMID: 21687592 PMCID: PMC3114432 DOI: 10.1155/2011/780973] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2010] [Revised: 12/17/2010] [Accepted: 03/01/2011] [Indexed: 11/17/2022] Open

Torres-García W, Brown SD, Johnson RH, Zhang W, Runger GC, Meldrum DR. Integrative analysis of transcriptomic and proteomic data of Shewanella oneidensis: missing value imputation using temporal datasets. MOLECULAR BIOSYSTEMS 2011;7:1093-104. [PMID: 21212895 DOI: 10.1039/c0mb00260g] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Abstract

Despite significant improvements in recent years, proteomic datasets currently available still suffer from large number of missing values. Integrative analyses based upon incomplete proteomic and transcriptomic datasets could seriously bias the biological interpretation. In this study, we applied a non-linear data-driven stochastic gradient boosted trees (GBT) model to impute missing proteomic values using a temporal transcriptomic and proteomic dataset of Shewanella oneidensis. In this dataset, genes' expression was measured after the cells were exposed to 1 mM potassium chromate for 5, 30, 60, and 90 min, while protein abundance was measured for 45 and 90 min. With the ultimate objective to impute protein values for experimentally undetected samples at 45 and 90 min, we applied a serial set of algorithms to capture relationships between temporal gene and protein expression. This work follows four main steps: (1) a quality control step for gene expression reliability, (2) mRNA imputation, (3) protein prediction, and (4) validation. Initially, an S control chart approach is performed on gene expression replicates to remove unwanted variability. Then, we focused on the missing measurements of gene expression through a nonlinear Smoothing Splines Curve Fitting. This method identifies temporal relationships among transcriptomic data at different time points and enables imputation of mRNA abundance at 45 min. After mRNA imputation was validated by biological constrains (i.e. operons), we used a data-driven GBT model to impute protein abundance for the proteins experimentally undetected in the 45 and 90 min samples, based on relevant predictors such as temporal mRNA gene expression data and cellular functional roles. The imputed protein values were validated using biological constraints such as operon and pathway information through a permutation test to investigate whether dispersion measures are indeed smaller for known biological groups than for any set of random genes. Finally, we demonstrated that such missing value imputation improved characterization of the temporal response of S. oneidensis to chromate.

Collapse

Rogers S. Statistical methods and models for bridging Omics data levels. Methods Mol Biol 2011;719:133-51. [PMID: 21370082 DOI: 10.1007/978-1-61779-027-0_6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]

Transcriptome and proteome exploration to model translation efficiency and protein stability in Lactococcus lactis. PLoS Comput Biol 2009;5:e1000606. [PMID: 20019804 PMCID: PMC2787624 DOI: 10.1371/journal.pcbi.1000606] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2009] [Accepted: 11/12/2009] [Indexed: 11/19/2022] Open

Abstract

This genome-scale study analysed the various parameters influencing protein levels in cells. To achieve this goal, the model bacterium Lactococcus lactis was grown at steady state in continuous cultures at different growth rates, and proteomic and transcriptomic data were thoroughly compared. Ratios of mRNA to protein were highly variable among proteins but also, for a given gene, between the different growth conditions. The modeling of cellular processes combined with a data fitting modeling approach allowed both translation efficiencies and degradation rates to be estimated for each protein in each growth condition. Estimated translational efficiencies and degradation rates strongly differed between proteins and were tested for their biological significance through statistical correlations with relevant parameters such as codon or amino acid bias. These efficiencies and degradation rates were not constant in all growth conditions and were inversely proportional to the growth rate, indicating a more efficient translation at low growth rate but an antagonistic higher rate of protein degradation. Estimated protein median half-lives ranged from 23 to 224 min, underlying the importance of protein degradation notably at low growth rates. The regulation of intracellular protein level was analysed through regulatory coefficient calculations, revealing a complex control depending on protein and growth conditions. The modeling approach enabled translational efficiencies and protein degradation rates to be estimated, two biological parameters extremely difficult to determine experimentally and generally lacking in bacteria. This method is generic and can now be extended to other environments and/or other micro-organisms.

This work is in the field of systems biology. Via an in-depth comparison of proteomic and transcriptomic data in various culture conditions, our objective was to better understand the regulation of protein levels. We have demonstrated that bacteria exert a tight control on intracellular protein levels, through a multi-level regulation involving translation but also dilution due to growth and protein degradation. We have estimated translational efficiencies and protein degradation rates by modeling. These two biological parameters are extremely difficult to measure experimentally and have not been previously determined in bacteria. We have found that they are growth rate dependent, indicating a fine control of translation and degradation processes. We have worked with the small genome bacterium Lactococcus lactis on a limited number of mRNA-protein couples but keeping in mind that this approach could be extended to other micro-organisms and biological phenomena. We have exhibited that mathematical modeling associated to experimental steady-states cultures is a powerful tool to understand microbial physiology.

Collapse

Zhang W, Li F, Nie L. Integrating multiple 'omics' analysis for microbial biology: application and methodologies. MICROBIOLOGY-SGM 2009;156:287-301. [PMID: 19910409 DOI: 10.1099/mic.0.034793-0] [Citation(s) in RCA: 281] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]

Sun A, Zhang J, Wang C, Yang D, Wei H, Zhu Y, Jiang Y, He F. Modified Spectral Count Index (mSCI) for Estimation of Protein Abundance by Protein Relative Identification Possibility (RIPpro): A New Proteomic Technological Parameter. J Proteome Res 2009;8:4934-42. [DOI: 10.1021/pr900252n] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Affiliation(s)

Aihua Sun Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100730, P. R. China, State Key Laboratory of Proteomics, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine, Beijing 102206, P. R. China, and Institutes of Biomedical Sciences and Department of Chemistry, Fudan University, Shanghai 200032, P. R. China
Jiyang Zhang Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100730, P. R. China, State Key Laboratory of Proteomics, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine, Beijing 102206, P. R. China, and Institutes of Biomedical Sciences and Department of Chemistry, Fudan University, Shanghai 200032, P. R. China
Chunping Wang Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100730, P. R. China, State Key Laboratory of Proteomics, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine, Beijing 102206, P. R. China, and Institutes of Biomedical Sciences and Department of Chemistry, Fudan University, Shanghai 200032, P. R. China
Dong Yang Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100730, P. R. China, State Key Laboratory of Proteomics, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine, Beijing 102206, P. R. China, and Institutes of Biomedical Sciences and Department of Chemistry, Fudan University, Shanghai 200032, P. R. China
Handong Wei Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100730, P. R. China, State Key Laboratory of Proteomics, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine, Beijing 102206, P. R. China, and Institutes of Biomedical Sciences and Department of Chemistry, Fudan University, Shanghai 200032, P. R. China
Yunping Zhu Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100730, P. R. China, State Key Laboratory of Proteomics, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine, Beijing 102206, P. R. China, and Institutes of Biomedical Sciences and Department of Chemistry, Fudan University, Shanghai 200032, P. R. China
Ying Jiang Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100730, P. R. China, State Key Laboratory of Proteomics, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine, Beijing 102206, P. R. China, and Institutes of Biomedical Sciences and Department of Chemistry, Fudan University, Shanghai 200032, P. R. China
Fuchu He Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100730, P. R. China, State Key Laboratory of Proteomics, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine, Beijing 102206, P. R. China, and Institutes of Biomedical Sciences and Department of Chemistry, Fudan University, Shanghai 200032, P. R. China

Collapse

de Sousa Abreu R, Penalva LO, Marcotte EM, Vogel C. Global signatures of protein and mRNA expression levels. MOLECULAR BIOSYSTEMS 2009;5:1512-26. [PMID: 20023718 DOI: 10.1039/b908315d] [Citation(s) in RCA: 578] [Impact Index Per Article: 38.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Tan CS, Salim A, Ploner A, Lehtiö J, Chia KS, Pawitan Y. Correlating gene and protein expression data using Correlated Factor Analysis. BMC Bioinformatics 2009;10:272. [PMID: 19723309 PMCID: PMC2744708 DOI: 10.1186/1471-2105-10-272] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2008] [Accepted: 09/01/2009] [Indexed: 01/06/2023] Open

Torres-García W, Zhang W, Runger GC, Johnson RH, Meldrum DR. Integrative analysis of transcriptomic and proteomic data of Desulfovibrio vulgaris: a non-linear model to predict abundance of undetected proteins. ACTA ACUST UNITED AC 2009;25:1905-14. [PMID: 19447782 DOI: 10.1093/bioinformatics/btp325] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Abstract

MOTIVATION

Gene expression profiling technologies can generally produce mRNA abundance data for all genes in a genome. A dearth of proteomic data persists because identification range and sensitivity of proteomic measurements lag behind those of transcriptomic measurements. Using partial proteomic data, it is likely that integrative transcriptomic and proteomic analysis may introduce significant bias. Developing methodologies to accurately estimate missing proteomic data will allow better integration of transcriptomic and proteomic datasets and provide deeper insight into metabolic mechanisms underlying complex biological systems.

RESULTS

In this study, we present a non-linear data-driven model to predict abundance for undetected proteins using two independent datasets of cognate transcriptomic and proteomic data collected from Desulfovibrio vulgaris. We use stochastic gradient boosted trees (GBT) to uncover possible non-linear relationships between transcriptomic and proteomic data, and to predict protein abundance for the proteins not experimentally detected based on relevant predictors such as mRNA abundance, cellular role, molecular weight, sequence length, protein length, guanine-cytosine (GC) content and triple codon counts. Initially, we constructed a GBT model using all possible variables to assess their relative importance and characterize the behavior of the predictive model. A strong plateau effect in the regions of high mRNA values and sparse data occurred in this model. Hence, we removed genes in those areas based on thresholds estimated from the partial dependency plots where this behavior was captured. At this stage, only the strongest predictors of protein abundance were retained to reduce the complexity of the GBT model. After removing genes in the plateau region, mRNA abundance, main cellular functional categories and few triple codon counts emerged as the top-ranked predictors of protein abundance. We then created a new tuned GBT model using the five most significant predictors. The construction of our non-linear model consists of a set of serial regression trees models with implicit strength in variable selection. The model provides variable relative importance measures using as a criterion mean square error. The results showed that coefficients of determination for our nonlinear models ranged from 0.393 to 0.582 in both datasets, providing better results than linear regression used in the past. We evaluated the validity of this non-linear model using biological information of operons, regulons and pathways, and the results demonstrated that the coefficients of variation of estimated protein abundance values within operons, regulons or pathways are indeed smaller than those for random groups of proteins.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Baíllo A, Berrendero J, Cárcamo J. Tests for zero-inflation and overdispersion: A new approach based on the stochastic convex order. Comput Stat Data Anal 2009. [DOI: 10.1016/j.csda.2008.12.012] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Nie L, Wu G, Zhang W. Statistical Application and Challenges in Global Gel-Free Proteomic Analysis by Mass Spectrometry. Crit Rev Biotechnol 2008;28:297-307. [DOI: 10.1080/07388550802543158] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Mishra Y, Chaurasia N, Rai LC. Heat pretreatment alleviates UV-B toxicity in the cyanobacterium Anabaena doliolum: A proteomic analysis of cross tolerance. Photochem Photobiol 2008;85:824-33. [PMID: 19076303 DOI: 10.1111/j.1751-1097.2008.00469.x] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Bhargava P, Kumar A, Mishra Y, Rai LC. Copper pretreatment augments ultraviolet B toxicity in the cyanobacterium Anabaena doliolum: a proteomic analysis of cell death. FUNCTIONAL PLANT BIOLOGY : FPB 2008;35:360-372. [PMID: 32688793 DOI: 10.1071/fp07267] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2007] [Accepted: 05/22/2008] [Indexed: 06/11/2023]

Guo Y, Xiao P, Lei S, Deng F, Xiao GG, Liu Y, Chen X, Li L, Wu S, Chen Y, Jiang H, Tan L, Xie J, Zhu X, Liang S, Deng H. How is mRNA expression predictive for protein expression? A correlation study on human circulating monocytes. Acta Biochim Biophys Sin (Shanghai) 2008;40:426-36. [PMID: 18465028 DOI: 10.1111/j.1745-7270.2008.00418.x] [Citation(s) in RCA: 311] [Impact Index Per Article: 19.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open

Rohmer L, Guina T, Chen J, Gallis B, Taylor GK, Shaffer SA, Miller SI, Brittnacher MJ, Goodlett DR. Determination and Comparison of the Francisella tularensis subsp.novicida U112 Proteome to Other Bacterial Proteomes. J Proteome Res 2008;7:2016-24. [DOI: 10.1021/pr700760z] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Affiliation(s)

Laurence Rohmer Department of Genome Sciences, Microbiology, Medicine, Medicinal Chemistry, and Department of Pediatrics, Division of Infectious Diseases, University of Washington, Seattle, Washington 98195
Tina Guina Department of Genome Sciences, Microbiology, Medicine, Medicinal Chemistry, and Department of Pediatrics, Division of Infectious Diseases, University of Washington, Seattle, Washington 98195
Jinzhi Chen Department of Genome Sciences, Microbiology, Medicine, Medicinal Chemistry, and Department of Pediatrics, Division of Infectious Diseases, University of Washington, Seattle, Washington 98195
Byron Gallis Department of Genome Sciences, Microbiology, Medicine, Medicinal Chemistry, and Department of Pediatrics, Division of Infectious Diseases, University of Washington, Seattle, Washington 98195
Greg K. Taylor Department of Genome Sciences, Microbiology, Medicine, Medicinal Chemistry, and Department of Pediatrics, Division of Infectious Diseases, University of Washington, Seattle, Washington 98195
Scott A. Shaffer Department of Genome Sciences, Microbiology, Medicine, Medicinal Chemistry, and Department of Pediatrics, Division of Infectious Diseases, University of Washington, Seattle, Washington 98195
Samuel I. Miller Department of Genome Sciences, Microbiology, Medicine, Medicinal Chemistry, and Department of Pediatrics, Division of Infectious Diseases, University of Washington, Seattle, Washington 98195
Mitchell J. Brittnacher Department of Genome Sciences, Microbiology, Medicine, Medicinal Chemistry, and Department of Pediatrics, Division of Infectious Diseases, University of Washington, Seattle, Washington 98195
David R. Goodlett Department of Genome Sciences, Microbiology, Medicine, Medicinal Chemistry, and Department of Pediatrics, Division of Infectious Diseases, University of Washington, Seattle, Washington 98195

Collapse

Nie L, Wu G, Culley DE, Scholten JCM, Zhang W. Integrative analysis of transcriptomic and proteomic data: challenges, solutions and applications. Crit Rev Biotechnol 2007;27:63-75. [PMID: 17578703 DOI: 10.1080/07388550701334212] [Citation(s) in RCA: 170] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Comulada WS, Weiss RE, Cumberland W, Rotheram-Borus MJ. Reductions in drug use among young people living with HIV. THE AMERICAN JOURNAL OF DRUG AND ALCOHOL ABUSE 2007;33:493-501. [PMID: 17613977 PMCID: PMC2819808 DOI: 10.1080/00952990701301921] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Fagan A, Culhane AC, Higgins DG. A multivariate analysis approach to the integration of proteomic and gene expression data. Proteomics 2007;7:2162-71. [PMID: 17549791 DOI: 10.1002/pmic.200600898] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Nie L, Wu G, Zhang W. Correlation of mRNA expression and protein abundance affected by multiple sequence features related to translational efficiency in Desulfovibrio vulgaris: a quantitative analysis. Genetics 2006;174:2229-43. [PMID: 17028312 PMCID: PMC1698625 DOI: 10.1534/genetics.106.065862] [Citation(s) in RCA: 163] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Abstract

The modest correlation between mRNA expression and protein abundance in large-scale data sets is explained in part by experimental challenges, such as technological limitations, and in part by fundamental biological factors in the transcription and translation processes. Among various factors affecting the mRNA-protein correlation, the roles of biological factors related to translation are poorly understood. In this study, using experimental mRNA expression and protein abundance data collected from Desulfovibrio vulgaris by DNA microarray and liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) proteomic analysis, we quantitatively examined the effects of several translational-efficiency-related sequence features on mRNA-protein correlation. Three classes of sequence features were investigated according to different translational stages: (i) initiation, Shine-Dalgarno sequences, start codon identity, and start codon context; (ii) elongation, codon usage and amino acid usage; and (iii) termination, stop codon identity and stop codon context. Surprisingly, although it is widely accepted that translation initiation is the rate-limiting step for translation, our results showed that the mRNA-protein correlation was affected the most by the features at elongation stages, i.e., codon usage and amino acid composition (5.3-15.7% and 5.8-11.9% of the total variation of mRNA-protein correlation, respectively), followed by stop codon context and the Shine-Dalgarno sequence (3.7-5.1% and 1.9-3.8%, respectively). Taken together, all sequence features contributed to 15.2-26.2% of the total variation of mRNA-protein correlation. This study provides the first comprehensive quantitative analysis of the mRNA-protein correlation in bacterial D. vulgaris and adds new insights into the relative importance of various sequence features in prokaryotic protein translation.

Collapse