1
|
Burger T. Controlling for false discoveries subsequently to large scale one-way ANOVA testing in proteomics: Practical considerations. Proteomics 2023; 23:e2200406. [PMID: 37357151 DOI: 10.1002/pmic.202200406] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 05/30/2023] [Accepted: 05/31/2023] [Indexed: 06/27/2023]
Abstract
In discovery proteomics, as well as many other "omic" approaches, the possibility to test for the differential abundance of hundreds (or of thousands) of features simultaneously is appealing, despite requiring specific statistical safeguards, among which controlling for the false discovery rate (FDR) has become standard. Moreover, when more than two biological conditions or group treatments are considered, it has become customary to rely on the one-way analysis of variance (ANOVA) framework, where a first global differential abundance landscape provided by an omnibus test can be subsequently refined using various post-hoc tests (PHTs). However, the interactions between the FDR control procedures and the PHTs are complex, because both correspond to different types of multiple test corrections (MTCs). This article surveys various ways to orchestrate them in a data processing workflow and discusses their pros and cons.
Collapse
Affiliation(s)
- Thomas Burger
- Univ. Grenoble Alpes, CNRS, CEA, INSERM, ProFI, EDyP, Grenoble, France
| |
Collapse
|
2
|
Thavarajah T, Dos Santos CC, Slutsky AS, Marshall JC, Bowden P, Romaschin A, Marshall JG. The plasma peptides of sepsis. Clin Proteomics 2020; 17:26. [PMID: 32636717 PMCID: PMC7331219 DOI: 10.1186/s12014-020-09288-5] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2020] [Accepted: 06/15/2020] [Indexed: 12/28/2022] Open
Abstract
Background A practical strategy to discover sepsis specific proteins may be to compare the plasma peptides and proteins from patients in the intensive care unit with and without sepsis. The aim was to discover proteins and/or peptides that show greater observation frequency and/or precursor intensity in sepsis. The endogenous tryptic peptides of ICU-Sepsis were compared to ICU Control, ovarian cancer, breast cancer, female normal, sepsis, heart attack, Alzheimer’s and multiple sclerosis along with their institution-matched controls, female normals and normal samples collected directly onto ice. Methods Endogenous tryptic peptides were extracted from individual sepsis and control EDTA plasma samples in a step gradient of acetonitrile for random and independent sampling by LC–ESI–MS/MS with a set of robust and sensitive linear quadrupole ion traps. The MS/MS spectra were fit to fully tryptic peptides within proteins using the X!TANDEM algorithm. The protein observation frequency was counted using the SEQUEST algorithm after selecting the single best charge state and peptide sequence for each MS/MS spectra. The protein observation frequency of ICU-sepsis versus ICU Control was subsequently tested by Chi square analysis. The average protein or peptide log10 precursor intensity was compared across disease and control treatments by ANOVA in the R statistical system. Results Peptides and/or phosphopeptides of common plasma proteins such as ITIH3, SAA2, SAA1, and FN1 showed increased observation frequency by Chi square (χ2 > 9, p < 0.003) and/or precursor intensity in sepsis. Cellular gene symbols with large Chi square values from tryptic peptides included POTEB, CTNNA1, U2SURP, KIF24, NLGN2, KSR1, GTF2H1, KIT, RPS6KL1, VAV2, HSPA7, SMC2, TCEB3B, ZNF300, SUPV3L1, ADAMTS20, LAMB4, MCCC1, SUPT6H, SCN9A, SBNO1, EPHA1, ABLIM2, cB5E3.2, EPHA10, GRIN2B, HIVEP2, CCL16, TKT, LRP2 and TMF1 amongst others showed increased observation frequency. Similarly, increased frequency of tryptic phosphopeptides were observed from POM121C, SCN8A, TMED8, NSUN7, SLX4, MADD, DNLZ, PDE3B, UTY, DEPDC7, MTX1, MYO1E, RXRB, SYDE1, FN1, PUS7L, FYCO1, USP26, ACAP2, AHI1, KSR2, LMAN1, ZNF280D and SLC8A2 amongst others. Increases in mean precursor intensity in peptides from common plasma proteins such as ITIH3, SAA2, SAA1, and FN1 as well as cellular proteins such as COL24A1, POTEB, KANK1, SDCBP2, DNAH11, ADAMTS7, MLLT1, TTC21A, TSHR, SLX4, MTCH1, and PUS7L among others were associated with sepsis. The processing of SAA1 included the cleavage of the terminal peptide D/PNHFRPAGLPEKY from the most hydrophilic point of SAA1 on the COOH side of the cystatin C binding that was most apparent in ICU-Sepsis patients compared to all other diseases and controls. Additional cleavage of SAA1 on the NH2 terminus side of the cystatin binding site were observed in ICU-Sepsis. Thus there was disease associated variation in the processing of SAA1 in ICU-Sepsis versus ICU controls or other diseases and controls. Conclusion Specific proteins and peptides that vary between diseases might be discovered by the random and independent sampling of multiple disease and control plasma from different hospital and clinics by LC–ESI–MS/MS for storage in a relational SQL Server database and analysis with the R statistical system that will be a powerful tool for clinical research. The processing of SAA1 may play an unappreciated role in the inflammatory response to Sepsis.
Collapse
Affiliation(s)
- Thanusi Thavarajah
- Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St., Toronto, ON Canada
| | - Claudia C Dos Santos
- St. Michael's Hospital, Keenan Research Centre for Biomedical Science, Toronto, Canada
| | - Arthur S Slutsky
- St. Michael's Hospital, Keenan Chair in Medicine, University of Toronto, Toronto, Canada
| | - John C Marshall
- International Biobank of Luxembourg (IBBL), Institute of Health (formerly CRP Sante Luxembourg), Dudelange, Luxembourg
| | - Pete Bowden
- Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St., Toronto, ON Canada
| | - Alexander Romaschin
- St. Michael's Hospital, Keenan Research Centre for Biomedical Science, Toronto, Canada
| | - John G Marshall
- Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St., Toronto, ON Canada.,International Biobank of Luxembourg (IBBL), Institute of Health (formerly CRP Sante Luxembourg), Dudelange, Luxembourg
| |
Collapse
|
3
|
Yu Y, Liu X, Liu L, Sief M. The adaptivity of thresholding wavelet estimators in heteroscedastic nonparametric model with negatively super-additive dependent errors. J Korean Stat Soc 2020. [DOI: 10.1007/s42952-020-00049-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
4
|
Dufresne J, Bowden P, Thavarajah T, Florentinus-Mefailoski A, Chen ZZ, Tucholska M, Norzin T, Ho MT, Phan M, Mohamed N, Ravandi A, Stanton E, Slutsky AS, Dos Santos CC, Romaschin A, Marshall JC, Addison C, Malone S, Heyland D, Scheltens P, Killestein J, Teunissen C, Diamandis EP, Siu KWM, Marshall JG. The plasma peptides of breast versus ovarian cancer. Clin Proteomics 2019; 16:43. [PMID: 31889940 PMCID: PMC6927194 DOI: 10.1186/s12014-019-9262-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2019] [Accepted: 12/05/2019] [Indexed: 02/07/2023] Open
Abstract
Background There is a need to demonstrate a proof of principle that proteomics has the capacity to analyze plasma from breast cancer versus other diseases and controls in a multisite clinical trial design. The peptides or proteins that show a high observation frequency, and/or precursor intensity, specific to breast cancer plasma might be discovered by comparison to other diseases and matched controls. The endogenous tryptic peptides of breast cancer plasma were compared to ovarian cancer, female normal, sepsis, heart attack, Alzheimer's and multiple sclerosis along with the institution-matched normal and control samples collected directly onto ice. Methods Endogenous tryptic peptides were extracted from individual breast cancer and control EDTA plasma samples in a step gradient of acetonitrile, and collected over preparative C18 for LC-ESI-MS/MS with a set of LTQ XL linear quadrupole ion traps working together in parallel to randomly and independently sample clinical populations. The MS/MS spectra were fit to fully tryptic peptides or phosphopeptides within proteins using the X!TANDEM algorithm. The protein observation frequency was counted using the SEQUEST algorithm after selecting the single best charge state and peptide sequence for each MS/MS spectra. The observation frequency was subsequently tested by Chi Square analysis. The log10 precursor intensity was compared by ANOVA in the R statistical system. Results Peptides and/or phosphopeptides of common plasma proteins such as APOE, C4A, C4B, C3, APOA1, APOC2, APOC4, ITIH3 and ITIH4 showed increased observation frequency and/or precursor intensity in breast cancer. Many cellular proteins also showed large changes in frequency by Chi Square (χ2 > 100, p < 0.0001) in the breast cancer samples such as CPEB1, LTBP4, HIF-1A, IGHE, RAB44, NEFM, C19orf82, SLC35B1, 1D12A, C8orf34, HIF1A, OCLN, EYA1, HLA-DRB1, LARS, PTPDC1, WWC1, ZNF562, PTMA, MGAT1, NDUFA1, NOGOC, OR1E1, OR1E2, CFI, HSA12, GCSH, ELTD1, TBX15, NR2C2, FLJ00045, PDLIM1, GALNT9, ASH2L, PPFIBP1, LRRC4B, SLCO3A1, BHMT2, CS, FAM188B2, LGALS7, SAT2, SFRS8, SLC22A12, WNT9B, SLC2A4, ZNF101, WT1, CCDC47, ERLIN1, SPFH1, EID2, THOC1, DDX47, MREG, PTPRE, EMILIN1, DKFZp779G1236 and MAP3K8 among others. The protein gene symbols with large Chi Square values were significantly enriched in proteins that showed a complex set of previously established functional and structural relationships by STRING analysis. An increase in mean precursor intensity of peptides was observed for QSER1 as well as SLC35B1, IQCJ-SCHIP1, MREG, BHMT2, LGALS7, THOC1, ANXA4, DHDDS, SAT2, PTMA and FYCO1 among others. In contrast, the QSER1 peptide QPKVKAEPPPK was apparently specific to ovarian cancer. Conclusion There was striking agreement between the breast cancer plasma peptides and proteins discovered by LC-ESI-MS/MS with previous biomarkers from tumors, cells lines or body fluids by genetic or biochemical methods. The results indicate that variation in plasma peptides from breast cancer versus ovarian cancer may be directly discovered by LC-ESI-MS/MS that will be a powerful tool for clinical research. It may be possible to use a battery of sensitive and robust linear quadrupole ion traps for random and independent sampling of plasma from a multisite clinical trial.
Collapse
Affiliation(s)
- Jaimie Dufresne
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St., Toronto, ON Canada
| | - Pete Bowden
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St., Toronto, ON Canada
| | - Thanusi Thavarajah
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St., Toronto, ON Canada
| | - Angelique Florentinus-Mefailoski
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St., Toronto, ON Canada
| | - Zhuo Zhen Chen
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St., Toronto, ON Canada
| | - Monika Tucholska
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St., Toronto, ON Canada
| | - Tenzin Norzin
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St., Toronto, ON Canada
| | - Margaret Truc Ho
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St., Toronto, ON Canada
| | - Morla Phan
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St., Toronto, ON Canada
| | - Nargiz Mohamed
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St., Toronto, ON Canada
| | - Amir Ravandi
- 2Institute of Cardiovascular Sciences, St. Boniface Hospital Research Center, University of Manitoba, Winnipeg, Canada
| | - Eric Stanton
- 3Division of Cardiology, Department of Medicine, McMaster University, Hamilton, Canada
| | - Arthur S Slutsky
- 4St. Michael's Hospital, Keenan Chair in Medicine, Professor of Medicine, Surgery & Biomedical Engineering, University of Toronto, Toronto, Canada
| | - Claudia C Dos Santos
- 5St. Michael's Hospital, Keenan Research Centre for Biomedical Science, Toronto, Canada
| | - Alexander Romaschin
- 5St. Michael's Hospital, Keenan Research Centre for Biomedical Science, Toronto, Canada
| | - John C Marshall
- 5St. Michael's Hospital, Keenan Research Centre for Biomedical Science, Toronto, Canada
| | - Christina Addison
- 6Program for Cancer Therapeutics, Ottawa Hospital Research Institute, Ottawa, Canada
| | - Shawn Malone
- 6Program for Cancer Therapeutics, Ottawa Hospital Research Institute, Ottawa, Canada
| | - Daren Heyland
- 7Clinical Evaluation Research Unit, Kingston General Hospital, Kingston, Canada
| | - Philip Scheltens
- 8Alzheimer Center, Dept of Neurology, Amsterdam University Medical Centers, Vrije Universiteit, Amsterdam Neuroscience, Amsterdam, The Netherlands
| | - Joep Killestein
- 9MS Center, Dept of Neurology, Amsterdam University Medical Centers, Vrije Universiteit, Amsterdam Neuroscience, Amsterdam, The Netherlands
| | - Charlotte Teunissen
- 10Neurochemistry Lab and Biobank, Dept of Clinical Chemsitry, Amsterdam University Medical Centers, Vrije Universiteit, Amsterdam Neuroscience, Amsterdam, The Netherlands
| | | | - K W M Siu
- 12University of Windsor, Windsor, Canada
| | - John G Marshall
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St., Toronto, ON Canada.,13International Biobank of Luxembourg (IBBL), Luxembourg Institute of Health (formerly CRP Sante Luxembourg), Strassen, Luxembourg
| |
Collapse
|
5
|
Yocum KM, Smith HH, Todd EW, Mora L, Gerakines PA, Milam SN, Widicus Weaver SL. Millimeter/Submillimeter Spectroscopic Detection of Desorbed Ices: A New Technique in Laboratory Astrochemistry. J Phys Chem A 2019; 123:8702-8708. [PMID: 31556610 DOI: 10.1021/acs.jpca.9b04587] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
A new laboratory technique has been developed that utilizes gas-phase, direct-absorption millimeter and submillimeter spectroscopy to detect and identify desorbed species from interstellar and cometary ice analogues. Rotational spectroscopy is a powerful structure-specific technique for detecting isomers and other species possessing the same mass that are indistinguishable with mass spectrometry. Furthermore, the resultant laboratory spectra are directly comparable to observational data from far-infrared and millimeter telescopes. Here, we present the proof-of-concept measurements of the detection of thermally desorbed H2O, D2O, and CH3OH originating in a solid film created at low temperature (∼12 K). The surface binding energy of H2O is reported and compared to results from traditional techniques, including mass spectrometry and quartz-crystal microbalance measurements of mass loss. Lastly, we demonstrate that this technique can be used to derive thermodynamic values including the sublimation enthalpy and entropy of H2O.
Collapse
Affiliation(s)
- Katarina M Yocum
- Department of Chemistry , Emory University , Atlanta 30322 , Georgia , United States
| | - Houston H Smith
- Department of Chemistry , Emory University , Atlanta 30322 , Georgia , United States
| | - Ethan W Todd
- Department of Chemistry , Emory University , Atlanta 30322 , Georgia , United States
| | - Leslie Mora
- Department of Chemistry , Emory University , Atlanta 30322 , Georgia , United States
| | - Perry A Gerakines
- Astrochemistry Laboratory , NASA Goddard Space Flight Center , Greenbelt 20771 , Maryland , United States
| | - Stefanie N Milam
- Astrochemistry Laboratory , NASA Goddard Space Flight Center , Greenbelt 20771 , Maryland , United States
| | | |
Collapse
|
6
|
Vidal SEL, Tamamoto KA, Nguyen H, Abbott RD, Cairns DM, Kaplan DL. 3D biomaterial matrix to support long term, full thickness, immuno-competent human skin equivalents with nervous system components. Biomaterials 2019; 198:194-203. [PMID: 29709325 PMCID: PMC6200656 DOI: 10.1016/j.biomaterials.2018.04.044] [Citation(s) in RCA: 44] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2018] [Revised: 04/07/2018] [Accepted: 04/21/2018] [Indexed: 01/16/2023]
Abstract
Current commercially available human skin equivalents (HSEs) are used for relatively short term studies (∼1 week) due in part to the time-dependent contraction of the collagen gel-based matrix and the limited cell types and skin tissue components utilized. In contrast, here we describe a new matrix consisting of a silk-collagen composite system that provides long term, stable cultivation with reduced contraction and degradation over time. This matrix supports full thickness skin equivalents which include nerves. The unique silk-collagen composite system preserves cell-binding domains of collagen while maintaining the stability and mechanics of the skin system for long-term culture with silk. The utility of this new composite protein-based biomaterial was demonstrated by bioengineering full thickness human skin systems using primary cells, including nerves and immune cells to establish an HSE with a neuro-immuno-cutaneous system. The HSEs with neurons and hypodermis, compared to in vitro skin-only HSEs controls, demonstrated higher secretion of pro-inflammatory cytokines. Proteomics analysis confirmed the presence of several proteins associated with inflammation across all sample groups, but HSEs with neurons had the highest amount of detected protein due to the complexity of the model. This improved, in vitro full thickness HSE model system utilizes cross-linked silk-collagen as the biomaterial and allows reduced reliance on animal models and provides a new in vitro tissue system for the assessment of chronic responses related to skin diseases and drug discovery.
Collapse
Affiliation(s)
| | - Kasey A Tamamoto
- Tufts University, Department of Chemistry, Medford, MA 02155, USA
| | - Hanh Nguyen
- Tufts University, Department of Child Studies and Human Development, Medford, MA 02155, USA
| | - Rosalyn D Abbott
- Carnegie Mellon University, Department of Biomedical Engineering, Pittsburg, PA 15213, USA
| | - Dana M Cairns
- Tufts University, Department of Biomedical Engineering, 4 Colby St., Medford, MA 02155, USA
| | - David L Kaplan
- Tufts University, Department of Biomedical Engineering, 4 Colby St., Medford, MA 02155, USA.
| |
Collapse
|
7
|
Dufresne J, Bowden P, Thavarajah T, Florentinus-Mefailoski A, Chen ZZ, Tucholska M, Norzin T, Ho MT, Phan M, Mohamed N, Ravandi A, Stanton E, Slutsky AS, Dos Santos CC, Romaschin A, Marshall JC, Addison C, Malone S, Heyland D, Scheltens P, Killestein J, Teunissen CE, Diamandis EP, Michael Siu KW, Marshall JG. The plasma peptides of ovarian cancer. Clin Proteomics 2018; 15:41. [PMID: 30598658 PMCID: PMC6302491 DOI: 10.1186/s12014-018-9215-z] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2018] [Accepted: 11/19/2018] [Indexed: 12/13/2022] Open
Abstract
Background It may be possible to discover new diagnostic or therapeutic peptides or proteins from blood plasma by using liquid chromatography and tandem mass spectrometry to identify, quantify and compare the peptides cleaved ex vivo from different clinical populations. The endogenous tryptic peptides of ovarian cancer plasma were compared to breast cancer and female cancer normal controls, other diseases with their matched or normal controls, plus ice cold plasma to control for pre-analytical variation. Methods The endogenous tryptic peptides or tryptic phospho peptides (i.e. without exogenous digestion) were analyzed from 200 μl of EDTA plasma. The plasma peptides were extracted by a step gradient of organic/water with differential centrifugation, dried, and collected over C18 for analytical HPLC nano electrospray ionization and tandem mass spectrometry (LC–ESI–MS/MS) with a linear quadrupole ion trap. The endogenous peptides of ovarian cancer were compared to multiple disease and normal samples from different institutions alongside ice cold controls. Peptides were randomly and independently sampled by LC–ESI–MS/MS. Precursor ions from peptides > E4 counts were identified by the SEQUEST and X!TANDEM algorithms, filtered in SQL Server, before testing of frequency counts by Chi Square (χ2), for analysis with the STRING algorithm, and comparison of precursor intensity by ANOVA in the R statistical system with the Tukey-Kramer Honestly Significant Difference (HSD) test. Results Peptides and/or phosphopeptides of common plasma proteins such as HPR, HP, HPX, and SERPINA1 showed increased observation frequency and/or precursor intensity in ovarian cancer. Many cellular proteins showed large changes in frequency by Chi Square (χ2 > 60, p < 0.0001) in the ovarian cancer samples such as ZNF91, ZNF254, F13A1, LOC102723511, ZNF253, QSER1, P4HA1, GPC6, LMNB2, PYGB, NBR1, CCNI2, LOC101930455, TRPM5, IGSF1, ITGB1, CHD6, SIRT1, NEFM, SKOR2, SUPT20HL1, PLCE1, CCDC148, CPSF3, MORN3, NMI, XTP11, LOC101927572, SMC5, SEMA6B, LOXL3, SEZ6L2, and DHCR24. The protein gene symbols with large Chi Square values were significantly enriched in proteins that showed a complex set of previously established functional and structural relationships by STRING analysis. Analysis of the frequently observed proteins by ANOVA confirmed increases in mean precursor intensity in ZFN91, TRPM5, SIRT1, CHD6, RIMS1, LOC101930455 (XP_005275896), CCDC37 and GIMAP4 between ovarian cancer versus normal female and other diseases or controls by the Tukey–Kramer HSD test. Conclusion Here we show that separation of endogenous peptides with a step gradient of organic/water and differential centrifugation followed by random and independent sampling by LC–ESI–MS/MS with analysis of peptide frequency and intensity by SQL Server and R revealed significant difference in the ex vivo cleavage of peptides between ovarian cancer and other clinical treatments. There was striking agreement between the proteins discovered from cancer plasma versus previous biomarkers discovered in tumors by genetic or biochemical methods. The results indicate that variation in plasma proteins from ovarian cancer may be directly discovered by LC–ESI–MS/MS that will be a powerful tool for clinical research. Electronic supplementary material The online version of this article (10.1186/s12014-018-9215-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jaimie Dufresne
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Ryerson University, Toronto, Canada
| | - Pete Bowden
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Ryerson University, Toronto, Canada
| | - Thanusi Thavarajah
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Ryerson University, Toronto, Canada
| | | | - Zhuo Zhen Chen
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Ryerson University, Toronto, Canada
| | - Monika Tucholska
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Ryerson University, Toronto, Canada
| | - Tenzin Norzin
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Ryerson University, Toronto, Canada
| | - Margaret Truc Ho
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Ryerson University, Toronto, Canada
| | - Morla Phan
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Ryerson University, Toronto, Canada
| | - Nargiz Mohamed
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Ryerson University, Toronto, Canada
| | - Amir Ravandi
- 2Institute of Cardiovascular Sciences, St Boniface Hospital Research Center, University of Manitoba, Winnipeg, Canada
| | - Eric Stanton
- 3Division of Cardiology, Department of Medicine, McMaster University, Hamilton, Canada
| | - Arthur S Slutsky
- 4Keenan Chair in Medicine, St. Michael's Hospital, University of Toronto, Toronto, Canada
| | - Claudia C Dos Santos
- 5Keenan Research Centre for Biomedical Science, St. Michael's Hospital, Toronto, Canada
| | - Alexander Romaschin
- 5Keenan Research Centre for Biomedical Science, St. Michael's Hospital, Toronto, Canada
| | - John C Marshall
- 5Keenan Research Centre for Biomedical Science, St. Michael's Hospital, Toronto, Canada
| | - Christina Addison
- 6Program for Cancer Therapeutics, Ottawa Hospital Research Institute, Ottawa, Canada
| | - Shawn Malone
- 6Program for Cancer Therapeutics, Ottawa Hospital Research Institute, Ottawa, Canada
| | - Daren Heyland
- 7Clinical Evaluation Research Unit, Kingston General Hospital, Kingston, Canada
| | - Philip Scheltens
- 8Alzheimer Center, Department of Neurology, Amsterdam Neuroscience, Amsterdam University Medical Centers, Vrije Universiteit, Amsterdam, The Netherlands
| | - Joep Killestein
- 9MS Center, Department of Neurology, Amsterdam Neuroscience, Amsterdam University Medical Centers, Vrije Universiteit, Amsterdam, The Netherlands
| | - Charlotte E Teunissen
- 10Neurochemistry Lab and Biobank, Department of Clinical Chemistry, Amsterdam Neuroscience, Amsterdam University Medical Centers, Vrije Universiteit, Amsterdam, The Netherlands
| | | | | | - John G Marshall
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Ryerson University, Toronto, Canada.,13International Biobank of Luxembourg (IBBL), Luxembourg Institute of Health (formerly CRP Sante Luxembourg), Strassen, Luxembourg.,14Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St., Toronto, ON Canada
| |
Collapse
|
8
|
Dufresne J, Bowden P, Thavarajah T, Florentinus-Mefailoski A, Chen ZZ, Tucholska M, Norzin T, Ho MT, Phan M, Mohamed N, Ravandi A, Stanton E, Slutsky AS, Dos Santos CC, Romaschin A, Marshall JC, Addison C, Malone S, Heyland D, Scheltens P, Killestein J, Teunissen C, Diamandis EP, Siu KWM, Marshall JG. The plasma peptidome. Clin Proteomics 2018; 15:39. [PMID: 30519149 PMCID: PMC6271647 DOI: 10.1186/s12014-018-9211-3] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2018] [Accepted: 10/23/2018] [Indexed: 02/07/2023] Open
Abstract
Background It may be possible to discover new diagnostic or therapeutic peptides or proteins from blood plasma using LC–ESI–MS/MS to identify, with a linear quadrupole ion trap to identify, quantify and compare the statistical distributions of peptides cleaved ex vivo from plasma samples from different clinical populations. Methods A systematic method for the organic fractionation of plasma peptides was applied to identify and quantify the endogenous tryptic peptides from human plasma from multiple institutions by C18 HPLC followed nano electrospray ionization and tandem mass spectrometry (LC–ESI–MS/MS) with a linear quadrupole ion trap. The endogenous tryptic peptides, or tryptic phospho peptides (i.e. without exogenous digestion), were extracted in a mixture of organic solvent and water, dried and collected by preparative C18. The tryptic peptides from 6 institutions with 12 different disease and normal EDTA plasma populations, alongside ice cold controls for pre-analytical variation, were characterized by mass spectrometry. Each patient plasma was precipitated in 90% acetonitrile and the endogenous tryptic peptides extracted by a stepwise gradient of increasing water and then formic acid resulting in 10 sub-fractions. The fractionated peptides were manually collected over preparative C18 and injected for 1508 LC–ESI–MS/MS experiments analyzed in SQL Server R. Results Peptides that were cleaved in human plasma by a tryptic activity ex vivo provided convenient and sensitive access to most human proteins in plasma that show differences in the frequency or intensity of proteins observed across populations that may have clinical significance. Combination of step wise organic extraction of 200 μL of plasma with nano electrospray resulted in the confident identification and quantification ~ 14,000 gene symbols by X!TANDEM that is the largest number of blood proteins identified to date and shows that you can monitor the ex vivo proteolysis of most human proteins, including interleukins, from blood. A total of 15,968,550 MS/MS spectra ≥ E4 intensity counts were correlated by the SEQUEST and X!TANDEM algorithms to a federated library of 157,478 protein sequences that were filtered for best charge state (2+ or 3+) and peptide sequence in SQL Server resulting in 1,916,672 distinct best-fit peptide correlations for analysis with the R statistical system. SEQUEST identified some 140,054 protein accessions, or some ~ 26,000 gene symbols, proteins or loci, with at least 5 independent correlations. The X!TANDEM algorithm made at least 5 best fit correlations to more than 14,000 protein gene symbols with p-values and FDR corrected q-values of ~ 0.001 or less. Log10 peptide intensity values showed a Gaussian distribution from E8 to E4 arbitrary counts by quantile plot, and significant variation in average precursor intensity across the disease and controls treatments by ANOVA with means compared by the Tukey–Kramer test. STRING analysis of the top 2000 gene symbols showed a tight association of cellular proteins that were apparently present in the plasma as protein complexes with related cellular components, molecular functions and biological processes. Conclusions The random and independent sampling of pre-fractionated blood peptides by LC-ESI-MS/MS with SQL Server-R analysis revealed the largest plasma proteome to date and was a practical method to quantify and compare the frequency or log10 intensity of individual proteins cleaved ex vivo across populations of plasma samples from multiple clinical locations to discover treatment-specific variation using classical statistics suitable for clinical science. It was possible to identify and quantify nearly all human proteins from EDTA plasma and compare the results of thousands of LC–ESI–MS/MS experiments from multiple clinical populations using standard database methods in SQL Server and classical statistical strategies in the R data analysis system. Electronic supplementary material The online version of this article (10.1186/s12014-018-9211-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jaimie Dufresne
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St, Toronto, ON Canada
| | - Pete Bowden
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St, Toronto, ON Canada
| | - Thanusi Thavarajah
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St, Toronto, ON Canada
| | - Angelique Florentinus-Mefailoski
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St, Toronto, ON Canada
| | - Zhuo Zhen Chen
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St, Toronto, ON Canada
| | - Monika Tucholska
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St, Toronto, ON Canada
| | - Tenzin Norzin
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St, Toronto, ON Canada
| | - Margaret Truc Ho
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St, Toronto, ON Canada
| | - Morla Phan
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St, Toronto, ON Canada
| | - Nargiz Mohamed
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St, Toronto, ON Canada
| | - Amir Ravandi
- 2Institute of Cardiovascular Sciences, St Boniface Hospital Research Center, University of Manitoba, Winnipeg, Canada
| | - Eric Stanton
- 3Division of Cardiology, Department of Medicine, McMaster University, Hamilton, Canada
| | - Arthur S Slutsky
- 4St. Michael's Hospital, Keenan Chair in Medicine, University of Toronto, Toronto, Canada
| | - Claudia C Dos Santos
- 5St. Michael's Hospital, Keenan Research Centre for Biomedical Science, Toronto, Canada
| | - Alexander Romaschin
- 5St. Michael's Hospital, Keenan Research Centre for Biomedical Science, Toronto, Canada
| | - John C Marshall
- 5St. Michael's Hospital, Keenan Research Centre for Biomedical Science, Toronto, Canada
| | - Christina Addison
- 6Program for Cancer Therapeutics, Ottawa Hospital Research Institute, Ottawa, Canada
| | - Shawn Malone
- 6Program for Cancer Therapeutics, Ottawa Hospital Research Institute, Ottawa, Canada
| | - Daren Heyland
- 7Clinical Evaluation Research Unit, Kingston General Hospital, Kingston, Canada
| | - Philip Scheltens
- 8Alzheimer Center, Department of Neurology, Amsterdam University Medical Centers, Vrije Universiteit, Amsterdam Neuroscience, Amsterdam, The Netherlands
| | - Joep Killestein
- 9MS Center, Department of Neurology, Amsterdam University Medical Centers, Vrije Universiteit, Amsterdam Neuroscience, Amsterdam, The Netherlands
| | - Charlotte Teunissen
- 10Neurochemistry Lab and Biobank, Department of Clinical Chemistry, Amsterdam University Medical Centers, Vrije Universiteit, Amsterdam Neuroscience, Amsterdam, The Netherlands
| | | | - K W M Siu
- 12University of Windsor, Windsor, Canada
| | - John G Marshall
- 1Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, 350 Victoria St, Toronto, ON Canada.,13International Biobank of Luxembourg (IBBL), Luxembourg Institute of Health (formerly CRP Sante Luxembourg), Strassen, Luxembourg
| |
Collapse
|
9
|
Dufresne J, Florentinus-Mefailoski A, Zhu PH, Bowden P, Marshall JG. Re-evaluation of the rabbit myosin protein standard used to create the empirical statistical model for decoy library searching. Anal Biochem 2018; 560:39-49. [PMID: 30171831 DOI: 10.1016/j.ab.2018.08.025] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2017] [Revised: 08/22/2018] [Accepted: 08/28/2018] [Indexed: 01/28/2023]
Abstract
A Rabbit myosin standard, like that used to create the empirical statistical model, was randomly and independently sampled by liquid chromatography micro electrospray ionization and tandem mass spectrometry (LC-ESI-MS/MS) with a linear quadrupole ion trap. The rabbit myosin protein standard appeared pure by SDS-PAGE and CBBR staining but showed many other proteins by silver staining. The LC-MS intensity from myosin and IgG samples were above the 99% safe limit of detection and quantification computed from 36 blank LC-ESI-MS/MS runs. The myosin contained ≤406 Gene Symbols, open reading frames or loci where 79 protein types showed ≥3 peptides from X!TANDEM. Myosins, actin, troponin, other proteins showed 95%-100% homology between the rabbit versus the human decoy library. The myosin protein complex from STRING was true positive compared to random or noise spectra MS/MS with a low type I error (p-value) and low FDR (q-value) computed in R. SDS-PAGE, Western blot, comparison to random and noise MS/MS spectra, X!TANDEM p-values, FDR corrected q-values, and STRING all agreed that the error rate of LC-ESI-MS/MS with a quadrupole ion trap is far below that assumed a priori by the design of the empirical statistical model for decoy library searching.
Collapse
Affiliation(s)
- Jaimie Dufresne
- Ryerson Analytical Biochemistry Laboratory (RABL), Kerr Hall East Rm 332b, Department of Chemistry and Biology, Faculty of Science, Ryerson University, Canada
| | - Angelique Florentinus-Mefailoski
- Ryerson Analytical Biochemistry Laboratory (RABL), Kerr Hall East Rm 332b, Department of Chemistry and Biology, Faculty of Science, Ryerson University, Canada
| | - Pei-Hong Zhu
- Ryerson Analytical Biochemistry Laboratory (RABL), Kerr Hall East Rm 332b, Department of Chemistry and Biology, Faculty of Science, Ryerson University, Canada
| | - Peter Bowden
- Ryerson Analytical Biochemistry Laboratory (RABL), Kerr Hall East Rm 332b, Department of Chemistry and Biology, Faculty of Science, Ryerson University, Canada
| | - John G Marshall
- Ryerson Analytical Biochemistry Laboratory (RABL), Kerr Hall East Rm 332b, Department of Chemistry and Biology, Faculty of Science, Ryerson University, Canada.
| |
Collapse
|
10
|
Klich A, Mercier C, Gerfault L, Grangeat P, Beaulieu C, Degout-Charmette E, Fortin T, Mahé P, Giovannelli JF, Charrier JP, Giremus A, Maucort-Boulch D, Roy P. Variance component analysis to assess protein quantification in biomarker validation: application to selected reaction monitoring-mass spectrometry. BMC Bioinformatics 2018; 19:73. [PMID: 29490628 PMCID: PMC5831836 DOI: 10.1186/s12859-018-2075-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2017] [Accepted: 02/20/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In the field of biomarker validation with mass spectrometry, controlling the technical variability is a critical issue. In selected reaction monitoring (SRM) measurements, this issue provides the opportunity of using variance component analysis to distinguish various sources of variability. However, in case of unbalanced data (unequal number of observations in all factor combinations), the classical methods cannot correctly estimate the various sources of variability, particularly in presence of interaction. The present paper proposes an extension of the variance component analysis to estimate the various components of the variance, including an interaction component in case of unbalanced data. RESULTS We applied an experimental design that uses a serial dilution to generate known relative protein concentrations and estimated these concentrations by two processing algorithms, a classical and a more recent one. The extended method allowed estimating the variances explained by the dilution and the technical process by each algorithm in an experiment with 9 proteins: L-FABP, 14.3.3 sigma, Calgi, Def.A6, Villin, Calmo, I-FABP, Peroxi-5, and S100A14. Whereas, the recent algorithm gave a higher dilution variance and a lower technical variance than the classical one in two proteins with three peptides (L-FABP and Villin), there were no significant difference between the two algorithms on all proteins. CONCLUSIONS The extension of the variance component analysis was able to estimate correctly the variance components of protein concentration measurement in case of unbalanced design.
Collapse
Affiliation(s)
- Amna Klich
- Service de Biostatistique-Bioinformatique, Hospices Civils de Lyon, 162, avenue Lacassagne, F-69003, Lyon, France. .,Université de Lyon, Lyon, France. .,PRABI, Université Lyon 1, Villeurbanne, France. .,CNRS UMR 5558, LBBE, Équipe Biostatistique Santé, Villeurbanne, France.
| | - Catherine Mercier
- Service de Biostatistique-Bioinformatique, Hospices Civils de Lyon, 162, avenue Lacassagne, F-69003, Lyon, France.,Université de Lyon, Lyon, France.,PRABI, Université Lyon 1, Villeurbanne, France.,CNRS UMR 5558, LBBE, Équipe Biostatistique Santé, Villeurbanne, France
| | - Laurent Gerfault
- Université Grenoble-Alpes, F-38000, Grenoble, France.,Commissariat à l'Énergie Atomique, Laboratoire d'Électronique et de Technologie de l'Information, MINATEC Campus, Département Micro-technologies pour la Biologie et la Santé, F-38054, Grenoble, France
| | - Pierre Grangeat
- Université Grenoble-Alpes, F-38000, Grenoble, France.,Commissariat à l'Énergie Atomique, Laboratoire d'Électronique et de Technologie de l'Information, MINATEC Campus, Département Micro-technologies pour la Biologie et la Santé, F-38054, Grenoble, France
| | - Corinne Beaulieu
- Innovation Unit, Technology Research Department, bioMérieux, F-69280, Marcy l'Étoile, France
| | - Elodie Degout-Charmette
- Innovation Unit, Technology Research Department, bioMérieux, F-69280, Marcy l'Étoile, France
| | - Tanguy Fortin
- Innovation Unit, Technology Research Department, bioMérieux, F-69280, Marcy l'Étoile, France.,, Present Address: Villeurbanne, France
| | - Pierre Mahé
- Innovation Unit, Technology Research Department, bioMérieux, F-38000, Grenoble, France
| | - Jean-François Giovannelli
- Intégration du Matériau au Système (Université de Bordeaux, CNRS, Bordeaux Aquitaine INP), F-33400, Talence, France
| | - Jean-Philippe Charrier
- Innovation Unit, Technology Research Department, bioMérieux, F-69280, Marcy l'Étoile, France
| | - Audrey Giremus
- Intégration du Matériau au Système (Université de Bordeaux, CNRS, Bordeaux Aquitaine INP), F-33400, Talence, France
| | - Delphine Maucort-Boulch
- Service de Biostatistique-Bioinformatique, Hospices Civils de Lyon, 162, avenue Lacassagne, F-69003, Lyon, France.,Université de Lyon, Lyon, France.,PRABI, Université Lyon 1, Villeurbanne, France.,CNRS UMR 5558, LBBE, Équipe Biostatistique Santé, Villeurbanne, France
| | - Pascal Roy
- Service de Biostatistique-Bioinformatique, Hospices Civils de Lyon, 162, avenue Lacassagne, F-69003, Lyon, France.,Université de Lyon, Lyon, France.,PRABI, Université Lyon 1, Villeurbanne, France.,CNRS UMR 5558, LBBE, Équipe Biostatistique Santé, Villeurbanne, France
| |
Collapse
|
11
|
Dufresne J, Florentinus-Mefailoski A, Bowden P, Marshall JG. A method for the extraction of the endogenous tryptic peptides (peptidome) from human EDTA plasma. Anal Biochem 2018; 549:188-196. [PMID: 29486203 DOI: 10.1016/j.ab.2018.02.025] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2017] [Revised: 02/21/2018] [Accepted: 02/23/2018] [Indexed: 10/18/2022]
Abstract
The proteins identified from endogenous peptides agreed between serum versus plasma, and tryptic versus non-tryptic peptides, when collected by C18 alone and analyzed by liquid chromatography electrospray ionization and tandem mass spectrometry (LC-ESI-MS/MS) including amyloids, apolipoproteins, haptoglobin, complements, fibrinogens, hemopexin, antitrypsin and alpha 2 macroglobulin. Precipitation of polypeptides from plasma in 9 vol of 100% organic solvent followed by stepwise extraction of the insoluble pellet with an increasing fraction of water identified thousands of proteins. A Coomassie-blue protein binding assay, and tricine SDS-PAGE, showed that Acetonitrile-Water (AH) resulted in a greater relative enrichment of low molecular weight plasma polypeptides than Acetonitrile-Methanol Water (AMH). A total of 905,386 MS/MS spectra greater than ~10,000 (E4) counts were correlated by X!TANDEM to a federated human protein library of 153,124 different protein sequences that resulted in 58,223 fully tryptic peptides from 3463 Gene Symbols of which 1880 had ≥ 5 independent peptides (p ≤ 0.00001). The results were filtered and organized in an SQL database for analysis using the generic R statistical analysis system. Cellular proteins including secreted and exosome proteins, signaling factors, nucleic acid binding proteins, metabolic enzymes and uncharacterized factors were observed with a significant enrichment of expected protein-protein interactions by STRING analysis.
Collapse
Affiliation(s)
- Jaimie Dufresne
- Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, Toronto, ON, M5B 2K3, Canada
| | - Angelique Florentinus-Mefailoski
- Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, Toronto, ON, M5B 2K3, Canada
| | - Pete Bowden
- Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, Toronto, ON, M5B 2K3, Canada
| | - John G Marshall
- Ryerson Analytical Biochemistry Laboratory (RABL), Department of Chemistry and Biology, Faculty of Science, Ryerson University, Toronto, ON, M5B 2K3, Canada.
| |
Collapse
|
12
|
Giacofci M, Lambert-Lacroix S, Picard F. Minimax wavelet estimation for multisample heteroscedastic nonparametric regression. J Nonparametr Stat 2017. [DOI: 10.1080/10485252.2017.1406091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Madison Giacofci
- Laboratoire LJK, Université de Grenoble et CNRS, Grenoble, France
| | | | | |
Collapse
|
13
|
Dufresne J, Hoang T, Ajambo J, Florentinus-Mefailoski A, Bowden P, Marshall J. Freeze-dried plasma proteins are stable at room temperature for at least 1 year. Clin Proteomics 2017; 14:35. [PMID: 29093647 PMCID: PMC5659006 DOI: 10.1186/s12014-017-9170-0] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2017] [Accepted: 10/11/2017] [Indexed: 12/23/2022] Open
Abstract
Thirty human EDTA plasma samples from male and female subjects ranging in age from 24 to 74 years were collected on ice, processed ice cold and stored frozen at -80 °C, in liquid nitrogen (LN2), or freeze dried and stored at room temperature in a desiccator (FDRT) or freeze dried and stored at -20 °C for 1 year (FD-20). In a separate experiment, EDTA plasma samples were collected onto ice, processed ice cold and maintained on ice ± protease inhibitors versus incubated at room temperature for up to 96 h. Random and independent sampling by liquid chromatography and tandem mass spectrometry (LC-ESI-MS/MS), as correlated by the MASCOT, OMSSA, X!TANDEM and SEQUEST algorithms, showed that tryptic peptides from complement component 4B (C4B) were rapidly released in plasma at room temperature. Random sampling by LC-ESI-MS/MS showed that peptides from C4B were undetectable on ice, but peptides were cleaved from the mature C4B protein including NGFKSHALQLNNR within as little as 1 h at room temperature. The frequency and intensity of precursors within ± 3 m/z of the C4B peptide NGFKSHALQLNNR was confirmed by automated targeted analysis where the precursors from MS/MS spectra that correlated to the target sequence were analyzed in SQL/R. The C4B preproprotein was processed at the N terminus to release the mature chain that was cleaved on the carboxyl side of the isoprene C2 domain within a polar C terminal sequence of the mature C4B protein, to reveal the thioester reaction site, consistent with LC-ESI-MS/MS and Western blot. Random sampling showed that proteolytic peptides from complement component C4B were rarely observed with long term storage at - 80 °C in a freezer or in liquid nitrogen (LN2), freeze drying with storage at - 20 °C (FD-20 °C) or freeze drying and storage at room temperature (FDRT). Plasma samples maintained at room temperature (RT) showed at least 10-fold to 100-fold greater frequency of peptide correlation to C4B and measured peptide intensity compared to samples on ice for up to 72 h or stored at - 80 °C, LN2, FDRT or FD-20 °C for up to a year.
Collapse
Affiliation(s)
- Jaimie Dufresne
- Ryerson University, 350 Victoria Street, Toronto, ON M5B 2K3 Canada
| | - Trung Hoang
- Ryerson University, 350 Victoria Street, Toronto, ON M5B 2K3 Canada
| | - Juliet Ajambo
- Ryerson University, 350 Victoria Street, Toronto, ON M5B 2K3 Canada
| | | | - Peter Bowden
- Ryerson University, 350 Victoria Street, Toronto, ON M5B 2K3 Canada
| | - John Marshall
- Ryerson University, 350 Victoria Street, Toronto, ON M5B 2K3 Canada.,Integrated BioBank of Luxembourg, 6 r. Nicolas-Ernest Barblé, 1210 Luxembourg, Luxembourg
| |
Collapse
|
14
|
Kong A, Azencott R. Binary Markov Random Fields and interpretable mass spectra discrimination. Stat Appl Genet Mol Biol 2017; 16:/j/sagmb.ahead-of-print/sagmb-2016-0019/sagmb-2016-0019.xml. [PMID: 28475101 DOI: 10.1515/sagmb-2016-0019] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
For mass spectra acquired from cancer patients by MALDI or SELDI techniques, automated discrimination between cancer types or stages has often been implemented by machine learning algorithms. Nevertheless, these techniques typically lack interpretability in terms of biomarkers. In this paper, we propose a new mass spectra discrimination algorithm by parameterized Markov Random Fields to automatically generate interpretable classifiers with small groups of scored biomarkers. A dataset of 238 MALDI colorectal mass spectra and two datasets of 216 and 253 SELDI ovarian mass spectra respectively were used to test our approach. The results show that our approach reaches accuracies of 81% to 100% to discriminate between patients from different colorectal and ovarian cancer stages, and performs as well or better than previous studies on similar datasets. Moreover, our approach enables efficient planar-displays to visualize mass spectra discrimination and has good asymptotic performance for large datasets. Thus, our classifiers should facilitate the choice and planning of further experiments for biological interpretation of cancer discriminating signatures. In our experiments, the number of mass spectra for each colorectal cancer stage is roughly half of that for each ovarian cancer stage, so that we reach lower discrimination accuracy for colorectal cancer than for ovarian cancer.
Collapse
|
15
|
Sikdar S, Gill R, Datta S. Improving protein identification from tandem mass spectrometry data by one-step methods and integrating data from other platforms. Brief Bioinform 2015; 17:262-9. [PMID: 26141827 DOI: 10.1093/bib/bbv043] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2015] [Indexed: 01/28/2023] Open
Abstract
MOTIVATION Many approaches have been proposed for the protein identification problem based on tandem mass spectrometry (MS/MS) data. In these experiments, proteins are digested into peptides and the resulting peptide mixture is subjected to mass spectrometry. Some interesting putative peptide features (peaks) are selected from the mass spectra. Following that, the precursor ions undergo fragmentation and are analyzed by MS/MS. The process of identification of peptides from the mass spectra and the constituent proteins in the sample is called protein identification from MS/MS data. There are many two-step protein identification procedures, reviewed in the literature, which first attempt to identify the peptides in a separate process and then use these results to infer the proteins. However, in recent years, there have been attempts to provide a one-step solution to protein identification, which simultaneously identifies the proteins and the peptides in the sample. RESULTS In this review, we briefly introduce the most popular two-step protein identification procedure, PeptideProphet coupled with ProteinProphet. Following that, we describe the difficulties with two-step procedures and review some recently introduced one-step protein/peptide identification procedures that do not suffer from these issues. The focus of this review is on one-step procedures that are based on statistical likelihood-based models, but some discussion of other one-step procedures is also included. We report comparative performances of one-step and two-step methods, which support the overall superiorities of one-step procedures. We also cover some recent efforts to improve protein identification by incorporating other molecular data along with MS/MS data.
Collapse
|
16
|
Cairns DA. Statistical issues in the design and planning of proteomic profiling experiments. Methods Mol Biol 2015; 1243:223-236. [PMID: 25384749 DOI: 10.1007/978-1-4939-1872-0_13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
The statistical design of a clinical proteomics experiment is a critical part of well-undertaken investigation. Standard concepts from experimental design such as randomization, replication and blocking should be applied in all experiments, and this is possible when the experimental conditions are well understood by the investigator. The large number of proteins simultaneously considered in proteomic discovery experiments means that determining the number of required replicates to perform a powerful experiment is more complicated than in simple experiments. However, by using information about the nature of an experiment and making simple assumptions this is achievable for a variety of experiments useful for biomarker discovery and initial validation.
Collapse
Affiliation(s)
- David A Cairns
- Section of Oncology and Clinical Research, Leeds Institute of Cancer and Pathology, St. James's University Hospital, Beckett Street, Leeds, LS9 7TF, UK,
| |
Collapse
|
17
|
Mandel M, Askenazi M, Zhang Y, Marto JA. Variance function estimation in quantitative mass spectrometry with application to iTRAQ labeling. Ann Appl Stat 2013. [DOI: 10.1214/12-aoas572] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
18
|
Giacofci M, Lambert-Lacroix S, Marot G, Picard F. Wavelet-Based Clustering for Mixed-Effects Functional Models in High Dimension. Biometrics 2013; 69:31-40. [DOI: 10.1111/j.1541-0420.2012.01828.x] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Affiliation(s)
- M. Giacofci
- Laboratoire LJK, BP 53; Université de Grenoble et CNRS; 38041 Grenoble cedex 9 France
| | | | - G. Marot
- Projet BAMBOO; INRIA Rhône-Alpes; F-38330 Montbonnot Saint-Martin France
- Biostatistics, EA 2694; UDSL, Université Lille Nord de France
- MODAL; INRIA Lille Nord Europe; F-59650 Villeneuve d'Ascq France
| | - F. Picard
- Biostatistics, EA 2694; UDSL, Université Lille Nord de France
| |
Collapse
|
19
|
Oberg AL, Mahoney DW. Statistical methods for quantitative mass spectrometry proteomic experiments with labeling. BMC Bioinformatics 2012; 13 Suppl 16:S7. [PMID: 23176383 PMCID: PMC3489540 DOI: 10.1186/1471-2105-13-s16-s7] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Mass Spectrometry utilizing labeling allows multiple specimens to be subjected to mass spectrometry simultaneously. As a result, between-experiment variability is reduced. Here we describe use of fundamental concepts of statistical experimental design in the labeling framework in order to minimize variability and avoid biases. We demonstrate how to export data in the format that is most efficient for statistical analysis. We demonstrate how to assess the need for normalization, perform normalization, and check whether it worked. We describe how to build a model explaining the observed values and test for differential protein abundance along with descriptive statistics and measures of reliability of the findings. Concepts are illustrated through the use of three case studies utilizing the iTRAQ 4-plex labeling protocol.
Collapse
Affiliation(s)
- Ann L Oberg
- Division of Biomedical Statistics and Informatics, Department of Health Sciences Research, Mayo Clinic, 200 First Street SW, Rochester, MN 55905, USA.
| | | |
Collapse
|
20
|
Chen LS, Paul D, Prentice RL, Wang P. A regularized Hotelling's T2 test for pathway analysis in proteomic studies. J Am Stat Assoc 2011; 106:1345-1360. [PMID: 23997374 DOI: 10.1198/jasa.2011.ap10599] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Recent proteomic studies have identified proteins related to specific phenotypes. In addition to marginal association analysis for individual proteins, analyzing pathways (functionally related sets of proteins) may yield additional valuable insights. Identifying pathways that differ between phenotypes can be conceptualized as a multivariate hypothesis testing problem: whether the mean vector μ of a p-dimensional random vector X is μ0 . Proteins within the same biological pathway may correlate with one another in a complicated way, and type I error rates can be inflated if such correlations are incorrectly assumed to be absent. The inflation tends to be more pronounced when the sample size is very small or there is a large amount of missingness in the data, as is frequently the case in proteomic discovery studies. To tackle these challenges, we propose a regularized Hotelling's T2 (RHT) statistic together with a non-parametric testing procedure, which effectively controls the type I error rate and maintains good power in the presence of complex correlation structures and missing data patterns. We investigate asymptotic properties of the RHT statistic under pertinent assumptions and compare the test performance with four existing methods through simulation examples. We apply the RHT test to a hormone therapy proteomics data set, and identify several interesting biological pathways for which blood serum concentrations changed following hormone therapy initiation.
Collapse
Affiliation(s)
- Lin S Chen
- Department of Health Studies, The University of Chicago, IL
| | | | | | | |
Collapse
|
21
|
Florentinus AK, Bowden P, Sardana G, Diamandis EP, Marshall JG. Identification and quantification of peptides and proteins secreted from prostate epithelial cells by unbiased liquid chromatography tandem mass spectrometry using goodness of fit and analysis of variance. J Proteomics 2011; 75:1303-17. [PMID: 22120120 DOI: 10.1016/j.jprot.2011.11.002] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2011] [Revised: 10/29/2011] [Accepted: 11/05/2011] [Indexed: 10/15/2022]
Abstract
The proteins secreted by prostate cancer cells (PC3(AR)6) were separated by strong anion exchange chromatography, digested with trypsin and analyzed by unbiased liquid chromatography tandem mass spectrometry with an ion trap. The spectra were matched to peptides within proteins using a goodness of fit algorithm that showed a low false positive rate. The parent ions for MS/MS were randomly and independently sampled from a log-normal population and therefore could be analyzed by ANOVA. Normal distribution analysis confirmed that the parent and fragment ion intensity distributions were sampled over 99.9% of their range that was above the background noise. Arranging the ion intensity data with the identified peptide and protein sequences in structured query language (SQL) permitted the quantification of ion intensity across treatments, proteins and peptides. The intensity of 101,905 fragment ions from 1421 peptide precursors of 583 peptides from 233 proteins separated over 11 sample treatments were computed together in one ANOVA model using the statistical analysis system (SAS) prior to Tukey-Kramer honestly significant difference (HSD) testing. Thus complex mixtures of proteins were identified and quantified with a high degree of confidence using an ion trap without isotopic labels, multivariate analysis or comparing chromatographic retention times.
Collapse
|
22
|
Cairns DA. Statistical issues in quality control of proteomic analyses: Good experimental design and planning. Proteomics 2011; 11:1037-48. [DOI: 10.1002/pmic.201000579] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2010] [Revised: 11/26/2010] [Accepted: 11/29/2010] [Indexed: 12/21/2022]
|
23
|
Xia JQ, Sedransk N, Feng X. Variance component analysis of a multi-site study for the reproducibility of multiple reaction monitoring measurements of peptides in human plasma. PLoS One 2011; 6:e14590. [PMID: 21298095 PMCID: PMC3027641 DOI: 10.1371/journal.pone.0014590] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2010] [Accepted: 12/17/2010] [Indexed: 11/18/2022] Open
Abstract
Background In the Addona et al. paper (Nature Biotechnology 2009), a large-scale multi-site study was performed to quantify Multiple Reaction Monitoring (MRM) measurements of proteins spiked in human plasma. The unlabeled signature peptides derived from the seven target proteins were measured at nine different concentration levels, and their isotopic counterparts were served as the internal standards. Methodology/Principal Findings In this paper, the sources of variation are analyzed by decomposing the variance into parts attributable to specific experimental factors: technical replicates, sites, peptides, transitions within each peptide, and higher-order interaction terms based on carefully built mixed effects models. The factors of peptides and transitions are shown to be major contributors to the variance of the measurements considering heavy (isotopic) peptides alone. For the light (12C) peptides alone, in addition to these factors, the factor of study*peptide also contributes significantly to the variance of the measurements. Heterogeneous peptide component models as well as influence analysis identify the outlier peptides in the study, which are then excluded from the analysis. Using a log-log scale transformation and subtracting the heavy/isotopic peptide [internal standard] measurement from the peptide measurements (i.e., taking the logarithm of the peak area ratio in the original scale establishes that), the MRM measurements are overall consistent across laboratories following the same standard operating procedures, and the variance components related to sites, transitions and higher-order interaction terms involving sites have greatly reduced impact. Thus the heavy peptides have been effective in reducing apparent inter-site variability. In addition, the estimates of intercepts and slopes of the calibration curves are calculated for the sub-studies. Conclusions/Significance The MRM measurements are overall consistent across laboratories following the same standard operating procedures, and heavy peptides can be used as an effective internal standard for reducing apparent inter-site variability. Mixed effects modeling is a valuable tool in mass spectrometry-based proteomics research.
Collapse
Affiliation(s)
- Jessie Q Xia
- National Institute of Statistical Sciences, Research Triangle Park, North Carolina, United States of America.
| | | | | |
Collapse
|
24
|
Borgaonkar SP, Hocker H, Shin H, Markey MK. Comparison of Normalization Methods for the Identification of Biomarkers Using MALDI-TOF and SELDI-TOF Mass Spectra. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2010; 14:115-26. [DOI: 10.1089/omi.2009.0082] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Affiliation(s)
| | - Harrison Hocker
- The University of Texas, Department of Biomedical Engineering, Austin, Texas
| | - Hyunjin Shin
- Harvard School of Public Health, Department of Biostatistics, Dana-Farber Cancer Institute, Boston, Massachusetts
| | - Mia K. Markey
- The University of Texas, Department of Biomedical Engineering, Austin, Texas
| |
Collapse
|
25
|
Fröbel J, Lehr S, Haas R, Czibere A. Mass spectrometry-based proteomics and its potential use in haematological research. Arch Physiol Biochem 2009; 115:286-97. [PMID: 19916740 DOI: 10.3109/13813450903428086] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
In the last decade proteomics has made great progress reaching throughput and comprehensiveness comparable to genomics technologies. Mass spectrometry plays a key role in proteomics and has become an indispensable method for molecular and cellular biology because many cellular changes in response to internal or external stimuli can only be detected at the proteome level. Furthermore, different from genomics which depends on the availability of DNA or RNA, proteomics is not restricted to cellular samples, but also allows the analysis of biological fluids like serum, plasma or urine. This article provides an overview of the recent developments in proteomics techniques useful for haematological research.
Collapse
Affiliation(s)
- Julia Fröbel
- Department of Haematology, Oncology and Clinical Immunology, Heinrich-Heine-University, Moorenstrasse 5, Düsseldorf, Germany.
| | | | | | | |
Collapse
|