1
|
Kaulich PT, Jeong K, Kohlbacher O, Tholey A. Influence of different sample preparation approaches on proteoform identification by top-down proteomics. Nat Methods 2024; 21:2397-2407. [PMID: 39438734 PMCID: PMC11621018 DOI: 10.1038/s41592-024-02481-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Accepted: 09/23/2024] [Indexed: 10/25/2024]
Abstract
Top-down proteomics using mass spectrometry facilitates the identification of intact proteoforms, that is, all molecular forms of proteins. Multiple past advances have lead to the development of numerous sample preparation workflows. Here we systematically investigated the influence of different sample preparation steps on proteoform and protein identifications, including cell lysis, reduction and alkylation, proteoform enrichment, purification and fractionation. We found that all steps in sample preparation influence the subset of proteoforms identified (for example, their number, confidence, physicochemical properties and artificially generated modifications). The various sample preparation strategies resulted in complementary identifications, substantially increasing the proteome coverage. Overall, we identified 13,975 proteoforms from 2,720 proteins of human Caco-2 cells. The results presented can serve as suggestions for designing and adapting top-down proteomics sample preparation strategies to particular research questions. Moreover, we expect that the sampling bias and modifications identified at the intact protein level will also be useful in improving bottom-up proteomics approaches.
Collapse
Affiliation(s)
- Philipp T Kaulich
- Systematic Proteome Research and Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| | - Kyowon Jeong
- Applied Bioinformatics, Computer Science Department, University of Tübingen, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen, Germany
| | - Oliver Kohlbacher
- Applied Bioinformatics, Computer Science Department, University of Tübingen, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen, Germany
- Translational Bioinformatics, University Hospital Tübingen, Tübingen, Germany
| | - Andreas Tholey
- Systematic Proteome Research and Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany.
| |
Collapse
|
2
|
Gant MS, Chamot-Rooke J. Present and future perspectives on mass spectrometry for clinical microbiology. Microbes Infect 2024; 26:105296. [PMID: 38199266 DOI: 10.1016/j.micinf.2024.105296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 12/01/2023] [Accepted: 01/05/2024] [Indexed: 01/12/2024]
Abstract
In the last decade, MALDI-TOF Mass Spectrometry (MALDI-TOF MS) has been introduced and broadly accepted by clinical laboratory laboratories throughout the world as a powerful and efficient tool for rapid microbial identification. During the MALDI-TOF MS process, microbes are identified using either intact cells or cell extracts. The process is rapid, sensitive, and economical in terms of both labor and costs involved. Whilst MALDI-TOF MS is currently the gold-standard, it suffers from several shortcomings such as lack of direct information on antibiotic resistance, poor depth of analysis and insufficient discriminatory power for the distinction of closely related bacterial species or for reliably sub-differentiating isolates to the level of clones or strains. Thus, new approaches targeting proteins and allowing a better characterization of bacterial strains are strongly needed, if possible, on a very short time scale after sample collection in the hospital. Bottom-up proteomics (BUP) is a nice alternative to MALDI-TOF MS, offering the possibility for in-depth proteome analysis. Top-down proteomics (TDP) provides the highest molecular precision in proteomics, allowing the characterization of proteins at the proteoform level. A number of studies have already demonstrated the potential of these techniques in clinical microbiology. In this review, we will discuss the current state-of-the-art of MALDI-TOF MS for the rapid microbial identification and detection of resistance to antibiotics and describe emerging approaches, including bottom-up and top-down proteomics as well as ambient MS technologies.
Collapse
Affiliation(s)
- Megan S Gant
- Institut Pasteur, Université Paris Cité, CNRS UAR 2024, Mass Spectrometry for Biology 75015 Paris, France
| | - Julia Chamot-Rooke
- Institut Pasteur, Université Paris Cité, CNRS UAR 2024, Mass Spectrometry for Biology 75015 Paris, France.
| |
Collapse
|
3
|
Rusbjerg-Weberskov CE, Gant MS, Chamot-Rooke J, Nielsen NS, Enghild JJ. Development of a top-down MS assay for specific identification of human periostin isoforms. Front Mol Biosci 2024; 11:1399225. [PMID: 38962283 PMCID: PMC11220192 DOI: 10.3389/fmolb.2024.1399225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Accepted: 05/31/2024] [Indexed: 07/05/2024] Open
Abstract
Periostin is a matricellular protein encoded by the POSTN gene that is alternatively spliced to produce ten different periostin isoforms with molecular weights ranging from 78 to 91 kDa. It is known to promote fibrillogenesis, organize the extracellular matrix, and bind integrin-receptors to induce cell signaling. As well as being a key component of the wound healing process, it is also known to participate in the pathogenesis of different diseases including atopic dermatitis, asthma, and cancer. In both health and disease, the functions of the different periostin isoforms are largely unknown. The ability to precisely determine the isoform profile of a given human sample is fundamental for characterizing their functional significance. Identification of periostin isoforms is most often carried out at the transcriptional level using RT-PCR based approaches, but due to high sequence homogeneity, identification on the protein level has always been challenging. Top-down proteomics, where whole proteins are measured by mass spectrometry, offers a fast and reliable method for isoform identification. Here we present a fully developed top-down mass spectrometry assay for the characterization of periostin splice isoforms at the protein level.
Collapse
Affiliation(s)
| | - Megan S. Gant
- Mass Spectrometry for Biology, Institut Pasteur, Université Paris Cité, CNRS UAR 2024, Paris, France
| | - Julia Chamot-Rooke
- Mass Spectrometry for Biology, Institut Pasteur, Université Paris Cité, CNRS UAR 2024, Paris, France
| | - Nadia Sukusu Nielsen
- Department of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark
| | - Jan J. Enghild
- Department of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark
| |
Collapse
|
4
|
Roberts DS, Loo JA, Tsybin YO, Liu X, Wu S, Chamot-Rooke J, Agar JN, Paša-Tolić L, Smith LM, Ge Y. Top-down proteomics. NATURE REVIEWS. METHODS PRIMERS 2024; 4:38. [PMID: 39006170 PMCID: PMC11242913 DOI: 10.1038/s43586-024-00318-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 04/24/2024] [Indexed: 07/16/2024]
Abstract
Proteoforms, which arise from post-translational modifications, genetic polymorphisms and RNA splice variants, play a pivotal role as drivers in biology. Understanding proteoforms is essential to unravel the intricacies of biological systems and bridge the gap between genotypes and phenotypes. By analysing whole proteins without digestion, top-down proteomics (TDP) provides a holistic view of the proteome and can decipher protein function, uncover disease mechanisms and advance precision medicine. This Primer explores TDP, including the underlying principles, recent advances and an outlook on the future. The experimental section discusses instrumentation, sample preparation, intact protein separation, tandem mass spectrometry techniques and data collection. The results section looks at how to decipher raw data, visualize intact protein spectra and unravel data analysis. Additionally, proteoform identification, characterization and quantification are summarized, alongside approaches for statistical analysis. Various applications are described, including the human proteoform project and biomedical, biopharmaceutical and clinical sciences. These are complemented by discussions on measurement reproducibility, limitations and a forward-looking perspective that outlines areas where the field can advance, including potential future applications.
Collapse
Affiliation(s)
- David S Roberts
- Department of Chemistry, Stanford University, Stanford, CA, USA
- Sarafan ChEM-H, Stanford University, Stanford, CA, USA
| | - Joseph A Loo
- Department of Chemistry and Biochemistry, Department of Biological Chemistry, University of California - Los Angeles, Los Angeles, CA, USA
| | | | - Xiaowen Liu
- Deming Department of Medicine, School of Medicine, Tulane University, New Orleans, LA, USA
| | - Si Wu
- Department of Chemistry and Biochemistry, The University of Alabama, Tuscaloosa, AL, USA
| | | | - Jeffrey N Agar
- Departments of Chemistry and Chemical Biology and Pharmaceutical Sciences, Northeastern University, Boston, MA, USA
| | - Ljiljana Paša-Tolić
- Environmental and Molecular Sciences Division, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Lloyd M Smith
- Department of Chemistry, University of Wisconsin, Madison, WI, USA
| | - Ying Ge
- Department of Chemistry, University of Wisconsin, Madison, WI, USA
- Department of Cell and Regenerative Biology, Human Proteomics Program, University of Wisconsin - Madison, Madison, WI, USA
| |
Collapse
|
5
|
Alves G, Ogurtsov AY, Porterfield H, Maity T, Jenkins LM, Sacks DB, Yu YK. Multiplexing the Identification of Microorganisms via Tandem Mass Tag Labeling Augmented by Interference Removal through a Novel Modification of the Expectation Maximization Algorithm. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024; 35:1138-1155. [PMID: 38740383 PMCID: PMC11157548 DOI: 10.1021/jasms.3c00445] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Revised: 04/12/2024] [Accepted: 04/17/2024] [Indexed: 05/16/2024]
Abstract
Having fast, accurate, and broad spectrum methods for the identification of microorganisms is of paramount importance to public health, research, and safety. Bottom-up mass spectrometer-based proteomics has emerged as an effective tool for the accurate identification of microorganisms from microbial isolates. However, one major hurdle that limits the deployment of this tool for routine clinical diagnosis, and other areas of research such as culturomics, is the instrument time required for the mass spectrometer to analyze a single sample, which can take ∼1 h per sample, when using mass spectrometers that are presently used in most institutes. To address this issue, in this study, we employed, for the first time, tandem mass tags (TMTs) in multiplex identifications of microorganisms from multiple TMT-labeled samples in one MS/MS experiment. A difficulty encountered when using TMT labeling is the presence of interference in the measured intensities of TMT reporter ions. To correct for interference, we employed in the proposed method a modified version of the expectation maximization (EM) algorithm that redistributes the signal from ion interference back to the correct TMT-labeled samples. We have evaluated the sensitivity and specificity of the proposed method using 94 MS/MS experiments (covering a broad range of protein concentration ratios across TMT-labeled channels and experimental parameters), containing a total of 1931 true positive TMT-labeled channels and 317 true negative TMT-labeled channels. The results of the evaluation show that the proposed method has an identification sensitivity of 93-97% and a specificity of 100% at the species level. Furthermore, as a proof of concept, using an in-house-generated data set composed of some of the most common urinary tract pathogens, we demonstrated that by using the proposed method the mass spectrometer time required per sample, using a 1 h LC-MS/MS run, can be reduced to 10 and 6 min when samples are labeled with TMT-6 and TMT-10, respectively. The proposed method can also be used along with Orbitrap mass spectrometers that have faster MS/MS acquisition rates, like the recently released Orbitrap Astral mass spectrometer, to further reduce the mass spectrometer time required per sample.
Collapse
Affiliation(s)
- Gelio Alves
- National
Center for Biotechnology Information, National Library of Medicine,
National Institutes of Health, Bethesda, Maryland 20894, United States
| | - Aleksey Y. Ogurtsov
- National
Center for Biotechnology Information, National Library of Medicine,
National Institutes of Health, Bethesda, Maryland 20894, United States
| | - Harry Porterfield
- Department
of Laboratory Medicine, Clinical Center, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Tapan Maity
- Laboratory
of Cell Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Lisa M. Jenkins
- Laboratory
of Cell Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - David B. Sacks
- Department
of Laboratory Medicine, Clinical Center, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Yi-Kuo Yu
- National
Center for Biotechnology Information, National Library of Medicine,
National Institutes of Health, Bethesda, Maryland 20894, United States
| |
Collapse
|
6
|
Wang X, Liu Y, Yong ZH, Yu XJ, Zhou FD, Zhao MH. Immunoglobulin repertoire sequencing and de novo sequencing - Powerful tools for identifying free light chains from patients with light chain cast nephropathy. Int Immunopharmacol 2024; 135:112302. [PMID: 38772298 DOI: 10.1016/j.intimp.2024.112302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Revised: 05/02/2024] [Accepted: 05/16/2024] [Indexed: 05/23/2024]
Abstract
In patients with light chain cast nephropathy (LCCN), abundantly produced monoclonal immunoglobulin free light chains (FLCs) play a vital role in pathogenesis. Determining the precise sequences of patient-derived FLCs is therefore highly desirable. Although immunoglobulin repertoire sequencing (5' RACE-seq) has been proven to be sensitive enough to provide full-length V(D)J region (variable, diversity and joining genes) of FLCs using bone marrow samples, an invasive and bone marrow independent method is still in demand. Here a de novo sequencing workflow based on the bottom-up proteomics for patient-derived FLCs was established. PEAKS software was used for the de novo sequencing of peptides that were further assembled into full-length FLC sequences. This de novo protein sequencing method can obtain the full-length amino acid sequences of FLCs, and had been shown to be as reliable as 5' RACE-seq. The two LCCN sequences derived from above the two methods were identical, and they possessed more hydrophobic or nonpolar amino acids compared with the corresponding germline, which may be associated with the pathogenesis.
Collapse
Affiliation(s)
- Xin Wang
- Renal Division, Department of Medicine, Peking University First Hospital, No. 8, Xishiku Street, Xicheng District, Beijing, China; Peking-Tsinghua Center for Life Sciences, Beijing, China; Institute of Nephrology, Peking University, Beijing, China; Renal Pathology Center, Institute of Nephrology, Peking University, Beijing, China; Key Laboratory of Renal Disease, Ministry of Health of China, Beijing, China; Key Laboratory of CKD Prevention and Treatment, Ministry of Education of China, Beijing, China.
| | - Yi Liu
- Key Laboratory of Combinatorial Biosynthesis and Drug Discovery, Ministry of Education, School of Pharmaceutical Sciences, Wuhan University, Wuhan, China
| | - Zi-Hao Yong
- Department of Basic Medicine, Anhui Medical College, Hefei, Anhui, China
| | - Xiao-Juan Yu
- Renal Division, Department of Medicine, Peking University First Hospital, No. 8, Xishiku Street, Xicheng District, Beijing, China; Institute of Nephrology, Peking University, Beijing, China; Renal Pathology Center, Institute of Nephrology, Peking University, Beijing, China; Key Laboratory of Renal Disease, Ministry of Health of China, Beijing, China; Key Laboratory of CKD Prevention and Treatment, Ministry of Education of China, Beijing, China; Research Units of Diagnosis and Treatment of Immune-Mediated Kidney Diseases, Chinese Academy of Medical Sciences, Beijing, China.
| | - Fu-de Zhou
- Renal Division, Department of Medicine, Peking University First Hospital, No. 8, Xishiku Street, Xicheng District, Beijing, China; Institute of Nephrology, Peking University, Beijing, China; Renal Pathology Center, Institute of Nephrology, Peking University, Beijing, China; Key Laboratory of Renal Disease, Ministry of Health of China, Beijing, China; Key Laboratory of CKD Prevention and Treatment, Ministry of Education of China, Beijing, China; Research Units of Diagnosis and Treatment of Immune-Mediated Kidney Diseases, Chinese Academy of Medical Sciences, Beijing, China
| | - Ming-Hui Zhao
- Renal Division, Department of Medicine, Peking University First Hospital, No. 8, Xishiku Street, Xicheng District, Beijing, China; Peking-Tsinghua Center for Life Sciences, Beijing, China; Institute of Nephrology, Peking University, Beijing, China; Renal Pathology Center, Institute of Nephrology, Peking University, Beijing, China; Key Laboratory of Renal Disease, Ministry of Health of China, Beijing, China; Key Laboratory of CKD Prevention and Treatment, Ministry of Education of China, Beijing, China; Research Units of Diagnosis and Treatment of Immune-Mediated Kidney Diseases, Chinese Academy of Medical Sciences, Beijing, China
| |
Collapse
|
7
|
Jeong K, Kaulich PT, Jung W, Kim J, Tholey A, Kohlbacher O. Precursor deconvolution error estimation: The missing puzzle piece in false discovery rate in top-down proteomics. Proteomics 2024; 24:e2300068. [PMID: 37997224 DOI: 10.1002/pmic.202300068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 11/09/2023] [Accepted: 11/13/2023] [Indexed: 11/25/2023]
Abstract
Top-down proteomics (TDP) directly analyzes intact proteins and thus provides more comprehensive qualitative and quantitative proteoform-level information than conventional bottom-up proteomics (BUP) that relies on digested peptides and protein inference. While significant advancements have been made in TDP in sample preparation, separation, instrumentation, and data analysis, reliable and reproducible data analysis still remains one of the major bottlenecks in TDP. A key step for robust data analysis is the establishment of an objective estimation of proteoform-level false discovery rate (FDR) in proteoform identification. The most widely used FDR estimation scheme is based on the target-decoy approach (TDA), which has primarily been established for BUP. We present evidence that the TDA-based FDR estimation may not work at the proteoform-level due to an overlooked factor, namely the erroneous deconvolution of precursor masses, which leads to incorrect FDR estimation. We argue that the conventional TDA-based FDR in proteoform identification is in fact protein-level FDR rather than proteoform-level FDR unless precursor deconvolution error rate is taken into account. To address this issue, we propose a formula to correct for proteoform-level FDR bias by combining TDA-based FDR and precursor deconvolution error rate.
Collapse
Affiliation(s)
- Kyowon Jeong
- Applied Bioinformatics, Computer Science Department, University of Tübingen, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen, Germany
| | - Philipp T Kaulich
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| | - Wonhyeuk Jung
- Department of Cell Biology, Yale School of Medicine, New Haven, Connecticut, USA
| | - Jihyung Kim
- Applied Bioinformatics, Computer Science Department, University of Tübingen, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen, Germany
| | - Andreas Tholey
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| | - Oliver Kohlbacher
- Applied Bioinformatics, Computer Science Department, University of Tübingen, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen, Germany
- Translational Bioinformatics, University Hospital Tübingen, Tübingen, Germany
| |
Collapse
|
8
|
Kaulich PT, Cassidy L, Tholey A. Identification of proteoforms by top-down proteomics using two-dimensional low/low pH reversed-phase liquid chromatography-mass spectrometry. Proteomics 2024; 24:e2200542. [PMID: 36815320 DOI: 10.1002/pmic.202200542] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 02/09/2023] [Accepted: 02/13/2023] [Indexed: 02/24/2023]
Abstract
In top-down (TD) proteomics, efficient proteoform separation is crucial to reduce the sample complexity and increase the depth of the analysis. Here, we developed a two-dimensional low pH/low pH reversed-phase liquid chromatography separation scheme for TD proteomics. The first dimension for offline fractionation was performed using a polymeric reversed-phase (PLRP-S) column with trifluoroacetic acid as ion-pairing reagent. The second dimension, a C4 nanocolumn with formic acid as ion-pairing reagent, was coupled online with a high-field asymmetric ion mobility spectrometry (FAIMS) Orbitrap Tribrid mass spectrometer. For both dimensions several parameters were optimized, such as the adaption of the LC gradients in the second dimension according to the elution time (i.e., fraction number) in the first dimension. Avoidance of elevated temperatures and prolonged exposure to acidic conditions minimized cleavage of acid labile aspartate-proline peptide bonds. Furthermore, a concatenation strategy was developed to reduce the total measurement time. We compared our low/low pH with a previously published high pH (C4, ammonium formate)/low pH strategy and found that both separation strategies led to complementary proteoform identifications, mainly below 20 kDa, with a higher number of proteoforms identified by the low/low pH separation. With the optimized separation scheme, more than 4900 proteoforms from 1250 protein groups were identified in Caco-2 cells.
Collapse
Affiliation(s)
- Philipp T Kaulich
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| | - Liam Cassidy
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| | - Andreas Tholey
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| |
Collapse
|
9
|
Tabb DL, Jeong K, Druart K, Gant MS, Brown KA, Nicora C, Zhou M, Couvillion S, Nakayasu E, Williams JE, Peterson HK, McGuire MK, McGuire MA, Metz TO, Chamot-Rooke J. Comparing Top-Down Proteoform Identification: Deconvolution, PrSM Overlap, and PTM Detection. J Proteome Res 2023. [PMID: 37235544 DOI: 10.1021/acs.jproteome.2c00673] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
Generating top-down tandem mass spectra (MS/MS) from complex mixtures of proteoforms benefits from improvements in fractionation, separation, fragmentation, and mass analysis. The algorithms to match MS/MS to sequences have undergone a parallel evolution, with both spectral alignment and match-counting approaches producing high-quality proteoform-spectrum matches (PrSMs). This study assesses state-of-the-art algorithms for top-down identification (ProSight PD, TopPIC, MSPathFinderT, and pTop) in their yield of PrSMs while controlling false discovery rate. We evaluated deconvolution engines (ThermoFisher Xtract, Bruker AutoMSn, Matrix Science Mascot Distiller, TopFD, and FLASHDeconv) in both ThermoFisher Orbitrap-class and Bruker maXis Q-TOF data (PXD033208) to produce consistent precursor charges and mass determinations. Finally, we sought post-translational modifications (PTMs) in proteoforms from bovine milk (PXD031744) and human ovarian tissue. Contemporary identification workflows produce excellent PrSM yields, although approximately half of all identified proteoforms from these four pipelines were specific to only one workflow. Deconvolution algorithms disagree on precursor masses and charges, contributing to identification variability. Detection of PTMs is inconsistent among algorithms. In bovine milk, 18% of PrSMs produced by pTop and TopMG were singly phosphorylated, but this percentage fell to 1% for one algorithm. Applying multiple search engines produces more comprehensive assessments of experiments. Top-down algorithms would benefit from greater interoperability.
Collapse
Affiliation(s)
- David L Tabb
- Université Paris Cité, Institut Pasteur, CNRS UAR 2024, Mass Spectrometry for Biology Unit, Paris 75015, France
| | - Kyowon Jeong
- Applied Bioinformatics, Computer Science Department, University of Tübingen, Tübingen 72076, Germany
| | - Karen Druart
- Université Paris Cité, Institut Pasteur, CNRS UAR 2024, Mass Spectrometry for Biology Unit, Paris 75015, France
| | - Megan S Gant
- Université Paris Cité, Institut Pasteur, CNRS UAR 2024, Mass Spectrometry for Biology Unit, Paris 75015, France
| | - Kyle A Brown
- School of Medicine and Public Health, University of Wisconsin, Madison, Wisconsin 53705, United States
| | - Carrie Nicora
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Mowei Zhou
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Sneha Couvillion
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Ernesto Nakayasu
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Janet E Williams
- Department of Animal, Veterinary, and Food Sciences, University of Idaho, Moscow, Idaho 83844, United States
| | - Haley K Peterson
- Department of Animal, Veterinary, and Food Sciences, University of Idaho, Moscow, Idaho 83844, United States
| | - Michelle K McGuire
- Margaret Ritchie School of Family and Consumer Sciences, University of Idaho, Moscow, Idaho 83844, United States
| | - Mark A McGuire
- Department of Animal, Veterinary, and Food Sciences, University of Idaho, Moscow, Idaho 83844, United States
| | - Thomas O Metz
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Julia Chamot-Rooke
- Université Paris Cité, Institut Pasteur, CNRS UAR 2024, Mass Spectrometry for Biology Unit, Paris 75015, France
| |
Collapse
|
10
|
DiagnoMass: A proteomics hub for pinpointing discriminative spectral clusters. J Proteomics 2023; 277:104853. [PMID: 36804625 DOI: 10.1016/j.jprot.2023.104853] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Accepted: 02/14/2023] [Indexed: 02/18/2023]
Abstract
MOTIVATION There are several well-established paradigms for identifying and pinpointing discriminative peptides/proteins using shotgun proteomic data; examples are peptide-spectrum matching, de novo sequencing, open searches, and even hybrid approaches. Such an arsenal of complementary paradigms can provide deep data coverage, albeit some unidentified discriminative peptides remain. RESULTS We present DiagnoMass, software tool that groups similar spectra into spectral clusters and then shortlists those clusters that are discriminative for biological conditions. DiagnoMass then communicates with proteomic tools to attempt the identification of such clusters. We demonstrate the effectiveness of DiagnoMass by analyzing proteomic data from Escherichia coli, Salmonella, and Shigella, listing many high-quality discriminative spectral clusters that had thus far remained unidentified by widely adopted proteomic tools. DiagnoMass can also classify proteomic profiles. We anticipate the use of DiagnoMass as a vital tool for pinpointing biomarkers. AVAILABILITY DiagnoMass and related documentation, including a usage protocol, are available at http://www.diagnomass.com.
Collapse
|
11
|
Nickerson JL, Baghalabadi V, Rajendran SRCK, Jakubec PJ, Said H, McMillen TS, Dang Z, Doucette AA. Recent advances in top-down proteome sample processing ahead of MS analysis. MASS SPECTROMETRY REVIEWS 2023; 42:457-495. [PMID: 34047392 DOI: 10.1002/mas.21706] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Revised: 04/21/2021] [Accepted: 05/06/2021] [Indexed: 06/12/2023]
Abstract
Top-down proteomics is emerging as a preferred approach to investigate biological systems, with objectives ranging from the detailed assessment of a single protein therapeutic, to the complete characterization of every possible protein including their modifications, which define the human proteoform. Given the controlling influence of protein modifications on their biological function, understanding how gene products manifest or respond to disease is most precisely achieved by characterization at the intact protein level. Top-down mass spectrometry (MS) analysis of proteins entails unique challenges associated with processing whole proteins while maintaining their integrity throughout the processes of extraction, enrichment, purification, and fractionation. Recent advances in each of these critical front-end preparation processes, including minimalistic workflows, have greatly expanded the capacity of MS for top-down proteome analysis. Acknowledging the many contributions in MS technology and sample processing, the present review aims to highlight the diverse strategies that have forged a pathway for top-down proteomics. We comprehensively discuss the evolution of front-end workflows that today facilitate optimal characterization of proteoform-driven biology, including a brief description of the clinical applications that have motivated these impactful contributions.
Collapse
Affiliation(s)
| | - Venus Baghalabadi
- Department of Chemistry, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Subin R C K Rajendran
- Department of Chemistry, Dalhousie University, Halifax, Nova Scotia, Canada
- Verschuren Centre for Sustainability in Energy and the Environment, Sydney, Nova Scotia, Canada
| | - Philip J Jakubec
- Department of Chemistry, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Hammam Said
- Department of Chemistry, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Teresa S McMillen
- Department of Chemistry, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Ziheng Dang
- Department of Chemistry, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Alan A Doucette
- Department of Chemistry, Dalhousie University, Halifax, Nova Scotia, Canada
| |
Collapse
|
12
|
Sternke-Hoffmann R, Pauly T, Norrild RK, Hansen J, Tucholski F, Høie MH, Marcatili P, Dupré M, Duchateau M, Rey M, Malosse C, Metzger S, Boquoi A, Platten F, Egelhaaf SU, Chamot-Rooke J, Fenk R, Nagel-Steger L, Haas R, Buell AK. Widespread amyloidogenicity potential of multiple myeloma patient-derived immunoglobulin light chains. BMC Biol 2023; 21:21. [PMID: 36737754 PMCID: PMC9898917 DOI: 10.1186/s12915-022-01506-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Accepted: 12/15/2022] [Indexed: 02/05/2023] Open
Abstract
BACKGROUND In a range of human disorders such as multiple myeloma (MM), immunoglobulin light chains (IgLCs) can be produced at very high concentrations. This can lead to pathological aggregation and deposition of IgLCs in different tissues, which in turn leads to severe and potentially fatal organ damage. However, IgLCs can also be highly soluble and non-toxic. It is generally thought that the cause for this differential solubility behaviour is solely found within the IgLC amino acid sequences, and a variety of individual sequence-related biophysical properties (e.g. thermal stability, dimerisation) have been proposed in different studies as major determinants of the aggregation in vivo. Here, we investigate biophysical properties underlying IgLC amyloidogenicity. RESULTS We introduce a novel and systematic workflow, Thermodynamic and Aggregation Fingerprinting (ThAgg-Fip), for detailed biophysical characterisation, and apply it to nine different MM patient-derived IgLCs. Our set of pathogenic IgLCs spans the entire range of values in those parameters previously proposed to define in vivo amyloidogenicity; however, none actually forms amyloid in patients. Even more surprisingly, we were able to show that all our IgLCs are able to form amyloid fibrils readily in vitro under the influence of proteolytic cleavage by co-purified cathepsins. CONCLUSIONS We show that (I) in vivo aggregation behaviour is unlikely to be mechanistically linked to any single biophysical or biochemical parameter and (II) amyloidogenic potential is widespread in IgLC sequences and is not confined to those sequences that form amyloid fibrils in patients. Our findings suggest that protein sequence, environmental conditions and presence and action of proteases all determine the ability of light chains to form amyloid fibrils in patients.
Collapse
Affiliation(s)
- Rebecca Sternke-Hoffmann
- grid.411327.20000 0001 2176 9917Institut für Physikalische Biologie, Heinrich-Heine-Universität Düsseldorf, Düsseldorf, Germany ,grid.5991.40000 0001 1090 7501Department of Biology and Chemistry, Paul Scherrer Institute, Villigen, Switzerland
| | - Thomas Pauly
- grid.411327.20000 0001 2176 9917Institut für Physikalische Biologie, Heinrich-Heine-Universität Düsseldorf, Düsseldorf, Germany ,grid.8385.60000 0001 2297 375XForschungszentrum Jülich GmbH, IBI-7, Jülich, Germany
| | - Rasmus K. Norrild
- grid.5170.30000 0001 2181 8870Department of Biotechnology and Biomedicine, Technical University of Denmark, Lyngby, Denmark
| | - Jan Hansen
- grid.411327.20000 0001 2176 9917Condensed Matter Physics Laboratory, Heinrich-Heine-Universität Düsseldorf, Düsseldorf, Germany
| | - Florian Tucholski
- grid.411327.20000 0001 2176 9917Institut für Physikalische Biologie, Heinrich-Heine-Universität Düsseldorf, Düsseldorf, Germany
| | - Magnus Haraldson Høie
- grid.5170.30000 0001 2181 8870Department of Health Technology, Technical University of Denmark, Lyngby, Denmark
| | - Paolo Marcatili
- grid.5170.30000 0001 2181 8870Department of Health Technology, Technical University of Denmark, Lyngby, Denmark
| | - Mathieu Dupré
- grid.428999.70000 0001 2353 6535Mass Spectrometry for Biology Unit, CNRS USR2000, Institut Pasteur, 75015 Paris, France
| | - Magalie Duchateau
- grid.428999.70000 0001 2353 6535Mass Spectrometry for Biology Unit, CNRS USR2000, Institut Pasteur, 75015 Paris, France
| | - Martial Rey
- grid.428999.70000 0001 2353 6535Mass Spectrometry for Biology Unit, CNRS USR2000, Institut Pasteur, 75015 Paris, France
| | - Christian Malosse
- grid.428999.70000 0001 2353 6535Mass Spectrometry for Biology Unit, CNRS USR2000, Institut Pasteur, 75015 Paris, France
| | - Sabine Metzger
- grid.6190.e0000 0000 8580 3777Cologne Biocenter, Cluster of Excellence on Plant Sciences, Mass Spectrometry Platform, University of Cologne, Cologne, Germany
| | - Amelie Boquoi
- grid.411327.20000 0001 2176 9917Department of Hematology, Oncology and Clinical Oncology, Heinrich-Heine Universität Düsseldorf, Düsseldorf, Germany
| | - Florian Platten
- grid.411327.20000 0001 2176 9917Condensed Matter Physics Laboratory, Heinrich-Heine-Universität Düsseldorf, Düsseldorf, Germany ,grid.8385.60000 0001 2297 375XForschungszentrum Jülich GmbH, IBI-4, Jülich, Germany
| | - Stefan U. Egelhaaf
- grid.411327.20000 0001 2176 9917Condensed Matter Physics Laboratory, Heinrich-Heine-Universität Düsseldorf, Düsseldorf, Germany
| | - Julia Chamot-Rooke
- grid.428999.70000 0001 2353 6535Mass Spectrometry for Biology Unit, CNRS USR2000, Institut Pasteur, 75015 Paris, France
| | - Roland Fenk
- grid.411327.20000 0001 2176 9917Department of Hematology, Oncology and Clinical Oncology, Heinrich-Heine Universität Düsseldorf, Düsseldorf, Germany
| | - Luitgard Nagel-Steger
- grid.411327.20000 0001 2176 9917Institut für Physikalische Biologie, Heinrich-Heine-Universität Düsseldorf, Düsseldorf, Germany ,grid.8385.60000 0001 2297 375XForschungszentrum Jülich GmbH, IBI-7, Jülich, Germany
| | - Rainer Haas
- Department of Hematology, Oncology and Clinical Oncology, Heinrich-Heine Universität Düsseldorf, Düsseldorf, Germany.
| | - Alexander K. Buell
- grid.411327.20000 0001 2176 9917Institut für Physikalische Biologie, Heinrich-Heine-Universität Düsseldorf, Düsseldorf, Germany ,grid.5170.30000 0001 2181 8870Department of Biotechnology and Biomedicine, Technical University of Denmark, Lyngby, Denmark
| |
Collapse
|
13
|
Liang Y, Zhang L, Zhang Y. Chromatographic separation of peptides and proteins for characterization of proteomes. Chem Commun (Camb) 2023; 59:270-281. [PMID: 36504223 DOI: 10.1039/d2cc05568f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
Characterization of proteomes aims to comprehensively characterize proteins in cells or tissues via two main strategies: (1) bottom-up strategy based on the separation and identification of enzymatic peptides; (2) top-down strategy based on the separation and identification of intact proteins. However, it is challenged by the high complexity of proteomes. Consequently, the improvements in peptide and protein separation technologies for simplifying the sample should be critical. In this feature article, separation columns for peptide and protein separation were introduced, and peptide separation technologies for bottom-up proteomic analysis as well as protein separation technologies for top-down proteomic analysis were summarized. The achievement, recent development, limitation and future trends are discussed. Besides, the outlook on challenges and future directions of chromatographic separation in the field of proteomics was also presented.
Collapse
Affiliation(s)
- Yu Liang
- CAS Key Lab of Separation Sciences for Analytical Chemistry, National Chromatographic Research and Analysis Center, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116023, China.
| | - Lihua Zhang
- CAS Key Lab of Separation Sciences for Analytical Chemistry, National Chromatographic Research and Analysis Center, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116023, China.
| | - Yukui Zhang
- CAS Key Lab of Separation Sciences for Analytical Chemistry, National Chromatographic Research and Analysis Center, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116023, China.
| |
Collapse
|
14
|
Jeong K, Babović M, Gorshkov V, Kim J, Jensen ON, Kohlbacher O. FLASHIda enables intelligent data acquisition for top-down proteomics to boost proteoform identification counts. Nat Commun 2022; 13:4407. [PMID: 35906205 PMCID: PMC9338294 DOI: 10.1038/s41467-022-31922-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Accepted: 07/07/2022] [Indexed: 11/17/2022] Open
Abstract
The detailed analysis and structural characterization of proteoforms by top-down proteomics (TDP) has gained a lot of interest in biomedical research. Data-dependent acquisition (DDA) of intact proteins is non-trivial due to the diversity and complexity of proteoforms. Dedicated acquisition methods thus have the potential to greatly improve TDP. Here, we present FLASHIda, an intelligent online data acquisition algorithm for TDP that ensures the real-time selection of high-quality precursors of diverse proteoforms. FLASHIda combines fast charge deconvolution algorithms and machine learning-based quality assessment for optimal precursor selection. In an analysis of E. coli lysate, FLASHIda increases the number of unique proteoform level identifications from 800 to 1500 or generates a near-identical number of identifications in one third of the instrument time when compared to standard DDA mode. Furthermore, FLASHIda enables sensitive mapping of post-translational modifications and detection of chemical adducts. As a software extension module to the instrument, FLASHIda can be readily adopted for TDP studies of complex samples to enhance proteoform identification rates.
Collapse
Affiliation(s)
- Kyowon Jeong
- Applied Bioinformatics, Computer Science Department, University of Tübingen, Sand 14, 72076, Tübingen, Germany.
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, Sand 14, 72076, Tübingen, Germany.
| | - Maša Babović
- Department of Biochemistry & Molecular Biology and VILLUM Center for Bioanalytical Sciences, University of Southern Denmark, Campusvej 55, DK-5230 Odense M, Odense, Denmark
| | - Vladimir Gorshkov
- Department of Biochemistry & Molecular Biology and VILLUM Center for Bioanalytical Sciences, University of Southern Denmark, Campusvej 55, DK-5230 Odense M, Odense, Denmark
| | - Jihyung Kim
- Applied Bioinformatics, Computer Science Department, University of Tübingen, Sand 14, 72076, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, Sand 14, 72076, Tübingen, Germany
| | - Ole N Jensen
- Department of Biochemistry & Molecular Biology and VILLUM Center for Bioanalytical Sciences, University of Southern Denmark, Campusvej 55, DK-5230 Odense M, Odense, Denmark
| | - Oliver Kohlbacher
- Applied Bioinformatics, Computer Science Department, University of Tübingen, Sand 14, 72076, Tübingen, Germany.
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, Sand 14, 72076, Tübingen, Germany.
- Translational Bioinformatics, University Hospital Tübingen, Hoppe-Seyler-Str. 9, 72076, Tübingen, Germany.
| |
Collapse
|
15
|
Yang M, Hu H, Su P, Thomas PM, Camarillo JM, Greer JB, Early BP, Fellers RT, Kelleher NL, Laskin J. Proteoform-Selective Imaging of Tissues Using Mass Spectrometry. Angew Chem Int Ed Engl 2022; 61:e202200721. [PMID: 35446460 PMCID: PMC9276647 DOI: 10.1002/anie.202200721] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Indexed: 01/28/2023]
Abstract
Unraveling the complexity of biological systems relies on the development of new approaches for spatially resolved proteoform‐specific analysis of the proteome. Herein, we employ nanospray desorption electrospray ionization mass spectrometry imaging (nano‐DESI MSI) for the proteoform‐selective imaging of biological tissues. Nano‐DESI generates multiply charged protein ions, which is advantageous for their structural characterization using tandem mass spectrometry (MS/MS) directly on the tissue. Proof‐of‐concept experiments demonstrate that nano‐DESI MSI combined with on‐tissue top‐down proteomics is ideally suited for the proteoform‐selective imaging of tissue sections. Using rat brain tissue as a model system, we provide the first evidence of differential proteoform expression in different regions of the brain.
Collapse
Affiliation(s)
- Manxi Yang
- Department of ChemistryPurdue University560 Oval DriveWest LafayetteIN 47907USA
| | - Hang Hu
- Department of ChemistryPurdue University560 Oval DriveWest LafayetteIN 47907USA
| | - Pei Su
- Department of ChemistryPurdue University560 Oval DriveWest LafayetteIN 47907USA
- Departments of Chemistry and Molecular BiosciencesNorthwestern University2145 Sheridan RoadEvanstonIL 60208USA
| | - Paul M. Thomas
- Departments of Chemistry and Molecular BiosciencesNorthwestern University2145 Sheridan RoadEvanstonIL 60208USA
| | - Jeannie M. Camarillo
- Departments of Chemistry and Molecular BiosciencesNorthwestern University2145 Sheridan RoadEvanstonIL 60208USA
| | - Joseph B. Greer
- Departments of Chemistry and Molecular BiosciencesNorthwestern University2145 Sheridan RoadEvanstonIL 60208USA
| | - Bryan P. Early
- Departments of Chemistry and Molecular BiosciencesNorthwestern University2145 Sheridan RoadEvanstonIL 60208USA
| | - Ryan T. Fellers
- Departments of Chemistry and Molecular BiosciencesNorthwestern University2145 Sheridan RoadEvanstonIL 60208USA
| | - Neil L. Kelleher
- Departments of Chemistry and Molecular BiosciencesNorthwestern University2145 Sheridan RoadEvanstonIL 60208USA
| | - Julia Laskin
- Department of ChemistryPurdue University560 Oval DriveWest LafayetteIN 47907USA
| |
Collapse
|
16
|
Yang M, Hu H, Su P, Thomas PM, Camarillo JM, Greer JB, Early BP, Fellers RT, Kelleher NL, Laskin J. Proteoform‐Selective Imaging of Tissues Using Mass Spectrometry. Angew Chem Int Ed Engl 2022. [DOI: 10.1002/ange.202200721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Affiliation(s)
- Manxi Yang
- Purdue University Department of Chemistry chemistry 560 Oval Dr. 47906 West Lafayette UNITED STATES
| | - Hang Hu
- Purdue University Chemistry UNITED STATES
| | - Pei Su
- Northwestern University Chemistry and Molecular Biosciences UNITED STATES
| | - Paul M. Thomas
- Northwestern University Chemistry and Molecular Biosciences UNITED STATES
| | | | - Joseph B. Greer
- Northwestern University Chemistry and Molecular Biosciences UNITED STATES
| | - Bryan P. Early
- Northwestern University Chemistry and Molecular Biosciences UNITED STATES
| | - Ryan T. Fellers
- Northwestern University Chemistry and Molecular Biosciences UNITED STATES
| | - Neil L. Kelleher
- Northwestern University Chemistry and Molecular Biosciences UNITED STATES
| | - Julia Laskin
- Purdue University Department of Chemistry 560 Oval Dr. 47907 West Lafayette UNITED STATES
| |
Collapse
|
17
|
Wang C, Liang Y, Zhao B, Liang Z, Zhang L, Zhang Y. Ethane-Bridged Hybrid Monolithic Column with Large Mesopores for Boosting Top-Down Proteomic Analysis. Anal Chem 2022; 94:6172-6179. [PMID: 35412811 DOI: 10.1021/acs.analchem.1c05234] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Top-down proteomics is challenged by the high complexity of biological samples. The coelution of intact proteins results in overlapped mass spectra, and hence, an increased peak capacity for protein separation is needed. Herein, ethane-bridged hybrid monoliths with well-defined large mesopores were successfully prepared based on the sol-gel condensation of 1,2-bis(trimethoxysilyl)ethane and tetramethoxysilane, followed by two-step base etching of the Si-O-Si domain while maintaining the Si-C-C-Si domain in the structure. Relatively homogeneous macropores of 1.1 μm and large mesopores of 24 nm were obtained, permitting fast mass transfer of large molecules and efficient diffusion without obstruction. The use of less hydrophobic C1 ligand further sharpened the peak shape and improved peak capacity. A 120 cm-long capillary column was used for top-down proteomic analysis of E. coli lysates under low backpressure with 16 MPa. High peak capacity of 646 was achieved within 240 min gradient. With MS/MS analysis, 959 proteoforms corresponding to 263 proteins could be unambiguously identified from E. coli lysates in a single run. Furthermore, to illustrate the separation performance for large proteoforms, such monoliths were applied to top-down analysis of the SEC fraction of E. coli lysates with Mw ranging from 30 to 70 kDa. With highly effective separation, 347 large proteoforms with Mw higher than 30 kDa were detected in the single 75 min run. These results showed great potential for top-down proteomic analysis in complex samples.
Collapse
Affiliation(s)
- Chao Wang
- CAS Key Lab of Separation Sciences for Analytical Chemistry, National Chromatographic Research and Analysis Center, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yu Liang
- CAS Key Lab of Separation Sciences for Analytical Chemistry, National Chromatographic Research and Analysis Center, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
| | - Baofeng Zhao
- CAS Key Lab of Separation Sciences for Analytical Chemistry, National Chromatographic Research and Analysis Center, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
| | - Zhen Liang
- CAS Key Lab of Separation Sciences for Analytical Chemistry, National Chromatographic Research and Analysis Center, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
| | - Lihua Zhang
- CAS Key Lab of Separation Sciences for Analytical Chemistry, National Chromatographic Research and Analysis Center, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
| | - Yukui Zhang
- CAS Key Lab of Separation Sciences for Analytical Chemistry, National Chromatographic Research and Analysis Center, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
| |
Collapse
|
18
|
Fagerquist CK, Dodd CE. Top-down proteomic identification of plasmid and host proteins produced by pathogenic Escherichia coli using MALDI-TOF-TOF tandem mass spectrometry. PLoS One 2021; 16:e0260650. [PMID: 34843608 PMCID: PMC8629258 DOI: 10.1371/journal.pone.0260650] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Accepted: 11/12/2021] [Indexed: 12/21/2022] Open
Abstract
Fourteen proteins produced by three pathogenic Escherichia coli strains were identified using antibiotic induction, MALDI-TOF-TOF tandem mass spectrometry (MS/MS) and top-down proteomic analysis using software developed in-house. Host proteins as well as plasmid proteins were identified. Mature, intact protein ions were fragmented by post-source decay (PSD), and prominent fragment ions resulted from the aspartic acid effect fragmentation mechanism wherein polypeptide backbone cleavage (PBC) occurs on the C-terminal side of aspartic acid (D), glutamic acid (E) and asparagine (N) residues. These highly specific MS/MS-PSD fragment ions were compared to b- and y-type fragment ions on the C-terminal side of D-, E- and N-residues of in silico protein sequences derived from whole genome sequencing. Nine proteins were found to be post-translationally modified with either removal of an N-terminal methionine or a signal peptide. The protein sequence truncation algorithm of our software correctly identified all full and truncated protein sequences. Truncated sequences were compared to those predicted by SignalP. Nearly complete concurrence was obtained except for one protein where SignalP mis-identified the cleavage site by one residue. Two proteins had intramolecular disulfide bonds that were inferred by the absence of PBC on the C-terminal side of a D-residue located within the disulfide loop. These results demonstrate the utility of MALDI-TOF-TOF for identification of full and truncated bacterial proteins.
Collapse
Affiliation(s)
- Clifton K. Fagerquist
- Produce Safety & Microbiology, Western Regional Research Center, Agricultural Research Service, U.S. Department of Agriculture, Albany, California, United States of America
| | - Claire E. Dodd
- Produce Safety & Microbiology, Western Regional Research Center, Agricultural Research Service, U.S. Department of Agriculture, Albany, California, United States of America
| |
Collapse
|
19
|
Kaulich PT, Winkels K, Kaulich TB, Treitz C, Cassidy L, Tholey A. MSTopDiff: A Tool for the Visualization of Mass Shifts in Deconvoluted Top-Down Proteomics Data for the Database-Independent Detection of Protein Modifications. J Proteome Res 2021; 21:20-29. [PMID: 34818005 DOI: 10.1021/acs.jproteome.1c00766] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Top-down proteomics analyzes intact proteoforms with all of their post-translational modifications and genetic and RNA splice variants. In addition, modifications introduced either deliberately or inadvertently during sample preparation, that is, via oxidation, alkylation, or labeling reagents, or through the formation of noncovalent adducts (e.g., detergents) further increase the sample complexity. To facilitate the recognition of protein modifications introduced during top-down analysis, we developed MSTopDiff, a software tool with a graphical user interface written in Python, which allows one to detect protein modifications by calculating and visualizing mass differences in top-down data without the prerequisite of a database search. We demonstrate the successful application of MSTopDiff for the detection of artifacts originating from oxidation, formylation, overlabeling during isobaric labeling, and adduct formation with cations or sodium dodecyl sulfate. MSTopDiff offers several modes of data representation using deconvoluted MS1 or MS2 spectra. In addition to artificial modifications, the tool enables the visualization of biological modifications such as phosphorylation and acetylation. MSTopDiff provides an overview of the artificial and biological modifications in top-down proteomics samples, which makes it a valuable tool in quality control of standard workflows and for parameter evaluation during method development.
Collapse
Affiliation(s)
- Philipp T Kaulich
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, 24105 Kiel, Germany
| | - Konrad Winkels
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, 24105 Kiel, Germany
| | - Tobias B Kaulich
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, 24105 Kiel, Germany
| | - Christian Treitz
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, 24105 Kiel, Germany
| | - Liam Cassidy
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, 24105 Kiel, Germany
| | - Andreas Tholey
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, 24105 Kiel, Germany
| |
Collapse
|
20
|
Neil JR, Verma A, Kronewitter SR, McGee WM, Mullen C, Viirtola M, Kotovuori A, Friedrich H, Finell J, Rannisto J, Syka JEP, Stephenson JL. Rapid MRSA detection via tandem mass spectrometry of the intact 80 kDa PBP2a resistance protein. Sci Rep 2021; 11:18309. [PMID: 34526615 PMCID: PMC8443585 DOI: 10.1038/s41598-021-97844-w] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2021] [Accepted: 08/30/2021] [Indexed: 02/08/2023] Open
Abstract
Treatment of antibiotic-resistant infections is dependent on the detection of specific bacterial genes or proteins in clinical assays. Identification of methicillin-resistant Staphylococcus aureus (MRSA) is often accomplished through the detection of penicillin-binding protein 2a (PBP2a). With greater dependence on mass spectrometry (MS)-based bacterial identification, complementary efforts to detect resistance have been hindered by the complexity of those proteins responsible. Initial characterization of PBP2a indicates the presence of glycan modifications. To simplify detection, we demonstrate a proof-of-concept tandem MS approach involving the generation of N-terminal PBP2a peptide-like fragments and detection of unique product ions during top-down proteomic sample analyses. This approach was implemented for two PBP2a variants, PBP2amecA and PBP2amecC, and was accurate across a representative panel of MRSA strains with different genetic backgrounds. Additionally, PBP2amecA was successfully detected from clinical isolates using a five-minute liquid chromatographic separation and implementation of this MS detection strategy. Our results highlight the capability of direct MS-based resistance marker detection and potential advantages for implementing these approaches in clinical diagnostics.
Collapse
|
21
|
Dupré M, Duchateau M, Sternke-Hoffmann R, Boquoi A, Malosse C, Fenk R, Haas R, Buell AK, Rey M, Chamot-Rooke J. De Novo Sequencing of Antibody Light Chain Proteoforms from Patients with Multiple Myeloma. Anal Chem 2021; 93:10627-10634. [PMID: 34292722 DOI: 10.1021/acs.analchem.1c01955] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
In multiple myeloma diseases, monoclonal immunoglobulin light chains (LCs) are abundantly produced, with, as a consequence in some cases, the formation of deposits affecting various organs, such as the kidney, while in other cases remaining soluble up to concentrations of several g·L-1 in plasma. The exact factors crucial for the solubility of LCs are poorly understood, but it can be hypothesized that their amino acid sequence plays an important role. Determining the precise sequences of patient-derived LCs is therefore highly desirable. We establish here a novel de novo sequencing workflow for patient-derived LCs, based on the combination of bottom-up and top-down proteomics without database search. PEAKS is used for the de novo sequencing of peptides that are further assembled into full length LC sequences using ALPS. Top-down proteomics provides the molecular masses of proteoforms and allows the exact determination of the amino acid sequence including all posttranslational modifications. This pipeline is then used for the complete de novo sequencing of LCs extracted from the urine of 10 patients with multiple myeloma. We show that for the bottom-up part, digestions with trypsin and Nepenthes digestive fluid are sufficient to produce overlapping peptides able to generate the best sequence candidates. Top-down proteomics is absolutely required to achieve 100% final sequence coverage and characterize clinical samples containing several LCs. Our work highlights an unexpected range of modifications.
Collapse
Affiliation(s)
- Mathieu Dupré
- Mass Spectrometry for Biology Unit, CNRS USR2000, Institut Pasteur, CNRS, 28 rue du Dr Roux, Paris 75015, France
| | - Magalie Duchateau
- Mass Spectrometry for Biology Unit, CNRS USR2000, Institut Pasteur, CNRS, 28 rue du Dr Roux, Paris 75015, France
| | - Rebecca Sternke-Hoffmann
- Institut für Physikalische Biologie, Heinrich-Heine-Universität Düsseldorf, Universitätsstr. 1, Düsseldorf 40225, Germany
| | - Amelie Boquoi
- Department of Hematology, Oncology and Clinical Oncology, Heinrich-Heine Universität Düsseldorf, Düsseldorf, Germany, Moorenstr. 5, Düsseldorf 40225, Germany
| | - Christian Malosse
- Mass Spectrometry for Biology Unit, CNRS USR2000, Institut Pasteur, CNRS, 28 rue du Dr Roux, Paris 75015, France
| | - Roland Fenk
- Department of Hematology, Oncology and Clinical Oncology, Heinrich-Heine Universität Düsseldorf, Düsseldorf, Germany, Moorenstr. 5, Düsseldorf 40225, Germany
| | - Rainer Haas
- Department of Hematology, Oncology and Clinical Oncology, Heinrich-Heine Universität Düsseldorf, Düsseldorf, Germany, Moorenstr. 5, Düsseldorf 40225, Germany
| | - Alexander K Buell
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Søltofts Plads, Kgs. Lyngby 2800, Denmark
| | - Martial Rey
- Mass Spectrometry for Biology Unit, CNRS USR2000, Institut Pasteur, CNRS, 28 rue du Dr Roux, Paris 75015, France
| | - Julia Chamot-Rooke
- Mass Spectrometry for Biology Unit, CNRS USR2000, Institut Pasteur, CNRS, 28 rue du Dr Roux, Paris 75015, France
| |
Collapse
|
22
|
Havlikova J, May RC, Styles IB, Cooper HJ. Liquid Extraction Surface Analysis Mass Spectrometry of ESKAPE Pathogens. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2021; 32:1345-1351. [PMID: 33647207 PMCID: PMC8176453 DOI: 10.1021/jasms.0c00466] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
The ESKAPE pathogens (Enterococcus faecium, Staphylococcus aureus, Klebsiella pneumoniae, Acinetobacter baumannii, Pseudomonas aeruginosa, and Enterobacter cloacae) represent clinically important bacterial species that are responsible for most hospital-acquired drug-resistant infections; hence, the need for rapid identification is of high importance. Previous work has demonstrated the suitability of liquid extraction surface analysis mass spectrometry (LESA MS) for the direct analysis of colonies of two of the ESKAPE pathogens (Staphylococcus aureus and Pseudomonas aeruginosa) growing on agar. Here, we apply LESA MS to the remaining four ESKAPE species (E. faecium E745, K. pneumoniae KP257, A. baumannii AYE, and E. cloacae S11) as well as E. faecalis V583 (a close relative of E. faecium) and a clinical isolate of A. baumannii AC02 using an optimized solvent sampling system. In each case, top-down LESA MS/MS was employed for protein identification. In total, 24 proteins were identified from 37 MS/MS spectra by searching against protein databases for the individual species. The MS/MS spectra for the identified proteins were subsequently searched against multiple databases from multiple species in an automated data analysis workflow with a view to determining the accuracy of identification of unknowns. Out of 24 proteins, 19 were correctly assigned at the protein and species level, corresponding to an identification success rate of 79%.
Collapse
Affiliation(s)
- Jana Havlikova
- EPSRC
Centre for Doctoral Training in Physical Sciences for Health, University of Birmingham, Edgbaston, Birmingham B15 2TT, United Kingdom
- School
of Biosciences, University of Birmingham, Edgbaston, Birmingham B15 2TT, United Kingdom
| | - Robin C. May
- Institute
of Microbiology and Infection, University
of Birmingham, Edgbaston, Birmingham B15 2TT, United
Kingdom
| | - Iain B. Styles
- EPSRC
Centre for Doctoral Training in Physical Sciences for Health, University of Birmingham, Edgbaston, Birmingham B15 2TT, United Kingdom
- School
of Computer Science, University of Birmingham, Edgbaston, Birmingham B15 2TT, United Kingdom
| | - Helen J. Cooper
- School
of Biosciences, University of Birmingham, Edgbaston, Birmingham B15 2TT, United Kingdom
- Phone: +44 (0)121 414 7527; . (H.J.C.)
| |
Collapse
|
23
|
Borges Lima D, Dupré M, Mariano Santos MD, Carvalho PC, Chamot-Rooke J. DiagnoTop: A Computational Pipeline for Discriminating Bacterial Pathogens without Database Search. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2021; 32:1295-1299. [PMID: 33856212 DOI: 10.1021/jasms.1c00014] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Pathogen identification is crucial to confirm bacterial infections and guide antimicrobial therapy. Although MALDI-TOF mass spectrometry (MS) serves as foundation for tools that enable rapid microbial identification, some bacteria remain challenging to identify. We recently showed that top-down proteomics (TDP) could be used to discriminate closely related enterobacterial pathogens (Escherichia coli, Shigella, and Salmonella) that are indistinguishable with tools rooted in the MALDI-TOF MS approach. Current TDP diagnostic relies on the identification of specific proteoforms for each species through a database search. However, microbial proteomes are often poorly annotated, which complicates the large-scale identification of proteoforms and leads to many unidentified high-quality mass spectra. Here, we describe a new computational pipeline called DiagnoTop that lists discriminative spectral clusters found in TDP data sets that can be used for microbial diagnostics without database search. Applied to our enterobacterial TDP data sets, DiagnoTop could easily shortlist high-quality discriminative spectral clusters, leading to increased diagnostic power. This pipeline opens new perspectives in clinical microbiology and biomarker discovery using TDP.
Collapse
Affiliation(s)
- Diogo Borges Lima
- Mass Spectrometry for Biology Unit, CNRS USR2000, Institut Pasteur, Paris 75015, France
| | - Mathieu Dupré
- Mass Spectrometry for Biology Unit, CNRS USR2000, Institut Pasteur, Paris 75015, France
| | - Marlon Dias Mariano Santos
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute Fiocruz, Paraná, Curitiba CIC 81350-010, Brazil
| | - Paulo Costa Carvalho
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute Fiocruz, Paraná, Curitiba CIC 81350-010, Brazil
| | - Julia Chamot-Rooke
- Mass Spectrometry for Biology Unit, CNRS USR2000, Institut Pasteur, Paris 75015, France
| |
Collapse
|
24
|
Rey M, Dhenin J, Kong Y, Nouchikian L, Filella I, Duchateau M, Dupré M, Pellarin R, Duménil G, Chamot-Rooke J. Advanced In Vivo Cross-Linking Mass Spectrometry Platform to Characterize Proteome-Wide Protein Interactions. Anal Chem 2021; 93:4166-4174. [DOI: 10.1021/acs.analchem.0c04430] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Affiliation(s)
- Martial Rey
- Mass Spectrometry for Biology Unit, Department of Structural Biology and Chemistry, Institut Pasteur, CNRS USR 2000, 28 rue du Docteur Roux, 75015 Paris, France
| | - Jonathan Dhenin
- Mass Spectrometry for Biology Unit, Department of Structural Biology and Chemistry, Institut Pasteur, CNRS USR 2000, 28 rue du Docteur Roux, 75015 Paris, France
| | - Youxin Kong
- Pathogenesis of Vascular Infections, Department of Cell Biology and Infection, Institut Pasteur, INSERM U1225, 28 rue du Docteur Roux, 75015 Paris France
| | - Lucienne Nouchikian
- Mass Spectrometry for Biology Unit, Department of Structural Biology and Chemistry, Institut Pasteur, CNRS USR 2000, 28 rue du Docteur Roux, 75015 Paris, France
| | - Isaac Filella
- Structural Bioinformatics Unit, Department of Structural Biology and Chemistry, Institut Pasteur, CNRS UMR 3528, 28 rue du Docteur Roux, 75015 Paris, France
| | - Magalie Duchateau
- Mass Spectrometry for Biology Unit, Department of Structural Biology and Chemistry, Institut Pasteur, CNRS USR 2000, 28 rue du Docteur Roux, 75015 Paris, France
| | - Mathieu Dupré
- Mass Spectrometry for Biology Unit, Department of Structural Biology and Chemistry, Institut Pasteur, CNRS USR 2000, 28 rue du Docteur Roux, 75015 Paris, France
| | - Riccardo Pellarin
- Structural Bioinformatics Unit, Department of Structural Biology and Chemistry, Institut Pasteur, CNRS UMR 3528, 28 rue du Docteur Roux, 75015 Paris, France
| | - Guillaume Duménil
- Pathogenesis of Vascular Infections, Department of Cell Biology and Infection, Institut Pasteur, INSERM U1225, 28 rue du Docteur Roux, 75015 Paris France
| | - Julia Chamot-Rooke
- Mass Spectrometry for Biology Unit, Department of Structural Biology and Chemistry, Institut Pasteur, CNRS USR 2000, 28 rue du Docteur Roux, 75015 Paris, France
| |
Collapse
|