1
|
Kumar P, Rani S, Dahiya P, Kumar A, Dang AS, Suneja P. Whole genome analysis for plant growth promotion profiling of Pantoea agglomerans CPHN2, a non-rhizobial nodule endophyte. Front Microbiol 2022; 13:998821. [DOI: 10.3389/fmicb.2022.998821] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 09/13/2022] [Indexed: 11/09/2022] Open
Abstract
Reduced agricultural production as well as issues like nutrient-depleted soils, eutrophication, and groundwater contamination have drawn attention to the use of endophyte-based bioformulations to restore soil fertility. Pantoea agglomerans CPHN2, a non-rhizobial nodule endophyte isolated from Cicer arietinum, exhibited a variety of plant growth-promoting traits. In this study, we used NextSeq500 technology to analyze whole-genome sequence information of this plant growth-promoting endophytic bacteria. The genome of P. agglomerans CPHN2 has a length of 4,839,532 bp and a G + C content of 55.2%. The whole genome comprises three different genomic fractions, comprising one circular chromosome and two circular plasmids. A comparative analysis between P. agglomerans CPHN2 and 10 genetically similar strains was performed using a bacterial pan-genome pipeline. All the predicted and annotated gene sequences for plant growth promotions (PGPs), such as phosphate solubilization, siderophore synthesis, nitrogen metabolism, and indole-3-acetic acid (IAA) of P. agglomerans CPHN2, were identified. The whole-genome analysis of P. agglomerans CPHN2 provides an insight into the mechanisms underlying PGP by endophytes and its potential applications as a biofertilizer.
Collapse
|
2
|
A comprehensive LFQ benchmark dataset on modern day acquisition strategies in proteomics. Sci Data 2022; 9:126. [PMID: 35354825 PMCID: PMC8967878 DOI: 10.1038/s41597-022-01216-6] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Accepted: 02/23/2022] [Indexed: 12/23/2022] Open
Abstract
In the last decade, a revolution in liquid chromatography-mass spectrometry (LC-MS) based proteomics was unfolded with the introduction of dozens of novel instruments that incorporate additional data dimensions through innovative acquisition methodologies, in turn inspiring specialized data analysis pipelines. Simultaneously, a growing number of proteomics datasets have been made publicly available through data repositories such as ProteomeXchange, Zenodo and Skyline Panorama. However, developing algorithms to mine this data and assessing the performance on different platforms is currently hampered by the lack of a single benchmark experimental design. Therefore, we acquired a hybrid proteome mixture on different instrument platforms and in all currently available families of data acquisition. Here, we present a comprehensive Data-Dependent and Data-Independent Acquisition (DDA/DIA) dataset acquired using several of the most commonly used current day instrumental platforms. The dataset consists of over 700 LC-MS runs, including adequate replicates allowing robust statistics and covering over nearly 10 different data formats, including scanning quadrupole and ion mobility enabled acquisitions. Datasets are available via ProteomeXchange (PXD028735). Measurement(s) | Digital Data Repository | Technology Type(s) | Digital Data Repository |
Collapse
|
3
|
Halder A, Verma A, Biswas D, Srivastava S. Recent advances in mass-spectrometry based proteomics software, tools and databases. DRUG DISCOVERY TODAY. TECHNOLOGIES 2021; 39:69-79. [PMID: 34906327 DOI: 10.1016/j.ddtec.2021.06.007] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Revised: 05/08/2021] [Accepted: 06/21/2021] [Indexed: 01/12/2023]
Abstract
The field of proteomics immensely depends on data generation and data analysis which are thoroughly supported by software and databases. There has been a massive advancement in mass spectrometry-based proteomics over the last 10 years which has compelled the scientific community to upgrade or develop algorithms, tools, and repository databases in the field of proteomics. Several standalone software, and comprehensive databases have aided the establishment of integrated omics pipeline and meta-analysis workflow which has contributed to understand the disease pathobiology, biomarker discovery and predicting new therapeutic modalities. For shotgun proteomics where Data Dependent Acquisition is performed, several user-friendly software are developed that can analyse the pre-processed data to provide mechanistic insights of the disease. Likewise, in Data Independent Acquisition, pipelines are emerged which can accomplish the task from building the spectral library to identify the therapeutic targets. Furthermore, in the age of big data analysis the implications of machine learning and cloud computing are appending robustness, rapidness and in-depth proteomics data analysis. The current review talks about the recent advancement, and development of software, tools, and database in the field of mass-spectrometry based proteomics.
Collapse
Affiliation(s)
- Ankit Halder
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai 400076, India
| | - Ayushi Verma
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai 400076, India
| | - Deeptarup Biswas
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai 400076, India
| | - Sanjeeva Srivastava
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai 400076, India.
| |
Collapse
|
4
|
Dudeja SS, Suneja-Madan P, Paul M, Maheswari R, Kothe E. Bacterial endophytes: Molecular interactions with their hosts. J Basic Microbiol 2021; 61:475-505. [PMID: 33834549 DOI: 10.1002/jobm.202000657] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2020] [Revised: 03/07/2021] [Accepted: 03/16/2021] [Indexed: 01/19/2023]
Abstract
Plant growth promotion has been found associated with plants on the surface (epiphytic), inside (endophytic), or close to the plant roots (rhizospheric). Endophytic bacteria mainly have been researched for their beneficial activities in terms of nutrient availability, plant growth hormones, and control of soil-borne and systemic pathogens. Molecular communications leading to these interactions between plants and endophytic bacteria are now being unrevealed using multidisciplinary approaches with advanced techniques such as metagenomics, metaproteomics, metatranscriptomics, metaproteogenomic, microRNAs, microarray, chips as well as the comparison of complete genome sequences. More than 400 genes in both the genomes of host plant and bacterial endophyte are up- or downregulated for the establishment of endophytism and plant growth-promoting activity. The involvement of more than 20 genes for endophytism, about 50 genes for direct plant growth promotion, about 25 genes for biocontrol activity, and about 10 genes for mitigation of different stresses has been identified in various bacterial endophytes. This review summarizes the progress that has been made in recent years by these modern techniques and approaches.
Collapse
Affiliation(s)
- Surjit S Dudeja
- Department of Bio & Nanotechnology, Guru Jambeshwar University of Science & Technology, Hisar, India
| | - Pooja Suneja-Madan
- Department of Microbiology, Maharishi Dayanand University, Rohtak, India
| | - Minakshi Paul
- Department of Bio & Nanotechnology, Guru Jambeshwar University of Science & Technology, Hisar, India
| | - Rajat Maheswari
- Department of Microbiology, Maharishi Dayanand University, Rohtak, India
| | - Erika Kothe
- Microbial Communication, Institute of Microbiology, Faculty for Biosciences, Friedrich Schiller University of Jena, Jena, Germany
| |
Collapse
|
5
|
The Power of Three in Cannabis Shotgun Proteomics: Proteases, Databases and Search Engines. Proteomes 2020; 8:proteomes8020013. [PMID: 32549361 PMCID: PMC7356525 DOI: 10.3390/proteomes8020013] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Revised: 06/12/2020] [Accepted: 06/12/2020] [Indexed: 11/29/2022] Open
Abstract
Cannabis research has taken off since the relaxation of legislation, yet proteomics is still lagging. In 2019, we published three proteomics methods aimed at optimizing protein extraction, protein digestion for bottom-up and middle-down proteomics, as well as the analysis of intact proteins for top-down proteomics. The database of Cannabis sativa proteins used in these studies was retrieved from UniProt, the reference repositories for proteins, which is incomplete and therefore underrepresents the genetic diversity of this non-model species. In this fourth study, we remedy this shortcoming by searching larger databases from various sources. We also compare two search engines, the oldest, SEQUEST, and the most popular, Mascot. This shotgun proteomics experiment also utilizes the power of parallel digestions with orthogonal proteases of increasing selectivity, namely chymotrypsin, trypsin/Lys-C and Asp-N. Our results show that the larger the database the greater the list of accessions identified but the longer the duration of the search. Using orthogonal proteases and different search algorithms increases the total number of proteins identified, most of them common despite differing proteases and algorithms, but many of them unique as well.
Collapse
|
6
|
Kiseleva O, Zgoda V, Naryzhny S, Poverennaya E. Empowering Shotgun Mass Spectrometry with 2DE: A HepG2 Study. Int J Mol Sci 2020; 21:E3813. [PMID: 32471280 PMCID: PMC7312985 DOI: 10.3390/ijms21113813] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Revised: 05/21/2020] [Accepted: 05/26/2020] [Indexed: 01/07/2023] Open
Abstract
One of the major goals of the Chromosome-Centric Human Proteome Project (C-HPP) is to catalog and annotate a myriad of heterogeneous proteoforms, produced by ca. 20 thousand genes. To achieve a detailed and personalized understanding into proteomes, we suggest using a customized RNA-seq library of potential proteoforms, which includes aberrant variants specific to certain biological samples. Two-dimensional electrophoresis coupled with high-performance liquid chromatography allowed us to downgrade the difficulty of biological mixing following shotgun mass spectrometry. To benchmark the proposed pipeline, we examined heterogeneity of the HepG2 hepatoblastoma cell line proteome. Data are available via ProteomeXchange with identifier PXD018450.
Collapse
Affiliation(s)
- Olga Kiseleva
- Institute of Biomedical Chemistry, Moscow 119121, Russia; (V.Z.); (S.N.); (E.P.)
| | - Victor Zgoda
- Institute of Biomedical Chemistry, Moscow 119121, Russia; (V.Z.); (S.N.); (E.P.)
| | - Stanislav Naryzhny
- Institute of Biomedical Chemistry, Moscow 119121, Russia; (V.Z.); (S.N.); (E.P.)
- Petersburg Nuclear Physics Institute named by B.P. Konstantinov of NRC “Kurchatov Institute”, Gatchina 188300, Russia
| | | |
Collapse
|
7
|
Kuznetsova KG, Solovyeva EM, Kuzikov AV, Gorshkov MV, Moshkovskii SA. [Modification of cysteine residues for mass spectrometry-based proteomic analysis: facts and artifacts]. BIOMEDIT︠S︡INSKAI︠A︡ KHIMII︠A︡ 2020; 66:18-29. [PMID: 32116223 DOI: 10.18097/pbmc20206601018] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Mass spectrometric proteomic analysis at the sample preparation stage involves the artificial reduction of disulfide bonds in proteins formed between cysteine residues. Such bonds, when preserved in their native state, complicate subsequent enzymatic hydrolysis and interpretation of the research results. To prevent the re-formation of the disulfide bonds, cysteine residues are protected by special groups, most often by alkylation. In this review, we consider the methods used to modify cysteine residues during sample preparation, as well as possible artifacts of this stage. Particularly, adverse reactions of the alkylating agents with other amino acid residues are described. The most common alkylating compound used to protect cysteine residues in mass spectrometric proteomic analysis is iodoacetamide. However, an analysis of the literature in this area indicates that this reagent causes more adverse reactions than other agents used, such as chloroacetamide and acrylamide. The latter can be recommended for wider use. In the review we also discuss the features of the cysteine residue modifications and their influence on the efficiency of the search for post-translational modifications and protein products of single nucleotide substitutions.
Collapse
Affiliation(s)
| | - E M Solovyeva
- Talrose Institute for Energy Problems of Chemical Physics, Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow, Russia; Moscow Institute of Physics and Technology (State University), Dolgoprudny, Russia
| | - A V Kuzikov
- Institute of Biomedical Chemistry, Moscow, Russia; Pirogov Russian National Research Medical University, Moscow, Russia
| | - M V Gorshkov
- Talrose Institute for Energy Problems of Chemical Physics, Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow, Russia
| | - S A Moshkovskii
- Institute of Biomedical Chemistry, Moscow, Russia; Pirogov Russian National Research Medical University, Moscow, Russia
| |
Collapse
|
8
|
Ang MY, Low TY, Lee PY, Wan Mohamad Nazarie WF, Guryev V, Jamal R. Proteogenomics: From next-generation sequencing (NGS) and mass spectrometry-based proteomics to precision medicine. Clin Chim Acta 2019; 498:38-46. [DOI: 10.1016/j.cca.2019.08.010] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2019] [Revised: 08/13/2019] [Accepted: 08/13/2019] [Indexed: 12/14/2022]
|
9
|
Fert-Bober J, Murray CI, Parker SJ, Van Eyk JE. Precision Profiling of the Cardiovascular Post-Translationally Modified Proteome: Where There Is a Will, There Is a Way. Circ Res 2019; 122:1221-1237. [PMID: 29700069 DOI: 10.1161/circresaha.118.310966] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
There is an exponential increase in biological complexity as initial gene transcripts are spliced, translated into amino acid sequence, and post-translationally modified. Each protein can exist as multiple chemical or sequence-specific proteoforms, and each has the potential to be a critical mediator of a physiological or pathophysiological signaling cascade. Here, we provide an overview of how different proteoforms come about in biological systems and how they are most commonly measured using mass spectrometry-based proteomics and bioinformatics. Our goal is to present this information at a level accessible to every scientist interested in mass spectrometry and its application to proteome profiling. We will specifically discuss recent data linking various protein post-translational modifications to cardiovascular disease and conclude with a discussion for enablement and democratization of proteomics across the cardiovascular and scientific community. The aim is to inform and inspire the readership to explore a larger breadth of proteoform, particularity post-translational modifications, related to their particular areas of expertise in cardiovascular physiology.
Collapse
Affiliation(s)
- Justyna Fert-Bober
- From the Advanced Clinical BioSystems Research Institute, Smidt Heart Institute, Department of Medicine, Cedars Sinai Medical Center, Los Angeles, CA
| | - Christopher I Murray
- From the Advanced Clinical BioSystems Research Institute, Smidt Heart Institute, Department of Medicine, Cedars Sinai Medical Center, Los Angeles, CA
| | - Sarah J Parker
- From the Advanced Clinical BioSystems Research Institute, Smidt Heart Institute, Department of Medicine, Cedars Sinai Medical Center, Los Angeles, CA.
| | - Jennifer E Van Eyk
- From the Advanced Clinical BioSystems Research Institute, Smidt Heart Institute, Department of Medicine, Cedars Sinai Medical Center, Los Angeles, CA
| |
Collapse
|
10
|
Allmer J. Towards an Internet of Science. J Integr Bioinform 2019; 16:/j/jib.ahead-of-print/jib-2019-0024/jib-2019-0024.xml. [PMID: 31145694 PMCID: PMC6798852 DOI: 10.1515/jib-2019-0024] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2019] [Accepted: 04/25/2019] [Indexed: 11/15/2022] Open
Abstract
Big data and complex analysis workflows (pipelines) are common issues in data driven science such as bioinformatics. Large amounts of computational tools are available for data analysis. Additionally, many workflow management systems to piece together such tools into data analysis pipelines have been developed. For example, more than 50 computational tools for read mapping are available representing a large amount of duplicated effort. Furthermore, it is unclear whether these tools are correct and only a few have a user base large enough to have encountered and reported most of the potential problems. Bringing together many largely untested tools in a computational pipeline must lead to unpredictable results. Yet, this is the current state. While presently data analysis is performed on personal computers/workstations/clusters, the future will see development and analysis shift to the cloud. None of the workflow management systems is ready for this transition. This presents the opportunity to build a new system, which will overcome current duplications of effort, introduce proper testing, allow for development and analysis in public and private clouds, and include reporting features leading to interactive documents.
Collapse
Affiliation(s)
- Jens Allmer
- Hochschule Ruhr West, University of Applied Sciences, Medical Informatics and Bioinformatics, 45407 Mülheim an der Ruhr, Germany
| |
Collapse
|
11
|
Lisitsa AV, Petushkova NA, Levitsky LI, Zgoda VG, Larina OV, Kisrieva YS, Frankevich VE, Gamidov SI. Comparative Analysis of the Performаnce of Mascot and IdentiPy Algorithms on a Benchmark Dataset Obtained by Tandem Mass Spectrometry Analysis of Testicular Biopsies. Mol Biol 2019. [DOI: 10.1134/s0026893319010096] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
12
|
Understanding Camellia sinensis using Omics Technologies along with Endophytic Bacteria and Environmental Roles on Metabolism: A Review. APPLIED SCIENCES-BASEL 2019. [DOI: 10.3390/app9020281] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Camellia sinensis is the most consumed beverage worldwide. It contains a wide variety of secondary metabolites, such as alkaloids, saponins, tannins, catechins, and polyphenols, generated through a condensation reaction of cinnamic acid with three malonyl-CoA groups. In addition to the metabolic processes occurring within this plant, there are also some plant-associated bacterial endophytes. These bacteria reside in the living tissues of the host plants without causing any harmful effect to them, thereby stimulating secondary metabolite production with a diverse range of biological effects. Omics technologies reveal understanding of the biological phenomena of transcriptomics, proteomics, and metabolomics. In this sense, the present review aims to provide a comprehensive review of various methods used to identify distinct plant compounds, namely transcriptomic, proteomic, and metabolomic analysis. The role of endophytic bacteria in C. sinensis metabolism, and C. sinensis antioxidant and antimicrobial effects, are also carefully highlighted.
Collapse
|
13
|
Big Text advantages and challenges: classification perspective. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS 2017. [DOI: 10.1007/s41060-017-0087-5] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
|
14
|
Kiseleva O, Poverennaya E, Shargunov A, Lisitsa A. Proteomic Cinderella: Customized analysis of bulky MS/MS data in one night. J Bioinform Comput Biol 2017; 16:1740011. [PMID: 29216772 DOI: 10.1142/s021972001740011x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Proteomic challenges, stirred up by the advent of high-throughput technologies, produce large amount of MS data. Nowadays, the routine manual search does not satisfy the "speed" of modern science any longer. In our work, the necessity of single-thread analysis of bulky data emerged during interpretation of HepG2 proteome profiling results for proteoforms searching. We compared the contribution of each of the eight search engines (X!Tandem, MS-GF[Formula: see text], MS Amanda, MyriMatch, Comet, Tide, Andromeda, and OMSSA) integrated in an open-source graphical user interface SearchGUI ( http://searchgui.googlecode.com ) into total result of proteoforms identification and optimized set of engines working simultaneously. We also compared the results of our search combination with Mascot results using protein kit UPS2, containing 48 human proteins. We selected combination of X!Tandem, MS-GF[Formula: see text] and OMMSA as the most time-efficient and productive combination of search. We added homemade java-script to automatize pipeline from file picking to report generation. These settings resulted in rise of the efficiency of our customized pipeline unobtainable by manual scouting: the analysis of 192 files searched against human proteome (42153 entries) downloaded from UniProt took 11[Formula: see text]h.
Collapse
Affiliation(s)
- Olga Kiseleva
- 1 Department of Bioinformatics, Institute of Biomedical Chemistry, 10/8 Pogodinskaya str., Moscow 119121, Russia
| | - Ekaterina Poverennaya
- 1 Department of Bioinformatics, Institute of Biomedical Chemistry, 10/8 Pogodinskaya str., Moscow 119121, Russia
| | - Alexander Shargunov
- 1 Department of Bioinformatics, Institute of Biomedical Chemistry, 10/8 Pogodinskaya str., Moscow 119121, Russia
| | - Andrey Lisitsa
- 1 Department of Bioinformatics, Institute of Biomedical Chemistry, 10/8 Pogodinskaya str., Moscow 119121, Russia
| |
Collapse
|