1
|
Wu X, Jia W. Multilayer Annotation Strategy AnnoSePS: Disentangling the Intricate Structure of Selenium-Containing Polysaccharides Based on Preferential Fragmentation Patterns. Anal Chem 2024; 96:10696-10704. [PMID: 38904260 DOI: 10.1021/acs.analchem.4c01576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/22/2024]
Abstract
Precision mapping of selenium at structural and position levels poses significant challenges in selenium-containing polysaccharide identification. Due to the absence of reference spectra, database-centric approaches are still limited in the discovery of selenium binding sites and distinction among different isomeric structures. A multilayer annotation strategy, AnnoSePS, is proposed for achieving the identification of seleno-substituent and the unbiased profiling of polysaccharides. Applying Snoop-triggered multiple reaction monitoring (Snoop-MRM) identified multidimensional monosaccharides in selenium-containing polysaccharides. Galactose, galacturonic acid, and glucose were the predominant monosaccharides with a molar ratio of 25.19, 19.45, and 11.72, respectively. Selenium present in seleno-rhamnose was found to substitute the hydroxyl group located at C-1 positions through the formation of a Se-H bond. Ions C6H9O3Se-, C6H7O3Se-, C5H5O3Se-, C4H5O2Se-, C3H5O2Se-, C2H3O2Se-, and CHOSe- were defined as the characteristic fragments of seleno-rhamnose. The agglomerative hierarchical clustering algorithm is applied to group spectra from each run based on the characteristic information. Preferential fragmentation patterns in mass spectrometry are revealed by training a probabilistic model. A list of candidate oligosaccharides is generated by step-by-step browsing through the transition pairs for all reference spectra and applying the transitions (addition, insertion, removal, and substitution) to reference structures. Combining time course analyses revealed the linkage composition of selenium-containing oligosaccharides. Glycosidic linkages were annotated based on a synthesis-driven approach. T-Galactose (16.67 ± 5.23%) and T-Galacturonic acid (11.54 ± 4.66%) were the predominant linkage residues. As the database-independent mapping strategy, AnnoSePS makes it possible to comprehensively interrogate spectral data and dissect the fine structure of selenium-containing polysaccharides.
Collapse
Affiliation(s)
- Xixuan Wu
- School of Food and Biological Engineering, Shaanxi University of Science and Technology, Xi'an 710021, China
| | - Wei Jia
- School of Food and Biological Engineering, Shaanxi University of Science and Technology, Xi'an 710021, China
- Shaanxi Research Institute of Agricultural Products Processing Technology, Xi'an 710021, China
| |
Collapse
|
2
|
Prunier G, Cherkaoui M, Lysiak A, Langella O, Blein-Nicolas M, Lollier V, Benoist E, Jean G, Fertin G, Rogniaux H, Tessier D. Fast alignment of mass spectra in large proteomics datasets, capturing dissimilarities arising from multiple complex modifications of peptides. BMC Bioinformatics 2023; 24:421. [PMID: 37940845 PMCID: PMC10631047 DOI: 10.1186/s12859-023-05555-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 10/30/2023] [Indexed: 11/10/2023] Open
Abstract
BACKGROUND In proteomics, the interpretation of mass spectra representing peptides carrying multiple complex modifications remains challenging, as it is difficult to strike a balance between reasonable execution time, a limited number of false positives, and a huge search space allowing any number of modifications without a priori. The scientific community needs new developments in this area to aid in the discovery of novel post-translational modifications that may play important roles in disease. RESULTS To make progress on this issue, we implemented SpecGlobX (SpecGlob eXTended to eXperimental spectra), a standalone Java application that quickly determines the best spectral alignments of a (possibly very large) list of Peptide-to-Spectrum Matches (PSMs) provided by any open modification search method, or generated by the user. As input, SpecGlobX reads a file containing spectra in MGF or mzML format and a semicolon-delimited spreadsheet describing the PSMs. SpecGlobX returns the best alignment for each PSM as output, splitting the mass difference between the spectrum and the peptide into one or more shifts while considering the possibility of non-aligned masses (a phenomenon resulting from many situations including neutral losses). SpecGlobX is fast, able to align one million PSMs in about 1.5 min on a standard desktop. Firstly, we remind the foundations of the algorithm and detail how we adapted SpecGlob (the method we previously developed following the same aim, but limited to the interpretation of perfect simulated spectra) to the interpretation of imperfect experimental spectra. Then, we highlight the interest of SpecGlobX as a complementary tool downstream to three open modification search methods on a large simulated spectra dataset. Finally, we ran SpecGlobX on a proteome-wide dataset downloaded from PRIDE to demonstrate that SpecGlobX functions just as well on simulated and experimental spectra. We then carefully analyzed a limited set of interpretations. CONCLUSIONS SpecGlobX is helpful as a decision support tool, providing keys to interpret peptides carrying complex modifications still poorly considered by current open modification search software. Better alignment of PSMs enhances confidence in the identification of spectra provided by open modification search methods and should improve the interpretation rate of spectra.
Collapse
Affiliation(s)
- Grégoire Prunier
- INRAE, PROBE Research Infrastructure, BIBS Facility, 44300, Nantes, France
- INRAE, UR1268 Biopolymères Interactions Assemblages, 44316, Nantes, France
| | - Mehdi Cherkaoui
- INRAE, PROBE Research Infrastructure, BIBS Facility, 44300, Nantes, France
- INRAE, UR1268 Biopolymères Interactions Assemblages, 44316, Nantes, France
| | - Albane Lysiak
- INRAE, PROBE Research Infrastructure, BIBS Facility, 44300, Nantes, France
- Nantes Université, CNRS, LS2N, UMR 6004, 44000, Nantes, France
| | - Olivier Langella
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, PAPPSO, 91190, Gif-Sur-Yvette, France
| | - Mélisande Blein-Nicolas
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, PAPPSO, 91190, Gif-Sur-Yvette, France
| | - Virginie Lollier
- INRAE, PROBE Research Infrastructure, BIBS Facility, 44300, Nantes, France
- INRAE, UR1268 Biopolymères Interactions Assemblages, 44316, Nantes, France
| | - Emile Benoist
- Nantes Université, CNRS, LS2N, UMR 6004, 44000, Nantes, France
| | - Géraldine Jean
- Nantes Université, CNRS, LS2N, UMR 6004, 44000, Nantes, France
| | | | - Hélène Rogniaux
- INRAE, PROBE Research Infrastructure, BIBS Facility, 44300, Nantes, France
- INRAE, UR1268 Biopolymères Interactions Assemblages, 44316, Nantes, France
| | - Dominique Tessier
- INRAE, PROBE Research Infrastructure, BIBS Facility, 44300, Nantes, France.
- INRAE, UR1268 Biopolymères Interactions Assemblages, 44316, Nantes, France.
| |
Collapse
|
3
|
The Bridge between Screening and Assessment: Establishment and Application of Online Screening Platform for Food Risk Substances. J FOOD QUALITY 2021. [DOI: 10.1155/2021/2275471] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
In order to improve the risk identification ability of the technical support system of food safety supervision, an online screening platform for food risk substances (hereafter referred to as “platform”) was established. The platform aims at the qualitative analysis of unknown compounds and consists of three parts: a standard spectrum library, screening model, and online comparison module. The standard library contains the standard spectra of 527 food risk substances by high-performance liquid chromatography/high-resolution mass spectrometry. The screening comparison algorithm, the core of the screening model, is obtained through the improvement of the existing spectral library search algorithm. The inspector uploads the original spectrum file through the online comparison module; the online comparison module calls the corresponding script to convert the original spectrum file into a standard spectrum file and then uses the screening and comparison algorithm to achieve online real-time comparison. The comparison results are used to determine whether the sample to be tested contains the food risk substances contained in the standard library, so as to realize the preliminary screening of potential food risk substances. The platform supports the spectrogram data format of mainstream instrument manufacturers. The standard spectrogram database can be coconstructed and shared by cooperative laboratories to effectively enrich the types of food risk substances. Through laboratory comparison, data calibration, and model optimization, the screening accuracy of the platform can reach more than 97%. The platform adopts the Internet online screening method, which greatly facilitates the risk investigation and control of national food safety inspection and testing institutions. At the same time, the construction of the screening platform for food risk substances based on high-performance liquid chromatography/high-resolution mass spectrometry, the Internet, big data, and other technologies will provide a new technical means for food safety risk management and control. Hence, it can build a bridge between the screening of risk substances and illegally added substances, as well as risk assessment, risk management, and control.
Collapse
|
4
|
Lysiak A, Fertin G, Jean G, Tessier D. Evaluation of open search methods based on theoretical mass spectra comparison. BMC Bioinformatics 2021; 22:65. [PMID: 33902435 PMCID: PMC8073971 DOI: 10.1186/s12859-021-03963-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Accepted: 01/08/2021] [Indexed: 11/17/2022] Open
Abstract
Background Mass spectrometry remains the privileged method to characterize proteins. Nevertheless, most of the spectra generated by an experiment remain unidentified after their analysis, mostly because of the modifications they carry. Open Modification Search (OMS) methods offer a promising answer to this problem. However, assessing the quality of OMS identifications remains a difficult task. Methods Aiming at better understanding the relationship between (1) similarity of pairs of spectra provided by OMS methods and (2) relevance of their corresponding peptide sequences, we used a dataset composed of theoretical spectra only, on which we applied two OMS strategies. We also introduced two appropriately defined measures for evaluating the above mentioned spectra/sequence relevance in this context: one is a color classification representing the level of difficulty to retrieve the proper sequence of the peptide that generated the identified spectrum ; the other, called LIPR, is the proportion of common masses, in a given Peptide Spectrum Match (PSM), that represent dissimilar sequences. These two measures were also considered in conjunction with the False Discovery Rate (FDR). Results According to our measures, the strategy that selects the best candidate by taking the mass difference between two spectra into account yields better quality results. Besides, although the FDR remains an interesting indicator in OMS methods (as shown by LIPR), it is questionable: indeed, our color classification shows that a non negligible proportion of relevant spectra/sequence interpretations corresponds to PSMs coming from the decoy database. Conclusions The three above mentioned measures allowed us to clearly determine which of the two studied OMS strategies outperformed the other, both in terms of number of identifications and of accuracy of these identifications. Even though quality evaluation of PSMs in OMS methods remains challenging, the study of theoretical spectra is a favorable framework for going further in this direction.
Collapse
Affiliation(s)
- Albane Lysiak
- CNRS, LS2N, Université de Nantes, 44000, Nantes, France.,UR BIA, INRAE, 44316, Nantes, France
| | | | | | - Dominique Tessier
- BIBS Facility, INRAE, 44316, Nantes, France.,UR BIA, INRAE, 44316, Nantes, France
| |
Collapse
|
5
|
Aggarwal S, Tolani P, Gupta S, Yadav AK. Posttranslational modifications in systems biology. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2021; 127:93-126. [PMID: 34340775 DOI: 10.1016/bs.apcsb.2021.03.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The biological complexity cannot be captured by genes or proteins alone. The protein posttranslational modifications (PTMs) impart functional diversity to the proteome and regulate protein structure, activity, localization and interactions. Their dynamics drive cellular signaling, growth and development while their dysregulation causes many diseases. Mass spectrometry based quantitative profiling of PTMs and bioinformatics analysis tools allow systems level insights into their network architecture. High-resolution profiling of PTM networks will advance disease understanding and precision medicine. It can accelerate the discovery of biomarkers and drug targets. This requires better tools for unbiased, high-throughput and accurate PTM identification, site localization and automated annotation on a systems level.
Collapse
Affiliation(s)
- Suruchi Aggarwal
- Translational Health Science and Technology Institute, NCR Biotech Science Cluster, Faridabad, Haryana, India; Department of Molecular Biology and Biotechnology, Cotton University, Guwahati, Assam, India
| | - Priya Tolani
- Translational Health Science and Technology Institute, NCR Biotech Science Cluster, Faridabad, Haryana, India
| | - Srishti Gupta
- Translational Health Science and Technology Institute, NCR Biotech Science Cluster, Faridabad, Haryana, India; School of Biosciences and Technology, Vellore Institute of Technology, Vellore, India
| | - Amit Kumar Yadav
- Translational Health Science and Technology Institute, NCR Biotech Science Cluster, Faridabad, Haryana, India.
| |
Collapse
|
6
|
Rusconi F. Free Open Source Software for Protein and Peptide Mass Spectrometry- based Science. Curr Protein Pept Sci 2021; 22:134-147. [PMID: 33461461 DOI: 10.2174/1389203722666210118160946] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2020] [Revised: 10/12/2020] [Accepted: 01/04/2021] [Indexed: 12/28/2022]
Abstract
In the field of biology, and specifically in protein and peptide science, the power of mass spectrometry is that it is applicable to a vast spectrum of applications. Mass spectrometry can be applied to identify proteins and peptides in complex mixtures, to identify and locate post-translational modifications, to characterize the structure of proteins and peptides to the most detailed level or to detect protein-ligand non-covalent interactions. Thanks to the Free and Open Source Software (FOSS) movement, scientists have limitless opportunities to deepen their skills in software development to code software that solves mass spectrometric data analysis problems. After the conversion of raw data files into open standard format files, the entire spectrum of data analysis tasks can now be performed integrally on FOSS platforms, like GNU/Linux, and only with FOSS solutions. This review presents a brief history of mass spectrometry open file formats and goes on with the description of FOSS projects that are commonly used in protein and peptide mass spectrometry fields of endeavor: identification projects that involve mostly automated pipelines, like proteomics and peptidomics, and bio-structural characterization projects that most often involve manual scrutiny of the mass data. Projects of the last kind usually involve software that allows the user to delve into the mass data in an interactive graphics-oriented manner. Software projects are thus categorized on the basis of these criteria: software libraries for software developers vs desktop-based graphical user interface, software for the end-user and automated pipeline-based data processing vs interactive graphics-based mass data scrutiny.
Collapse
Affiliation(s)
- Filippo Rusconi
- PAPPSO, Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, 91190, Gif-sur-Yvette, France
| |
Collapse
|
7
|
Tariq MU, Haseeb M, Aledhari M, Razzak R, Parizi RM, Saeed F. Methods for Proteogenomics Data Analysis, Challenges, and Scalability Bottlenecks: A Survey. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2020; 9:5497-5516. [PMID: 33537181 PMCID: PMC7853650 DOI: 10.1109/access.2020.3047588] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
Big Data Proteogenomics lies at the intersection of high-throughput Mass Spectrometry (MS) based proteomics and Next Generation Sequencing based genomics. The combined and integrated analysis of these two high-throughput technologies can help discover novel proteins using genomic, and transcriptomic data. Due to the biological significance of integrated analysis, the recent past has seen an influx of proteogenomic tools that perform various tasks, including mapping proteins to the genomic data, searching experimental MS spectra against a six-frame translation genome database, and automating the process of annotating genome sequences. To date, most of such tools have not focused on scalability issues that are inherent in proteogenomic data analysis where the size of the database is much larger than a typical protein database. These state-of-the-art tools can take more than half a month to process a small-scale dataset of one million spectra against a genome of 3 GB. In this article, we provide an up-to-date review of tools that can analyze proteogenomic datasets, providing a critical analysis of the techniques' relative merits and potential pitfalls. We also point out potential bottlenecks and recommendations that can be incorporated in the future design of these workflows to ensure scalability with the increasing size of proteogenomic data. Lastly, we make a case of how high-performance computing (HPC) solutions may be the best bet to ensure the scalability of future big data proteogenomic data analysis.
Collapse
Affiliation(s)
- Muhammad Usman Tariq
- School of Computing and Information Sciences, Florida International University, Miami, FL 33199, USA
| | - Muhammad Haseeb
- School of Computing and Information Sciences, Florida International University, Miami, FL 33199, USA
| | - Mohammed Aledhari
- College of Computing and Software Engineering, Kennesaw State University, Marietta, GA 30060, USA
| | - Rehma Razzak
- College of Computing and Software Engineering, Kennesaw State University, Marietta, GA 30060, USA
| | - Reza M Parizi
- College of Computing and Software Engineering, Kennesaw State University, Marietta, GA 30060, USA
| | - Fahad Saeed
- School of Computing and Information Sciences, Florida International University, Miami, FL 33199, USA
| |
Collapse
|
8
|
Qin C, Luo X, Deng C, Shu K, Zhu W, Griss J, Hermjakob H, Bai M, Perez-Riverol Y. Deep learning embedder method and tool for mass spectra similarity search. J Proteomics 2020; 232:104070. [PMID: 33307250 DOI: 10.1016/j.jprot.2020.104070] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Revised: 11/25/2020] [Accepted: 12/01/2020] [Indexed: 12/31/2022]
Abstract
Spectral similarity calculation is widely used in protein identification tools and mass spectra clustering algorithms while comparing theoretical or experimental spectra. The performance of the spectral similarity calculation plays an important role in these tools and algorithms especially in the analysis of large-scale datasets. Recently, deep learning methods have been proposed to improve the performance of clustering algorithms and protein identification by training the algorithms with existing data and the use of multiple spectra and identified peptide features. While the efficiency of these algorithms is still under study in comparison with traditional approaches, their application in proteomics data analysis is becoming more common. Here, we propose the use of deep learning to improve spectral similarity comparison. We assessed the performance of deep learning for spectral similarity, with GLEAMS and a newly trained embedder model (DLEAMSE), which uses high-quality spectra from PRIDE Cluster. Also, we developed a new bioinformatics tool (mslookup - https://github.com/bigbio/DLEAMSE/) that allows users to quickly search for spectra in previously identified mass spectra publish in public repositories and spectral libraries. Finally, we released a human database to enable bioinformaticians and biologists to search for identified spectra in their machines. SIGNIFICANCE STATEMENT: Spectral similarity calculation plays an important role in proteomics data analysis. With deep learning's ability to learn the implicit and effective features from large-scale training datasets, deep learning-based MS/MS spectra embedding models has emerged as a solution to improve mass spectral clustering similarity calculation algorithms. We compare multiple similarity scoring and deep learning methods in terms of accuracy (compute the similarity for a pair of the mass spectrum) and computing-time performance. The benchmark results showed no major differences in accuracy between DLEAMSE and normalized dot product for spectrum similarity calculations. The DLEAMSE GPU implementation is faster than NDP in preprocessing on the GPU server and the similarity calculation of DLEAMSE (Euclidean distance on 32-D vectors) takes about 1/3 of dot product calculations. The deep learning model (DLEAMSE) encoding and embedding steps needed to run once for each spectrum and the embedded 32-D points can be persisted in the repository for future comparison, which is faster for future comparisons and large-scale data. Based on these, we proposed a new tool mslookup that enables the researcher to find spectra previously identified in public data. The tool can be also used to generate in-house databases of previously identified spectra to share with other laboratories and consortiums.
Collapse
Affiliation(s)
- Chunyuan Qin
- Chongqing Key Laboratory on Big Data for Bio Intelligence, Chongqing University of Posts and telecommunications, Chongqing, China
| | - Xiyang Luo
- Chongqing Key Laboratory on Big Data for Bio Intelligence, Chongqing University of Posts and telecommunications, Chongqing, China
| | - Chuan Deng
- Chongqing Key Laboratory on Big Data for Bio Intelligence, Chongqing University of Posts and telecommunications, Chongqing, China
| | - Kunxian Shu
- Chongqing Key Laboratory on Big Data for Bio Intelligence, Chongqing University of Posts and telecommunications, Chongqing, China
| | - Weimin Zhu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Life Omics, Beijing 102206, China
| | - Johannes Griss
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK; Department of Dermatology, Medical University of Vienna, 1090 Vienna, Austria
| | - Henning Hermjakob
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Life Omics, Beijing 102206, China; European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mingze Bai
- Chongqing Key Laboratory on Big Data for Bio Intelligence, Chongqing University of Posts and telecommunications, Chongqing, China; State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Life Omics, Beijing 102206, China.
| | - Yasset Perez-Riverol
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK.
| |
Collapse
|
9
|
Dong N, Spencer DM, Quan Q, Le Blanc JCY, Feng J, Li M, Siu KWM, Chu IK. rPTMDetermine: A Fully Automated Methodology for Endogenous Tyrosine Nitration Validation, Site-Localization, and Beyond. Anal Chem 2020; 92:10768-10776. [DOI: 10.1021/acs.analchem.0c02148] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Affiliation(s)
- Naiping Dong
- Department of Chemistry, The University of Hong Kong, Pokfulam, Hong Kong, China
| | - Daniel M. Spencer
- Department of Chemistry, The University of Hong Kong, Pokfulam, Hong Kong, China
| | - Quan Quan
- Department of Chemistry, The University of Hong Kong, Pokfulam, Hong Kong, China
| | | | - Jinwen Feng
- Department of Chemistry, The University of Hong Kong, Pokfulam, Hong Kong, China
| | - Mengzhu Li
- Department of Chemistry, The University of Hong Kong, Pokfulam, Hong Kong, China
| | - K. W. Michael Siu
- Department of Chemistry, The University of Hong Kong, Pokfulam, Hong Kong, China
- Department of Chemistry and Centre for Research in Mass Spectrometry, York University, Toronto, Ontario M3J 1P3, Canada
- Department of Chemistry and Biochemistry, University of Windsor, Windsor, Ontario N9B 3P4, Canada
| | - Ivan K. Chu
- Department of Chemistry, The University of Hong Kong, Pokfulam, Hong Kong, China
| |
Collapse
|
10
|
Optimization of TripleTOF spectral simulation and library searching for confident localization of phosphorylation sites. PLoS One 2019; 14:e0225885. [PMID: 31790495 PMCID: PMC6886777 DOI: 10.1371/journal.pone.0225885] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2019] [Accepted: 11/14/2019] [Indexed: 12/31/2022] Open
Abstract
Tandem mass spectrometry (MS/MS) has been used in analysis of proteins and their post-translational modifications. A recently developed data analysis method, which simulates MS/MS spectra of phosphopeptides and performs spectral library searching using SpectraST, facilitates confident localization of phosphorylation sites. However, its performance has been evaluated only on MS/MS spectra acquired using Orbitrap HCD mass spectrometers so far. In this study, we have investigated whether this approach would be applicable to another type of mass spectrometers, and optimized the simulation and search conditions to achieve sensitive and confident site localization. Synthetic phosphopeptides and enriched K562 cell phosphopeptides were analyzed using a TripleTOF 6600 mass spectrometer before and after enzymatic dephosphorylation. Dephosphorylated peptides identified by X!Tandem database searching were subjected to spectral simulation of all possible single phosphorylations using SimPhospho software. Phosphopeptides were identified and localized by SpectraST searching against a library of the simulated spectra. Although no synthetic phosphopeptide was localized at 1% false localization rate under the previous conditions, optimization of the spectral simulation and search conditions for the TripleTOF datasets achieved the localization and improved the sensitivity. Furthermore, the optimized conditions enabled sensitive localization of K562 phosphopeptides at 1% false discovery and localization rates. These results suggest that accurate phosphopeptide simulation of TripleTOF MS/MS spectra is possible and the simulated spectral libraries can be used in SpectraST searching for confident localization of phosphorylation sites.
Collapse
|
11
|
Nunes J, Charneira C, Morello J, Rodrigues J, Pereira SA, Antunes AMM. Mass Spectrometry-Based Methodologies for Targeted and Untargeted Identification of Protein Covalent Adducts (Adductomics): Current Status and Challenges. High Throughput 2019; 8:ht8020009. [PMID: 31018479 PMCID: PMC6631461 DOI: 10.3390/ht8020009] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2019] [Revised: 04/18/2019] [Accepted: 04/20/2019] [Indexed: 12/12/2022] Open
Abstract
Protein covalent adducts formed upon exposure to reactive (mainly electrophilic) chemicals may lead to the development of a wide range of deleterious health outcomes. Therefore, the identification of protein covalent adducts constitutes a huge opportunity for a better understanding of events underlying diseases and for the development of biomarkers which may constitute effective tools for disease diagnosis/prognosis, for the application of personalized medicine approaches and for accurately assessing human exposure to chemical toxicants. The currently available mass spectrometry (MS)-based methodologies, are clearly the most suitable for the analysis of protein covalent modifications, providing accuracy, sensitivity, unbiased identification of the modified residue and conjugates along with quantitative information. However, despite the huge technological advances in MS instrumentation and bioinformatics tools, the identification of low abundant protein covalent adducts is still challenging. This review is aimed at summarizing the MS-based methodologies currently used for the identification of protein covalent adducts and the strategies developed to overcome the analytical challenges, involving not only sample pre-treatment procedures but also distinct MS and data analysis approaches.
Collapse
Affiliation(s)
- João Nunes
- Centro de Química Estrutural, Instituto Superior Técnico, ULisboa, 1049-001 Lisboa, Portugal.
| | - Catarina Charneira
- Centro de Química Estrutural, Instituto Superior Técnico, ULisboa, 1049-001 Lisboa, Portugal.
| | - Judit Morello
- Centro de Química Estrutural, Instituto Superior Técnico, ULisboa, 1049-001 Lisboa, Portugal.
| | - João Rodrigues
- Clarify Analytical, Rua dos Mercadores 128A, 7000-872 Évora, Portugal.
| | - Sofia A Pereira
- CEDOC, Chronic Diseases Research Centre, NOVA Medical School, Faculdade de Ciências Médicas, Universidade NOVA de Lisboa, 1169-006 Lisboa, Portugal.
| | - Alexandra M M Antunes
- Centro de Química Estrutural, Instituto Superior Técnico, ULisboa, 1049-001 Lisboa, Portugal.
| |
Collapse
|
12
|
Bittremieux W, Meysman P, Noble WS, Laukens K. Fast Open Modification Spectral Library Searching through Approximate Nearest Neighbor Indexing. J Proteome Res 2018; 17:3463-3474. [PMID: 30184435 PMCID: PMC6173621 DOI: 10.1021/acs.jproteome.8b00359] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Open modification searching (OMS) is a powerful search strategy that identifies peptides carrying any type of modification by allowing a modified spectrum to match against its unmodified variant by using a very wide precursor mass window. A drawback of this strategy, however, is that it leads to a large increase in search time. Although performing an open search can be done using existing spectral library search engines by simply setting a wide precursor mass window, none of these tools have been optimized for OMS, leading to excessive runtimes and suboptimal identification results. We present the ANN-SoLo tool for fast and accurate open spectral library searching. ANN-SoLo uses approximate nearest neighbor indexing to speed up OMS by selecting only a limited number of the most relevant library spectra to compare to an unknown query spectrum. This approach is combined with a cascade search strategy to maximize the number of identified unmodified and modified spectra while strictly controlling the false discovery rate as well as a shifted dot product score to sensitively match modified spectra to their unmodified counterparts. ANN-SoLo achieves state-of-the-art performance in terms of speed and the number of identifications. On a previously published human cell line data set, ANN-SoLo confidently identifies more spectra than SpectraST or MSFragger and achieves a speedup of an order of magnitude compared with SpectraST. ANN-SoLo is implemented in Python and C++. It is freely available under the Apache 2.0 license at https://github.com/bittremieux/ANN-SoLo .
Collapse
Affiliation(s)
- Wout Bittremieux
- Department of Mathematics and Computer Science , University of Antwerp , 2020 Antwerp , Belgium
- Biomedical Informatics Network Antwerpen (biomina) , 2020 Antwerp , Belgium
- Department of Genome Sciences , University of Washington , Seattle , Washington 98195 , United States
| | - Pieter Meysman
- Department of Mathematics and Computer Science , University of Antwerp , 2020 Antwerp , Belgium
- Biomedical Informatics Network Antwerpen (biomina) , 2020 Antwerp , Belgium
| | - William Stafford Noble
- Department of Genome Sciences , University of Washington , Seattle , Washington 98195 , United States
- Department of Computer Science and Engineering , University of Washington , Seattle , Washington 98195 , United States
| | - Kris Laukens
- Department of Mathematics and Computer Science , University of Antwerp , 2020 Antwerp , Belgium
- Biomedical Informatics Network Antwerpen (biomina) , 2020 Antwerp , Belgium
| |
Collapse
|
13
|
Mylonas R, Beer I, Iseli C, Chong C, Pak HS, Gfeller D, Coukos G, Xenarios I, Müller M, Bassani-Sternberg M. Estimating the Contribution of Proteasomal Spliced Peptides to the HLA-I Ligandome. Mol Cell Proteomics 2018; 17:2347-2357. [PMID: 30171158 PMCID: PMC6283289 DOI: 10.1074/mcp.ra118.000877] [Citation(s) in RCA: 80] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Revised: 08/27/2018] [Indexed: 12/21/2022] Open
Abstract
It has been reported that about 30% of the HLA-I ligands are produced by proteasomal splicing of two noncontiguous fragments of a parental protein. We report that the identification of many of those spliced peptides is ambiguous. With an alternative workflow, based on de novo sequencing and subsequent verification with multiple search tools, we estimate that the upper bound for the proportion of cis-spliced peptides is 2–6%. Nevertheless, the true contribution of spliced peptides to the ligandome may be much smaller. Spliced peptides are short protein fragments spliced together in the proteasome by peptide bond formation. True estimation of the contribution of proteasome-spliced peptides (PSPs) to the global human leukocyte antigen (HLA) ligandome is critical. A recent study suggested that PSPs contribute up to 30% of the HLA ligandome. We performed a thorough reanalysis of the reported results using multiple computational tools and various validation steps and concluded that only a fraction of the proposed PSPs passes the quality filters. To better estimate the actual number of PSPs, we present an alternative workflow. We performed de novo sequencing of the HLA-peptide spectra and discarded all de novo sequences found in the UniProt database. We checked whether the remaining de novo sequences could match spliced peptides from human proteins. The spliced sequences were appended to the UniProt fasta file, which was searched by two search tools at a false discovery rate (FDR) of 1%. We find that 2–6% of the HLA ligandome could be explained as spliced protein fragments. The majority of these potential PSPs have good peptide-spectrum match properties and are predicted to bind the respective HLA molecules. However, it remains to be shown how many of these potential PSPs actually originate from proteasomal splicing events.
Collapse
Affiliation(s)
- Roman Mylonas
- Vital-IT, 1015 Lausanne, Switzerland; Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Ilan Beer
- Adicet Bio Israel, Ltd., Technion City, 32000, Haifa, Israel
| | - Christian Iseli
- Vital-IT, 1015 Lausanne, Switzerland; Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Chloe Chong
- Ludwig Cancer Research Center, University of Lausanne, 1066 Epalinges, Switzerland; Department of Oncology, University Hospital of Lausanne, 1011 Lausanne, Switzerland
| | - Hui-Song Pak
- Ludwig Cancer Research Center, University of Lausanne, 1066 Epalinges, Switzerland; Department of Oncology, University Hospital of Lausanne, 1011 Lausanne, Switzerland
| | - David Gfeller
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland; Ludwig Cancer Research Center, University of Lausanne, 1066 Epalinges, Switzerland
| | - George Coukos
- Ludwig Cancer Research Center, University of Lausanne, 1066 Epalinges, Switzerland; Department of Oncology, University Hospital of Lausanne, 1011 Lausanne, Switzerland
| | - Ioannis Xenarios
- Vital-IT, 1015 Lausanne, Switzerland; Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Markus Müller
- Vital-IT, 1015 Lausanne, Switzerland; Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland.
| | - Michal Bassani-Sternberg
- Vital-IT, 1015 Lausanne, Switzerland; Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland.
| |
Collapse
|
14
|
Willems S, Bouyssié D, Deforce D, Dorfer V, Gorshkov V, Kopczynski D, Laukens K, Locard-Paulet M, Schwämmle V, Uszkoreit J, Valkenborg D, Vaudel M, Bittremieux W. Proceedings of the EuBIC developer's meeting 2018. J Proteomics 2018; 187:25-27. [PMID: 29864591 DOI: 10.1016/j.jprot.2018.05.015] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2018] [Accepted: 05/27/2018] [Indexed: 11/18/2022]
Abstract
The inaugural European Bioinformatics Community (EuBIC) developer's meeting was held from January 9th to January 12th 2018 in Ghent, Belgium. While the meeting kicked off with an interactive keynote session featuring four internationally renowned experts in the field of computational proteomics, its primary focus were the hands-on hackathon sessions which featured six community-proposed projects revolving around three major topics: Here, we present an overview of the scientific program of the EuBIC developer's meeting and provide a starting point for follow-up on the covered projects.
Collapse
Affiliation(s)
- Sander Willems
- Laboratory of Pharmaceutical Biotechnology, Ghent University, Ghent, Belgium
| | - David Bouyssié
- Institute of Pharmacology and Structural Biology, University of Toulouse, CNRS, UPS, Toulouse, France
| | - Dieter Deforce
- Laboratory of Pharmaceutical Biotechnology, Ghent University, Ghent, Belgium
| | - Viktoria Dorfer
- Bioinformatics Research Group, University of Applied Sciences Upper Austria, Hagenberg, Austria
| | - Vladimir Gorshkov
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense M, Denmark
| | - Dominik Kopczynski
- Leibniz-Institut für Analytische Wissenschaften - ISAS - e.V., Dortmund, Germany
| | - Kris Laukens
- Department of Mathematics and Computer Science, University of Antwerp, Antwerp, Belgium
| | - Marie Locard-Paulet
- Institute of Pharmacology and Structural Biology, University of Toulouse, CNRS, UPS, Toulouse, France
| | - Veit Schwämmle
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense M, Denmark
| | - Julian Uszkoreit
- Medizinisches Proteom-Center, Ruhr University Bochum, Bochum, Germany
| | - Dirk Valkenborg
- Interuniversity Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Hasselt, Belgium; Centre for Proteomics, University of Antwerp, Antwerp, Belgium
| | - Marc Vaudel
- KG Jebsen Center for Diabetes Research, Department of Clinical Science, University of Bergen, Bergen, Norway; Center for Medical Genetics and Molecular Medicine, Haukeland University Hospital, Bergen, Norway
| | - Wout Bittremieux
- Department of Mathematics and Computer Science, University of Antwerp, Antwerp, Belgium; Department of Genome Sciences, University of Washington, Seattle, WA, USA.
| |
Collapse
|
15
|
Zhang Z, Burke M, Mirokhin YA, Tchekhovskoi DV, Markey SP, Yu W, Chaerkady R, Hess S, Stein SE. Reverse and Random Decoy Methods for False Discovery Rate Estimation in High Mass Accuracy Peptide Spectral Library Searches. J Proteome Res 2018; 17:846-857. [DOI: 10.1021/acs.jproteome.7b00614] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Affiliation(s)
- Zheng Zhang
- Mass
Spectrometry Data Center, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, Maryland 20899, United States
| | - Meghan Burke
- Mass
Spectrometry Data Center, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, Maryland 20899, United States
| | - Yuri A. Mirokhin
- Mass
Spectrometry Data Center, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, Maryland 20899, United States
| | - Dmitrii V. Tchekhovskoi
- Mass
Spectrometry Data Center, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, Maryland 20899, United States
| | - Sanford P. Markey
- Mass
Spectrometry Data Center, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, Maryland 20899, United States
| | - Wen Yu
- Research
Bioinformatics, MedImmune LLC, One MedImmune Way, Gaithersburg, Maryland 20878, United States
| | - Raghothama Chaerkady
- Antibody
Discovery and Protein Engineering, Protein Sciences, MedImmune LLC, One MedImmune Way, Gaithersburg, Maryland 20878, United States
| | - Sonja Hess
- Antibody
Discovery and Protein Engineering, Protein Sciences, MedImmune LLC, One MedImmune Way, Gaithersburg, Maryland 20878, United States
| | - Stephen E. Stein
- Mass
Spectrometry Data Center, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, Maryland 20899, United States
| |
Collapse
|
16
|
Horlacher O, Jin C, Alocci D, Mariethoz J, Müller M, Karlsson NG, Lisacek F. Glycoforest 1.0. Anal Chem 2017; 89:10932-10940. [PMID: 28901741 DOI: 10.1021/acs.analchem.7b02754] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Tandem mass spectrometry, when combined with liquid chromatography and applied to complex mixtures, produces large amounts of raw data, which needs to be analyzed to identify molecular structures. This technique is widely used, particularly in glycomics. Due to a lack of high throughput glycan sequencing software, glycan spectra are predominantly sequenced manually. A challenge for writing glycan-sequencing software is that there is no direct template that can be used to infer structures detectable in an organism. To help alleviate this bottleneck, we present Glycoforest 1.0, a partial de novo algorithm for sequencing glycan structures based on MS/MS spectra. Glycoforest was tested on two data sets (human gastric and salmon mucosa O-linked glycomes) for which MS/MS spectra were annotated manually. Glycoforest generated the human validated structure for 92% of test cases. The correct structure was found as the best scoring match for 70% and among the top 3 matches for 83% of test cases. In addition, the Glycoforest algorithm detected glycan structures from MS/MS spectra missing a manual annotation. In total 1532 MS/MS previously unannotated spectra were annotated by Glycoforest. A portion containing 521 spectra was manually checked confirming that Glycoforest annotated an additional 50 MS/MS spectra overlooked during manual annotation.
Collapse
Affiliation(s)
- Oliver Horlacher
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics , Geneva, 1211, Switzerland.,University of Geneva , Geneva, 1211, Switzerland
| | - Chunsheng Jin
- Glyco Inflammatory Group, Department of Medical Biochemistry and Cell Biology, Institute of Biomedicine, Sahlgrenska Academy, University of Gothenburg , Gothenburg, SE405 30, Sweden
| | - Davide Alocci
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics , Geneva, 1211, Switzerland.,University of Geneva , Geneva, 1211, Switzerland
| | - Julien Mariethoz
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics , Geneva, 1211, Switzerland.,University of Geneva , Geneva, 1211, Switzerland
| | - Markus Müller
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics , Geneva, 1211, Switzerland.,University of Geneva , Geneva, 1211, Switzerland
| | - Niclas G Karlsson
- Glyco Inflammatory Group, Department of Medical Biochemistry and Cell Biology, Institute of Biomedicine, Sahlgrenska Academy, University of Gothenburg , Gothenburg, SE405 30, Sweden
| | - Frederique Lisacek
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics , Geneva, 1211, Switzerland.,University of Geneva , Geneva, 1211, Switzerland
| |
Collapse
|
17
|
Dasari R, La Clair JJ, Kornienko A. Irreversible Protein Labeling by Paal-Knorr Conjugation. Chembiochem 2017; 18:1792-1796. [PMID: 28715110 PMCID: PMC5766258 DOI: 10.1002/cbic.201700210] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2017] [Indexed: 01/07/2023]
Abstract
The application of new chemical reactions in a biological context has advanced bioconjugation methods for both fundamental research and commercial arenas. Recent adaptations of reactions such as Huisgen 1,3-dipolar or Diels-Alder cycloadditions have enabled the labeling of specific residues in biomolecules by the attachment of molecules carrying azides, alkynes, or strained alkenes. Although these are fundamental tools, there is a need for the discovery of reactions that can label native proteins. We report herein the adaptation of the Paal-Knorr reaction to label lysine residues in proteins via pyrrole linkages.
Collapse
Affiliation(s)
- Ramesh Dasari
- Department of Chemistry and Biochemistry, Texas State University, San Marcos, TX, 78666, USA
| | - James J La Clair
- Department of Chemistry and Biochemistry, University of California at San Diego, La Jolla, CA, 92093, USA
- Xenobe Research Institute, P. O. Box 3052, San Diego, CA, 92163, USA
| | - Alexander Kornienko
- Department of Chemistry and Biochemistry, Texas State University, San Marcos, TX, 78666, USA
| |
Collapse
|
18
|
Ruggles KV, Krug K, Wang X, Clauser KR, Wang J, Payne SH, Fenyö D, Zhang B, Mani DR. Methods, Tools and Current Perspectives in Proteogenomics. Mol Cell Proteomics 2017; 16:959-981. [PMID: 28456751 DOI: 10.1074/mcp.mr117.000024] [Citation(s) in RCA: 95] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2017] [Indexed: 12/20/2022] Open
Abstract
With combined technological advancements in high-throughput next-generation sequencing and deep mass spectrometry-based proteomics, proteogenomics, i.e. the integrative analysis of proteomic and genomic data, has emerged as a new research field. Early efforts in the field were focused on improving protein identification using sample-specific genomic and transcriptomic sequencing data. More recently, integrative analysis of quantitative measurements from genomic and proteomic studies have identified novel insights into gene expression regulation, cell signaling, and disease. Many methods and tools have been developed or adapted to enable an array of integrative proteogenomic approaches and in this article, we systematically classify published methods and tools into four major categories, (1) Sequence-centric proteogenomics; (2) Analysis of proteogenomic relationships; (3) Integrative modeling of proteogenomic data; and (4) Data sharing and visualization. We provide a comprehensive review of methods and available tools in each category and highlight their typical applications.
Collapse
Affiliation(s)
- Kelly V Ruggles
- From the ‡Department of Medicine, New York University School of Medicine, New York, New York 10016
| | - Karsten Krug
- §The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142
| | - Xiaojing Wang
- ¶Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas 77030.,‖Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030
| | - Karl R Clauser
- §The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142
| | - Jing Wang
- ¶Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas 77030.,‖Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030
| | - Samuel H Payne
- **Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354
| | - David Fenyö
- ‡‡Department of Biochemistry and Molecular Pharmacology, New York University School of Medicine, New York, New York 10016; .,§§Institute for Systems Genetics, New York University School of Medicine, New York, New York 10016
| | - Bing Zhang
- ¶Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas 77030; .,‖Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030
| | - D R Mani
- §The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142;
| |
Collapse
|
19
|
Kohlbacher O, Vitek O, Weintraub ST. Challenges in Large-Scale Computational Mass Spectrometry and Multiomics. J Proteome Res 2016; 15:681-2. [DOI: 10.1021/acs.jproteome.6b00067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Oliver Kohlbacher
- Center for Bioinformatics, Quantitative Biology Center,
Department of Computer Science and Faculty of Medicine, University
of Tübingen and Max Planck Institute for Developmental Biology
| | - Olga Vitek
- Sy and Laurie Sternberg Interdisciplinary Associate
Professor, College of Science College of Computer and Information
Science, Northeastern University
| | - Susan T. Weintraub
- Department of Biochemistry, The University of Texas
Health Science Center at San Antonio
| |
Collapse
|