1
|
Picciani M, Gabriel W, Giurcoiu VG, Shouman O, Hamood F, Lautenbacher L, Jensen CB, Müller J, Kalhor M, Soleymaniniya A, Kuster B, The M, Wilhelm M. Oktoberfest: Open-source spectral library generation and rescoring pipeline based on Prosit. Proteomics 2024; 24:e2300112. [PMID: 37672792 DOI: 10.1002/pmic.202300112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 08/17/2023] [Accepted: 08/18/2023] [Indexed: 09/08/2023]
Abstract
Machine learning (ML) and deep learning (DL) models for peptide property prediction such as Prosit have enabled the creation of high quality in silico reference libraries. These libraries are used in various applications, ranging from data-independent acquisition (DIA) data analysis to data-driven rescoring of search engine results. Here, we present Oktoberfest, an open source Python package of our spectral library generation and rescoring pipeline originally only available online via ProteomicsDB. Oktoberfest is largely search engine agnostic and provides access to online peptide property predictions, promoting the adoption of state-of-the-art ML/DL models in proteomics analysis pipelines. We demonstrate its ability to reproduce and even improve our results from previously published rescoring analyses on two distinct use cases. Oktoberfest is freely available on GitHub (https://github.com/wilhelm-lab/oktoberfest) and can easily be installed locally through the cross-platform PyPI Python package.
Collapse
Affiliation(s)
- Mario Picciani
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Wassim Gabriel
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Victor-George Giurcoiu
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Omar Shouman
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Firas Hamood
- Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Ludwig Lautenbacher
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Cecilia Bang Jensen
- Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Julian Müller
- Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Mostafa Kalhor
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Armin Soleymaniniya
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Bernhard Kuster
- Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Matthew The
- Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Mathias Wilhelm
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| |
Collapse
|
3
|
González-Solares EA, Dariush A, González-Fernández C, Küpcü Yoldaş A, Molaeinezhad A, Al Sa’d M, Smith L, Whitmarsh T, Millar N, Chornay N, Falciatori I, Fatemi A, Goodwin D, Kuett L, Mulvey CM, Páez Ribes M, Qosaj F, Roth A, Vázquez-García I, Watson SS, Windhager J, Aparicio S, Bodenmiller B, Boyden E, Caldas C, Harris O, Shah SP, Tavaré S, Bressan D, Hannon GJ, Walton NA. Imaging and Molecular Annotation of Xenographs and Tumours (IMAXT): High throughput data and analysis infrastructure. Biol Imaging 2023; 3:e11. [PMID: 38487685 PMCID: PMC10936408 DOI: 10.1017/s2633903x23000090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Revised: 12/21/2022] [Accepted: 03/08/2023] [Indexed: 03/17/2024]
Abstract
With the aim of producing a 3D representation of tumors, imaging and molecular annotation of xenografts and tumors (IMAXT) uses a large variety of modalities in order to acquire tumor samples and produce a map of every cell in the tumor and its host environment. With the large volume and variety of data produced in the project, we developed automatic data workflows and analysis pipelines. We introduce a research methodology where scientists connect to a cloud environment to perform analysis close to where data are located, instead of bringing data to their local computers. Here, we present the data and analysis infrastructure, discuss the unique computational challenges and describe the analysis chains developed and deployed to generate molecularly annotated tumor models. Registration is achieved by use of a novel technique involving spherical fiducial marks that are visible in all imaging modalities used within IMAXT. The automatic pipelines are highly optimized and allow to obtain processed datasets several times quicker than current solutions narrowing the gap between data acquisition and scientific exploitation.
Collapse
Affiliation(s)
| | - Ali Dariush
- Institute of Astronomy, University of Cambridge, Cambridge, United Kingdom
- CRUK Cambridge Institute, Li Ka Shing Centre, University of Cambridge, Cambridge, United Kingdom
| | | | | | | | - Mohammad Al Sa’d
- Institute of Astronomy, University of Cambridge, Cambridge, United Kingdom
| | - Leigh Smith
- Institute of Astronomy, University of Cambridge, Cambridge, United Kingdom
| | - Tristan Whitmarsh
- Institute of Astronomy, University of Cambridge, Cambridge, United Kingdom
| | - Neil Millar
- Institute of Astronomy, University of Cambridge, Cambridge, United Kingdom
| | - Nicholas Chornay
- Institute of Astronomy, University of Cambridge, Cambridge, United Kingdom
| | - Ilaria Falciatori
- CRUK Cambridge Institute, Li Ka Shing Centre, University of Cambridge, Cambridge, United Kingdom
| | - Atefeh Fatemi
- CRUK Cambridge Institute, Li Ka Shing Centre, University of Cambridge, Cambridge, United Kingdom
| | - Daniel Goodwin
- McGovern Institute, Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
- McGovern Institute, Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Laura Kuett
- Department of Quantitative Biomedicine, University of Zurich, Zurich, Switzerland
- Institute of Molecular Life Sciences, University of Zurich, Zurich, Switzerland
| | - Claire M. Mulvey
- CRUK Cambridge Institute, Li Ka Shing Centre, University of Cambridge, Cambridge, United Kingdom
| | - Marta Páez Ribes
- CRUK Cambridge Institute, Li Ka Shing Centre, University of Cambridge, Cambridge, United Kingdom
| | - Fatime Qosaj
- CRUK Cambridge Institute, Li Ka Shing Centre, University of Cambridge, Cambridge, United Kingdom
| | - Andrew Roth
- Department of Computer Science, University of British Columbia, Vancouver, BC, Canada
| | - Ignacio Vázquez-García
- Herbert and Florence Irving Institute for Cancer Dynamics, Columbia University, New York, NY, USA
- Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Spencer S. Watson
- Department of Oncology and Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
| | - Jonas Windhager
- Department of Quantitative Biomedicine, University of Zurich, Zurich, Switzerland
- Institute of Molecular Life Sciences, University of Zurich, Zurich, Switzerland
| | - Samuel Aparicio
- Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, BC, Canada
| | - Bernd Bodenmiller
- Department of Quantitative Biomedicine, University of Zurich, Zurich, Switzerland
- Institute of Molecular Life Sciences, University of Zurich, Zurich, Switzerland
| | - Ed Boyden
- McGovern Institute, Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
- McGovern Institute, Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
- Howard Hughes Medical Institute, Department of Physics, Harvard University, Cambridge, MA, USA
- Howard Hughes Medical Institute, Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA, USA
| | - Carlos Caldas
- CRUK Cambridge Institute, Li Ka Shing Centre, University of Cambridge, Cambridge, United Kingdom
- Cambridge Breast Unit, Addenbrooke’s Hospital, Cambridge University Hospital NHS Foundation Trust and NIHR Cambridge Biomedical Research Centre, Cambridge, United Kingdom
| | | | - Sohrab P. Shah
- Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Simon Tavaré
- CRUK Cambridge Institute, Li Ka Shing Centre, University of Cambridge, Cambridge, United Kingdom
- Herbert and Florence Irving Institute for Cancer Dynamics, Columbia University, New York, NY, USA
- New York Genome Center, New York, NY, USA
| | | | - Dario Bressan
- CRUK Cambridge Institute, Li Ka Shing Centre, University of Cambridge, Cambridge, United Kingdom
| | - Gregory J. Hannon
- CRUK Cambridge Institute, Li Ka Shing Centre, University of Cambridge, Cambridge, United Kingdom
| | - Nicholas A. Walton
- Institute of Astronomy, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
6
|
Routier A, Burgos N, Díaz M, Bacci M, Bottani S, El-Rifai O, Fontanella S, Gori P, Guillon J, Guyot A, Hassanaly R, Jacquemont T, Lu P, Marcoux A, Moreau T, Samper-González J, Teichmann M, Thibeau-Sutre E, Vaillant G, Wen J, Wild A, Habert MO, Durrleman S, Colliot O. Clinica: An Open-Source Software Platform for Reproducible Clinical Neuroscience Studies. Front Neuroinform 2021; 15:689675. [PMID: 34483871 PMCID: PMC8415107 DOI: 10.3389/fninf.2021.689675] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Accepted: 07/19/2021] [Indexed: 12/03/2022] Open
Abstract
We present Clinica (www.clinica.run), an open-source software platform designed to make clinical neuroscience studies easier and more reproducible. Clinica aims for researchers to (i) spend less time on data management and processing, (ii) perform reproducible evaluations of their methods, and (iii) easily share data and results within their institution and with external collaborators. The core of Clinica is a set of automatic pipelines for processing and analysis of multimodal neuroimaging data (currently, T1-weighted MRI, diffusion MRI, and PET data), as well as tools for statistics, machine learning, and deep learning. It relies on the brain imaging data structure (BIDS) for the organization of raw neuroimaging datasets and on established tools written by the community to build its pipelines. It also provides converters of public neuroimaging datasets to BIDS (currently ADNI, AIBL, OASIS, and NIFD). Processed data include image-valued scalar fields (e.g., tissue probability maps), meshes, surface-based scalar fields (e.g., cortical thickness maps), or scalar outputs (e.g., regional averages). These data follow the ClinicA Processed Structure (CAPS) format which shares the same philosophy as BIDS. Consistent organization of raw and processed neuroimaging files facilitates the execution of single pipelines and of sequences of pipelines, as well as the integration of processed data into statistics or machine learning frameworks. The target audience of Clinica is neuroscientists or clinicians conducting clinical neuroscience studies involving multimodal imaging, and researchers developing advanced machine learning algorithms applied to neuroimaging data.
Collapse
Affiliation(s)
- Alexandre Routier
- Inria, Aramis Project-Team, Paris, France
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Ninon Burgos
- Inria, Aramis Project-Team, Paris, France
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Mauricio Díaz
- Inria, Service d'Expérimentation et de Développement, Paris, France
| | - Michael Bacci
- Inria, Aramis Project-Team, Paris, France
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Simona Bottani
- Inria, Aramis Project-Team, Paris, France
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Omar El-Rifai
- Inria, Aramis Project-Team, Paris, France
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Sabrina Fontanella
- Inria, Aramis Project-Team, Paris, France
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Pietro Gori
- Inria, Aramis Project-Team, Paris, France
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Jérémy Guillon
- Inria, Aramis Project-Team, Paris, France
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Alexis Guyot
- Inria, Aramis Project-Team, Paris, France
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Ravi Hassanaly
- Inria, Aramis Project-Team, Paris, France
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Thomas Jacquemont
- Inria, Aramis Project-Team, Paris, France
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Pascal Lu
- Inria, Aramis Project-Team, Paris, France
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Arnaud Marcoux
- Inria, Aramis Project-Team, Paris, France
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Tristan Moreau
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Jorge Samper-González
- Inria, Aramis Project-Team, Paris, France
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Marc Teichmann
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
- Department of Neurology, Institute for Memory and Alzheimer's Disease, Pitié-Salpêtrière Hospital, AP-HP, Paris, France
| | - Elina Thibeau-Sutre
- Inria, Aramis Project-Team, Paris, France
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Ghislain Vaillant
- Inria, Aramis Project-Team, Paris, France
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Junhao Wen
- Inria, Aramis Project-Team, Paris, France
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Adam Wild
- Inria, Aramis Project-Team, Paris, France
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Marie-Odile Habert
- Sorbonne Université, CNRS, INSERM, Laboratoire d'Imagerie Biomédicale (LIB), Paris, France
- AP-HP, Hôpital Pitié-Salpêtrière, Médecine Nucléaire, Paris, France
- Centre d'Acquisition et Traitement des Images, Paris, France
| | - Stanley Durrleman
- Inria, Aramis Project-Team, Paris, France
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Olivier Colliot
- Inria, Aramis Project-Team, Paris, France
- Sorbonne Université, Paris, France
- Institut du Cerveau – Paris Brain Institute – ICM, Paris, France
- Inserm, Paris, France
- CNRS, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
| |
Collapse
|