Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wilson SL, Way GP, Bittremieux W, Armache JP, Haendel MA, Hoffman MM. Sharing biological data: why, when, and how. FEBS Lett 2021;595:847-863. [PMID: 33843054 PMCID: PMC10390076 DOI: 10.1002/1873-3468.14067] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

For:	Wilson SL, Way GP, Bittremieux W, Armache JP, Haendel MA, Hoffman MM. Sharing biological data: why, when, and how. FEBS Lett 2021;595:847-863. [PMID: 33843054 PMCID: PMC10390076 DOI: 10.1002/1873-3468.14067] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Number

Cited by Other Article(s)

Ghanegolmohammadi F, Eslami M, Ohya Y. Systematic data analysis pipeline for quantitative morphological cell phenotyping. Comput Struct Biotechnol J 2024;23:2949-2962. [PMID: 39104709 PMCID: PMC11298594 DOI: 10.1016/j.csbj.2024.07.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Revised: 07/09/2024] [Accepted: 07/10/2024] [Indexed: 08/07/2024] Open

Wu NC, Alton L, Bovo RP, Carey N, Currie SE, Lighton JRB, McKechnie AE, Pottier P, Rossi G, White CR, Levesque DL. Reporting guidelines for terrestrial respirometry: Building openness, transparency of metabolic rate and evaporative water loss data. Comp Biochem Physiol A Mol Integr Physiol 2024;296:111688. [PMID: 38944270 DOI: 10.1016/j.cbpa.2024.111688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Revised: 06/24/2024] [Accepted: 06/25/2024] [Indexed: 07/01/2024]

Affiliation(s)

Nicholas C Wu Hawkesbury Institute for the Environment, Western Sydney University, New South Wales 2753, Australia.
Lesley Alton Centre for Geometric Biology, School of Biological Sciences, Monash University, Melbourne, VIC 3800, Australia. https://twitter.com/lesley_alton
Rafael P Bovo Department of Evolution, Ecology, and Organismal Biology, University of California Riverside, Riverside, CA, United States. https://twitter.com/bovo_rp
Nicholas Carey Marine Directorate for the Scottish Government, Aberdeen, United Kingdom
Shannon E Currie Institute for Cell and Systems Biology, University of Hamburg, Martin-Luther-King Plz 3, 20146 Hamburg, Germany; School of Biosciences, University of Melbourne, Victoria, Australia. https://twitter.com/batsinthbelfry
John R B Lighton Sable Systems International, North Las Vegas, NV, United States. https://twitter.com/SableSys
Andrew E McKechnie South African Research Chair in Conservation Physiology, South African National Biodiversity Institute, South Africa; DSI-NRF Centre of Excellence at the FitzPatrick Institute, Department of Zoology and Entomology, University of Pretoria, South Africa
Patrice Pottier Evolution & Ecology Research Centre, School of Biological, Earth and Environmental Sciences, The University of New South Wales, Sydney, New South Wales, Australia; Division of Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, Australian Capital Territory, Australia. https://twitter.com/PatriceEcoEvo
Giulia Rossi Department of Biology, McMaster University, Hamilton, Ontario, Canada. https://twitter.com/giuliasrossi
Craig R White Centre for Geometric Biology, School of Biological Sciences, Monash University, Melbourne, VIC 3800, Australia
Danielle L Levesque School of Biology and Ecology, University of Maine, Orono, ME, United States. https://twitter.com/dl_levesque

Collapse

Aksenova A, Johny A, Adams T, Gribbon P, Jacobs M, Hofmann-Apitius M. Current state of data stewardship tools in life science. Front Big Data 2024;7:1428568. [PMID: 39351001 PMCID: PMC11439729 DOI: 10.3389/fdata.2024.1428568] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Accepted: 08/23/2024] [Indexed: 10/04/2024] Open

Scorza LC, Zieliński T, Kalita I, Lepore A, El Karoui M, Millar AJ. Daily life in the Open Biologist's second job, as a Data Curator. Wellcome Open Res 2024;9:523. [PMID: 39360219 PMCID: PMC11445645 DOI: 10.12688/wellcomeopenres.22899.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/29/2024] [Indexed: 10/04/2024] Open

Abdill RJ, Talarico E, Grieneisen L. A how-to guide for code sharing in biology. PLoS Biol 2024;22:e3002815. [PMID: 39255324 DOI: 10.1371/journal.pbio.3002815] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Revised: 09/20/2024] [Indexed: 09/12/2024] Open

Tiemann JKS, Szczuka M, Bouarroudj L, Oussaren M, Garcia S, Howard RJ, Delemotte L, Lindahl E, Baaden M, Lindorff-Larsen K, Chavent M, Poulain P. MDverse, shedding light on the dark matter of molecular dynamics simulations. eLife 2024;12:RP90061. [PMID: 39212001 PMCID: PMC11364437 DOI: 10.7554/elife.90061] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/04/2024] Open

Fouad K, Vavrek R, Surles-Zeigler MC, Huie JR, Radabaugh HL, Gurkoff GG, Visser U, Grethe JS, Martone ME, Ferguson AR, Gensel JC, Torres-Espin A. A practical guide to data management and sharing for biomedical laboratory researchers. Exp Neurol 2024;378:114815. [PMID: 38762093 DOI: 10.1016/j.expneurol.2024.114815] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2023] [Revised: 05/13/2024] [Accepted: 05/14/2024] [Indexed: 05/20/2024]

Affiliation(s)

K Fouad Department of Physical Therapy, Faculty of Rehabilitation Medicine, University of Alberta, Edmonton, AB, Canada.
R Vavrek Department of Physical Therapy, Faculty of Rehabilitation Medicine, University of Alberta, Edmonton, AB, Canada
M C Surles-Zeigler Department of Neuroscience, University of California, San Diego, La Jolla, CA, United States
J R Huie Department of Neurosurgery, Brain and Spinal Injury Center, Weill Institutes for Neurosciences, University of California, San Francisco, San Francisco, CA, United States; San Francisco Veterans Affairs Healthcare System, San Francisco, CA, United States
H L Radabaugh Department of Neurosurgery, Brain and Spinal Injury Center, Weill Institutes for Neurosciences, University of California, San Francisco, San Francisco, CA, United States
G G Gurkoff Center for Neuroscience, University of California Davis, Davis, CA, United States; Department of Neurological Surgery, University of California Davis, Davis, CA, United States; Northern California Veterans Affairs Healthcare System, Martinez, CA, United States
U Visser Department of Computer Science, University of Miami, Coral Gables, FL, United States
J S Grethe Department of Neuroscience, University of California, San Diego, La Jolla, CA, United States
M E Martone Department of Neuroscience, University of California, San Diego, La Jolla, CA, United States; San Francisco Veterans Affairs Healthcare System, San Francisco, CA, United States
A R Ferguson Department of Neurosurgery, Brain and Spinal Injury Center, Weill Institutes for Neurosciences, University of California, San Francisco, San Francisco, CA, United States; San Francisco Veterans Affairs Healthcare System, San Francisco, CA, United States
J C Gensel Spinal Cord and Brain Injury Research Center and Department of Physiology, University of Kentucky College of Medicine, Lexington, KY, United States.
A Torres-Espin Department of Physical Therapy, Faculty of Rehabilitation Medicine, University of Alberta, Edmonton, AB, Canada; Department of Neurosurgery, Brain and Spinal Injury Center, Weill Institutes for Neurosciences, University of California, San Francisco, San Francisco, CA, United States; School of Public Health Sciences, University of Waterloo, Waterloo, ON, Canada.

Collapse

Biriukov D, Vácha R. Pathways to a Shiny Future: Building the Foundation for Computational Physical Chemistry and Biophysics in 2050. ACS PHYSICAL CHEMISTRY AU 2024;4:302-313. [PMID: 39069976 PMCID: PMC11274290 DOI: 10.1021/acsphyschemau.4c00003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/07/2024] [Revised: 03/15/2024] [Accepted: 03/18/2024] [Indexed: 07/30/2024]

Martorelli I, Pooryousefi A, van Thiel H, Sicking FJ, Ramackers GJ, Merckx V, Verbeek FJ. Multiple graphical views for automatically generating SQL for the MycoDiversity DB; making fungal biodiversity studies accessible. Biodivers Data J 2024;12:e119660. [PMID: 38933486 PMCID: PMC11199959 DOI: 10.3897/bdj.12.e119660] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2024] [Accepted: 06/06/2024] [Indexed: 06/28/2024] Open

Tiemann JKS, Szczuka M, Bouarroudj L, Oussaren M, Garcia S, Howard RJ, Delemotte L, Lindahl E, Baaden M, Lindorff-Larsen K, Chavent M, Poulain P. MDverse: Shedding Light on the Dark Matter of Molecular Dynamics Simulations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.05.02.538537. [PMID: 37205542 PMCID: PMC10187166 DOI: 10.1101/2023.05.02.538537] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]

Bibik P, Alibai S, Pandini A, Dantu SC. PyCoM: a python library for large-scale analysis of residue-residue coevolution data. Bioinformatics 2024;40:btae166. [PMID: 38532297 PMCID: PMC11009027 DOI: 10.1093/bioinformatics/btae166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 02/02/2024] [Accepted: 03/25/2024] [Indexed: 03/28/2024] Open

Emissah H, Ljungquist B, Ascoli GA. Bibliometric analysis of neuroscience publications quantifies the impact of data sharing. Bioinformatics 2023;39:btad746. [PMID: 38070153 PMCID: PMC10733721 DOI: 10.1093/bioinformatics/btad746] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 11/01/2023] [Accepted: 12/07/2023] [Indexed: 12/19/2023] Open

Way GP, Sailem H, Shave S, Kasprowicz R, Carragher NO. Evolution and impact of high content imaging. SLAS DISCOVERY : ADVANCING LIFE SCIENCES R & D 2023;28:292-305. [PMID: 37666456 DOI: 10.1016/j.slasd.2023.08.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 08/09/2023] [Accepted: 08/29/2023] [Indexed: 09/06/2023]

Emissah H, Ljungquist B, Ascoli GA. Bibliometric analysis of neuroscience publications quantifies the impact of data sharing. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.12.557386. [PMID: 37745378 PMCID: PMC10515804 DOI: 10.1101/2023.09.12.557386] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]

Kemmer I, Keppler A, Serrano-Solano B, Rybina A, Özdemir B, Bischof J, El Ghadraoui A, Eriksson JE, Mathur A. Building a FAIR image data ecosystem for microscopy communities. Histochem Cell Biol 2023;160:199-209. [PMID: 37341795 PMCID: PMC10492678 DOI: 10.1007/s00418-023-02203-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/27/2023] [Indexed: 06/22/2023]

O'Connor LM, O'Connor BA, Lim SB, Zeng J, Lo CH. Integrative multi-omics and systems bioinformatics in translational neuroscience: A data mining perspective. J Pharm Anal 2023;13:836-850. [PMID: 37719197 PMCID: PMC10499660 DOI: 10.1016/j.jpha.2023.06.011] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 06/20/2023] [Accepted: 06/25/2023] [Indexed: 09/19/2023] Open

Abstract

Bioinformatic analysis of large and complex omics datasets has become increasingly useful in modern day biology by providing a great depth of information, with its application to neuroscience termed neuroinformatics. Data mining of omics datasets has enabled the generation of new hypotheses based on differentially regulated biological molecules associated with disease mechanisms, which can be tested experimentally for improved diagnostic and therapeutic targeting of neurodegenerative diseases. Importantly, integrating multi-omics data using a systems bioinformatics approach will advance the understanding of the layered and interactive network of biological regulation that exchanges systemic knowledge to facilitate the development of a comprehensive human brain profile. In this review, we first summarize data mining studies utilizing datasets from the individual type of omics analysis, including epigenetics/epigenomics, transcriptomics, proteomics, metabolomics, lipidomics, and spatial omics, pertaining to Alzheimer's disease, Parkinson's disease, and multiple sclerosis. We then discuss multi-omics integration approaches, including independent biological integration and unsupervised integration methods, for more intuitive and informative interpretation of the biological data obtained across different omics layers. We further assess studies that integrate multi-omics in data mining which provide convoluted biological insights and offer proof-of-concept proposition towards systems bioinformatics in the reconstruction of brain networks. Finally, we recommend a combination of high dimensional bioinformatics analysis with experimental validation to achieve translational neuroscience applications including biomarker discovery, therapeutic development, and elucidation of disease mechanisms. We conclude by providing future perspectives and opportunities in applying integrative multi-omics and systems bioinformatics to achieve precision phenotyping of neurodegenerative diseases and towards personalized medicine.

Collapse

Danis D, Jacobsen JOB, Wagner AH, Groza T, Beckwith MA, Rekerle L, Carmody LC, Reese J, Hegde H, Ladewig MS, Seitz B, Munoz-Torres M, Harris NL, Rambla J, Baudis M, Mungall CJ, Haendel MA, Robinson PN. Phenopacket-tools: Building and validating GA4GH Phenopackets. PLoS One 2023;18:e0285433. [PMID: 37196000 PMCID: PMC10191354 DOI: 10.1371/journal.pone.0285433] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Accepted: 04/21/2023] [Indexed: 05/19/2023] Open

Abstract

The Global Alliance for Genomics and Health (GA4GH) is a standards-setting organization that is developing a suite of coordinated standards for genomics. The GA4GH Phenopacket Schema is a standard for sharing disease and phenotype information that characterizes an individual person or biosample. The Phenopacket Schema is flexible and can represent clinical data for any kind of human disease including rare disease, complex disease, and cancer. It also allows consortia or databases to apply additional constraints to ensure uniform data collection for specific goals. We present phenopacket-tools, an open-source Java library and command-line application for construction, conversion, and validation of phenopackets. Phenopacket-tools simplifies construction of phenopackets by providing concise builders, programmatic shortcuts, and predefined building blocks (ontology classes) for concepts such as anatomical organs, age of onset, biospecimen type, and clinical modifiers. Phenopacket-tools can be used to validate the syntax and semantics of phenopackets as well as to assess adherence to additional user-defined requirements. The documentation includes examples showing how to use the Java library and the command-line tool to create and validate phenopackets. We demonstrate how to create, convert, and validate phenopackets using the library or the command-line application. Source code, API documentation, comprehensive user guide and a tutorial can be found at https://github.com/phenopackets/phenopacket-tools. The library can be installed from the public Maven Central artifact repository and the application is available as a standalone archive. The phenopacket-tools library helps developers implement and standardize the collection and exchange of phenotypic and other clinical data for use in phenotype-driven genomic diagnostics, translational research, and precision medicine applications.

Collapse

Affiliation(s)

Daniel Danis The Jackson Laboratory for Genomic Medicine, Farmington, CT, United States of America
Julius O. B. Jacobsen William Harvey Research Institute, Queen Mary University of London, London, United Kingdom
Alex H. Wagner Departments of Pediatrics and Biomedical Informatics, The Ohio State University College of Medicine, Columbus, OH, United States of America The Steve and Cindy Rasmussen Institute for Genomic Medicine, Nationwide Children’s Hospital, Columbus, OH, United States of America
Tudor Groza EMBL-EBI, Cambridge, United Kingdom
Martha A. Beckwith The Jackson Laboratory for Genomic Medicine, Farmington, CT, United States of America
Lauren Rekerle The Jackson Laboratory for Genomic Medicine, Farmington, CT, United States of America
Leigh C. Carmody The Jackson Laboratory for Genomic Medicine, Farmington, CT, United States of America
Justin Reese Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, United States of America
Harshad Hegde Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, United States of America
Markus S. Ladewig Department of Ophthalmology, Klinikum Saarbrücken, Saarbrücken, Germany
Berthold Seitz Department of Ophthalmology, Saarland University Medical Center, Homburg/Saar, Germany
Monica Munoz-Torres Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, United States of America
Nomi L. Harris Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, United States of America
Jordi Rambla European Genome-Phenome Archive (EGA) in the Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
Michael Baudis University of Zurich and Swiss Institute of Bioinformatics, Zurich, Switzerland
Christopher J. Mungall Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, United States of America
Melissa A. Haendel Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, United States of America
Peter N. Robinson The Jackson Laboratory for Genomic Medicine, Farmington, CT, United States of America Institute for Systems Genomics, University of Connecticut, Farmington, CT, United States of America

Collapse

Tsueng G, Cano MAA, Bento J, Czech C, Kang M, Pache L, Rasmussen LV, Savidge TC, Starren J, Wu Q, Xin J, Yeaman MR, Zhou X, Su AI, Wu C, Brown L, Shabman RS, Hughes LD. Developing a standardized but extendable framework to increase the findability of infectious disease datasets. Sci Data 2023;10:99. [PMID: 36823157 PMCID: PMC9950378 DOI: 10.1038/s41597-023-01968-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Accepted: 01/13/2023] [Indexed: 02/25/2023] Open

Affiliation(s)

Ginger Tsueng Department of Integrative, Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, 92037, USA.
Marco A Alvarado Cano Department of Integrative, Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, 92037, USA
José Bento Department of Computer Science, Boston College, 245 Beacon St, Chestnut Hill, MA, 02467, USA
Candice Czech Department of Integrative, Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, 92037, USA
Mengjia Kang Division of Pulmonary and Critical Care, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA
Lars Pache Infectious and Inflammatory Disease Center, Immunity and Pathogenesis Program, Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA, 92037, USA
Luke V Rasmussen Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, 60611, USA
Tor C Savidge Texas Children's Microbiome Center & Department of Pathology & Immunology, Baylor College of Medicine, Houston, TX, 77030, USA
Justin Starren Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, 60611, USA
Qinglong Wu Texas Children's Microbiome Center & Department of Pathology & Immunology, Baylor College of Medicine, Houston, TX, 77030, USA
Jiwen Xin Department of Integrative, Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, 92037, USA
Michael R Yeaman Department of Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA Divisions of Molecular Medicine and Infectious Diseases, Harbor-UCLA Medical Center, Torrance, CA, 90502, USA Lundquist Institute for Infection & Immunity at Harbor-UCLA Medical Center, Torrance, CA, 90502, USA
Xinghua Zhou Department of Integrative, Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, 92037, USA
Andrew I Su Department of Integrative, Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, 92037, USA Scripps Research Translational Institute, La Jolla, CA, 92037, USA Department of Molecular Medicine, The Scripps Research Institute, La Jolla, CA, 92037, USA
Chunlei Wu Department of Integrative, Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, 92037, USA Scripps Research Translational Institute, La Jolla, CA, 92037, USA Department of Molecular Medicine, The Scripps Research Institute, La Jolla, CA, 92037, USA
Liliana Brown Office of Genomics and Advanced Technologies, National Institute of Allergy and Infectious Diseases, Rockville, MD, 20852, USA
Reed S Shabman Office of Genomics and Advanced Technologies, National Institute of Allergy and Infectious Diseases, Rockville, MD, 20852, USA
Laura D Hughes Department of Integrative, Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, 92037, USA.

Collapse

Gomes DGE, Pottier P, Crystal-Ornelas R, Hudgins EJ, Foroughirad V, Sánchez-Reyes LL, Turba R, Martinez PA, Moreau D, Bertram MG, Smout CA, Gaynor KM. Why don't we share data and code? Perceived barriers and benefits to public archiving practices. Proc Biol Sci 2022;289:20221113. [PMID: 36416041 PMCID: PMC9682438 DOI: 10.1098/rspb.2022.1113] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Accepted: 11/02/2022] [Indexed: 08/10/2023] Open

Hoyt CT, Balk M, Callahan TJ, Domingo-Fernández D, Haendel MA, Hegde HB, Himmelstein DS, Karis K, Kunze J, Lubiana T, Matentzoglu N, McMurry J, Moxon S, Mungall CJ, Rutz A, Unni DR, Willighagen E, Winston D, Gyori BM. Unifying the identification of biomedical entities with the Bioregistry. Sci Data 2022;9:714. [PMID: 36402838 PMCID: PMC9675740 DOI: 10.1038/s41597-022-01807-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2022] [Accepted: 10/26/2022] [Indexed: 11/21/2022] Open

Bittremieux W, Wang M, Dorrestein PC. The critical role that spectral libraries play in capturing the metabolomics community knowledge. Metabolomics 2022;18:94. [PMID: 36409434 PMCID: PMC10284100 DOI: 10.1007/s11306-022-01947-y] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 10/19/2022] [Indexed: 11/22/2022]

Wang LQ, Fernandez-Boyano I, Robinson WP. Genetic variation in placental insufficiency: What have we learned over time? Front Cell Dev Biol 2022;10:1038358. [PMID: 36313546 PMCID: PMC9613937 DOI: 10.3389/fcell.2022.1038358] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 10/03/2022] [Indexed: 11/28/2022] Open

Garcia BJ, Urrutia J, Zheng G, Becker D, Corbet C, Maschhoff P, Cristofaro A, Gaffney N, Vaughn M, Saxena U, Chen YP, Gordon DB, Eslami M. A toolkit for enhanced reproducibility of RNASeq analysis for synthetic biologists. SYNTHETIC BIOLOGY (OXFORD, ENGLAND) 2022;7:ysac012. [PMID: 36035514 PMCID: PMC9408027 DOI: 10.1093/synbio/ysac012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Revised: 06/17/2022] [Accepted: 08/22/2022] [Indexed: 11/13/2022]

Abstract

Sequencing technologies, in particular RNASeq, have become critical tools in the design, build, test and learn cycle of synthetic biology. They provide a better understanding of synthetic designs, and they help identify ways to improve and select designs. While these data are beneficial to design, their collection and analysis is a complex, multistep process that has implications on both discovery and reproducibility of experiments. Additionally, tool parameters, experimental metadata, normalization of data and standardization of file formats present challenges that are computationally intensive. This calls for high-throughput pipelines expressly designed to handle the combinatorial and longitudinal nature of synthetic biology. In this paper, we present a pipeline to maximize the analytical reproducibility of RNASeq for synthetic biologists. We also explore the impact of reproducibility on the validation of machine learning models. We present the design of a pipeline that combines traditional RNASeq data processing tools with structured metadata tracking to allow for the exploration of the combinatorial design in a high-throughput and reproducible manner. We then demonstrate utility via two different experiments: a control comparison experiment and a machine learning model experiment. The first experiment compares datasets collected from identical biological controls across multiple days for two different organisms. It shows that a reproducible experimental protocol for one organism does not guarantee reproducibility in another. The second experiment quantifies the differences in experimental runs from multiple perspectives. It shows that the lack of reproducibility from these different perspectives can place an upper bound on the validation of machine learning models trained on RNASeq data.

Graphical Abstract

Collapse

Forero DA, Curioso WH, Patrinos GP. The importance of adherence to international standards for depositing open data in public repositories. BMC Res Notes 2021;14:405. [PMID: 34727971 PMCID: PMC8561348 DOI: 10.1186/s13104-021-05817-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Accepted: 10/22/2021] [Indexed: 12/14/2022] Open

Heil BJ, Hoffman MM, Markowetz F, Lee SI, Greene CS, Hicks SC. Reproducibility standards for machine learning in the life sciences. Nat Methods 2021;18:1132-1135. [PMID: 34462593 PMCID: PMC9131851 DOI: 10.1038/s41592-021-01256-7] [Citation(s) in RCA: 60] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]

Way GP, Greene CS, Carninci P, Carvalho BS, de Hoon M, Finley SD, Gosline SJC, Lȇ Cao KA, Lee JSH, Marchionni L, Robine N, Sindi SS, Theis FJ, Yang JYH, Carpenter AE, Fertig EJ. A field guide to cultivating computational biology. PLoS Biol 2021;19:e3001419. [PMID: 34618807 PMCID: PMC8525744 DOI: 10.1371/journal.pbio.3001419] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Revised: 10/19/2021] [Indexed: 11/18/2022] Open

Affiliation(s)

Gregory P. Way Imaging Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America Center for Health AI, University of Colorado School of Medicine, Aurora, Colorado, United States of America
Casey S. Greene Center for Health AI, University of Colorado School of Medicine, Aurora, Colorado, United States of America
Piero Carninci RIKEN Center for Integrative Medical Sciences Yokohama, Kanagawa, Japan Human Technopole, Milan, Italy
Benilton S. Carvalho Department of Statistics, Institute of Mathematics, Statistics and Scientific Computing, University of Campinas, Campinas, Brazil
Michiel de Hoon RIKEN Center for Integrative Medical Sciences Yokohama, Kanagawa, Japan
Stacey D. Finley Department of Biomedical Engineering, Quantitative and Computational Biology, and Chemical Engineering & Materials Science, University of Southern California, Los Angeles, California, United States of America
Sara J. C. Gosline Pacific Northwest National Laboratory, Seattle, Washington, United States of America
Kim-Anh Lȇ Cao Melbourne Integrative Genomics, School of Mathematics and Statistics, The University of Melbourne, Melbourne, Australia
Jerry S. H. Lee Ellison Institute and Departments of Medicine/Oncology, Chemical Engineering, and Material Sciences, University of Southern California, Los Angeles, California, United States of America
Luigi Marchionni Department of Pathology and Laboratory Medicine, Weill-Cornell Medicine, New York, New York, United States of America
Nicolas Robine Computational Biology Lab, New York Genome Center, New York, New York, United States of America
Suzanne S. Sindi Department of Applied Mathematics, University of California Merced, Merced, California, United States of America
Fabian J. Theis Institute of Computational Biology, Helmholtz Center Munich and Department of Mathematics, Technical University of Munich, Munich, Germany
Jean Y. H. Yang Charles Perkins Centre and School of Mathematics and Statistics, The University of Sydney, Australia
Anne E. Carpenter Imaging Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
Elana J. Fertig Convergence Institute, Departments of Oncology, Biomedical Engineering, and Applied Mathematics and Statistics, Johns Hopkins University, Baltimore, Maryland, United States of America

Collapse

Fondrie WE, Bittremieux W, Noble WS. ppx: Programmatic Access to Proteomics Data Repositories. J Proteome Res 2021;20:4621-4624. [PMID: 34342226 PMCID: PMC8457024 DOI: 10.1021/acs.jproteome.1c00454] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]