1
|
Savonen C, Wright C, Hoffman A, Humphries E, Cox K, Tan F, Leek J. Motivation, inclusivity, and realism should drive data science education. F1000Res 2024; 12:1240. [PMID: 38764793 PMCID: PMC11101914 DOI: 10.12688/f1000research.134655.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/06/2024] [Indexed: 05/21/2024] Open
Abstract
Data science education provides tremendous opportunities but remains inaccessible to many communities. Increasing the accessibility of data science to these communities not only benefits the individuals entering data science, but also increases the field's innovation and potential impact as a whole. Education is the most scalable solution to meet these needs, but many data science educators lack formal training in education. Our group has led education efforts for a variety of audiences: from professional scientists to high school students to lay audiences. These experiences have helped form our teaching philosophy which we have summarized into three main ideals: 1) motivation, 2) inclusivity, and 3) realism. 20 we also aim to iteratively update our teaching approaches and curriculum as we find ways to better reach these ideals. In this manuscript we discuss these ideals as well practical ideas for how to implement these philosophies in the classroom.
Collapse
Affiliation(s)
| | - Carrie Wright
- Fred Hutchinson Cancer Center, Seattle, WA, 98109, USA
| | - Ava Hoffman
- Fred Hutchinson Cancer Center, Seattle, WA, 98109, USA
| | | | - Katherine Cox
- Bloomberg School of Public Health, Johns Hopkins University, Baltimore, Maryland, 21218, USA
| | | | - Jeffrey Leek
- Fred Hutchinson Cancer Center, Seattle, WA, 98109, USA
| |
Collapse
|
2
|
Humphries EM, Wright C, Hoffman AM, Savonen C, Leek JT. What's the best chatbot for me? Researchers put LLMs through their paces. Nature 2023:10.1038/d41586-023-03023-4. [PMID: 37759110 DOI: 10.1038/d41586-023-03023-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/29/2023]
|
3
|
Dang MT, Gonzalez MV, Gaonkar KS, Rathi KS, Young P, Arif S, Zhai L, Alam Z, Devalaraja S, To TKJ, Folkert IW, Raman P, Rokita JL, Martinez D, Taroni JN, Shapiro JA, Greene CS, Savonen C, Mafra F, Hakonarson H, Curran T, Haldar M. Macrophages in SHH subgroup medulloblastoma display dynamic heterogeneity that varies with treatment modality. Cell Rep 2023; 42:112600. [PMID: 37235472 PMCID: PMC10592430 DOI: 10.1016/j.celrep.2023.112600] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/28/2023] Open
|
4
|
Savonen C, Wright C, Hoffman AM, Muschelli J, Cox K, Tan FJ, Leek JT. Open-source Tools for Training Resources - OTTR. J Stat Data Sci Educ 2023; 31:57-65. [PMID: 37207236 PMCID: PMC10193921 DOI: 10.1080/26939169.2022.2118646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Data science and informatics tools are developing at a blistering rate, but their users often lack the educational background or resources to efficiently apply the methods to their research. Training resources and vignettes that accompany these tools often deprecate because their maintenance is not prioritized by funding, giving teams little time to devote to such endeavors. Our group has developed Open-source Tools for Training Resources (OTTR) to offer greater efficiency and flexibility for creating and maintaining these training resources. OTTR empowers creators to customize their work and allows for a simple workflow to publish using multiple platforms. OTTR allows content creators to publish training material to multiple massive online learner communities using familiar rendering mechanics. OTTR allows the incorporation of pedagogical practices like formative and summative assessments in the form of multiple choice questions and fill in the blank problems that are automatically graded. No local installation of any software is required to begin creating content with OTTR. Thus far, 15 training courses have been created with OTTR repository template. By using the OTTR system, the maintenance workload for updating these courses across platforms has been drastically reduced. For more information about OTTR and how to get started, go to ottrproject.org.
Collapse
Affiliation(s)
- Candace Savonen
- Johns Hopkins Bloomberg School of Public Health, Baltimore, MD
- Fred Hutchinson Cancer Center, Seattle, WA
- Corresponding author:
| | - Carrie Wright
- Johns Hopkins Bloomberg School of Public Health, Baltimore, MD
- Fred Hutchinson Cancer Center, Seattle, WA
| | - Ava M. Hoffman
- Johns Hopkins Bloomberg School of Public Health, Baltimore, MD
- Fred Hutchinson Cancer Center, Seattle, WA
| | - John Muschelli
- Johns Hopkins Bloomberg School of Public Health, Baltimore, MD
| | - Katherine Cox
- Johns Hopkins Bloomberg School of Public Health, Baltimore, MD
| | | | - Jeffrey T. Leek
- Johns Hopkins Bloomberg School of Public Health, Baltimore, MD
- Fred Hutchinson Cancer Center, Seattle, WA
| |
Collapse
|
5
|
Shapiro J, Savonen C, Bethell C, Gaonkar K, Zhu Y, Brown M, Duong N, Rathi K, Noureen N, Zhang B, Ennis B, Spielman S, Farrow B, Van Kuren N, Koganti T, Kannan S, Raman P, Miller D, Jain P, Guo Y, Huang X, Kraya A, Heath A, Koptyra M, Wong J, Mason J, Robbins S, Santi M, Viaene A, Waanders A, Hanson D, Scolaro L, Xie H, Zheng S, Kline C, Lilly J, Storm P, Resnick A, Rokita JL, Greene C, Taroni J. OMIC-14. OPENPBTA: AN OPEN PEDIATRIC BRAIN TUMOR ATLAS. Neuro Oncol 2021. [PMCID: PMC8168082 DOI: 10.1093/neuonc/noab090.161] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Pediatric brain tumors comprise a heterogeneous molecular and histological landscape that challenges most current precision-medicine approaches. While recent large-scale efforts to molecularly characterize distinct histological entities have dramatically advanced the field’s capacity to classify and further define molecular subtypes, developing therapeutic and less toxic molecularly-defined clinical approaches remains a challenge. To define new approaches to meet these challenges and advance scalable, shared biospecimen- and data-resources for pediatric brain tumors, the Children’s Brain Tumor Network and Pacific Pediatric Neuro-Oncology Consortium, in partnership with the Alex’s Lemonade Stand Foundation Childhood Cancer Data Lab, launched OpenPBTA, a global open science Pediatric Brain Tumor Atlas initiative to comprehensively define the molecular landscape of pediatric brain tumors. The initiative contains multi-modal analyses of research- and clinical-trial based DNA and RNA sequences from nearly 1,000 subjects (with 1,256 tumors) along with their longitudinal clinical data. The OpenPBTA’s open science framework for analysis tests the capacity of crowd-sourced collaborative architectures to advance more rapid, iterative and integrated discovery of the underlying mechanisms of disease across pediatric brain and spinal cord tumors. Since the launch of the project, OpenPBTA has collaboratively created reproducible workflows for integrated consensus SNV, CNV, and fusion calling, enabled RNA-Seq-based classification of medulloblastoma subtypes, and more than 25 additional DNA- and RNA-based analyses. The open-science platform and associated datasets and processed results provide a continuously updated, global view of the integrated cross-disease molecular landscape of pediatric brain tumors. Such biospecimen- and clinically-linked scalable data resources provide unprecedented collaborative opportunities for precision-based, personalized therapeutic discovery and drug development with the upcoming further integration of proteomic sample data (N >300) and drug response datasets, additionally diversifying the multimodal discovery potential of crowd-sourced approaches for accelerated impact for children with brain tumors.
Collapse
Affiliation(s)
| | | | | | | | - Yuankun Zhu
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Miguel Brown
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Nhat Duong
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Komal Rathi
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | | | - Bo Zhang
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Brian Ennis
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | | | - Bailey Farrow
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | | | | | | | - Pichai Raman
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Daniel Miller
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Payal Jain
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Yiran Guo
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Xiaoyan Huang
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Adam Kraya
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Allison Heath
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | | | - Jessica Wong
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Jennifer Mason
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | | | - Mariarita Santi
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- University of Pennsylvania, Philadelphia, PA, USA
| | - Angela Viaene
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- University of Pennsylvania, Philadelphia, PA, USA
| | - Angela Waanders
- Ann & Robert H. Lurie Children’s Hospital of Chicago, Chicago, IL, USA
- Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Derek Hanson
- Hackensack Meridian School of Medicine, Nutley, NJ, USA
- Hackensack University Medical Center, Hackensack, NJ, USA
| | - Laura Scolaro
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Hongbo Xie
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | | | - Cassie Kline
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Jena Lilly
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Philip Storm
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Adam Resnick
- Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | | | - Casey Greene
- Alex’s Lemonade Stand Foundation, Philadelphia, PA, USA
- University of Colorado School of Medicine, Aurora, CO, USA
| | - Jaclyn Taroni
- Alex’s Lemonade Stand Foundation, Philadelphia, PA, USA
| |
Collapse
|
6
|
Dang MT, Gonzalez MV, Gaonkar KS, Rathi KS, Young P, Arif S, Zhai L, Alam Z, Devalaraja S, To TKJ, Folkert IW, Raman P, Rokita JL, Martinez D, Taroni JN, Shapiro JA, Greene CS, Savonen C, Mafra F, Hakonarson H, Curran T, Haldar M. Macrophages in SHH subgroup medulloblastoma display dynamic heterogeneity that varies with treatment modality. Cell Rep 2021; 34:108917. [PMID: 33789113 PMCID: PMC10450591 DOI: 10.1016/j.celrep.2021.108917] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2020] [Revised: 01/13/2021] [Accepted: 03/09/2021] [Indexed: 12/21/2022] Open
Abstract
Tumor-associated macrophages (TAMs) play an important role in tumor immunity and comprise of subsets that have distinct phenotype, function, and ontology. Transcriptomic analyses of human medulloblastoma, the most common malignant pediatric brain cancer, showed that medulloblastomas (MBs) with activated sonic hedgehog signaling (SHH-MB) have significantly more TAMs than other MB subtypes. Therefore, we examined MB-associated TAMs by single-cell RNA sequencing of autochthonous murine SHH-MB at steady state and under two distinct treatment modalities: molecular-targeted inhibitor and radiation. Our analyses reveal significant TAM heterogeneity, identify markers of ontologically distinct TAM subsets, and show the impact of brain microenvironment on the differentiation of tumor-infiltrating monocytes. TAM composition undergoes dramatic changes with treatment and differs significantly between molecular-targeted and radiation therapy. We identify an immunosuppressive monocyte-derived TAM subset that emerges with radiation therapy and demonstrate its role in regulating T cell and neutrophil infiltration in MB.
Collapse
Affiliation(s)
- Mai T Dang
- Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA, USA; Abramson Family Cancer Research Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Michael V Gonzalez
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Krutika S Gaonkar
- Center for Data-Driven Discovery in Biomedicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA; Department of Bioinformatics and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA, USA; Division of Neurosurgery, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Komal S Rathi
- Center for Data-Driven Discovery in Biomedicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA; Department of Bioinformatics and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA, USA; Division of Neurosurgery, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Patricia Young
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Sherjeel Arif
- Center for Data-Driven Discovery in Biomedicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA; Department of Bioinformatics and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA, USA; Division of Neurosurgery, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Li Zhai
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Zahidul Alam
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Samir Devalaraja
- Abramson Family Cancer Research Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Institute for Immunology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Tsun Ki Jerrick To
- Abramson Family Cancer Research Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Ian W Folkert
- Abramson Family Cancer Research Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Department of Surgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Pichai Raman
- Center for Data-Driven Discovery in Biomedicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA; Department of Bioinformatics and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA, USA; Division of Neurosurgery, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Jo Lynne Rokita
- Center for Data-Driven Discovery in Biomedicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA; Department of Bioinformatics and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA, USA; Division of Neurosurgery, Children's Hospital of Philadelphia, Philadelphia, PA, USA; Alex's Lemonade Stand Foundation Childhood Cancer Data Lab, Philadelphia, PA, USA
| | - Daniel Martinez
- Pathology Core, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Jaclyn N Taroni
- Alex's Lemonade Stand Foundation Childhood Cancer Data Lab, Philadelphia, PA, USA
| | - Joshua A Shapiro
- Alex's Lemonade Stand Foundation Childhood Cancer Data Lab, Philadelphia, PA, USA
| | - Casey S Greene
- Alex's Lemonade Stand Foundation Childhood Cancer Data Lab, Philadelphia, PA, USA; Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Candace Savonen
- Alex's Lemonade Stand Foundation Childhood Cancer Data Lab, Philadelphia, PA, USA
| | - Fernanda Mafra
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Hakon Hakonarson
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA, USA; Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Tom Curran
- Children's Research Institute at Mercy Children's Hospital, Kansas City, KS, USA
| | - Malay Haldar
- Abramson Family Cancer Research Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Institute for Immunology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
7
|
Kochmanski J, Savonen C, Bernstein AI. A Novel Application of Mixed Effects Models for Reconciling Base-Pair Resolution 5-Methylcytosine and 5-Hydroxymethylcytosine Data in Neuroepigenetics. Front Genet 2019; 10:801. [PMID: 31552098 PMCID: PMC6748167 DOI: 10.3389/fgene.2019.00801] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2019] [Accepted: 07/31/2019] [Indexed: 12/01/2022] Open
Abstract
Epigenetic marks operate at multiple chromosomal levels to regulate gene expression, from direct covalent modification of DNA to three-dimensional chromosomal structure. Research has shown that 5-methylcytosine (5-mC) and its oxidized form, 5-hydroxymethylcytosine (5-hmC), are stable epigenetic marks with distinct genomic distributions and separate regulatory functions. In addition, recent data indicate that 5-hmC plays a critical regulatory role in the mammalian brain, emphasizing the importance of considering this alternative DNA modification in the context of neuroepigenetics. Traditional bisulfite (BS) treatment-based methods to measure the methylome are not able to distinguish between 5-mC and 5-hmC, meaning much of the existing literature does not differentiate these two DNA modifications. Recently developed methods, including Tet-assisted bisulfite treatment and oxidative bisulfite treatment, allow for differentiation of 5-hmC and/or 5-mC levels at base-pair resolution when combined with next-generation sequencing or methylation arrays. Despite these technological advances, there remains a lack of clarity regarding the appropriate statistical methods for integration of 5-mC and 5-hmC data. As a result, it can be difficult to determine the effects of an experimental treatment on 5-mC and 5-hmC dynamics. Here, we propose a statistical approach involving mixed effects to simultaneously model paired 5-mC and 5-hmC data as repeated measures. We tested this approach using publicly available BS/oxidative bisulfite-450K array data and showed that our new approach detected far more CpG probes with paired changes in 5-mC and 5-hmC by Alzheimer’s disease status (n = 14,183 probes) compared with the overlapping differential probes generated from separate models for each epigenetic mark (n = 68). Of note, all 68 of the overlapping probe IDs from the separate models were also significant in our new modeling approach, supporting the sensitivity of our new analysis method. Using the proposed approach, it will be possible to determine the effects of an experimental treatment on both 5-mC and 5-hmC at the base-pair level.
Collapse
Affiliation(s)
- Joseph Kochmanski
- Department of Translational Neuroscience, College of Human Medicine, Michigan State University, Grand Rapids, MI, United States
| | - Candace Savonen
- Department of Translational Neuroscience, College of Human Medicine, Michigan State University, Grand Rapids, MI, United States
| | - Alison I Bernstein
- Department of Translational Neuroscience, College of Human Medicine, Michigan State University, Grand Rapids, MI, United States
| |
Collapse
|