1
|
Goggin SM, Zunder ER. ESCHR: a hyperparameter-randomized ensemble approach for robust clustering across diverse datasets. Genome Biol 2024; 25:242. [PMID: 39285487 PMCID: PMC11406744 DOI: 10.1186/s13059-024-03386-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Accepted: 08/30/2024] [Indexed: 09/19/2024] Open
Abstract
Clustering is widely used for single-cell analysis, but current methods are limited in accuracy, robustness, ease of use, and interpretability. To address these limitations, we developed an ensemble clustering method that outperforms other methods at hard clustering without the need for hyperparameter tuning. It also performs soft clustering to characterize continuum-like regions and quantify clustering uncertainty, demonstrated here by mapping the connectivity and intermediate transitions between MNIST handwritten digits and between hypothalamic tanycyte subpopulations. This hyperparameter-randomized ensemble approach improves the accuracy, robustness, ease of use, and interpretability of single-cell clustering, and may prove useful in other fields as well.
Collapse
Affiliation(s)
- Sarah M Goggin
- Neuroscience Graduate Program, School of Medicine, University of Virginia, Charlottesville, VA, 22902, USA
| | - Eli R Zunder
- Neuroscience Graduate Program, School of Medicine, University of Virginia, Charlottesville, VA, 22902, USA.
- Department of Biomedical Engineering, School of Engineering, University of Virginia, Charlottesville, VA, 22902, USA.
| |
Collapse
|
2
|
Deshpande D, Chhugani K, Ramesh T, Pellegrini M, Shiffman S, Abedalthagafi MS, Alqahtani S, Ye J, Liu XS, Leek JT, Brazma A, Ophoff RA, Rao G, Butte AJ, Moore JH, Katritch V, Mangul S. The evolution of computational research in a data-centric world. Cell 2024; 187:4449-4457. [PMID: 39178828 DOI: 10.1016/j.cell.2024.07.045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Revised: 07/21/2024] [Accepted: 07/24/2024] [Indexed: 08/26/2024]
Abstract
Computational data-centric research techniques play a prevalent and multi-disciplinary role in life science research. In the past, scientists in wet labs generated the data, and computational researchers focused on creating tools for the analysis of those data. Computational researchers are now becoming more independent and taking leadership roles within biomedical projects, leveraging the increased availability of public data. We are now able to generate vast amounts of data, and the challenge has shifted from data generation to data analysis. Here we discuss the pitfalls, challenges, and opportunities facing the field of data-centric research in biology. We discuss the evolving perception of computational data-driven research and its rise as an independent domain in biomedical research while also addressing the significant collaborative opportunities that arise from integrating computational research with experimental and translational biology. Additionally, we discuss the future of data-centric research and its applications across various areas of the biomedical field.
Collapse
Affiliation(s)
- Dhrithi Deshpande
- Titus Department of Clinical Pharmacy, Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, University of Southern California, Los Angeles, CA 90089, USA.
| | - Karishma Chhugani
- Titus Department of Clinical Pharmacy, Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Tejasvene Ramesh
- Department of Pharmacology and Pharmaceutical Sciences, Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Matteo Pellegrini
- Department of Molecular, Cell and Developmental Biology, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Sagiv Shiffman
- Department of Genetics, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem 9190401, Israel
| | - Malak S Abedalthagafi
- Genomics Research Department, King Fahad Medical City, Riyadh, Saudi Arabia; Department of Pathology & Laboratory Medicine, Emory University Hospital, Atlanta, GA, USA
| | - Saleh Alqahtani
- The Liver Transplant Unit, King Faisal Specialist Hospital and Research Center, Riyadh, Saudi Arabia; The Division of Gastroenterology and Hepatology, Johns Hopkins University, Baltimore, MD 21205, USA
| | - Jimmie Ye
- Department of Epidemiology & Biostatistics, Institute for Human Genetics, University of California, San Francisco, 513 Parnassus Avenue S965F, San Francisco, CA 94143, USA
| | - Xiaole Shirley Liu
- GV20 Oncotherapy, One Broadway, 14th Floor, Kendall Square, Cambridge, MA 02142, USA
| | - Jeffrey T Leek
- Biostatistics and Oncology at the Johns Hopkins Bloomberg School of Public Health and Johns Hopkins Data Science Lab, John Hopkins University, 615 N. Wolfe Street, Baltimore, MD 21205, USA
| | - Alvis Brazma
- EMBL European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Roel A Ophoff
- Department of Psychiatry and Human Genetics, Center for Neurobehavioral Genetics, University of California, Los Angeles, Los Angeles, CA, USA
| | - Gauri Rao
- Titus Department of Clinical Pharmacy, Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Atul J Butte
- Bakar Computational Health Sciences Institute, University of California, San Francisco, 490 Illinois Street, San Francisco, CA 94158, USA
| | - Jason H Moore
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, 700 N. San Vicente Boulevard, Pacific Design Center Suite G540, West Hollywood, CA 90068, USA
| | - Vsevolod Katritch
- Department of Quantitative and Computational Biology, USC Dornsife College of Letters, Arts and Sciences, University of Southern California, Los Angeles, CA 90007, USA
| | - Serghei Mangul
- Department of Quantitative and Computational Biology, USC Dornsife College of Letters, Arts and Sciences, University of Southern California, Los Angeles, CA 90007, USA.
| |
Collapse
|
3
|
Jahja E, Papajorgji P, Moskowitz H, Margioukla I, Nasto F, Dedej A, Pina P, Shella M, Collaku M, Kaziu E, Gjoni K. Measuring the perceived wellbeing of hemodialysis patients: A Mind Genomics cartography. PLoS One 2024; 19:e0302526. [PMID: 38739575 PMCID: PMC11090323 DOI: 10.1371/journal.pone.0302526] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Accepted: 04/07/2024] [Indexed: 05/16/2024] Open
Abstract
Chronic Kidney Disease patients under hemodialysis have high morbidity rate, which tends to considerably affect their health-related quality of life. Multiple studies that have made use of different questionnaries report the poor life quality of this patient group. The research in hand implemented the Mind Genomics Approach as a method to asses the health-related quality of life of hemodialysis patients, while relying on conjoint measurements to group individuals with similar patterns of responses to a certain mindset. The study is conducted in 3 clinics with 219 patients. It uncovers three clusters or mindsets: Mindset 1- Feels guardedly optimistic but worried about money, Mindset 2-Feels strongly positive because the state guarantees and the family supports, Mindset 3-Feels positive only about money. Based on the analysis of the collected data, the findings of this study suggest that the quality of life in hemodialysis patients is highly correlated to their financial status. The current study is one of the few first attempts to apply Mind Genomics in medical settings and the first, to our knowledge, in hemodialysis centers. This technology might enable healthcare proffesionals to provide personalized psychological treatment and additional social support to patients, which in turn could improve their clinical outcomes. The study is an example of using technology as a service.
Collapse
Affiliation(s)
- Ermira Jahja
- Faculty of Dental Medicine, Department of Basic Sciences, Western Balkans University, Tirana, Albania
| | | | - Howard Moskowitz
- World Institute of Competitive Excellence, New York, New York, United States of America
| | | | - Fjona Nasto
- Department of Nephrology, American Hospital, Tirana, Albania
- Diavita Dialysis Center, Elbasan, Albania
| | - Arjeta Dedej
- Department of Nephrology, American Hospital, Tirana, Albania
| | - Paola Pina
- Diavita Dialysis Center, Elbasan, Albania
| | | | - Manjola Collaku
- Faculty of Technical Medical Sciences, Department of Medicine, Western Balkans University, Tirana, Albania
| | - Erjona Kaziu
- Department of Nephrology, American Hospital, Tirana, Albania
| | - Kristela Gjoni
- Department of Nephrology, American Hospital, Tirana, Albania
| |
Collapse
|
4
|
Rieckmann A, Nielsen S, Dworzynski P, Amini H, Mogensen SW, Silva IB, Chang AY, Arah OA, Samek W, Rod NH, Ekstrøm CT, Benn CS, Aaby P, Fisker AB. Discovering Subgroups of Children With High Mortality in Urban Guinea-Bissau: Exploratory and Validation Cohort Study. JMIR Public Health Surveill 2024; 10:e48060. [PMID: 38592761 PMCID: PMC11040440 DOI: 10.2196/48060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 12/22/2023] [Accepted: 01/23/2024] [Indexed: 04/10/2024] Open
Abstract
BACKGROUND The decline in global child mortality is an important public health achievement, yet child mortality remains disproportionally high in many low-income countries like Guinea-Bissau. The persisting high mortality rates necessitate targeted research to identify vulnerable subgroups of children and formulate effective interventions. OBJECTIVE This study aimed to discover subgroups of children at an elevated risk of mortality in the urban setting of Bissau, Guinea-Bissau, West Africa. By identifying these groups, we intend to provide a foundation for developing targeted health interventions and inform public health policy. METHODS We used data from the health and demographic surveillance site, Bandim Health Project, covering 2003 to 2019. We identified baseline variables recorded before children reached the age of 6 weeks. The focus was on determining factors consistently linked with increased mortality up to the age of 3 years. Our multifaceted methodological approach incorporated spatial analysis for visualizing geographical variations in mortality risk, causally adjusted regression analysis to single out specific risk factors, and machine learning techniques for identifying clusters of multifactorial risk factors. To ensure robustness and validity, we divided the data set temporally, assessing the persistence of identified subgroups over different periods. The reassessment of mortality risk used the targeted maximum likelihood estimation (TMLE) method to achieve more robust causal modeling. RESULTS We analyzed data from 21,005 children. The mortality risk (6 weeks to 3 years of age) was 5.2% (95% CI 4.8%-5.6%) for children born between 2003 and 2011, and 2.9% (95% CI 2.5%-3.3%) for children born between 2012 and 2016. Our findings revealed 3 distinct high-risk subgroups with notably higher mortality rates, children residing in a specific urban area (adjusted mortality risk difference of 3.4%, 95% CI 0.3%-6.5%), children born to mothers with no prenatal consultations (adjusted mortality risk difference of 5.8%, 95% CI 2.6%-8.9%), and children from polygamous families born during the dry season (adjusted mortality risk difference of 1.7%, 95% CI 0.4%-2.9%). These subgroups, though small, showed a consistent pattern of higher mortality risk over time. Common social and economic factors were linked to a larger share of the total child deaths. CONCLUSIONS The study's results underscore the need for targeted interventions to address the specific risks faced by these identified high-risk subgroups. These interventions should be designed to work to complement broader public health strategies, creating a comprehensive approach to reducing child mortality. We suggest future research that focuses on developing, testing, and comparing targeted intervention strategies unraveling the proposed hypotheses found in this study. The ultimate aim is to optimize health outcomes for all children in high-mortality settings, leveraging a strategic mix of targeted and general health interventions to address the varied needs of different child subgroups.
Collapse
Affiliation(s)
- Andreas Rieckmann
- Section of Epidemiology, Department of Public Health, University of Copenhagen, Copenhagen, Denmark
| | - Sebastian Nielsen
- Bandim Health Project, INDEPTH Network, Bissau, Guinea-Bissau
- Bandim Health Project, Research unit Odense Patient Data Explorative Network (OPEN), Department of Clinical Research, Odense University Hospital and University of Southern Denmark, Odense, Denmark
| | - Piotr Dworzynski
- Novo Nordisk Foundation Center for Basic Metabolic Research, University of Copenhagen, Copenhagen, Denmark
| | - Heresh Amini
- Department of Environmental Medicine and Climate Science, Icahn School of Medicine at Mount Sinai, New York, NY, United States
- Institute for Climate Change, Environmental Health, and Exposomics, Icahn School of Medicine at Mount Sinai, New York, NY, United States
| | | | - Isaquel Bartolomeu Silva
- Bandim Health Project, INDEPTH Network, Bissau, Guinea-Bissau
- Bandim Health Project, Research unit Odense Patient Data Explorative Network (OPEN), Department of Clinical Research, Odense University Hospital and University of Southern Denmark, Odense, Denmark
| | - Angela Y Chang
- Danish Institute for Advanced Study, University of Southern Denmark, Odense, Denmark
- The Interdisciplinary Centre on Population Dynamics, University of Southern Denmark, Odense, Denmark
| | - Onyebuchi A Arah
- Department of Epidemiology, Fielding School of Public Health, University of California, Los Angeles, Los Angeles, CA, United States
- Department of Statistics and Data Science, College of Letters and Science, University of California, Los Angeles, Los Angeles, CA, United States
- Research Unit for Epidemiology, Department of Public Health, University of Aarhus, Aarhus, Denmark
| | - Wojciech Samek
- Department of Artificial Intelligence, Fraunhofer Heinrich Hertz Institute, Berlin, Germany
- Department of Electrical Engineering and Computer Science, Technical University of Berlin, Berlin, Germany
- Berlin Institute for the Foundations of Learning and Data, Berlin, Germany
| | - Naja Hulvej Rod
- Section of Epidemiology, Department of Public Health, University of Copenhagen, Copenhagen, Denmark
| | - Claus Thorn Ekstrøm
- Section of Biostatistics, Department of Public Health, University of Copenhagen, Copenhagen, Denmark
| | - Christine Stabell Benn
- Bandim Health Project, INDEPTH Network, Bissau, Guinea-Bissau
- Bandim Health Project, Research unit Odense Patient Data Explorative Network (OPEN), Department of Clinical Research, Odense University Hospital and University of Southern Denmark, Odense, Denmark
- Danish Institute for Advanced Study, University of Southern Denmark, Odense, Denmark
| | - Peter Aaby
- Bandim Health Project, INDEPTH Network, Bissau, Guinea-Bissau
- Bandim Health Project, Research unit Odense Patient Data Explorative Network (OPEN), Department of Clinical Research, Odense University Hospital and University of Southern Denmark, Odense, Denmark
| | - Ane Bærent Fisker
- Bandim Health Project, INDEPTH Network, Bissau, Guinea-Bissau
- Bandim Health Project, Research unit Odense Patient Data Explorative Network (OPEN), Department of Clinical Research, Odense University Hospital and University of Southern Denmark, Odense, Denmark
| |
Collapse
|
5
|
Noble D. Editorial for online collection - The gene: An appraisal. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2024; 187:1-4. [PMID: 38176659 DOI: 10.1016/j.pbiomolbio.2024.01.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2024]
Affiliation(s)
- Denis Noble
- Department of Physiology, Anatomy & Genetics, University of Oxford, UK.
| |
Collapse
|
6
|
Gonzalez LE, Wei H, Greco V, Friedlaender LK. The art of observation: bridging science and art to see the unexpected. Development 2024; 151:dev202786. [PMID: 38477686 DOI: 10.1242/dev.202786] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/14/2024]
Abstract
Observation is the heart of research, but it can be challenging to observe deeply and go beyond expected observations. Here, we describe activities designed for scientists to enhance their observational skills by engaging with art. In collaboration with an art gallery at our university, our lab practiced observing representational paintings in a systematic way, separating the act of observation from interpretation. Applying this skill to our microscopy images allowed us to access information in the data that may otherwise have been overlooked. In addition, these activities highlighted the power of collecting observations from multiple observers before generating interpretations, as well as the value of discussing the creative and emotional aspects of data collection and interpretation. We provide concrete examples of how we will incorporate these skills into our research processes, as well as details that other groups can use to engage in similar art-based training activities to enhance their own observational skills.
Collapse
Affiliation(s)
- Lauren E Gonzalez
- Department of Genetics, Yale University School of Medicine, New Haven, CT 06510, USA
- Poorvu Center for Teaching and Learning, Yale University, New Haven, CT 06511, USA
| | - Haoyang Wei
- Department of Genetics, Yale University School of Medicine, New Haven, CT 06510, USA
- Howard Hughes Medical Institute, University of Wisconsin-Madison, Madison, WI 53715, USA
- Regenerative Biology, Morgridge Institute for Research, Madison, WI 53715, USA
| | - Valentina Greco
- Department of Genetics, Yale University School of Medicine, New Haven, CT 06510, USA
| | | |
Collapse
|
7
|
Felsky D, Cannitelli A, Pipitone J. Whole Person Modeling: a transdisciplinary approach to mental health research. DISCOVER MENTAL HEALTH 2023; 3:16. [PMID: 37638348 PMCID: PMC10449734 DOI: 10.1007/s44192-023-00041-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/02/2023] [Accepted: 08/10/2023] [Indexed: 08/29/2023]
Abstract
The growing global burden of mental illness has prompted calls for innovative research strategies. Theoretical models of mental health include complex contributions of biological, psychosocial, experiential, and other environmental influences. Accordingly, neuropsychiatric research has self-organized into largely isolated disciplines working to decode each individual contribution. However, research directly modeling objective biological measurements in combination with cognitive, psychological, demographic, or other environmental measurements is only now beginning to proliferate. This review aims to (1) to describe the landscape of modern mental health research and current movement towards integrative study, (2) to provide a concrete framework for quantitative integrative research, which we call Whole Person Modeling, (3) to explore existing and emerging techniques and methods used in Whole Person Modeling, and (4) to discuss our observations about the scarcity, potential value, and untested aspects of highly transdisciplinary research in general. Whole Person Modeling studies have the potential to provide a better understanding of multilevel phenomena, deliver more accurate diagnostic and prognostic tests to aid in clinical decision making, and test long standing theoretical models of mental illness. Some current barriers to progress include challenges with interdisciplinary communication and collaboration, systemic cultural barriers to transdisciplinary career paths, technical challenges in model specification, bias, and data harmonization, and gaps in transdisciplinary educational programs. We hope to ease anxiety in the field surrounding the often mysterious and intimidating world of transdisciplinary, data-driven mental health research and provide a useful orientation for students or highly specialized researchers who are new to this area.
Collapse
Affiliation(s)
- Daniel Felsky
- Krembil Centre for Neuroinformatics, Centre for Addiction and Mental Health, 250 College Street, Toronto, ON M5T 1R8 Canada
- Department of Psychiatry, Faculty of Medicine, University of Toronto, Toronto, ON Canada
- Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON Canada
- Rotman Research Institute, Baycrest Hospital, Toronto, ON Canada
- Faculty of Medicine, McMaster University, Hamilton, ON Canada
| | - Alyssa Cannitelli
- Krembil Centre for Neuroinformatics, Centre for Addiction and Mental Health, 250 College Street, Toronto, ON M5T 1R8 Canada
- Faculty of Medicine, McMaster University, Hamilton, ON Canada
| | - Jon Pipitone
- Department of Psychiatry, Queen’s University, Kingston, ON Canada
| |
Collapse
|
8
|
West J, Robertson-Tessi M, Anderson ARA. Agent-based methods facilitate integrative science in cancer. Trends Cell Biol 2023; 33:300-311. [PMID: 36404257 PMCID: PMC10918696 DOI: 10.1016/j.tcb.2022.10.006] [Citation(s) in RCA: 21] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Revised: 10/25/2022] [Accepted: 10/31/2022] [Indexed: 11/19/2022]
Abstract
In this opinion, we highlight agent-based modeling as a key tool for exploration of cell-cell and cell-environment interactions that drive cancer progression, therapeutic resistance, and metastasis. These biological phenomena are particularly suited to be captured at the cell-scale resolution possible only within agent-based or individual-based mathematical models. These modeling approaches complement experimental work (in vitro and in vivo systems) through parameterization and data extrapolation but also feed forward to drive new experiments that test model-generated predictions.
Collapse
Affiliation(s)
- Jeffrey West
- Integrated Mathematical Oncology Department, H. Lee Moffitt Cancer Center & Research Institute, Tampa, FL, USA
| | - Mark Robertson-Tessi
- Integrated Mathematical Oncology Department, H. Lee Moffitt Cancer Center & Research Institute, Tampa, FL, USA
| | - Alexander R A Anderson
- Integrated Mathematical Oncology Department, H. Lee Moffitt Cancer Center & Research Institute, Tampa, FL, USA.
| |
Collapse
|
9
|
Yanai I, Lercher MJ. Make science disruptive again. Nat Biotechnol 2023; 41:450-451. [PMID: 36973558 DOI: 10.1038/s41587-023-01736-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]
Affiliation(s)
- Itai Yanai
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA.
| | - Martin J Lercher
- Institute for Computer Science & Department of Biology, Heinrich Heine University, Düsseldorf, Germany
| |
Collapse
|
10
|
van den Hurk M, Lau S, Marchetto MC, Mertens J, Stern S, Corti O, Brice A, Winner B, Winkler J, Gage FH, Bardy C. Druggable transcriptomic pathways revealed in Parkinson's patient-derived midbrain neurons. NPJ Parkinsons Dis 2022; 8:134. [PMID: 36258029 PMCID: PMC9579158 DOI: 10.1038/s41531-022-00400-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Accepted: 09/28/2022] [Indexed: 11/06/2022] Open
Abstract
Complex genetic predispositions accelerate the chronic degeneration of midbrain substantia nigra neurons in Parkinson’s disease (PD). Deciphering the human molecular makeup of PD pathophysiology can guide the discovery of therapeutics to slow the disease progression. However, insights from human postmortem brain studies only portray the latter stages of PD, and there is a lack of data surrounding molecular events preceding the neuronal loss in patients. We address this gap by identifying the gene dysregulation of live midbrain neurons reprogrammed in vitro from the skin cells of 42 individuals, including sporadic and familial PD patients and matched healthy controls. To minimize bias resulting from neuronal reprogramming and RNA-seq methods, we developed an analysis pipeline integrating PD transcriptomes from different RNA-seq datasets (unsorted and sorted bulk vs. single-cell and Patch-seq) and reprogramming strategies (induced pluripotency vs. direct conversion). This PD cohort’s transcriptome is enriched for human genes associated with known clinical phenotypes of PD, regulation of locomotion, bradykinesia and rigidity. Dysregulated gene expression emerges strongest in pathways underlying synaptic transmission, metabolism, intracellular trafficking, neural morphogenesis and cellular stress/immune responses. We confirmed a synaptic impairment with patch-clamping and identified pesticides and endoplasmic reticulum stressors as the most significant gene-chemical interactions in PD. Subsequently, we associated the PD transcriptomic profile with candidate pharmaceuticals in a large database and a registry of current clinical trials. This study highlights human transcriptomic pathways that can be targeted therapeutically before the irreversible neuronal loss. Furthermore, it demonstrates the preclinical relevance of unbiased large transcriptomic assays of reprogrammed patient neurons.
Collapse
Affiliation(s)
- Mark van den Hurk
- grid.430453.50000 0004 0565 2606South Australian Health and Medical Research Institute (SAHMRI), Laboratory for Human Neurophysiology and Genetics, Adelaide, SA Australia
| | - Shong Lau
- grid.250671.70000 0001 0662 7144Laboratory of Genetics, Salk Institute for Biological Studies, La Jolla, CA USA
| | - Maria C. Marchetto
- grid.266100.30000 0001 2107 4242Department of Anthropology, University of California San Diego, La Jolla, CA USA
| | - Jerome Mertens
- grid.250671.70000 0001 0662 7144Laboratory of Genetics, Salk Institute for Biological Studies, La Jolla, CA USA ,grid.5771.40000 0001 2151 8122Neural Aging Laboratory, Institute of Molecular Biology, CMBI, Leopold-Franzens-University Innsbruck, Innsbruck, Tyrol Austria
| | - Shani Stern
- grid.250671.70000 0001 0662 7144Laboratory of Genetics, Salk Institute for Biological Studies, La Jolla, CA USA ,grid.18098.380000 0004 1937 0562Sagol Department of Neurobiology, Faculty of Natural Sciences, University of Haifa, Haifa, Israel
| | - Olga Corti
- grid.425274.20000 0004 0620 5939Sorbonne Université, Institut du Cerveau - Paris Brain Institute - ICM, Inserm, CNRS, AP-HP, Hôpital de la Pitié Salpêtrière, DMU BioGeM, Paris, France
| | - Alexis Brice
- grid.425274.20000 0004 0620 5939Sorbonne Université, Institut du Cerveau - Paris Brain Institute - ICM, Inserm, CNRS, AP-HP, Hôpital de la Pitié Salpêtrière, DMU BioGeM, Paris, France
| | - Beate Winner
- grid.411668.c0000 0000 9935 6525Department of Stem Cell Biology, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg (FAU), Erlangen, Germany ,grid.411668.c0000 0000 9935 6525Center of Rare Diseases Erlangen (ZSEER), University Hospital Erlangen, FAU Erlangen-Nürnberg, Erlangen, Germany ,grid.411668.c0000 0000 9935 6525Department of Molecular Neurology, University Hospital Erlangen, FAU Erlangen-Nürnberg, Erlangen, Germany
| | - Jürgen Winkler
- grid.411668.c0000 0000 9935 6525Department of Stem Cell Biology, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg (FAU), Erlangen, Germany ,grid.411668.c0000 0000 9935 6525Center of Rare Diseases Erlangen (ZSEER), University Hospital Erlangen, FAU Erlangen-Nürnberg, Erlangen, Germany ,grid.411668.c0000 0000 9935 6525Department of Molecular Neurology, University Hospital Erlangen, FAU Erlangen-Nürnberg, Erlangen, Germany
| | - Fred H. Gage
- grid.250671.70000 0001 0662 7144Laboratory of Genetics, Salk Institute for Biological Studies, La Jolla, CA USA
| | - Cedric Bardy
- grid.430453.50000 0004 0565 2606South Australian Health and Medical Research Institute (SAHMRI), Laboratory for Human Neurophysiology and Genetics, Adelaide, SA Australia ,grid.1014.40000 0004 0367 2697Flinders Health and Medical Research Institute, Flinders University, Adelaide, SA Australia
| |
Collapse
|
11
|
Abstract
The immune system is highly complex and distributed throughout an organism, with hundreds to thousands of cell states existing in parallel with diverse molecular pathways interacting in a highly dynamic and coordinated fashion. Although the characterization of individual genes and molecules is of the utmost importance for understanding immune-system function, high-throughput, high-resolution omics technologies combined with sophisticated computational modeling and machine-learning approaches are creating opportunities to complement standard immunological methods with new insights into immune-system dynamics. Like systems immunology itself, immunology researchers must take advantage of these technologies and form their own diverse networks, connecting with researchers from other disciplines. This Review is an introduction and 'how-to guide' for immunologists with no particular experience in the field of omics but with the intention to learn about and apply these systems-level approaches, and for immunologists who want to make the most of interdisciplinary networks.
Collapse
|
12
|
Zucco AG, Agius R, Svanberg R, Moestrup KS, Marandi RZ, MacPherson CR, Lundgren J, Ostrowski SR, Niemann CU. Personalized survival probabilities for SARS-CoV-2 positive patients by explainable machine learning. Sci Rep 2022; 12:13879. [PMID: 35974050 PMCID: PMC9380679 DOI: 10.1038/s41598-022-17953-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 08/03/2022] [Indexed: 01/08/2023] Open
Abstract
Interpretable risk assessment of SARS-CoV-2 positive patients can aid clinicians to implement precision medicine. Here we trained a machine learning model to predict mortality within 12 weeks of a first positive SARS-CoV-2 test. By leveraging data on 33,938 confirmed SARS-CoV-2 cases in eastern Denmark, we considered 2723 variables extracted from electronic health records (EHR) including demographics, diagnoses, medications, laboratory test results and vital parameters. A discrete-time framework for survival modelling enabled us to predict personalized survival curves and explain individual risk factors. Performance on the test set was measured with a weighted concordance index of 0.95 and an area under the curve for precision-recall of 0.71. Age, sex, number of medications, previous hospitalizations and lymphocyte counts were identified as top mortality risk factors. Our explainable survival model developed on EHR data also revealed temporal dynamics of the 22 selected risk factors. Upon further validation, this model may allow direct reporting of personalized survival probabilities in routine care.
Collapse
Affiliation(s)
- Adrian G Zucco
- PERSIMUNE Center of Excellence, Rigshospitalet, Copenhagen, Denmark.
| | - Rudi Agius
- Department of Hematology, Rigshospitalet, Copenhagen, Denmark
| | | | | | - Ramtin Z Marandi
- PERSIMUNE Center of Excellence, Rigshospitalet, Copenhagen, Denmark
| | | | - Jens Lundgren
- PERSIMUNE Center of Excellence, Rigshospitalet, Copenhagen, Denmark.,Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark
| | - Sisse R Ostrowski
- Department of Clinical Immunology, Rigshospitalet, Copenhagen, Denmark. .,Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark.
| | - Carsten U Niemann
- Department of Hematology, Rigshospitalet, Copenhagen, Denmark. .,Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
13
|
|
14
|
Lieberman ZE, Billen J, Kamp T, Boudinot BE. The ant abdomen: the skeletomuscular and soft tissue anatomy of
Amblyopone australis
workers (Hymenoptera: Formicidae). J Morphol 2022; 283:693-770. [DOI: 10.1002/jmor.21471] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2021] [Revised: 02/28/2022] [Accepted: 03/09/2022] [Indexed: 11/07/2022]
Affiliation(s)
- Ziv Ellis Lieberman
- Department of Entomology and Nematology University of California Davis, One Shields Ave, Davis, CA, U. S. A. 95616
| | - Johan Billen
- Zoological Institute, University of Leuven, Naamsestraat 59, Box 2466, B‐3000 Leuven Belgium
| | - Thomas Kamp
- Institute for Photon Science and Synchrotron Radiation (IPS), Karlsruhe Institute of Technology (KIT), Hermann‐von‐Helmholtz‐Platz 1, 76344 Eggenstein‐Leopoldshafen Germany
- Laboratory for Applications of Synchrotron Radiation (LAS), Karlsruhe Institute of Technology (KIT), Kaiserstraße 12 Karlsruhe Germany
| | - Brendon Elias Boudinot
- Friedrich‐Schiller‐Universität Jena, Institut für Spezielle Zoologie und Evolutionsforschung, Entomologie Gruppe, Erbertstraße 1 07743 Jena Germany
| |
Collapse
|
15
|
An expert-curated global database of online newspaper articles on spiders and spider bites. Sci Data 2022; 9:109. [PMID: 35347145 PMCID: PMC8960780 DOI: 10.1038/s41597-022-01197-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Accepted: 02/07/2022] [Indexed: 11/13/2022] Open
Abstract
Mass media plays an important role in the construction and circulation of risk perception associated with animals. Widely feared groups such as spiders frequently end up in the spotlight of traditional and social media. We compiled an expert-curated global database on the online newspaper coverage of human-spider encounters over the past ten years (2010–2020). This database includes information about the location of each human-spider encounter reported in the news article and a quantitative characterisation of the content—location, presence of photographs of spiders and bites, number and type of errors, consultation of experts, and a subjective assessment of sensationalism. In total, we collected 5348 unique news articles from 81 countries in 40 languages. The database refers to 211 identified and unidentified spider species and 2644 unique human-spider encounters (1121 bites and 147 as deadly bites). To facilitate data reuse, we explain the main caveats that need to be made when analysing this database and discuss research ideas and questions that can be explored with it. Measurement(s) | Newspaper articles on human-spider encounters | Technology Type(s) | Manual extraction | Sample Characteristic - Organism | Spiders (Arachnida: Araneae) | Sample Characteristic - Environment | Online | Sample Characteristic - Location | Global |
Collapse
|
16
|
Baillie M, le Cessie S, Schmidt CO, Lusa L, Huebner M. Ten simple rules for initial data analysis. PLoS Comput Biol 2022; 18:e1009819. [PMID: 35202399 PMCID: PMC8870512 DOI: 10.1371/journal.pcbi.1009819] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Affiliation(s)
| | - Saskia le Cessie
- Department of Clinical Epidemiology and Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, Netherlands
| | - Carsten Oliver Schmidt
- Institute for Community Medicine, SHIP-KEF University Medicine of Greifswald, Greifswald, Germany
| | - Lara Lusa
- Department of Mathematics, Faculty of Mathematics, Natural Sciences and Information Technology, University of Primorska, Koper, Slovenia
| | - Marianne Huebner
- Department of Statistics and Probability, Michigan State University, East Lansing, Michigan, United States of America
- * E-mail:
| | | |
Collapse
|
17
|
Yanai I, Lercher M. Improvisational science. Genome Biol 2022; 23:4. [PMID: 34980206 PMCID: PMC8721986 DOI: 10.1186/s13059-021-02575-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Affiliation(s)
- Itai Yanai
- Institute for Computational Medicine, NYU Langone Health, New York, NY, 10016, USA.
| | - Martin Lercher
- Institute for Computer Science & Department of Biology, Heinrich Heine University, 40225, Düsseldorf, Germany.
| |
Collapse
|
18
|
Abstract
Biological development is often described as a dynamic, emergent process. This is evident across a variety of phenomena, from the temporal organization of cell types in the embryo to compounding trends that affect large-scale differentiation. To better understand this, we propose combining quantitative investigations of biological development with theory-building techniques. This provides an alternative to the gene-centric view of development: namely, the view that developmental genes and their expression determine the complexity of the developmental phenotype. Using the model system Caenorhabditis elegans, we examine time-dependent properties of the embryonic phenotype and utilize the unique life-history properties to demonstrate how these emergent properties can be linked together by data analysis and theory-building. We also focus on embryogenetic differentiation processes, and how terminally-differentiated cells contribute to structure and function of the adult phenotype. Examining embryogenetic dynamics from 200 to 400 min post-fertilization provides basic quantitative information on developmental tempo and process. To summarize, theory construction techniques are summarized and proposed as a way to rigorously interpret our data. Our proposed approach to a formal data representation that can provide critical links across life-history, anatomy and function.
Collapse
|
19
|
Affiliation(s)
- Roy S Hessels
- Experimental Psychology, Helmholtz Institute, 8125Utrecht University, Utrecht, The Netherlands
| | - Ignace T C Hooge
- Experimental Psychology, Helmholtz Institute, 8125Utrecht University, Utrecht, The Netherlands
| |
Collapse
|
20
|
Tosa MI, Dziedzic EH, Appel CL, Urbina J, Massey A, Ruprecht J, Eriksson CE, Dolliver JE, Lesmeister DB, Betts MG, Peres CA, Levi T. The Rapid Rise of Next-Generation Natural History. Front Ecol Evol 2021. [DOI: 10.3389/fevo.2021.698131] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Many ecologists have lamented the demise of natural history and have attributed this decline to a misguided view that natural history is outdated and unscientific. Although there is a perception that the focus in ecology and conservation have shifted away from descriptive natural history research and training toward hypothetico-deductive research, we argue that natural history has entered a new phase that we call “next-generation natural history.” This renaissance of natural history is characterized by technological and statistical advances that aid in collecting detailed observations systematically over broad spatial and temporal extents. The technological advances that have increased exponentially in the last decade include electronic sensors such as camera-traps and acoustic recorders, aircraft- and satellite-based remote sensing, animal-borne biologgers, genetics and genomics methods, and community science programs. Advances in statistics and computation have aided in analyzing a growing quantity of observations to reveal patterns in nature. These robust next-generation natural history datasets have transformed the anecdotal perception of natural history observations into systematically collected observations that collectively constitute the foundation for hypothetico-deductive research and can be leveraged and applied to conservation and management. These advances are encouraging scientists to conduct and embrace detailed descriptions of nature that remain a critically important component of the scientific endeavor. Finally, these next-generation natural history observations are engaging scientists and non-scientists alike with new documentations of the wonders of nature. Thus, we celebrate next-generation natural history for encouraging people to experience nature directly.
Collapse
|
21
|
Kitano H. Nobel Turing Challenge: creating the engine for scientific discovery. NPJ Syst Biol Appl 2021; 7:29. [PMID: 34145287 PMCID: PMC8213706 DOI: 10.1038/s41540-021-00189-3] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2021] [Accepted: 06/03/2021] [Indexed: 12/15/2022] Open
Abstract
Scientific discovery has long been one of the central driving forces in our civilization. It uncovered the principles of the world we live in, and enabled us to invent new technologies reshaping our society, cure diseases, explore unknown new frontiers, and hopefully lead us to build a sustainable society. Accelerating the speed of scientific discovery is therefore one of the most important endeavors. This requires an in-depth understanding of not only the subject areas but also the nature of scientific discoveries themselves. In other words, the "science of science" needs to be established, and has to be implemented using artificial intelligence (AI) systems to be practically executable. At the same time, what may be implemented by "AI Scientists" may not resemble the scientific process conducted by human scientist. It may be an alternative form of science that will break the limitation of current scientific practice largely hampered by human cognitive limitation and sociological constraints. It could give rise to a human-AI hybrid form of science that shall bring systems biology and other sciences into the next stage. The Nobel Turing Challenge aims to develop a highly autonomous AI system that can perform top-level science, indistinguishable from the quality of that performed by the best human scientists, where some of the discoveries may be worthy of Nobel Prize level recognition and beyond.
Collapse
Affiliation(s)
- Hiroaki Kitano
- The Systems Biology Institute, Tokyo, Japan; Okinawa Institute of Science and Technology Graduate School, Okinawa, Japan; Sony Computer Science Laboratories, Inc., Tokyo, Japan; Sony AI, Inc., Tokyo, Japan; and The Alan Turing Institute, London, UK.
| |
Collapse
|
22
|
Affiliation(s)
- Itai Yanai
- Institute for Computational Medicine, NYU Langone Health, New York, NY, 10016, USA.
| | - Martin Lercher
- Institute for Computer Science & Department of Biology, Heinrich Heine University, 40225, Düsseldorf, Germany.
| |
Collapse
|
23
|
Abstract
A systematic and reproducible “workflow”—the process that moves a scientific investigation from raw data to coherent research question to insightful contribution—should be a fundamental part of academic data-intensive research practice. In this paper, we elaborate basic principles of a reproducible data analysis workflow by defining 3 phases: the Explore, Refine, and Produce Phases. Each phase is roughly centered around the audience to whom research decisions, methodologies, and results are being immediately communicated. Importantly, each phase can also give rise to a number of research products beyond traditional academic publications. Where relevant, we draw analogies between design principles and established practice in software development. The guidance provided here is not intended to be a strict rulebook; rather, the suggestions for practices and tools to advance reproducible, sound data-intensive analysis may furnish support for both students new to research and current researchers who are new to data-intensive work.
Collapse
|
24
|
|
25
|
Affiliation(s)
- Itai Yanai
- Institute for Computational Medicine, NYU Langone Health, New York, NY, 10016, USA.
| | - Martin Lercher
- Institute for Computer Science & Department of Biology, Heinrich Heine University, 40225, Düsseldorf, Germany.
| |
Collapse
|
26
|
mSphere of Influence: That's Racist-COVID-19, Biological Determinism, and the Limits of Hypotheses. mSphere 2020; 5:5/5/e00945-20. [PMID: 33025909 PMCID: PMC7529441 DOI: 10.1128/msphere.00945-20] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Kishana Taylor works in the field of virology. In this mSphere of Influence article, she reflects on the personal impact of "Racial health disparities and COVID-19 - caution and context" by Merlin Chowkwanyun and Adolph L. Reed, Jr. (N Engl J Med 383:201-203, 2020, https://doi.org/10.1056/NEJMp2012910) and "A hypothesis is a liability" by Itai Yanai and Martin Lercher (Genome Biol 21:231, 2020, https://doi.org/10.1186/s13059-020-02133-w) and how it became part of the mission for Black In Microbiology Week.
Collapse
|