1
|
Zhang Y, Kim M, Prerau M, Mobley D, Rueschman M, Sparks K, Tully M, Purcell S, Redline S. The National Sleep Research Resource: making data findable, accessible, interoperable, reusable and promoting sleep science. Sleep 2024; 47:zsae088. [PMID: 38688470 PMCID: PMC11236948 DOI: 10.1093/sleep/zsae088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Revised: 03/15/2024] [Indexed: 05/02/2024] Open
Abstract
This paper presents a comprehensive overview of the National Sleep Research Resource (NSRR), a National Heart Lung and Blood Institute-supported repository developed to share data from clinical studies focused on the evaluation of sleep disorders. The NSRR addresses challenges presented by the heterogeneity of sleep-related data, leveraging innovative strategies to optimize the quality and accessibility of available datasets. It provides authorized users with secure centralized access to a large quantity of sleep-related data including polysomnography, actigraphy, demographics, patient-reported outcomes, and other data. In developing the NSRR, we have implemented data processing protocols that ensure de-identification and compliance with FAIR (Findable, Accessible, Interoperable, Reusable) principles. Heterogeneity stemming from intrinsic variation in the collection, annotation, definition, and interpretation of data has proven to be one of the primary obstacles to efficient sharing of datasets. Approaches employed by the NSRR to address this heterogeneity include (1) development of standardized sleep terminologies utilizing a compositional coding scheme, (2) specification of comprehensive metadata, (3) harmonization of commonly used variables, and (3) computational tools developed to standardize signal processing. We have also leveraged external resources to engineer a domain-specific approach to data harmonization. We describe the scope of data within the NSRR, its role in promoting sleep and circadian research through data sharing, and harmonization of large datasets and analytical tools. Finally, we identify opportunities for approaches for the field of sleep medicine to further support data standardization and sharing.
Collapse
Affiliation(s)
- Ying Zhang
- Division of Sleep Medicine and Circadian Disorders, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
| | - Matthew Kim
- Division of Endocrinology, Diabetes and Hypertension, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
| | - Michael Prerau
- Division of Sleep Medicine and Circadian Disorders, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
| | - Daniel Mobley
- Division of Sleep Medicine and Circadian Disorders, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
| | - Michael Rueschman
- Division of Sleep Medicine and Circadian Disorders, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
| | - Kathryn Sparks
- Division of Sleep Medicine and Circadian Disorders, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
| | - Meg Tully
- Division of Sleep Medicine and Circadian Disorders, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
| | - Shaun Purcell
- Department of Psychiatry, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
| | - Susan Redline
- Division of Sleep Medicine and Circadian Disorders, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
| |
Collapse
|
2
|
Te TT, Keenan BT, Veatch OJ, Boland MR, Hubbard RA, Pack AI. Identifying clusters of patient comorbidities associated with obstructive sleep apnea using electronic health records. J Clin Sleep Med 2024; 20:521-533. [PMID: 38054454 PMCID: PMC10985292 DOI: 10.5664/jcsm.10930] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 11/15/2023] [Accepted: 11/16/2023] [Indexed: 12/07/2023]
Abstract
STUDY OBJECTIVES The objectives of this study were to understand the relative comorbidity burden of obstructive sleep apnea (OSA), determine whether these relationships were modified by sex or age, and identify patient subtypes defined by common comorbidities. METHODS Cases with OSA and noncases (controls) were defined using a validated electronic health record (EHR)-based phenotype and matched for age, sex, and time period of follow-up in the EHR. We compared prevalence of the 20 most common comorbidities between matched cases and controls using conditional logistic regression with and without controlling for body mass index. Latent class analysis was used to identify subtypes of OSA cases defined by combinations of these comorbidities. RESULTS In total, 60,586 OSA cases were matched to 60,586 controls (from 1,226,755 total controls). Patients with OSA were more likely to have each of the 20 most common comorbidities compared with controls, with odds ratios ranging from 3.1 to 30.8 in the full matched set and 1.3 to 10.2 after body mass index adjustment. Associations between OSA and these comorbidities were generally stronger in females and patients with younger age at diagnosis. We identified 5 distinct subgroups based on EHR-defined comorbidities: High Comorbidity Burden, Low Comorbidity Burden, Cardiovascular Comorbidities, Inflammatory Conditions and Less Obesity, and Inflammatory Conditions and Obesity. CONCLUSIONS Our study demonstrates the power of leveraging the EHR to understand the relative health burden of OSA, as well as heterogeneity in these relationships based on age and sex. In addition to enrichment for comorbidities, we identified 5 novel OSA subtypes defined by combinations of comorbidities in the EHR, which may be informative for understanding disease outcomes and improving prevention and clinical care. Overall, this study adds more evidence that OSA is heterogeneous and requires personalized management. CITATION Te TT, Keenan BT, Veatch OJ, Boland MR, Hubbard RA, Pack AI. Identifying clusters of patient comorbidities associated with obstructive sleep apnea using electronic health records. J Clin Sleep Med. 2024;20(4):521-533.
Collapse
Affiliation(s)
- Tue T. Te
- Division of Sleep Medicine, Department of Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania
| | - Brendan T. Keenan
- Division of Sleep Medicine, Department of Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania
| | - Olivia J. Veatch
- Department of Psychiatry and Behavioral Sciences, University of Kansas Medical Center, Kansas City, Kansas
| | - Mary Regina Boland
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Rebecca A. Hubbard
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Allan I. Pack
- Division of Sleep Medicine, Department of Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania
| |
Collapse
|
3
|
Mazzotti DR. Multimodal integration of sleep electroencephalogram, brain imaging, and cognitive assessments: approaches using noisy clinical data. Sleep 2024; 47:zsad305. [PMID: 38019853 PMCID: PMC10851849 DOI: 10.1093/sleep/zsad305] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Indexed: 12/01/2023] Open
Affiliation(s)
- Diego R Mazzotti
- Division of Medical Informatics, Department of Internal Medicine, University of Kansas Medical Center, Kansas City, KS, USA
- Division of Pulmonary Critical Care and Sleep Medicine, Department of Internal Medicine, University of Kansas Medical Center, Kansas City, KS, USA
| |
Collapse
|
4
|
Zieliński T, Hodge JJL, Millar AJ. Keep It Simple: Using README Files to Advance Standardization in Chronobiology. Clocks Sleep 2023; 5:499-506. [PMID: 37754351 PMCID: PMC10529918 DOI: 10.3390/clockssleep5030033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 08/25/2023] [Accepted: 08/28/2023] [Indexed: 09/28/2023] Open
Abstract
Standardization plays a crucial role in ensuring the reliability, reproducibility, and interoperability of research data in the biomedical sciences. Metadata standards are one foundation for the FAIR (Findable, Accessible, Interoperable, and Reusable) principles of data management. They facilitate data discovery, understanding, and reuse. However, the adoption of metadata standards in biological research lags in practice. Barriers such as complexity, lack of incentives, technical challenges, resource constraints, and resistance to change hinder widespread adoption. In the field of chronobiology, standardization is essential but faces particular challenges due to the longitudinal nature of experimental data, diverse model organisms, and varied measurement techniques. To address these challenges, we propose an approach that emphasizes simplicity and practicality: the development of README templates tailored for particular data types and species. Through this opinion article, our intention is to initiate a dialogue and commence a community-driven standardization process by engaging potential contributors and collaborators.
Collapse
Affiliation(s)
- Tomasz Zieliński
- Centre for Engineering Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JD, UK;
| | - James J. L. Hodge
- School of Physiology, Pharmacology and Neuroscience, University of Bristol, Bristol BS8 1TD, UK;
| | - Andrew J. Millar
- Centre for Engineering Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JD, UK;
| |
Collapse
|
5
|
Baum L, Johns M, Poikela M, Möller R, Ananthasubramaniam B, Prasser F. Data integration and analysis for circadian medicine. Acta Physiol (Oxf) 2023; 237:e13951. [PMID: 36790321 DOI: 10.1111/apha.13951] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 02/04/2023] [Accepted: 02/12/2023] [Indexed: 02/16/2023]
Abstract
Data integration, data sharing, and standardized analyses are important enablers for data-driven medical research. Circadian medicine is an emerging field with a particularly high need for coordinated and systematic collaboration between researchers from different disciplines. Datasets in circadian medicine are multimodal, ranging from molecular circadian profiles and clinical parameters to physiological measurements and data obtained from (wearable) sensors or reported by patients. Uniquely, data spanning both the time dimension and the spatial dimension (across tissues) are needed to obtain a holistic view of the circadian system. The study of human rhythms in the context of circadian medicine has to confront the heterogeneity of clock properties within and across subjects and our inability to repeatedly obtain relevant biosamples from one subject. This requires informatics solutions for integrating and visualizing relevant data types at various temporal resolutions ranging from milliseconds and seconds to minutes and several hours. Associated challenges range from a lack of standards that can be used to represent all required data in a common interoperable form, to challenges related to data storage, to the need to perform transformations for integrated visualizations, and to privacy issues. The downstream analysis of circadian rhythms requires specialized approaches for the identification, characterization, and discrimination of rhythms. We conclude that circadian medicine research provides an ideal environment for developing innovative methods to address challenges related to the collection, integration, visualization, and analysis of multimodal multidimensional biomedical data.
Collapse
Affiliation(s)
- Lena Baum
- Berlin Institute of Health at Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Marco Johns
- Berlin Institute of Health at Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Maija Poikela
- Berlin Institute of Health at Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Ralf Möller
- Institute of Information Systems, University of Lübeck, Lübeck, Germany
| | | | - Fabian Prasser
- Berlin Institute of Health at Charité-Universitätsmedizin Berlin, Berlin, Germany
| |
Collapse
|
6
|
Mazzotti DR, Haendel MA, McMurry JA, Smith CJ, Buysse DJ, Roenneberg T, Penzel T, Purcell S, Redline S, Zhang Y, Merikangas KR, Menetski JP, Mullington J, Boudreau E. Sleep and circadian informatics data harmonization: a workshop report from the Sleep Research Society and Sleep Research Network. Sleep 2022; 45:zsac002. [PMID: 35030631 PMCID: PMC9189941 DOI: 10.1093/sleep/zsac002] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Revised: 12/21/2021] [Indexed: 01/16/2023] Open
Abstract
The increasing availability and complexity of sleep and circadian data are equally exciting and challenging. The field is in constant technological development, generating better high-resolution physiological and molecular data than ever before. Yet, the promise of large-scale studies leveraging millions of patients is limited by suboptimal approaches for data sharing and interoperability. As a result, integration of valuable clinical and basic resources is problematic, preventing knowledge discovery and rapid translation of findings into clinical care. To understand the current data landscape in the sleep and circadian domains, the Sleep Research Society (SRS) and the Sleep Research Network (now a task force of the SRS) organized a workshop on informatics and data harmonization, presented at the World Sleep Congress 2019, in Vancouver, Canada. Experts in translational informatics gathered with sleep research experts to discuss opportunities and challenges in defining strategies for data harmonization. The goal of this workshop was to fuel discussion and foster innovative approaches for data integration and development of informatics infrastructure supporting multi-site collaboration. Key recommendations included collecting and storing findable, accessible, interoperable, and reusable data; identifying existing international cohorts and resources supporting research in sleep and circadian biology; and defining the most relevant sleep data elements and associated metadata that could be supported by early integration initiatives. This report introduces foundational concepts with the goal of facilitating engagement between the sleep/circadian and informatics communities and is a call to action for the implementation and adoption of data harmonization strategies in this domain.
Collapse
Affiliation(s)
- Diego R Mazzotti
- Division of Medical Informatics, Department of Internal Medicine, University of Kansas Medical Center, Kansas City, KS, USA
- Division of Pulmonary Critical Care and Sleep Medicine, Department of Internal Medicine, University of Kansas Medical Center, Kansas City, KS, USA
| | - Melissa A Haendel
- Center for Health AI, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Julie A McMurry
- Center for Health AI, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Connor J Smith
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, OR, USA
| | - Daniel J Buysse
- Department of Psychiatry, University of Pittsburgh, Pittsburgh, PA,USA
| | - Till Roenneberg
- Institute and Polyclinic for Occupational-, Social- and Environmental Medicine, LMU Munich, Germany
| | - Thomas Penzel
- Interdisciplinary Center of Sleep Medicine, Charité University Hospital, Berlin, Germany
| | - Shaun Purcell
- Department of Psychiatry, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
| | - Susan Redline
- Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
| | - Ying Zhang
- Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
| | - Kathleen R Merikangas
- Genetic Epidemiology Research Branch, Intramural Research Program, National Institute of Mental Health, Bethesda, MD, USA
| | | | - Janet Mullington
- Department of Neurology, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
| | - Eilis Boudreau
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, OR, USA
| |
Collapse
|