Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Castro VM, Minnier J, Murphy SN, Kohane I, Churchill SE, Gainer V, Cai T, Hoffnagle AG, Dai Y, Block S, Weill SR, Nadal-Vicens M, Pollastri AR, Rosenquist JN, Goryachev S, Ongur D, Sklar P, Perlis RH, Smoller JW. Validation of electronic health record phenotyping of bipolar disorder cases and controls. Am J Psychiatry 2015;172:363-72. [PMID: 25827034 PMCID: PMC4441333 DOI: 10.1176/appi.ajp.2014.14030423] [Citation(s) in RCA: 86] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

For:	Castro VM, Minnier J, Murphy SN, Kohane I, Churchill SE, Gainer V, Cai T, Hoffnagle AG, Dai Y, Block S, Weill SR, Nadal-Vicens M, Pollastri AR, Rosenquist JN, Goryachev S, Ongur D, Sklar P, Perlis RH, Smoller JW. Validation of electronic health record phenotyping of bipolar disorder cases and controls. Am J Psychiatry 2015;172:363-72. [PMID: 25827034 PMCID: PMC4441333 DOI: 10.1176/appi.ajp.2014.14030423] [Citation(s) in RCA: 86] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Number

Cited by Other Article(s)

Sikström S, Valavičiūtė I, Kuusela I, Evors N. Question-based computational language approach outperforms rating scales in quantifying emotional states. COMMUNICATIONS PSYCHOLOGY 2024;2:45. [PMID: 39242812 PMCID: PMC11332055 DOI: 10.1038/s44271-024-00097-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Accepted: 05/03/2024] [Indexed: 09/09/2024]

Deo AJ, Castro VM, Baker A, Carroll D, Gonzalez-Heydrich J, Henderson DC, Holt DJ, Hook K, Karmacharya R, Roffman JL, Madsen EM, Song E, Adams WG, Camacho L, Gasman S, Gibbs JS, Fortgang RG, Kennedy CJ, Lozinski G, Perez DC, Wilson M, Reis BY, Smoller JW. Validation of an ICD-Code-Based Case Definition for Psychotic Illness Across Three Health Systems. Schizophr Bull 2024:sbae064. [PMID: 38728421 DOI: 10.1093/schbul/sbae064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 05/12/2024]

Affiliation(s)

Anthony J Deo Department of Psychiatry and Behavioral Sciences, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA Department of Psychiatry, Harvard Medical School, Boston, MA, USA Department of Psychiatry, Rutgers-Robert Wood Johnson Medical School, Piscataway, NJ, USA Psychiatric Evaluation of Adolescent and Child Experiences (P.E.A.C.E.) Program, Rutgers University Behavioral Health Care, Piscataway, NJ, USA
Victor M Castro Research Information Science and Computing, Mass General Brigham, Somerville, MA, USA
Ashley Baker Ascend Integrative Medicine LLC, MA, USA
Devon Carroll Department of Psychiatry and Behavioral Sciences, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA College of Nursing, University of Rhode Island, Providence, RI, USA
Joseph Gonzalez-Heydrich Department of Psychiatry and Behavioral Sciences, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA Department of Psychiatry, Harvard Medical School, Boston, MA, USA Tommy Fuss Center for Neuropsychiatric Disease Research, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA Early Psychosis Investigation Center, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
David C Henderson Boston Medical Center, Boston, MA, USA Boston University Chobanian & Avedisian School of Medicine, Boston, MA, USA
Daphne J Holt Department of Psychiatry, Harvard Medical School, Boston, MA, USA Department of Psychiatry, Massachusetts General Hospital, Boston MA, USA
Kimberly Hook Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA, USA
Rakesh Karmacharya Department of Psychiatry, Harvard Medical School, Boston, MA, USA Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA Chemical Biology and Therapeutic Science Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA Schizophrenia and Bipolar Disorder Program, McLean Hospital, Belmont, MA, USA
Joshua L Roffman Department of Psychiatry, Harvard Medical School, Boston, MA, USA Department of Psychiatry, Massachusetts General Hospital, Boston MA, USA
Emily M Madsen Psychiatric & Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA
Eugene Song Psychiatric & Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
William G Adams Boston Medical Center, Boston, MA, USA
Luisa Camacho Boston Medical Center, Boston, MA, USA
Sarah Gasman Boston Medical Center, Boston, MA, USA
Jada S Gibbs Rutgers New Jersey Medical School, Newark, NJ, USA
Rebecca G Fortgang Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA Department of Psychology, Harvard University, Cambridge, MA, USA
Chris J Kennedy Department of Psychiatry, Harvard Medical School, Boston, MA, USA Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA
Galina Lozinski Boston Medical Center, Boston, MA, USA
Daisy C Perez Boston Medical Center, Boston, MA, USA Boston University Chobanian & Avedisian School of Medicine, Boston, MA, USA
Marina Wilson Psychiatric & Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA
Ben Y Reis Predictive Medicine Group, Harvard Medical School, Boston, MA, USA Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, USA
Jordan W Smoller Department of Psychiatry, Harvard Medical School, Boston, MA, USA Psychiatric & Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA Stanley Center for Psychiatric Research, Broad Institute, Cambridge, MA, USA

Collapse

Deo AJ, Castro VM, Baker A, Carroll D, Gonzalez-Heydrich J, Henderson DC, Holt DJ, Hook K, Karmacharya R, Roffman JL, Madsen EM, Song E, Adams WG, Camacho L, Gasman S, Gibbs JS, Fortgang RG, Kennedy CJ, Lozinski G, Perez DC, Wilson M, Reis BY, Smoller JW. Validation of an ICD-code-based case definition for psychotic illness across three health systems. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.02.28.24303443. [PMID: 38464074 PMCID: PMC10925367 DOI: 10.1101/2024.02.28.24303443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]

Affiliation(s)

Anthony J. Deo Department of Psychiatry and Behavioral Sciences, Boston Children’s Hospital, Harvard Medical School, Boston, MA Department of Psychiatry, Harvard Medical School, Boston, MA Department of Psychiatry, Rutgers-Robert Wood Johnson Medical School, Piscataway, NJ Rutgers University Behavioral Health Care, Piscataway, NJ
Victor M. Castro Research Information Science and Computing, Mass General Brigham, Somerville, MA
Ashley Baker Ascend Integrative Medicine LLC / Massachusetts
Devon Carroll Department of Psychiatry and Behavioral Sciences, Boston Children’s Hospital, Harvard Medical School, Boston, MA University of Rhode Island, Providence, RI, USA
Joseph Gonzalez-Heydrich Department of Psychiatry and Behavioral Sciences, Boston Children’s Hospital, Harvard Medical School, Boston, MA Department of Psychiatry, Harvard Medical School, Boston, MA Tommy Fuss Center for Neuropsychiatric Disease Research, Boston Children’s Hospital, Harvard Medical School, Boston, MA Early Psychosis Investigation Center, Boston Children’s Hospital, Harvard Medical School, Boston, MA
David C. Henderson Boston Medical Center, Boston MA Boston University Chobanian & Avedisian School of Medicine, Boston MA
Daphne J. Holt Department of Psychiatry, Harvard Medical School, Boston, MA Department of Psychiatry, Massachusetts General Hospital, Boston MA
Kimberly Hook Harvard T.H. Chan School of Public Health, Boston, MA
Rakesh Karmacharya Department of Psychiatry, Harvard Medical School, Boston, MA Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA Chemical Biology and Therapeutic Science Program, Broad Institute of MIT and Harvard, Cambridge, MA Schizophrenia and Bipolar Disorder Program, McLean Hospital, Belmont, MA
Joshua L. Roffman Department of Psychiatry, Harvard Medical School, Boston, MA Department of Psychiatry, Massachusetts General Hospital, Boston MA
Emily M. Madsen Psychiatric & Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA
Eugene Song Psychiatric & Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
William G. Adams Boston Medical Center, Boston MA Boston University Chobanian & Avedisian School of Medicine, Boston MA
Luisa Camacho Boston Medical Center, Boston MA
Sarah Gasman Boston Medical Center, Boston MA
Jada S. Gibbs Rutgers New Jersey Medical School, Newark, New Jersey 07103
Rebecca G. Fortgang Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA Department of Psychology, Harvard University, Cambridge, MA
Chris J. Kennedy Department of Psychiatry, Harvard Medical School, Boston, MA Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA
Galina Lozinski Boston Medical Center, Boston MA
Daisy C. Perez Boston Medical Center, Boston MA Boston University Chobanian & Avedisian School of Medicine, Boston MA
Marina Wilson Psychiatric & Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA
Ben Y. Reis Predictive Medicine Group, Harvard Medical School, Boston, MA Computational Health Informatics Program, Boston Children’s Hospital, Boston, MA
Jordan W. Smoller Department of Psychiatry, Harvard Medical School, Boston, MA Psychiatric & Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA Stanley Center for Psychiatric Research, Broad Institute, Cambridge, MA

Collapse

Walsh CG, Ripperger MA, Hu Y, Sheu YH, Lee H, Wilimitis D, Zheutlin AB, Rocha D, Choi KW, Castro VM, Kirchner HL, Chabris CF, Davis LK, Smoller JW. Development and multi-site external validation of a generalizable risk prediction model for bipolar disorder. Transl Psychiatry 2024;14:58. [PMID: 38272862 PMCID: PMC10810911 DOI: 10.1038/s41398-023-02720-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 11/29/2023] [Accepted: 12/15/2023] [Indexed: 01/27/2024] Open

Abstract

Bipolar disorder is a leading contributor to disability, premature mortality, and suicide. Early identification of risk for bipolar disorder using generalizable predictive models trained on diverse cohorts around the United States could improve targeted assessment of high risk individuals, reduce misdiagnosis, and improve the allocation of limited mental health resources. This observational case-control study intended to develop and validate generalizable predictive models of bipolar disorder as part of the multisite, multinational PsycheMERGE Network across diverse and large biobanks with linked electronic health records (EHRs) from three academic medical centers: in the Northeast (Massachusetts General Brigham), the Mid-Atlantic (Geisinger) and the Mid-South (Vanderbilt University Medical Center). Predictive models were developed and valid with multiple algorithms at each study site: random forests, gradient boosting machines, penalized regression, including stacked ensemble learning algorithms combining them. Predictors were limited to widely available EHR-based features agnostic to a common data model including demographics, diagnostic codes, and medications. The main study outcome was bipolar disorder diagnosis as defined by the International Cohort Collection for Bipolar Disorder, 2015. In total, the study included records for 3,529,569 patients including 12,533 cases (0.3%) of bipolar disorder. After internal and external validation, algorithms demonstrated optimal performance in their respective development sites. The stacked ensemble achieved the best combination of overall discrimination (AUC = 0.82-0.87) and calibration performance with positive predictive values above 5% in the highest risk quantiles at all three study sites. In conclusion, generalizable predictive models of risk for bipolar disorder can be feasibly developed across diverse sites to enable precision medicine. Comparison of a range of machine learning methods indicated that an ensemble approach provides the best performance overall but required local retraining. These models will be disseminated via the PsycheMERGE Network website.

Collapse

Zhu T, Kou R, Hu Y, Yuan M, Yuan C, Luo L, Zhang W. Dissecting clinical and biological heterogeneity in clinical states of bipolar disorder: a 10-year retrospective study from China. Front Psychiatry 2023;14:1128862. [PMID: 38179244 PMCID: PMC10764613 DOI: 10.3389/fpsyt.2023.1128862] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 12/01/2023] [Indexed: 01/06/2024] Open

Abstract

Objectives

To dissect clinical and biological heterogeneity in clinical states of bipolar disorder (BD), and investigate if neuropsychological symptomatology, comorbidity, vital signs, and blood laboratory indicators are predictors of distinct BD states.

Methods

A retrospective BD cohort was established with data extracted from a Chinese hospital's electronic medical records (EMR) between 2009 and 2018. Subjects were inpatients with a main discharge diagnosis of BD and were assessed for clinical state at hospitalization. We categorized all subjects into manic state, depressive state, and mixed state. Four machine learning classifiers were utilized to classify the subjects. A Shapley additive explanations (SHAP) algorithm was applied to the classifiers to aid in quantifying and visualizing the contributions of each feature that drive patient-specific classifications.

Results

A sample of 3,085 records was included (38.54% as manic, 56.69% as depressive, and 4.77% as mixed state). Mixed state showed more severe suicidal ideation and psychomotor abnormalities, while depressive state showed more common anxiety, sleep, and somatic-related symptoms and more comorbid conditions. Higher levels of body temperature, pulse, and systolic and diastolic blood pressures were present during manic episodes. Xgboost achieved the best AUC of 88.54% in manic/depressive states classification; Logistic regression and Random forest achieved the best AUCs of 75.5 and 75% in manic/mixed states and depressive/mixed states classifications, respectively. Myocardial enzymes and the non-enzymatic antioxidant uric acid and bilirubin contributed significantly to distinguish BD clinical states.

Conclusion

The observed novel biological associations with BD clinical states confirm that biological heterogeneity contributes to clinical heterogeneity of BD.

Collapse

Zhu T, Liu X, Wang J, Kou R, Hu Y, Yuan M, Yuan C, Luo L, Zhang W. Explainable machine-learning algorithms to differentiate bipolar disorder from major depressive disorder using self-reported symptoms, vital signs, and blood-based markers. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2023;240:107723. [PMID: 37480646 DOI: 10.1016/j.cmpb.2023.107723] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Revised: 06/26/2023] [Accepted: 07/15/2023] [Indexed: 07/24/2023]

Abstract

BACKGROUND AND OBJECTIVE

Caused by shared genetic risk factors and similar neuropsychological symptoms, bipolar disorder (BD) and major depressive disorder (MDD) are at high risk of misdiagnosis, which is associated with ineffective treatment and worsening of outcomes. We aimed to develop a machine learning (ML)-based diagnostic system, based on electronic medical records (EMR) data, to mimic the clinical reasoning of human physicians to differentiate MDD and BD (especially BD depressive episodes) patients about to be admitted to a hospital and, hence, reduce the misdiagnosis of BD as MDD on admission. In addition, we examined to what extent our ML model could be made interpretable by quantifying and visualizing the features that drive the predictions.

METHODS

By identifying 16,311 patients admitted to a hospital located in western China between 2009 and 2018 with a recorded main diagnosis of MDD or BD, we established three sub-cohorts with different combinations of features for both the MDD-BD cohort and the MDD-BD depressive episodes cohort, respectively. Four different ML algorithms (logistic regression, extreme gradient boosting (XGBoost), random forest, and support vector machine) and four train-test splits were used to train and validate diagnostic models, and explainable methods (SHAP and Break Down) were utilized to analyze the contribution of each of the features at both population-level and individual-level, including feature importance, feature interaction, and feature effect on prediction decision for a specific subject.

RESULTS

The XGBoost algorithm provided the best test performance (AUC: 0.838 (0.810-0.867), PPV: 0.810 and NPV: 0.834) for separating patients with BD from those with MDD. Core predictors included symptoms (mood-up, exciting, bad sleep, loss of interest, talking, mood-down, provoke), along with age, job, myocardial enzyme markers (creatine kinase, hydroxybutyrate dehydrogenase), diabetes-associated marker (glucose), bone function marker (alkaline phosphatase), non-enzymatic antioxidant (uric acid), markers of immune/inflammation (white blood cell count, lymphocyte count, basophil percentage, monocyte count), cardiovascular function marker (low density lipoprotein), renal marker (total protein), liver biochemistry marker (indirect bilirubin), and vital signs like pulse. For separating patients with BD depressive episodes from those with MDD, the test AUC was 0.777 (0.732-0.822), with PPV 0.576 and NPV 0.899. Additional validation in models built with self-reported symptoms removed from the feature set, showed test AUC of 0.701 (0.666-0.736) for differentiating BD and MDD, and AUC of 0.564 (0.515-0.614) for detecting patients in BD depressive episodes from MDD patients. Validation in the datasets without removing the patients with comorbidity showed an AUC of 0.826 (0.806-0.846).

CONCLUSION

The diagnostic system accurately identified patients with BD in various clinical scenarios, and differences in patterns of peripheral markers between BD and MDD could enrich our understanding of potential underlying pathophysiological mechanisms of them.

Collapse

Kirchner HL, Rocha D, Linner RK, Wilimitis D, Walsh CG, Ripperger M, Lee H, Liu Z, Davis L, Hu Y, Chabris CF, Smoller JW. Association Between Psychiatric Polygenic Scores, Healthcare Utilization and Comorbidity Burden. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.09.29.23296345. [PMID: 37808705 PMCID: PMC10557834 DOI: 10.1101/2023.09.29.23296345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/10/2023]

Abstract

Purpose

To estimate the association of psychiatric polygenic scores with healthcare utilization and comorbidity burden.

Methods

Observational cohort study (N = 118,882) of adolescent and adult biobank participants with linked electronic health records (EHRs) from three diverse study sites; (Massachusetts General Brigham, Vanderbilt University Medical Center, Geisinger). Polygenic scores (PGS) were derived from the largest available GWAS of major depressive depression, bipolar disorder, and schizophrenia at the time of analysis. Negative binomial regression models were used to estimate the association between each psychiatric PGS and healthcare utilization and comorbidity burden. Healthcare utilization was measured as frequency of emergency department (ED), inpatient (IP), and outpatient (OP) visits. Comorbidity burden was defined by the Elixhauser Comorbidity Index and the Charlson Comorbidity Index.

Results

Participants had a median follow-up duration of 12 years in the EHR. Individuals in the top decile of polygenic score for major depressive disorder had significantly more ED visits (RR=1.22, 95% CI; 1.17, 1.29) compared to those the lowest decile. Increases were also observed with IP and comorbidity burden. Among those diagnosed with depression and in the highest decile of the PGS, there was an increase in all utilization types (ED: RR=1.56, 95% CI 1.41, 1.72; OP: RR=1.16, 95% CI 1.08, 1.24; IP: RR=1.23, 95% CI 1.12, 1.36) post-diagnosis. No clinically significant results were observed with bipolar and schizophrenia polygenic scores.

Conclusions

Polygenic score for depression is modestly associated with increased healthcare resource utilization and comorbidity burden, in the absence of diagnosis. Following a diagnosis of depression, the PGS was associated with further increases in healthcare utilization. These findings suggest that depression genetic risk is associated with utilization and burden of chronic disease in real-world settings.

Collapse

Walsh CG, Ripperger MA, Hu Y, Sheu YH, Wilimitis D, Zheutlin AB, Rocha D, Choi KW, Castro VM, Kirchner HL, Chabris CF, Davis LK, Smoller JW. Development and Multi-Site External Validation of a Generalizable Risk Prediction Model for Bipolar Disorder. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.02.21.23286251. [PMID: 36865341 PMCID: PMC9980254 DOI: 10.1101/2023.02.21.23286251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/27/2023]

Abstract

Bipolar disorder is a leading contributor to disability, premature mortality, and suicide. Early identification of risk for bipolar disorder using generalizable predictive models trained on diverse cohorts around the United States could improve targeted assessment of high risk individuals, reduce misdiagnosis, and improve the allocation of limited mental health resources. This observational case-control study intended to develop and validate generalizable predictive models of bipolar disorder as part of the multisite, multinational PsycheMERGE Consortium across diverse and large biobanks with linked electronic health records (EHRs) from three academic medical centers: in the Northeast (Massachusetts General Brigham), the Mid-Atlantic (Geisinger) and the Mid-South (Vanderbilt University Medical Center). Predictive models were developed and validated with multiple algorithms at each study site: random forests, gradient boosting machines, penalized regression, including stacked ensemble learning algorithms combining them. Predictors were limited to widely available EHR-based features agnostic to a common data model including demographics, diagnostic codes, and medications. The main study outcome was bipolar disorder diagnosis as defined by the International Cohort Collection for Bipolar Disorder, 2015. In total, the study included records for 3,529,569 patients including 12,533 cases (0.3%) of bipolar disorder. After internal and external validation, algorithms demonstrated optimal performance in their respective development sites. The stacked ensemble achieved the best combination of overall discrimination (AUC = 0.82 - 0.87) and calibration performance with positive predictive values above 5% in the highest risk quantiles at all three study sites. In conclusion, generalizable predictive models of risk for bipolar disorder can be feasibly developed across diverse sites to enable precision medicine. Comparison of a range of machine learning methods indicated that an ensemble approach provides the best performance overall but required local retraining. These models will be disseminated via the PsycheMERGE Consortium website.

Collapse

Kishimoto T, Nakamura H, Kano Y, Eguchi Y, Kitazawa M, Liang KC, Kudo K, Sento A, Takamiya A, Horigome T, Yamasaki T, Sunami Y, Kikuchi T, Nakajima K, Tomita M, Bun S, Momota Y, Sawada K, Murakami J, Takahashi H, Mimura M. Understanding psychiatric illness through natural language processing (UNDERPIN): Rationale, design, and methodology. Front Psychiatry 2022;13:954703. [PMID: 36532181 PMCID: PMC9752868 DOI: 10.3389/fpsyt.2022.954703] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Accepted: 11/11/2022] [Indexed: 12/04/2022] Open

Abstract

Introduction

Psychiatric disorders are diagnosed through observations of psychiatrists according to diagnostic criteria such as the DSM-5. Such observations, however, are mainly based on each psychiatrist's level of experience and often lack objectivity, potentially leading to disagreements among psychiatrists. In contrast, specific linguistic features can be observed in some psychiatric disorders, such as a loosening of associations in schizophrenia. Some studies explored biomarkers, but biomarkers have yet to be used in clinical practice.

Aim

The purposes of this study are to create a large dataset of Japanese speech data labeled with detailed information on psychiatric disorders and neurocognitive disorders to quantify the linguistic features of those disorders using natural language processing and, finally, to develop objective and easy-to-use biomarkers for diagnosing and assessing the severity of them.

Methods

This study will have a multi-center prospective design. The DSM-5 or ICD-11 criteria for major depressive disorder, bipolar disorder, schizophrenia, and anxiety disorder and for major and minor neurocognitive disorders will be regarded as the inclusion criteria for the psychiatric disorder samples. For the healthy subjects, the absence of a history of psychiatric disorders will be confirmed using the Mini-International Neuropsychiatric Interview (M.I.N.I.). The absence of current cognitive decline will be confirmed using the Mini-Mental State Examination (MMSE). A psychiatrist or psychologist will conduct 30-to-60-min interviews with each participant; these interviews will include free conversation, picture-description task, and story-telling task, all of which will be recorded using a microphone headset. In addition, the severity of disorders will be assessed using clinical rating scales. Data will be collected from each participant at least twice during the study period and up to a maximum of five times at an interval of at least one month.

Discussion

This study is unique in its large sample size and the novelty of its method, and has potential for applications in many fields. We have some challenges regarding inter-rater reliability and the linguistic peculiarities of Japanese. As of September 2022, we have collected a total of >1000 records from >400 participants. To the best of our knowledge, this data sample is one of the largest in this field.

Clinical Trial Registration

Identifier: UMIN000032141.

Collapse

Affiliation(s)

Taishiro Kishimoto Department of Neuropsychiatry, Keio University School of Medicine, Tokyo, Japan Hills Joint Research Laboratory for Future Preventive Medicine and Wellness, Keio University School of Medicine, Tokyo, Japan
Hironobu Nakamura Department of Psychiatry and Behavioral Sciences, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University, Tokyo, Japan
Yoshinobu Kano Faculty of Informatics, Shizuoka University, Shizuoka, Japan
Yoko Eguchi Department of Neuropsychiatry, Keio University School of Medicine, Tokyo, Japan
Momoko Kitazawa Department of Neuropsychiatry, Keio University School of Medicine, Tokyo, Japan
Kuo-ching Liang Department of Neuropsychiatry, Keio University School of Medicine, Tokyo, Japan
Koki Kudo Department of Neuropsychiatry, Keio University School of Medicine, Tokyo, Japan Department of Neuropsychiatry, St. Marianna University School of Medicine Hospital, Kawasaki, Japan
Ayako Sento Department of Neuropsychiatry, Keio University School of Medicine, Tokyo, Japan
Akihiro Takamiya Department of Neuropsychiatry, Keio University School of Medicine, Tokyo, Japan
Toshiro Horigome Department of Neuropsychiatry, Keio University School of Medicine, Tokyo, Japan
Toshihiko Yamasaki Computer Vision and Media Lab (Yamasaki Lab), Department of Information and Communication Engineering, Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, Japan
Yuki Sunami Keio University School of Medicine, Tokyo, Japan
Toshiaki Kikuchi Department of Neuropsychiatry, Keio University School of Medicine, Tokyo, Japan
Kazuki Nakajima Department of Neuropsychiatry, Keio University School of Medicine, Tokyo, Japan
Masayuki Tomita Department of Psychiatry, Oizumi Hospital, Tokyo, Japan
Shogyoku Bun Department of Neuropsychiatry, Keio University School of Medicine, Tokyo, Japan Department of Psychiatry, Koutokukai Sato Hospital, Yamagata, Japan
Yuki Momota Department of Neuropsychiatry, Keio University School of Medicine, Tokyo, Japan
Kyosuke Sawada Department of Neuropsychiatry, Keio University School of Medicine, Tokyo, Japan
Junichi Murakami Department of Psychiatry, Biwako Hospital, Otsu, Japan
Hidehiko Takahashi Department of Psychiatry and Behavioral Sciences, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University, Tokyo, Japan
Masaru Mimura Department of Neuropsychiatry, Keio University School of Medicine, Tokyo, Japan

Collapse

Chen ZS, Kulkarni P(P, Galatzer-Levy IR, Bigio B, Nasca C, Zhang Y. Modern views of machine learning for precision psychiatry. PATTERNS (NEW YORK, N.Y.) 2022;3:100602. [PMID: 36419447 PMCID: PMC9676543 DOI: 10.1016/j.patter.2022.100602] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Benson NM, Yang Z, Weiss M, Fung V, Moran LV, Öngür D, Hsu J. Identifying Diagnoses of Schizophrenia Spectrum Disorder in Large Data Sets. Psychiatr Serv 2022;73:1210-1216. [PMID: 35440163 PMCID: PMC9582046 DOI: 10.1176/appi.ps.202100696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Ahuja Y, Wen J, Hong C, Xia Z, Huang S, Cai T. A semi-supervised adaptive Markov Gaussian embedding process (SAMGEP) for prediction of phenotype event times using the electronic health record. Sci Rep 2022;12:17737. [PMID: 36273240 PMCID: PMC9588081 DOI: 10.1038/s41598-022-22585-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2021] [Accepted: 10/17/2022] [Indexed: 01/18/2023] Open

Swerdel JN, Schuemie M, Murray G, Ryan PB. PheValuator 2.0: Methodological improvements for the PheValuator approach to semi-automated phenotype algorithm evaluation. J Biomed Inform 2022;135:104177. [PMID: 35995107 DOI: 10.1016/j.jbi.2022.104177] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2022] [Revised: 08/11/2022] [Accepted: 08/15/2022] [Indexed: 10/31/2022]

Abstract

PURPOSE

Phenotype algorithms are central to performing analyses using observational data. These algorithms translate the clinical idea of a health condition into an executable set of rules allowing for queries of data elements from a database. PheValuator, a software package in the Observational Health Data Sciences and Informatics (OHDSI) tool stack, provides a method to assess the performance characteristics of these algorithms, namely, sensitivity, specificity, and positive and negative predictive value. It uses machine learning to develop predictive models for determining a probabilistic gold standard of subjects for assessment of cases and non-cases of health conditions. PheValuator was developed to complement or even replace the traditional approach of algorithm validation, i.e., by expert assessment of subject records through chart review. Results in our first PheValuator paper suggest a systematic underestimation of the PPV compared to previous results using chart review. In this paper we evaluate modifications made to the method designed to improve its performance.

METHODS

The major changes to PheValuator included allowing all diagnostic conditions, clinical observations, drug prescriptions, and laboratory measurements to be included as predictors within the modeling process whereas in the prior version there were significant restrictions on the included predictors. We also have allowed for the inclusion of the temporal relationships of the predictors in the model. To evaluate the performance of the new method, we compared the results from the new and original methods against results found from the literature using traditional validation of algorithms for 19 phenotypes. We performed these tests using data from five commercial databases.

RESULTS

In the assessment aggregating all phenotype algorithms, the median difference between the PheValuator estimate and the gold standard estimate for PPV was reduced from -21 (IQR -34, -3) in Version 1.0 to 4 (IQR -3, 15) using Version 2.0. We found a median difference in specificity of 3 (IQR 1, 4.25) for Version 1.0 and 3 (IQR 1, 4) for Version 2.0. The median difference between the two versions of PheValuator and the gold standard for estimates of sensitivity was reduced from -39 (-51, -20) to -16 (-34, -6).

CONCLUSION

PheValuator 2.0 produces estimates for the performance characteristics for phenotype algorithms that are significantly closer to estimates from traditional validation through chart review compared to version 1.0. With this tool in researcher's toolkits, methods, such as quantitative bias analysis, may now be used to improve the reliability and reproducibility of research studies using observational data.

Collapse

Mahmoudi E, Wu W, Najarian C, Aikens J, Bynum J, Vydiswaran VV. Identify Caregiver Availability Using Medical Notes: Rule-Based Natural Language Processing. JMIR Aging 2022;5:e40241. [PMID: 35998328 PMCID: PMC9539648 DOI: 10.2196/40241] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2022] [Revised: 07/28/2022] [Accepted: 08/16/2022] [Indexed: 11/23/2022] Open

Abstract

Background

Identifying caregiver availability, particularly for patients with dementia or those with a disability, is critical to informing the appropriate care planning by the health systems, hospitals, and providers. This information is not readily available, and there is a paucity of pragmatic approaches to automatically identifying caregiver availability and type.

Objective

Our main objective was to use medical notes to assess caregiver availability and type for hospitalized patients with dementia. Our second objective was to identify whether the patient lived at home or resided at an institution.

Methods

In this retrospective cohort study, we used 2016-2019 telephone-encounter medical notes from a single institution to develop a rule-based natural language processing (NLP) algorithm to identify the patient’s caregiver availability and place of residence. Using note-level data, we compared the results of the NLP algorithm with human-conducted chart abstraction for both training (749/976, 77%) and test sets (227/976, 23%) for a total of 223 adults aged 65 years and older diagnosed with dementia. Our outcomes included determining whether the patients (1) reside at home or in an institution, (2) have a formal caregiver, and (3) have an informal caregiver.

Results

Test set results indicated that our NLP algorithm had high level of accuracy and reliability for identifying whether patients had an informal caregiver (F₁=0.94, accuracy=0.95, sensitivity=0.97, and specificity=0.93), but was relatively less able to identify whether the patient lived at an institution (F₁=0.64, accuracy=0.90, sensitivity=0.51, and specificity=0.98). The most common explanations for NLP misclassifications across all categories were (1) incomplete or misspelled facility names; (2) past, uncertain, or undecided status; (3) uncommon abbreviations; and (4) irregular use of templates.

Conclusions

This innovative work was the first to use medical notes to pragmatically determine caregiver availability. Our NLP algorithm identified whether hospitalized patients with dementia have a formal or informal caregiver and, to a lesser extent, whether they lived at home or in an institutional setting. There is merit in using NLP to identify caregivers. This study serves as a proof of concept. Future work can use other approaches and further identify caregivers and the extent of their availability.

Collapse

An electronic health record (EHR) phenotype algorithm to identify patients with attention deficit hyperactivity disorders (ADHD) and psychiatric comorbidities. J Neurodev Disord 2022;14:37. [PMID: 35690720 PMCID: PMC9188139 DOI: 10.1186/s11689-022-09447-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 05/31/2022] [Indexed: 11/10/2022] Open

Abstract

Background

In over half of pediatric cases, ADHD presents with comorbidities, and often, it is unclear whether the symptoms causing impairment are due to the comorbidity or the underlying ADHD. Comorbid conditions increase the likelihood for a more severe and persistent course and complicate treatment decisions. Therefore, it is highly important to establish an algorithm that identifies ADHD and comorbidities in order to improve research on ADHD using biorepository and other electronic record data.

Methods

It is feasible to accurately distinguish between ADHD in isolation from ADHD with comorbidities using an electronic algorithm designed to include other psychiatric disorders. We sought to develop an EHR phenotype algorithm to discriminate cases with ADHD in isolation from cases with ADHD with comorbidities more effectively for efficient future searches in large biorepositories. We developed a multi-source algorithm allowing for a more complete view of the patient’s EHR, leveraging the biobank of the Center for Applied Genomics (CAG) at Children’s Hospital of Philadelphia (CHOP). We mined EHRs from 2009 to 2016 using International Statistical Classification of Diseases and Related Health Problems (ICD) codes, medication history and keywords specific to ADHD, and comorbid psychiatric disorders to facilitate genotype-phenotype correlation efforts. Chart abstractions and behavioral surveys added evidence in support of the psychiatric diagnoses. Most notably, the algorithm did not exclude other psychiatric disorders, as is the case in many previous algorithms. Controls lacked psychiatric and other neurological disorders. Participants enrolled in various CAG studies at CHOP and completed a broad informed consent, including consent for prospective analyses of EHRs. We created and validated an EHR-based algorithm to classify ADHD and comorbid psychiatric status in a pediatric healthcare network to be used in future genetic analyses and discovery-based studies.

Results

In this retrospective case-control study that included data from 51,293 subjects, 5840 ADHD cases were discovered of which 46.1% had ADHD alone and 53.9% had ADHD with psychiatric comorbidities. Our primary study outcome was to examine whether the algorithm could identify and distinguish ADHD exclusive cases from ADHD comorbid cases. The results indicate ICD codes coupled with medication searches revealed the most cases. We discovered ADHD-related keywords did not increase yield. However, we found including ADHD-specific medications increased our number of cases by 21%. Positive predictive values (PPVs) were 95% for ADHD cases and 93% for controls.

Conclusion

We established a new algorithm and demonstrated the feasibility of the electronic algorithm approach to accurately diagnose ADHD and comorbid conditions, verifying the efficiency of our large biorepository for further genetic discovery-based analyses.

Trial registration

ClinicalTrials.gov, NCT02286817. First posted on 10 November 2014. ClinicalTrials.gov, NCT02777931. First posted on 19 May 2016. ClinicalTrials.gov, NCT03006367. First posted on 30 December 2016. ClinicalTrials.gov, NCT02895906. First posted on 12 September 2016.

Supplementary Information

The online version contains supplementary material available at 10.1186/s11689-022-09447-9.

Collapse

Klann JG, Strasser ZH, Hutch MR, Kennedy CJ, Marwaha JS, Morris M, Samayamuthu MJ, Pfaff AC, Estiri H, South AM, Weber GM, Yuan W, Avillach P, Wagholikar KB, Luo Y, Omenn GS, Visweswaran S, Holmes JH, Xia Z, Brat GA, Murphy SN. Distinguishing Admissions Specifically for COVID-19 From Incidental SARS-CoV-2 Admissions: National Retrospective Electronic Health Record Study. J Med Internet Res 2022;24:e37931. [PMID: 35476727 PMCID: PMC9119395 DOI: 10.2196/37931] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Revised: 04/22/2022] [Accepted: 04/22/2022] [Indexed: 01/16/2023] Open

Abstract

BACKGROUND

Admissions are generally classified as COVID-19 hospitalizations if the patient has a positive SARS-CoV-2 polymerase chain reaction (PCR) test. However, because 35% of SARS-CoV-2 infections are asymptomatic, patients admitted for unrelated indications with an incidentally positive test could be misclassified as a COVID-19 hospitalization. Electronic health record (EHR)-based studies have been unable to distinguish between a hospitalization specifically for COVID-19 versus an incidental SARS-CoV-2 hospitalization. Although the need to improve classification of COVID-19 versus incidental SARS-CoV-2 is well understood, the magnitude of the problems has only been characterized in small, single-center studies. Furthermore, there have been no peer-reviewed studies evaluating methods for improving classification.

OBJECTIVE

The aims of this study are to, first, quantify the frequency of incidental hospitalizations over the first 15 months of the pandemic in multiple hospital systems in the United States and, second, to apply electronic phenotyping techniques to automatically improve COVID-19 hospitalization classification.

METHODS

From a retrospective EHR-based cohort in 4 US health care systems in Massachusetts, Pennsylvania, and Illinois, a random sample of 1123 SARS-CoV-2 PCR-positive patients hospitalized from March 2020 to August 2021 was manually chart-reviewed and classified as "admitted with COVID-19" (incidental) versus specifically admitted for COVID-19 ("for COVID-19"). EHR-based phenotyping was used to find feature sets to filter out incidental admissions.

RESULTS

EHR-based phenotyped feature sets filtered out incidental admissions, which occurred in an average of 26% of hospitalizations (although this varied widely over time, from 0% to 75%). The top site-specific feature sets had 79%-99% specificity with 62%-75% sensitivity, while the best-performing across-site feature sets had 71%-94% specificity with 69%-81% sensitivity.

CONCLUSIONS

A large proportion of SARS-CoV-2 PCR-positive admissions were incidental. Straightforward EHR-based phenotypes differentiated admissions, which is important to assure accurate public health reporting and research.

Collapse

Affiliation(s)

Jeffrey G Klann Laboratory of Computer Science, Department of Medicine, Massachusetts General Hospital, Boston, MA, United States
Zachary H Strasser Laboratory of Computer Science, Department of Medicine, Massachusetts General Hospital, Boston, MA, United States
Meghan R Hutch Department of Preventive Medicine, Northwestern University, Chicago, IL, United States
Chris J Kennedy Center for Precision Psychiatry, Massachusetts General Hospital, Boston, MA, United States
Jayson S Marwaha Department of Surgery, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, United States
Michele Morris Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, United States
Malarkodi Jebathilagam Samayamuthu Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, United States
Ashley C Pfaff Department of Surgery, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, United States
Hossein Estiri Laboratory of Computer Science, Department of Medicine, Massachusetts General Hospital, Boston, MA, United States
Andrew M South Section of Nephrology, Department of Pediatrics, Brenner Children's, Wake Forest School of Medicine, Winston Salem, NC, United States
Griffin M Weber see Acknowledgments,
William Yuan see Acknowledgments,
Paul Avillach see Acknowledgments,
Kavishwar B Wagholikar Laboratory of Computer Science, Department of Medicine, Massachusetts General Hospital, Boston, MA, United States
Yuan Luo Department of Preventive Medicine, Northwestern University, Chicago, IL, United States
Gilbert S Omenn Center for Computational Medicine & Bioinformatics, University of Michigan, Ann Arbor, MI, United States
Shyam Visweswaran Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, United States
John H Holmes Department of Biostatistics, Epidemiology, and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
Zongqi Xia Department of Neurology, University of Pittsburgh, Pittsburgh, PA, United States
Gabriel A Brat see Acknowledgments,
Shawn N Murphy Department of Neurology, Massachusetts General Hospital, Boston, MA, United States

Collapse

Harvey D, Lobban F, Rayson P, Warner A, Jones S. Natural Language Processing Methods and Bipolar Disorder: Scoping Review. JMIR Ment Health 2022;9:e35928. [PMID: 35451984 PMCID: PMC9077496 DOI: 10.2196/35928] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 03/15/2022] [Accepted: 03/20/2022] [Indexed: 02/05/2023] Open

Abstract

BACKGROUND

Health researchers are increasingly using natural language processing (NLP) to study various mental health conditions using both social media and electronic health records (EHRs). There is currently no published synthesis that relates specifically to the use of NLP methods for bipolar disorder, and this scoping review was conducted to synthesize valuable insights that have been presented in the literature.

OBJECTIVE

This scoping review explored how NLP methods have been used in research to better understand bipolar disorder and identify opportunities for further use of these methods.

METHODS

A systematic, computerized search of index and free-text terms related to bipolar disorder and NLP was conducted using 5 databases and 1 anthology: MEDLINE, PsycINFO, Academic Search Ultimate, Scopus, Web of Science Core Collection, and the ACL Anthology.

RESULTS

Of 507 identified studies, a total of 35 (6.9%) studies met the inclusion criteria. A narrative synthesis was used to describe the data, and the studies were grouped into four objectives: prediction and classification (n=25), characterization of the language of bipolar disorder (n=13), use of EHRs to measure health outcomes (n=3), and use of EHRs for phenotyping (n=2). Ethical considerations were reported in 60% (21/35) of the studies.

CONCLUSIONS

The current literature demonstrates how language analysis can be used to assist in and improve the provision of care for people living with bipolar disorder. Individuals with bipolar disorder and the medical community could benefit from research that uses NLP to investigate risk-taking, web-based services, social and occupational functioning, and the representation of gender in bipolar disorder populations on the web. Future research that implements NLP methods to study bipolar disorder should be governed by ethical principles, and any decisions regarding the collection and sharing of data sets should ultimately be made on a case-by-case basis, considering the risk to the data participants and whether their privacy can be ensured.

Collapse

Birnbaum R, Mahjani B, Loos RJF, Sharp AJ. Clinical Characterization of Copy Number Variants Associated With Neurodevelopmental Disorders in a Large-scale Multiancestry Biobank. JAMA Psychiatry 2022;79:250-259. [PMID: 35080590 PMCID: PMC8792794 DOI: 10.1001/jamapsychiatry.2021.4080] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Accepted: 11/30/2021] [Indexed: 01/28/2023]

Abstract

IMPORTANCE

Past studies identified rare copy number variants (CNVs) as risk factors for neurodevelopmental disorders (NDDs), including autism spectrum disorder and schizophrenia. However, the clinical characterization of NDD CNVs is understudied in population cohorts unselected for neuropsychiatric disorders and in cohorts of diverse ancestry.

OBJECTIVE

To identify individuals harboring NDD CNVs in a multiancestry biobank and to query their enrichment for select neuropsychiatric disorders as well as association with multiple medical disorders.

DESIGN, SETTINGS, AND PARTICIPANTS

In a series of phenotypic enrichment and association analyses, NDD CNVs were clinically characterized among 24 877 participants in the BioMe biobank, an electronic health record-linked biobank derived from the Mount Sinai Health System, New York, New York. Participants were recruited into the biobank since September 2007 across diverse ancestry and medical and neuropsychiatric specialties. For the current analyses, electronic health record data were analyzed from May 2004 through May 2019.

MAIN OUTCOMES AND MEASURES

NDD CNVs were identified using a consensus of 2 CNV calling algorithms, based on whole-exome sequencing and genotype array data, followed by novel in-silico clinical assessments.

RESULTS

Of 24 877 participants, 14 586 (58.7%) were female; self-reported ancestry categories included 5965 (24.0%) who were of African ancestry, 7892 (31.7%) who were of European ancestry, and 8536 (34.3%) who were of Hispanic ancestry; and the mean (SD) age was 50.5 (17.3) years. Among 24 877 individuals, the prevalence of 64 NDD CNVs was 2.5% (n = 627), with prevalence varying by locus, corroborating the presence of some relatively highly prevalent NDD CNVs (eg, 15q11.2 deletion/duplication). An aggregate set of NDD CNVs were enriched for congenital disorders (odds ratio, 2.0; 95% CI, 1.1-3.5; P = .01) and major depressive disorder (odds ratio, 1.5; 95% CI, 1.1-2.0; P = .01). In a meta-analysis of medical diagnoses (n = 195 hierarchically clustered diagnostic codes), NDD CNVs were significantly associated with several medical outcomes, including essential hypertension (z score = 3.6; P = 2.8 × 10-4), kidney failure (z score = 3.3; P = 1.1 × 10-3), and obstructive sleep apnea (z score = 3.4; P = 8.1 × 10-4) and, in another analysis, morbid obesity (z score = 3.8; P = 1.3 × 10-4). Further, NDD CNVs were associated with increased body mass index in a multiancestry analysis (β = 0.19; 95% CI, 0.10-0.31; P = .003). For 36 common serum tests, there was no association with NDD CNVs.

CONCLUSIONS AND RELEVANCE

Clinical features of individuals harboring NDD CNVs were elucidated in a large-scale, multiancestry biobank, identifying enrichments for congenital disorders and major depressive disorder as well as associations with several medical outcomes, including hypertension, kidney failure, and obesity and obesity-related phenotypes, specifically obstructive sleep apnea and increased body mass index. The association between NDD CNVs and obesity outcomes indicate further potential pleiotropy of NDD CNVs beyond neurodevelopmental outcomes previously reported. Future clinical genetic investigations may lead to insights of at-risk individuals and therapeutic strategies targeting specific genetic variants. The importance of diverse inclusion within biobanks and considering the effect of rare genetic variants in a multiancestry context is evident.

Collapse

Klann JG, Strasser ZH, Hutch MR, Kennedy CJ, Marwaha JS, Morris M, Samayamuthu MJ, Pfaff AC, Estiri H, South AM, Weber GM, Yuan W, Avillach P, Wagholikar KB, Luo Y, Omenn GS, Visweswaran S, Holmes JH, Xia Z, Brat GA, Murphy SN. Distinguishing Admissions Specifically for COVID-19 from Incidental SARS-CoV-2 Admissions: A National EHR Research Consortium Study. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2022:2022.02.10.22270728. [PMID: 35350202 PMCID: PMC8963684 DOI: 10.1101/2022.02.10.22270728] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]

Loebel A, Koblan KS, Tsai J, Deng L, Fava M, Kent J, Hopkins SC. A Randomized, Double-blind, Placebo-controlled Proof-of-Concept Trial to Evaluate the Efficacy and Safety of Non-racemic Amisulpride (SEP-4199) for the Treatment of Bipolar I Depression. J Affect Disord 2022;296:549-558. [PMID: 34614447 DOI: 10.1016/j.jad.2021.09.109] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/24/2021] [Revised: 09/16/2021] [Accepted: 09/29/2021] [Indexed: 12/11/2022]

Crema C, Attardi G, Sartiano D, Redolfi A. Natural language processing in clinical neuroscience and psychiatry: A review. Front Psychiatry 2022;13:946387. [PMID: 36186874 PMCID: PMC9515453 DOI: 10.3389/fpsyt.2022.946387] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Accepted: 08/22/2022] [Indexed: 11/13/2022] Open

Guo A, Stephens KA, Khan YM, Langabeer JR, Foraker RE. Women and ethnoracial minorities with poor cardiovascular health measures associated with a higher risk of developing mood disorder. BMC Med Inform Decis Mak 2021;21:361. [PMID: 34952584 PMCID: PMC8709948 DOI: 10.1186/s12911-021-01674-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2020] [Accepted: 10/29/2021] [Indexed: 11/30/2022] Open

An independently validated, portable algorithm for the rapid identification of COPD patients using electronic health records. Sci Rep 2021;11:19959. [PMID: 34620889 PMCID: PMC8497529 DOI: 10.1038/s41598-021-98719-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Accepted: 08/25/2021] [Indexed: 11/24/2022] Open

Chapman M, Mumtaz S, Rasmussen LV, Karwath A, Gkoutos GV, Gao C, Thayer D, Pacheco JA, Parkinson H, Richesson RL, Jefferson E, Denaxas S, Curcin V. Desiderata for the development of next-generation electronic health record phenotype libraries. Gigascience 2021;10:giab059. [PMID: 34508578 PMCID: PMC8434766 DOI: 10.1093/gigascience/giab059] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Revised: 07/15/2021] [Accepted: 08/18/2021] [Indexed: 11/22/2022] Open

Berchuck SI, Jammal AA, Mukherjee S, Somers TJ, Medeiros FA. Impact of anxiety and depression on progression to glaucoma among glaucoma suspects. Br J Ophthalmol 2021;105:1244-1249. [PMID: 32862132 PMCID: PMC9924953 DOI: 10.1136/bjophthalmol-2020-316617] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2020] [Revised: 07/24/2020] [Accepted: 08/01/2020] [Indexed: 01/12/2023]

Estiri H, Strasser ZH, Murphy SN. High-throughput phenotyping with temporal sequences. J Am Med Inform Assoc 2021;28:772-781. [PMID: 33313899 DOI: 10.1093/jamia/ocaa288] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Accepted: 11/04/2020] [Indexed: 12/15/2022] Open

Abstract

OBJECTIVE

High-throughput electronic phenotyping algorithms can accelerate translational research using data from electronic health record (EHR) systems. The temporal information buried in EHRs is often underutilized in developing computational phenotypic definitions. This study aims to develop a high-throughput phenotyping method, leveraging temporal sequential patterns from EHRs.

MATERIALS AND METHODS

We develop a representation mining algorithm to extract 5 classes of representations from EHR diagnosis and medication records: the aggregated vector of the records (aggregated vector representation), the standard sequential patterns (sequential pattern mining), the transitive sequential patterns (transitive sequential pattern mining), and 2 hybrid classes. Using EHR data on 10 phenotypes from the Mass General Brigham Biobank, we train and validate phenotyping algorithms.

RESULTS

Phenotyping with temporal sequences resulted in a superior classification performance across all 10 phenotypes compared with the standard representations in electronic phenotyping. The high-throughput algorithm's classification performance was superior or similar to the performance of previously published electronic phenotyping algorithms. We characterize and evaluate the top transitive sequences of diagnosis records paired with the records of risk factors, symptoms, complications, medications, or vaccinations.

DISCUSSION

The proposed high-throughput phenotyping approach enables seamless discovery of sequential record combinations that may be difficult to assume from raw EHR data. Transitive sequences offer more accurate characterization of the phenotype, compared with its individual components, and reflect the actual lived experiences of the patients with that particular disease.

CONCLUSION

Sequential data representations provide a precise mechanism for incorporating raw EHR records into downstream machine learning. Our approach starts with user interpretability and works backward to the technology.

Collapse

Liu L, Bustamante R, Earles A, Demb J, Messer K, Gupta S. A strategy for validation of variables derived from large-scale electronic health record data. J Biomed Inform 2021;121:103879. [PMID: 34329789 PMCID: PMC9615095 DOI: 10.1016/j.jbi.2021.103879] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2021] [Revised: 07/21/2021] [Accepted: 07/24/2021] [Indexed: 11/16/2022]

Percha B. Modern Clinical Text Mining: A Guide and Review. Annu Rev Biomed Data Sci 2021;4:165-187. [PMID: 34465177 DOI: 10.1146/annurev-biodatasci-030421-030931] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Teneralli RE, Kern DM, Cepeda MS, Gilbert JP, Drevets WC. Exploring real-world evidence to uncover unknown drug benefits and support the discovery of new treatment targets for depressive and bipolar disorders. J Affect Disord 2021;290:324-333. [PMID: 34020207 DOI: 10.1016/j.jad.2021.04.096] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/15/2020] [Revised: 02/19/2021] [Accepted: 04/25/2021] [Indexed: 12/28/2022]

Le Glaz A, Haralambous Y, Kim-Dufor DH, Lenca P, Billot R, Ryan TC, Marsh J, DeVylder J, Walter M, Berrouiguet S, Lemey C. Machine Learning and Natural Language Processing in Mental Health: Systematic Review. J Med Internet Res 2021;23:e15708. [PMID: 33944788 PMCID: PMC8132982 DOI: 10.2196/15708] [Citation(s) in RCA: 94] [Impact Index Per Article: 31.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Revised: 04/18/2020] [Accepted: 10/02/2020] [Indexed: 01/22/2023] Open

Abstract

BACKGROUND

Machine learning systems are part of the field of artificial intelligence that automatically learn models from data to make better decisions. Natural language processing (NLP), by using corpora and learning approaches, provides good performance in statistical tasks, such as text classification or sentiment mining.

OBJECTIVE

The primary aim of this systematic review was to summarize and characterize, in methodological and technical terms, studies that used machine learning and NLP techniques for mental health. The secondary aim was to consider the potential use of these methods in mental health clinical practice.

METHODS

This systematic review follows the PRISMA (Preferred Reporting Items for Systematic Review and Meta-analysis) guidelines and is registered with PROSPERO (Prospective Register of Systematic Reviews; number CRD42019107376). The search was conducted using 4 medical databases (PubMed, Scopus, ScienceDirect, and PsycINFO) with the following keywords: machine learning, data mining, psychiatry, mental health, and mental disorder. The exclusion criteria were as follows: languages other than English, anonymization process, case studies, conference papers, and reviews. No limitations on publication dates were imposed.

RESULTS

A total of 327 articles were identified, of which 269 (82.3%) were excluded and 58 (17.7%) were included in the review. The results were organized through a qualitative perspective. Although studies had heterogeneous topics and methods, some themes emerged. Population studies could be grouped into 3 categories: patients included in medical databases, patients who came to the emergency room, and social media users. The main objectives were to extract symptoms, classify severity of illness, compare therapy effectiveness, provide psychopathological clues, and challenge the current nosography. Medical records and social media were the 2 major data sources. With regard to the methods used, preprocessing used the standard methods of NLP and unique identifier extraction dedicated to medical texts. Efficient classifiers were preferred rather than transparent functioning classifiers. Python was the most frequently used platform.

CONCLUSIONS

Machine learning and NLP models have been highly topical issues in medicine in recent years and may be considered a new paradigm in medical research. However, these processes tend to confirm clinical hypotheses rather than developing entirely new information, and only one major category of the population (ie, social media users) is an imprecise cohort. Moreover, some language-specific features can improve the performance of NLP methods, and their extension to other languages should be more closely investigated. However, machine learning and NLP techniques provide useful information from unexplored data (ie, patients' daily habits that are usually inaccessible to care providers). Before considering It as an additional tool of mental health care, ethical issues remain and should be discussed in a timely manner. Machine learning and NLP methods may offer multiple perspectives in mental health research but should also be considered as tools to support clinical practice.

Collapse

Zhao Y, Fu S, Bielinski SJ, Decker PA, Chamberlain AM, Roger VL, Liu H, Larson NB. Natural Language Processing and Machine Learning for Identifying Incident Stroke From Electronic Health Records: Algorithm Development and Validation. J Med Internet Res 2021;23:e22951. [PMID: 33683212 PMCID: PMC7985804 DOI: 10.2196/22951] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Revised: 08/25/2020] [Accepted: 01/20/2021] [Indexed: 11/29/2022] Open

Abstract

Background

Stroke is an important clinical outcome in cardiovascular research. However, the ascertainment of incident stroke is typically accomplished via time-consuming manual chart abstraction. Current phenotyping efforts using electronic health records for stroke focus on case ascertainment rather than incident disease, which requires knowledge of the temporal sequence of events.

Objective

The aim of this study was to develop a machine learning–based phenotyping algorithm for incident stroke ascertainment based on diagnosis codes, procedure codes, and clinical concepts extracted from clinical notes using natural language processing.

Methods

The algorithm was trained and validated using an existing epidemiology cohort consisting of 4914 patients with atrial fibrillation (AF) with manually curated incident stroke events. Various combinations of feature sets and machine learning classifiers were compared. Using a heuristic rule based on the composition of concepts and codes, we further detected the stroke subtype (ischemic stroke/transient ischemic attack or hemorrhagic stroke) of each identified stroke. The algorithm was further validated using a cohort (n=150) stratified sampled from a population in Olmsted County, Minnesota (N=74,314).

Results

Among the 4914 patients with AF, 740 had validated incident stroke events. The best-performing stroke phenotyping algorithm used clinical concepts, diagnosis codes, and procedure codes as features in a random forest classifier. Among patients with stroke codes in the general population sample, the best-performing model achieved a positive predictive value of 86% (43/50; 95% CI 0.74-0.93) and a negative predictive value of 96% (96/100). For subtype identification, we achieved an accuracy of 83% in the AF cohort and 80% in the general population sample.

Conclusions

We developed and validated a machine learning–based algorithm that performed well for identifying incident stroke and for determining type of stroke. The algorithm also performed well on a sample from a general population, further demonstrating its generalizability and potential for adoption by other institutions.

Collapse

Liao KP, Sun J, Cai TA, Link N, Hong C, Huang J, Huffman JE, Gronsbell J, Zhang Y, Ho YL, Castro V, Gainer V, Murphy SN, O'Donnell CJ, Gaziano JM, Cho K, Szolovits P, Kohane IS, Yu S, Cai T. High-throughput multimodal automated phenotyping (MAP) with application to PheWAS. J Am Med Inform Assoc 2021;26:1255-1262. [PMID: 31613361 DOI: 10.1093/jamia/ocz066] [Citation(s) in RCA: 59] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2018] [Revised: 04/08/2019] [Accepted: 04/26/2019] [Indexed: 01/01/2023] Open

Affiliation(s)

Katherine P Liao Division of Rheumatology, Immunology, and Allergy, Brigham and Women's Hospital, Boston, MA, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.,Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA
Jiehuan Sun Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA.,Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Tianrun A Cai Division of Rheumatology, Immunology, and Allergy, Brigham and Women's Hospital, Boston, MA, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.,Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA
Nicholas Link Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA
Chuan Hong Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.,Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA.,Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Jie Huang Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
Jennifer E Huffman Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA
Jessica Gronsbell Verily Life Sciences, Cambridge, MA, USA
Yichi Zhang Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.,University of Rhode Island, Kingston, RI, USA
Yuk-Lam Ho Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA
Victor Castro Partners Healthcare Systems, Summerville, MA, USA
Vivian Gainer Partners Healthcare Systems, Summerville, MA, USA
Shawn N Murphy Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.,Partners Healthcare Systems, Summerville, MA, USA.,Massachusetts General Hospital, Boston, MA, USA
Christopher J O'Donnell Division of Rheumatology, Immunology, and Allergy, Brigham and Women's Hospital, Boston, MA, USA.,Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA
J Michael Gaziano Division of Rheumatology, Immunology, and Allergy, Brigham and Women's Hospital, Boston, MA, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.,Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA
Kelly Cho Division of Rheumatology, Immunology, and Allergy, Brigham and Women's Hospital, Boston, MA, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.,Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA
Peter Szolovits Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA
Isaac S Kohane Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
Sheng Yu Center for Statistical Science, Tsinghua University, Beijing, China.,Department of Industrial Engineering, Tsinghua University, Beijing, China.,Institute for Data Science, Tsinghua University, Beijing, China
Tianxi Cai Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.,Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA.,Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA

Collapse

Guo Z, Rakshit P, Herman DS, Chen J. Inference for the Case Probability in High-dimensional Logistic Regression. JOURNAL OF MACHINE LEARNING RESEARCH : JMLR 2021;22:254. [PMID: 35935001 PMCID: PMC9354733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Hart KL, Pellegrini AM, Forester BP, Berretta S, Murphy SN, Perlis RH, McCoy TH. Distribution of agitation and related symptoms among hospitalized patients using a scalable natural language processing method. Gen Hosp Psychiatry 2021;68:46-51. [PMID: 33310013 PMCID: PMC7855889 DOI: 10.1016/j.genhosppsych.2020.11.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Revised: 11/03/2020] [Accepted: 11/04/2020] [Indexed: 01/29/2023]

Atuegwu NC, Oncken C, Laubenbacher RC, Perez MF, Mortensen EM. Factors Associated with E-Cigarette Use in U.S. Young Adult Never Smokers of Conventional Cigarettes: A Machine Learning Approach. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020;17:ijerph17197271. [PMID: 33027932 PMCID: PMC7579019 DOI: 10.3390/ijerph17197271] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Revised: 09/24/2020] [Accepted: 09/28/2020] [Indexed: 02/08/2023]

Palumbo SA, Adamson KM, Krishnamurthy S, Manoharan S, Beiler D, Seiwell A, Young C, Metpally R, Crist RC, Doyle GA, Ferraro TN, Li M, Berrettini WH, Robishaw JD, Troiani V. Assessment of Probable Opioid Use Disorder Using Electronic Health Record Documentation. JAMA Netw Open 2020;3:e2015909. [PMID: 32886123 PMCID: PMC7489858 DOI: 10.1001/jamanetworkopen.2020.15909] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open

Abstract

IMPORTANCE

Electronic health records are a potentially valuable source of information for identifying patients with opioid use disorder (OUD).

OBJECTIVE

To evaluate whether proxy measures from electronic health record data can be used reliably to identify patients with probable OUD based on Diagnostic and Statistical Manual of Mental Disorders (Fifth Edition) (DSM-5) criteria.

DESIGN, SETTING, AND PARTICIPANTS

This retrospective cross-sectional study analyzed individuals within the Geisinger health system who were prescribed opioids between December 31, 2000, and May 31, 2017, using a mixed-methods approach. The cohort was identified from 16 253 patients enrolled in a contract-based, Geisinger-specific medication monitoring program (GMMP) for opioid use, including patients who maintained or violated contract terms, as well as a demographically matched control group of 16 253 patients who were prescribed opioids but not enrolled in the GMMP. Substance use diagnoses and psychiatric comorbidities were assessed using automated electronic health record summaries. A manual medical record review procedure using DSM-5 criteria for OUD was completed for a subset of patients. The analysis was conducted beginning from June 5, 2017, until May 29, 2020.

MAIN OUTCOMES AND MEASURES

The primary outcome was the prevalence of OUD as defined by proxy measures for DSM-5 criteria for OUD as well as the prevalence of comorbidities among patients prescribed opioids within an integrated health system.

RESULTS

Among the 16 253 patients enrolled in the GMMP (9309 women [57%]; mean [SD] age, 52 [14] years), OUD diagnoses as defined by diagnostic codes were present at a much lower rate than expected (291 [2%]), indicating the necessity for alternative diagnostic strategies. The DSM-5 criteria for OUD can be assessed using manual medical record review; a manual review of 200 patients in the GMMP and 200 control patients identifed a larger percentage of patients with probable moderate to severe OUD (GMMP, 145 of 200 [73%]; and control, 27 of 200 [14%]) compared with the prevalence of OUD assessed using diagnostic codes.

CONCLUSIONS AND RELEVANCE

These results suggest that patients with OUD may be identified using information available in the electronic health record, even when diagnostic codes do not reflect this diagnosis. Furthermore, the study demonstrates the utility of coding for DSM-5 criteria from medical records to generate a quantitative DSM-5 score that is associated with OUD severity.

Collapse

Affiliation(s)

Sarah A. Palumbo Department of Biomedical Science, Schmidt College of Medicine of Florida Atlantic University, Boca Raton
Kayleigh M. Adamson Geisinger Clinic, Geisinger, Danville, Pennsylvania
Sarathbabu Krishnamurthy Department of Molecular and Functional Genomics, Geisinger, Danville, Pennsylvania
Shivani Manoharan Geisinger Clinic, Geisinger, Danville, Pennsylvania
Donielle Beiler Geisinger Clinic, Geisinger, Danville, Pennsylvania
Anthony Seiwell Geisinger Clinic, Geisinger, Danville, Pennsylvania
Colt Young Geisinger Clinic, Geisinger, Danville, Pennsylvania
Raghu Metpally Department of Molecular and Functional Genomics, Geisinger, Danville, Pennsylvania
Richard C. Crist Center for Neurobiology and Behavior, Department of Psychiatry, University of Pennsylvania Perelman School of Medicine, Philadelphia
Glenn A. Doyle Center for Neurobiology and Behavior, Department of Psychiatry, University of Pennsylvania Perelman School of Medicine, Philadelphia
Thomas N. Ferraro Center for Neurobiology and Behavior, Department of Psychiatry, University of Pennsylvania Perelman School of Medicine, Philadelphia Department of Biomedical Sciences, Cooper Medical School of Rowan University, Camden, New Jersey
Mingyao Li Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, Philadelphia
Wade H. Berrettini Geisinger Clinic, Geisinger, Danville, Pennsylvania Center for Neurobiology and Behavior, Department of Psychiatry, University of Pennsylvania Perelman School of Medicine, Philadelphia
Janet D. Robishaw Department of Biomedical Science, Schmidt College of Medicine of Florida Atlantic University, Boca Raton
Vanessa Troiani Geisinger Clinic, Geisinger, Danville, Pennsylvania Department of Imaging Science and Innovation, Geisinger, Danville, Pennsylvania Neuroscience Institute, Geisinger, Danville, Pennsylvania Department of Basic Sciences, Geisinger Commonwealth School of Medicine, Scranton, Pennsylvania

Collapse

Vuijk PJ, Martin J, Braaten EB, Genovese G, Capawana MR, O’Keefe SM, Lee BA, Lind HS, Smoller JW, Faraone SV, Perlis RH, Doyle AE. Translating Discoveries in Attention-Deficit/Hyperactivity Disorder Genomics to an Outpatient Child and Adolescent Psychiatric Cohort. J Am Acad Child Adolesc Psychiatry 2020;59:964-977. [PMID: 31421235 PMCID: PMC7408479 DOI: 10.1016/j.jaac.2019.08.004] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/27/2018] [Revised: 05/29/2019] [Accepted: 08/08/2019] [Indexed: 01/10/2023]

Abstract

OBJECTIVE

Genomic discoveries should be investigated in generalizable child psychiatric samples in order to justify and inform studies that will evaluate their use for specific clinical purposes. In youth consecutively referred for neuropsychiatric evaluation, we examined 1) the convergent and discriminant validity of attention-deficit/hyperactivity disorder (ADHD) polygenic risk scores (PRSs) in relation to DSM-based ADHD phenotypes; 2) the association of ADHD PRSs with phenotypes beyond ADHD that share its liability and have implications for outcome; and 3) the extent to which youth with high ADHD PRSs manifest a distinctive clinical profile.

METHOD

Participants were 433 youth, ages 7-18 years, from the Longitudinal Study of Genetic Influences on Cognition. We used logistic/linear regression and mixed effects models to examine associations with ADHD-related polygenic variation from the largest ADHD genome-wide association study to date. We replicated key findings in 5,140 adult patients from a local health system biobank.

RESULTS

Among referred youth, ADHD PRSs were associated with ADHD diagnoses, cross-diagnostic ADHD symptoms and academic impairment (odds ratios ∼1.4; R² values ∼2%-3%), as well as cross-diagnostic variation in aggression and working memory. In adults, ADHD PRSs were associated with ADHD and phenotypes beyond the condition that have public health implications. Finally, youth with a high ADHD polygenic burden showed a more severe clinical profile than youth with a low burden (β coefficients ∼.2).

CONCLUSION

Among child and adolescent outpatients, ADHD polygenic risk was associated with ADHD and related phenotypes as well as clinical severity. These results extend the scientific foundation for studies of ADHD polygenic risk in the clinical setting and highlight directions for further research.

Collapse

Affiliation(s)

Pieter J. Vuijk Center for Genomic Medicine, Massachusetts General Hospital, Boston
Joanna Martin MRC Centre for Neuropsychiatric Genetics and Genomics, Cardiff University, UK,cStanley Center for Psychiatric Research, Broad Institute, Cambridge, MA
Ellen B. Braaten Massachusetts General Hospital and Harvard Medical School, Massachusetts General Hospital, Boston
Giulio Genovese Stanley Center for Psychiatric Research, Broad Institute, Cambridge, MA
Michael R. Capawana Massachusetts General Hospital and Harvard Medical School, Massachusetts General Hospital, Boston
Sheila M. O’Keefe Massachusetts General Hospital and Harvard Medical School, Massachusetts General Hospital, Boston
B. Andi Lee Center for Genomic Medicine, Massachusetts General Hospital, Boston
Hannah S. Lind Center for Genomic Medicine, Massachusetts General Hospital, Boston
Jordan W. Smoller Center for Genomic Medicine, Massachusetts General Hospital, Boston,cStanley Center for Psychiatric Research, Broad Institute, Cambridge, MA,dMassachusetts General Hospital and Harvard Medical School, Massachusetts General Hospital, Boston
Stephen V. Faraone SUNY Upstate Medical University, Syracuse, NY
Roy H. Perlis Center for Genomic Medicine, Massachusetts General Hospital, Boston,cStanley Center for Psychiatric Research, Broad Institute, Cambridge, MA,dMassachusetts General Hospital and Harvard Medical School, Massachusetts General Hospital, Boston,fCenter for Experimental Drugs and Diagnostics, Massachusetts General Hospital, Boston
Alysa E. Doyle Center for Genomic Medicine, Massachusetts General Hospital, Boston,cStanley Center for Psychiatric Research, Broad Institute, Cambridge, MA,dMassachusetts General Hospital and Harvard Medical School, Massachusetts General Hospital, Boston,∗Correspondence to Alysa E. Doyle, PhD, Center for Genomic Medicine, Massachusetts General Hospital, 185 Cambridge Street, CPZN 6240, Boston, MA 02114

Collapse

Beesley LJ, Fritsche LG, Mukherjee B. An analytic framework for exploring sampling and observation process biases in genome and phenome-wide association studies using electronic health records. Stat Med 2020;39:1965-1979. [PMID: 32198773 DOI: 10.1002/sim.8524] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2019] [Revised: 02/14/2020] [Accepted: 02/14/2020] [Indexed: 12/17/2022]

Beesley LJ, Salvatore M, Fritsche LG, Pandit A, Rao A, Brummett C, Willer CJ, Lisabeth LD, Mukherjee B. The emerging landscape of health research based on biobanks linked to electronic health records: Existing resources, statistical challenges, and potential opportunities. Stat Med 2020;39:773-800. [PMID: 31859414 PMCID: PMC7983809 DOI: 10.1002/sim.8445] [Citation(s) in RCA: 52] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Revised: 09/10/2019] [Accepted: 11/16/2019] [Indexed: 01/03/2023]

Barak-Corren Y, Castro VM, Nock MK, Mandl KD, Madsen EM, Seiger A, Adams WG, Applegate RJ, Bernstam EV, Klann JG, McCarthy EP, Murphy SN, Natter M, Ostasiewski B, Patibandla N, Rosenthal GE, Silva GS, Wei K, Weber GM, Weiler SR, Reis BY, Smoller JW. Validation of an Electronic Health Record-Based Suicide Risk Prediction Modeling Approach Across Multiple Health Care Systems. JAMA Netw Open 2020;3:e201262. [PMID: 32211868 PMCID: PMC11136522 DOI: 10.1001/jamanetworkopen.2020.1262] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Abstract

Importance

Suicide is a leading cause of mortality, with suicide-related deaths increasing in recent years. Automated methods for individualized risk prediction have great potential to address this growing public health threat. To facilitate their adoption, they must first be validated across diverse health care settings.

Objective

To evaluate the generalizability and cross-site performance of a risk prediction method using readily available structured data from electronic health records in predicting incident suicide attempts across multiple, independent, US health care systems.

Design, Setting, and Participants

For this prognostic study, data were extracted from longitudinal electronic health record data comprising International Classification of Diseases, Ninth Revision diagnoses, laboratory test results, procedures codes, and medications for more than 3.7 million patients from 5 independent health care systems participating in the Accessible Research Commons for Health network. Across sites, 6 to 17 years' worth of data were available, up to 2018. Outcomes were defined by International Classification of Diseases, Ninth Revision codes reflecting incident suicide attempts (with positive predictive value >0.70 according to expert clinician medical record review). Models were trained using naive Bayes classifiers in each of the 5 systems. Models were cross-validated in independent data sets at each site, and performance metrics were calculated. Data analysis was performed from November 2017 to August 2019.

Main Outcomes and Measures

The primary outcome was suicide attempt as defined by a previously validated case definition using International Classification of Diseases, Ninth Revision codes. The accuracy and timeliness of the prediction were measured at each site.

Results

Across the 5 health care systems, of the 3 714 105 patients (2 130 454 female [57.2%]) included in the analysis, 39 162 cases (1.1%) were identified. Predictive features varied by site but, as expected, the most common predictors reflected mental health conditions (eg, borderline personality disorder, with odds ratios of 8.1-12.9, and bipolar disorder, with odds ratios of 0.9-9.1) and substance use disorders (eg, drug withdrawal syndrome, with odds ratios of 7.0-12.9). Despite variation in geographical location, demographic characteristics, and population health characteristics, model performance was similar across sites, with areas under the curve ranging from 0.71 (95% CI, 0.70-0.72) to 0.76 (95% CI, 0.75-0.77). Across sites, at a specificity of 90%, the models detected a mean of 38% of cases a mean of 2.1 years in advance.

Conclusions and Relevance

Across 5 diverse health care systems, a computationally efficient approach leveraging the full spectrum of structured electronic health record data was able to detect the risk of suicidal behavior in unselected patients. This approach could facilitate the development of clinical decision support tools that inform risk reduction interventions.

Collapse

Affiliation(s)

Yuval Barak-Corren Computational Health Informatics Program, Boston Children's Hospital, Boston, Massachusetts
Victor M Castro Partners Research Information Science and Computing, Boston, Massachusetts
Matthew K Nock Department of Psychology, Harvard University, Cambridge, Massachusetts
Kenneth D Mandl Computational Health Informatics Program, Boston Children's Hospital, Boston, Massachusetts Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts
Emily M Madsen Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts
Ashley Seiger Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts
William G Adams Department of Pediatrics, Boston Medical Center, Boston University School of Medicine, Boston, Massachusetts
R Joseph Applegate School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston
Elmer V Bernstam School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston McGovern Medical School, Division of General Internal Medicine, The University of Texas Health Science Center at Houston, Houston
Jeffrey G Klann Partners Research Information Science and Computing, Boston, Massachusetts
Ellen P McCarthy Department of Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts
Shawn N Murphy Partners Research Information Science and Computing, Boston, Massachusetts
Marc Natter Computational Health Informatics Program, Boston Children's Hospital, Boston, Massachusetts
Brian Ostasiewski Clinical and TranslationalScience Institute, Wake Forest School of Medicine, Winston-Salem, North Carolina
Nandan Patibandla Computational Health Informatics Program, Boston Children's Hospital, Boston, Massachusetts
Gary E Rosenthal Department of Internal Medicine, Wake Forest School of Medicine, Winston-Salem, North Carolina
George S Silva Department of Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts
Kun Wei Clinical and TranslationalScience Institute, Wake Forest School of Medicine, Winston-Salem, North Carolina
Griffin M Weber Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts Department of Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts
Sarah R Weiler Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts
Ben Y Reis Computational Health Informatics Program, Boston Children's Hospital, Boston, Massachusetts
Jordan W Smoller Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts

Collapse

Defining Major Depressive Disorder Cohorts Using the EHR: Multiple Phenotypes Based on ICD-9 Codes and Medication Orders. ACTA ACUST UNITED AC 2020;36:18-26. [PMID: 32218644 DOI: 10.1016/j.npbr.2020.02.002] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]

Wang J, Deng H, Liu B, Hu A, Liang J, Fan L, Zheng X, Wang T, Lei J. Systematic Evaluation of Research Progress on Natural Language Processing in Medicine Over the Past 20 Years: Bibliometric Study on PubMed. J Med Internet Res 2020;22:e16816. [PMID: 32012074 PMCID: PMC7005695 DOI: 10.2196/16816] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2019] [Revised: 12/05/2019] [Accepted: 12/15/2019] [Indexed: 12/15/2022] Open

Abstract

BACKGROUND

Natural language processing (NLP) is an important traditional field in computer science, but its application in medical research has faced many challenges. With the extensive digitalization of medical information globally and increasing importance of understanding and mining big data in the medical field, NLP is becoming more crucial.

OBJECTIVE

The goal of the research was to perform a systematic review on the use of NLP in medical research with the aim of understanding the global progress on NLP research outcomes, content, methods, and study groups involved.

METHODS

A systematic review was conducted using the PubMed database as a search platform. All published studies on the application of NLP in medicine (except biomedicine) during the 20 years between 1999 and 2018 were retrieved. The data obtained from these published studies were cleaned and structured. Excel (Microsoft Corp) and VOSviewer (Nees Jan van Eck and Ludo Waltman) were used to perform bibliometric analysis of publication trends, author orders, countries, institutions, collaboration relationships, research hot spots, diseases studied, and research methods.

RESULTS

A total of 3498 articles were obtained during initial screening, and 2336 articles were found to meet the study criteria after manual screening. The number of publications increased every year, with a significant growth after 2012 (number of publications ranged from 148 to a maximum of 302 annually). The United States has occupied the leading position since the inception of the field, with the largest number of articles published. The United States contributed to 63.01% (1472/2336) of all publications, followed by France (5.44%, 127/2336) and the United Kingdom (3.51%, 82/2336). The author with the largest number of articles published was Hongfang Liu (70), while Stéphane Meystre (17) and Hua Xu (33) published the largest number of articles as the first and corresponding authors. Among the first author's affiliation institution, Columbia University published the largest number of articles, accounting for 4.54% (106/2336) of the total. Specifically, approximately one-fifth (17.68%, 413/2336) of the articles involved research on specific diseases, and the subject areas primarily focused on mental illness (16.46%, 68/413), breast cancer (5.81%, 24/413), and pneumonia (4.12%, 17/413).

CONCLUSIONS

NLP is in a period of robust development in the medical field, with an average of approximately 100 publications annually. Electronic medical records were the most used research materials, but social media such as Twitter have become important research materials since 2015. Cancer (24.94%, 103/413) was the most common subject area in NLP-assisted medical research on diseases, with breast cancers (23.30%, 24/103) and lung cancers (14.56%, 15/103) accounting for the highest proportions of studies. Columbia University and the talents trained therein were the most active and prolific research forces on NLP in the medical field.

Collapse

Walsh CG, Chaudhry B, Dua P, Goodman KW, Kaplan B, Kavuluru R, Solomonides A, Subbian V. Stigma, biomarkers, and algorithmic bias: recommendations for precision behavioral health with artificial intelligence. JAMIA Open 2020;3:9-15. [PMID: 32607482 PMCID: PMC7309258 DOI: 10.1093/jamiaopen/ooz054] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2019] [Revised: 07/29/2019] [Accepted: 10/30/2019] [Indexed: 12/22/2022] Open

High-throughput phenotyping with electronic medical record data using a common semi-supervised approach (PheCAP). Nat Protoc 2019;14:3426-3444. [PMID: 31748751 DOI: 10.1038/s41596-019-0227-6] [Citation(s) in RCA: 83] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2018] [Accepted: 07/22/2019] [Indexed: 01/12/2023]

A Review of Automatic Phenotyping Approaches using Electronic Health Records. ELECTRONICS 2019. [DOI: 10.3390/electronics8111235] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Zheutlin AB, Dennis J, Karlsson Linnér R, Moscati A, Restrepo N, Straub P, Ruderfer D, Castro VM, Chen CY, Ge T, Huckins LM, Charney A, Kirchner HL, Stahl EA, Chabris CF, Davis LK, Smoller JW. Penetrance and Pleiotropy of Polygenic Risk Scores for Schizophrenia in 106,160 Patients Across Four Health Care Systems. Am J Psychiatry 2019;176:846-855. [PMID: 31416338 PMCID: PMC6961974 DOI: 10.1176/appi.ajp.2019.18091085] [Citation(s) in RCA: 147] [Impact Index Per Article: 29.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Abstract

OBJECTIVE

Individuals at high risk for schizophrenia may benefit from early intervention, but few validated risk predictors are available. Genetic profiling is one approach to risk stratification that has been extensively validated in research cohorts. The authors sought to test the utility of this approach in clinical settings and to evaluate the broader health consequences of high genetic risk for schizophrenia.

METHODS

The authors used electronic health records for 106,160 patients from four health care systems to evaluate the penetrance and pleiotropy of genetic risk for schizophrenia. Polygenic risk scores (PRSs) for schizophrenia were calculated from summary statistics and tested for association with 1,359 disease categories, including schizophrenia and psychosis, in phenome-wide association studies. Effects were combined through meta-analysis across sites.

RESULTS

PRSs were robustly associated with schizophrenia (odds ratio per standard deviation increase in PRS, 1.55; 95% CI=1.4, 1.7), and patients in the highest risk decile of the PRS distribution had up to 4.6-fold higher odds of schizophrenia compared with those in the bottom decile (95% CI=2.9, 7.3). PRSs were also positively associated with other phenotypes, including anxiety, mood, substance use, neurological, and personality disorders, as well as suicidal behavior, memory loss, and urinary syndromes; they were inversely related to obesity.

CONCLUSIONS

The study demonstrates that an available measure of genetic risk for schizophrenia is robustly associated with schizophrenia in health care settings and has pleiotropic effects on related psychiatric disorders as well as other medical syndromes. The results provide an initial indication of the opportunities and limitations that may arise with the future application of PRS testing in health care systems.

Collapse

Affiliation(s)

Amanda B Zheutlin Psychiatric and Neurodevelopmental Genetics Unit (Zheutlin, Chen, Ge, Smoller) and Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston (Chen); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Mass. (Zheutlin, Chen, Stahl, Smoller); Division of Genetic Medicine, Department of Medicine (Dennis, Straub, Ruderfer, Davis), Vanderbilt Genetics Institute (Dennis, Straub, Ruderfer, Davis), and Department of Biomedical Informatics (Ruderfer), Vanderbilt University Medical Center, Nashville; Department of Economics, School of Business and Economics, Vrije Universiteit Amsterdam, Amsterdam (Karlsson Linnér); Autism and Developmental Medicine Institute, Geisinger, Lewisburg, Pa. (Karlsson Linnér, Chabris); Charles Bronfman Institute for Personalized Medicine (Moscati), Pamela Sklar Division of Psychiatric Genomics (Huckins, Charney, Stahl), and Department of Genetics and Genomic Sciences (Huckins, Charney, Stahl, ), Icahn School of Medicine at Mount Sinai, New York; Department of Biomedical and Translational Informatics, Geisinger, Rockville, Md. (Restrepo, Kirchner); Research Information Science and Computing, Partners HealthCare, Somerville, Mass. (Castro)
Jessica Dennis Psychiatric and Neurodevelopmental Genetics Unit (Zheutlin, Chen, Ge, Smoller) and Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston (Chen); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Mass. (Zheutlin, Chen, Stahl, Smoller); Division of Genetic Medicine, Department of Medicine (Dennis, Straub, Ruderfer, Davis), Vanderbilt Genetics Institute (Dennis, Straub, Ruderfer, Davis), and Department of Biomedical Informatics (Ruderfer), Vanderbilt University Medical Center, Nashville; Department of Economics, School of Business and Economics, Vrije Universiteit Amsterdam, Amsterdam (Karlsson Linnér); Autism and Developmental Medicine Institute, Geisinger, Lewisburg, Pa. (Karlsson Linnér, Chabris); Charles Bronfman Institute for Personalized Medicine (Moscati), Pamela Sklar Division of Psychiatric Genomics (Huckins, Charney, Stahl), and Department of Genetics and Genomic Sciences (Huckins, Charney, Stahl, ), Icahn School of Medicine at Mount Sinai, New York; Department of Biomedical and Translational Informatics, Geisinger, Rockville, Md. (Restrepo, Kirchner); Research Information Science and Computing, Partners HealthCare, Somerville, Mass. (Castro)
Richard Karlsson Linnér Psychiatric and Neurodevelopmental Genetics Unit (Zheutlin, Chen, Ge, Smoller) and Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston (Chen); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Mass. (Zheutlin, Chen, Stahl, Smoller); Division of Genetic Medicine, Department of Medicine (Dennis, Straub, Ruderfer, Davis), Vanderbilt Genetics Institute (Dennis, Straub, Ruderfer, Davis), and Department of Biomedical Informatics (Ruderfer), Vanderbilt University Medical Center, Nashville; Department of Economics, School of Business and Economics, Vrije Universiteit Amsterdam, Amsterdam (Karlsson Linnér); Autism and Developmental Medicine Institute, Geisinger, Lewisburg, Pa. (Karlsson Linnér, Chabris); Charles Bronfman Institute for Personalized Medicine (Moscati), Pamela Sklar Division of Psychiatric Genomics (Huckins, Charney, Stahl), and Department of Genetics and Genomic Sciences (Huckins, Charney, Stahl, ), Icahn School of Medicine at Mount Sinai, New York; Department of Biomedical and Translational Informatics, Geisinger, Rockville, Md. (Restrepo, Kirchner); Research Information Science and Computing, Partners HealthCare, Somerville, Mass. (Castro)
Arden Moscati Psychiatric and Neurodevelopmental Genetics Unit (Zheutlin, Chen, Ge, Smoller) and Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston (Chen); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Mass. (Zheutlin, Chen, Stahl, Smoller); Division of Genetic Medicine, Department of Medicine (Dennis, Straub, Ruderfer, Davis), Vanderbilt Genetics Institute (Dennis, Straub, Ruderfer, Davis), and Department of Biomedical Informatics (Ruderfer), Vanderbilt University Medical Center, Nashville; Department of Economics, School of Business and Economics, Vrije Universiteit Amsterdam, Amsterdam (Karlsson Linnér); Autism and Developmental Medicine Institute, Geisinger, Lewisburg, Pa. (Karlsson Linnér, Chabris); Charles Bronfman Institute for Personalized Medicine (Moscati), Pamela Sklar Division of Psychiatric Genomics (Huckins, Charney, Stahl), and Department of Genetics and Genomic Sciences (Huckins, Charney, Stahl, ), Icahn School of Medicine at Mount Sinai, New York; Department of Biomedical and Translational Informatics, Geisinger, Rockville, Md. (Restrepo, Kirchner); Research Information Science and Computing, Partners HealthCare, Somerville, Mass. (Castro)
Nicole Restrepo Psychiatric and Neurodevelopmental Genetics Unit (Zheutlin, Chen, Ge, Smoller) and Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston (Chen); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Mass. (Zheutlin, Chen, Stahl, Smoller); Division of Genetic Medicine, Department of Medicine (Dennis, Straub, Ruderfer, Davis), Vanderbilt Genetics Institute (Dennis, Straub, Ruderfer, Davis), and Department of Biomedical Informatics (Ruderfer), Vanderbilt University Medical Center, Nashville; Department of Economics, School of Business and Economics, Vrije Universiteit Amsterdam, Amsterdam (Karlsson Linnér); Autism and Developmental Medicine Institute, Geisinger, Lewisburg, Pa. (Karlsson Linnér, Chabris); Charles Bronfman Institute for Personalized Medicine (Moscati), Pamela Sklar Division of Psychiatric Genomics (Huckins, Charney, Stahl), and Department of Genetics and Genomic Sciences (Huckins, Charney, Stahl, ), Icahn School of Medicine at Mount Sinai, New York; Department of Biomedical and Translational Informatics, Geisinger, Rockville, Md. (Restrepo, Kirchner); Research Information Science and Computing, Partners HealthCare, Somerville, Mass. (Castro)
Peter Straub Psychiatric and Neurodevelopmental Genetics Unit (Zheutlin, Chen, Ge, Smoller) and Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston (Chen); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Mass. (Zheutlin, Chen, Stahl, Smoller); Division of Genetic Medicine, Department of Medicine (Dennis, Straub, Ruderfer, Davis), Vanderbilt Genetics Institute (Dennis, Straub, Ruderfer, Davis), and Department of Biomedical Informatics (Ruderfer), Vanderbilt University Medical Center, Nashville; Department of Economics, School of Business and Economics, Vrije Universiteit Amsterdam, Amsterdam (Karlsson Linnér); Autism and Developmental Medicine Institute, Geisinger, Lewisburg, Pa. (Karlsson Linnér, Chabris); Charles Bronfman Institute for Personalized Medicine (Moscati), Pamela Sklar Division of Psychiatric Genomics (Huckins, Charney, Stahl), and Department of Genetics and Genomic Sciences (Huckins, Charney, Stahl, ), Icahn School of Medicine at Mount Sinai, New York; Department of Biomedical and Translational Informatics, Geisinger, Rockville, Md. (Restrepo, Kirchner); Research Information Science and Computing, Partners HealthCare, Somerville, Mass. (Castro)
Douglas Ruderfer Psychiatric and Neurodevelopmental Genetics Unit (Zheutlin, Chen, Ge, Smoller) and Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston (Chen); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Mass. (Zheutlin, Chen, Stahl, Smoller); Division of Genetic Medicine, Department of Medicine (Dennis, Straub, Ruderfer, Davis), Vanderbilt Genetics Institute (Dennis, Straub, Ruderfer, Davis), and Department of Biomedical Informatics (Ruderfer), Vanderbilt University Medical Center, Nashville; Department of Economics, School of Business and Economics, Vrije Universiteit Amsterdam, Amsterdam (Karlsson Linnér); Autism and Developmental Medicine Institute, Geisinger, Lewisburg, Pa. (Karlsson Linnér, Chabris); Charles Bronfman Institute for Personalized Medicine (Moscati), Pamela Sklar Division of Psychiatric Genomics (Huckins, Charney, Stahl), and Department of Genetics and Genomic Sciences (Huckins, Charney, Stahl, ), Icahn School of Medicine at Mount Sinai, New York; Department of Biomedical and Translational Informatics, Geisinger, Rockville, Md. (Restrepo, Kirchner); Research Information Science and Computing, Partners HealthCare, Somerville, Mass. (Castro)
Victor M Castro Psychiatric and Neurodevelopmental Genetics Unit (Zheutlin, Chen, Ge, Smoller) and Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston (Chen); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Mass. (Zheutlin, Chen, Stahl, Smoller); Division of Genetic Medicine, Department of Medicine (Dennis, Straub, Ruderfer, Davis), Vanderbilt Genetics Institute (Dennis, Straub, Ruderfer, Davis), and Department of Biomedical Informatics (Ruderfer), Vanderbilt University Medical Center, Nashville; Department of Economics, School of Business and Economics, Vrije Universiteit Amsterdam, Amsterdam (Karlsson Linnér); Autism and Developmental Medicine Institute, Geisinger, Lewisburg, Pa. (Karlsson Linnér, Chabris); Charles Bronfman Institute for Personalized Medicine (Moscati), Pamela Sklar Division of Psychiatric Genomics (Huckins, Charney, Stahl), and Department of Genetics and Genomic Sciences (Huckins, Charney, Stahl, ), Icahn School of Medicine at Mount Sinai, New York; Department of Biomedical and Translational Informatics, Geisinger, Rockville, Md. (Restrepo, Kirchner); Research Information Science and Computing, Partners HealthCare, Somerville, Mass. (Castro)
Chia-Yen Chen Psychiatric and Neurodevelopmental Genetics Unit (Zheutlin, Chen, Ge, Smoller) and Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston (Chen); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Mass. (Zheutlin, Chen, Stahl, Smoller); Division of Genetic Medicine, Department of Medicine (Dennis, Straub, Ruderfer, Davis), Vanderbilt Genetics Institute (Dennis, Straub, Ruderfer, Davis), and Department of Biomedical Informatics (Ruderfer), Vanderbilt University Medical Center, Nashville; Department of Economics, School of Business and Economics, Vrije Universiteit Amsterdam, Amsterdam (Karlsson Linnér); Autism and Developmental Medicine Institute, Geisinger, Lewisburg, Pa. (Karlsson Linnér, Chabris); Charles Bronfman Institute for Personalized Medicine (Moscati), Pamela Sklar Division of Psychiatric Genomics (Huckins, Charney, Stahl), and Department of Genetics and Genomic Sciences (Huckins, Charney, Stahl, ), Icahn School of Medicine at Mount Sinai, New York; Department of Biomedical and Translational Informatics, Geisinger, Rockville, Md. (Restrepo, Kirchner); Research Information Science and Computing, Partners HealthCare, Somerville, Mass. (Castro)
Tian Ge Psychiatric and Neurodevelopmental Genetics Unit (Zheutlin, Chen, Ge, Smoller) and Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston (Chen); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Mass. (Zheutlin, Chen, Stahl, Smoller); Division of Genetic Medicine, Department of Medicine (Dennis, Straub, Ruderfer, Davis), Vanderbilt Genetics Institute (Dennis, Straub, Ruderfer, Davis), and Department of Biomedical Informatics (Ruderfer), Vanderbilt University Medical Center, Nashville; Department of Economics, School of Business and Economics, Vrije Universiteit Amsterdam, Amsterdam (Karlsson Linnér); Autism and Developmental Medicine Institute, Geisinger, Lewisburg, Pa. (Karlsson Linnér, Chabris); Charles Bronfman Institute for Personalized Medicine (Moscati), Pamela Sklar Division of Psychiatric Genomics (Huckins, Charney, Stahl), and Department of Genetics and Genomic Sciences (Huckins, Charney, Stahl, ), Icahn School of Medicine at Mount Sinai, New York; Department of Biomedical and Translational Informatics, Geisinger, Rockville, Md. (Restrepo, Kirchner); Research Information Science and Computing, Partners HealthCare, Somerville, Mass. (Castro)
Laura M Huckins Psychiatric and Neurodevelopmental Genetics Unit (Zheutlin, Chen, Ge, Smoller) and Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston (Chen); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Mass. (Zheutlin, Chen, Stahl, Smoller); Division of Genetic Medicine, Department of Medicine (Dennis, Straub, Ruderfer, Davis), Vanderbilt Genetics Institute (Dennis, Straub, Ruderfer, Davis), and Department of Biomedical Informatics (Ruderfer), Vanderbilt University Medical Center, Nashville; Department of Economics, School of Business and Economics, Vrije Universiteit Amsterdam, Amsterdam (Karlsson Linnér); Autism and Developmental Medicine Institute, Geisinger, Lewisburg, Pa. (Karlsson Linnér, Chabris); Charles Bronfman Institute for Personalized Medicine (Moscati), Pamela Sklar Division of Psychiatric Genomics (Huckins, Charney, Stahl), and Department of Genetics and Genomic Sciences (Huckins, Charney, Stahl, ), Icahn School of Medicine at Mount Sinai, New York; Department of Biomedical and Translational Informatics, Geisinger, Rockville, Md. (Restrepo, Kirchner); Research Information Science and Computing, Partners HealthCare, Somerville, Mass. (Castro)
Alexander Charney Psychiatric and Neurodevelopmental Genetics Unit (Zheutlin, Chen, Ge, Smoller) and Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston (Chen); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Mass. (Zheutlin, Chen, Stahl, Smoller); Division of Genetic Medicine, Department of Medicine (Dennis, Straub, Ruderfer, Davis), Vanderbilt Genetics Institute (Dennis, Straub, Ruderfer, Davis), and Department of Biomedical Informatics (Ruderfer), Vanderbilt University Medical Center, Nashville; Department of Economics, School of Business and Economics, Vrije Universiteit Amsterdam, Amsterdam (Karlsson Linnér); Autism and Developmental Medicine Institute, Geisinger, Lewisburg, Pa. (Karlsson Linnér, Chabris); Charles Bronfman Institute for Personalized Medicine (Moscati), Pamela Sklar Division of Psychiatric Genomics (Huckins, Charney, Stahl), and Department of Genetics and Genomic Sciences (Huckins, Charney, Stahl, ), Icahn School of Medicine at Mount Sinai, New York; Department of Biomedical and Translational Informatics, Geisinger, Rockville, Md. (Restrepo, Kirchner); Research Information Science and Computing, Partners HealthCare, Somerville, Mass. (Castro)
H Lester Kirchner Psychiatric and Neurodevelopmental Genetics Unit (Zheutlin, Chen, Ge, Smoller) and Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston (Chen); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Mass. (Zheutlin, Chen, Stahl, Smoller); Division of Genetic Medicine, Department of Medicine (Dennis, Straub, Ruderfer, Davis), Vanderbilt Genetics Institute (Dennis, Straub, Ruderfer, Davis), and Department of Biomedical Informatics (Ruderfer), Vanderbilt University Medical Center, Nashville; Department of Economics, School of Business and Economics, Vrije Universiteit Amsterdam, Amsterdam (Karlsson Linnér); Autism and Developmental Medicine Institute, Geisinger, Lewisburg, Pa. (Karlsson Linnér, Chabris); Charles Bronfman Institute for Personalized Medicine (Moscati), Pamela Sklar Division of Psychiatric Genomics (Huckins, Charney, Stahl), and Department of Genetics and Genomic Sciences (Huckins, Charney, Stahl, ), Icahn School of Medicine at Mount Sinai, New York; Department of Biomedical and Translational Informatics, Geisinger, Rockville, Md. (Restrepo, Kirchner); Research Information Science and Computing, Partners HealthCare, Somerville, Mass. (Castro)
Eli A Stahl Psychiatric and Neurodevelopmental Genetics Unit (Zheutlin, Chen, Ge, Smoller) and Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston (Chen); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Mass. (Zheutlin, Chen, Stahl, Smoller); Division of Genetic Medicine, Department of Medicine (Dennis, Straub, Ruderfer, Davis), Vanderbilt Genetics Institute (Dennis, Straub, Ruderfer, Davis), and Department of Biomedical Informatics (Ruderfer), Vanderbilt University Medical Center, Nashville; Department of Economics, School of Business and Economics, Vrije Universiteit Amsterdam, Amsterdam (Karlsson Linnér); Autism and Developmental Medicine Institute, Geisinger, Lewisburg, Pa. (Karlsson Linnér, Chabris); Charles Bronfman Institute for Personalized Medicine (Moscati), Pamela Sklar Division of Psychiatric Genomics (Huckins, Charney, Stahl), and Department of Genetics and Genomic Sciences (Huckins, Charney, Stahl, ), Icahn School of Medicine at Mount Sinai, New York; Department of Biomedical and Translational Informatics, Geisinger, Rockville, Md. (Restrepo, Kirchner); Research Information Science and Computing, Partners HealthCare, Somerville, Mass. (Castro)
Christopher F Chabris Psychiatric and Neurodevelopmental Genetics Unit (Zheutlin, Chen, Ge, Smoller) and Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston (Chen); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Mass. (Zheutlin, Chen, Stahl, Smoller); Division of Genetic Medicine, Department of Medicine (Dennis, Straub, Ruderfer, Davis), Vanderbilt Genetics Institute (Dennis, Straub, Ruderfer, Davis), and Department of Biomedical Informatics (Ruderfer), Vanderbilt University Medical Center, Nashville; Department of Economics, School of Business and Economics, Vrije Universiteit Amsterdam, Amsterdam (Karlsson Linnér); Autism and Developmental Medicine Institute, Geisinger, Lewisburg, Pa. (Karlsson Linnér, Chabris); Charles Bronfman Institute for Personalized Medicine (Moscati), Pamela Sklar Division of Psychiatric Genomics (Huckins, Charney, Stahl), and Department of Genetics and Genomic Sciences (Huckins, Charney, Stahl, ), Icahn School of Medicine at Mount Sinai, New York; Department of Biomedical and Translational Informatics, Geisinger, Rockville, Md. (Restrepo, Kirchner); Research Information Science and Computing, Partners HealthCare, Somerville, Mass. (Castro)
Lea K Davis Psychiatric and Neurodevelopmental Genetics Unit (Zheutlin, Chen, Ge, Smoller) and Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston (Chen); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Mass. (Zheutlin, Chen, Stahl, Smoller); Division of Genetic Medicine, Department of Medicine (Dennis, Straub, Ruderfer, Davis), Vanderbilt Genetics Institute (Dennis, Straub, Ruderfer, Davis), and Department of Biomedical Informatics (Ruderfer), Vanderbilt University Medical Center, Nashville; Department of Economics, School of Business and Economics, Vrije Universiteit Amsterdam, Amsterdam (Karlsson Linnér); Autism and Developmental Medicine Institute, Geisinger, Lewisburg, Pa. (Karlsson Linnér, Chabris); Charles Bronfman Institute for Personalized Medicine (Moscati), Pamela Sklar Division of Psychiatric Genomics (Huckins, Charney, Stahl), and Department of Genetics and Genomic Sciences (Huckins, Charney, Stahl, ), Icahn School of Medicine at Mount Sinai, New York; Department of Biomedical and Translational Informatics, Geisinger, Rockville, Md. (Restrepo, Kirchner); Research Information Science and Computing, Partners HealthCare, Somerville, Mass. (Castro)
Jordan W Smoller Psychiatric and Neurodevelopmental Genetics Unit (Zheutlin, Chen, Ge, Smoller) and Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston (Chen); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Mass. (Zheutlin, Chen, Stahl, Smoller); Division of Genetic Medicine, Department of Medicine (Dennis, Straub, Ruderfer, Davis), Vanderbilt Genetics Institute (Dennis, Straub, Ruderfer, Davis), and Department of Biomedical Informatics (Ruderfer), Vanderbilt University Medical Center, Nashville; Department of Economics, School of Business and Economics, Vrije Universiteit Amsterdam, Amsterdam (Karlsson Linnér); Autism and Developmental Medicine Institute, Geisinger, Lewisburg, Pa. (Karlsson Linnér, Chabris); Charles Bronfman Institute for Personalized Medicine (Moscati), Pamela Sklar Division of Psychiatric Genomics (Huckins, Charney, Stahl), and Department of Genetics and Genomic Sciences (Huckins, Charney, Stahl, ), Icahn School of Medicine at Mount Sinai, New York; Department of Biomedical and Translational Informatics, Geisinger, Rockville, Md. (Restrepo, Kirchner); Research Information Science and Computing, Partners HealthCare, Somerville, Mass. (Castro)

Collapse

White JM, Mertz EA, Mullins JM, Even JB, Guy T, Blaga E, Kottek AM, Kumar SV, Bangar S, Vaderhobli R, Brandon R, Santo W, Jenson L, Gansky SA. Developing and Testing Electronic Health Record-Derived Caries Indices. Caries Res 2019;53:650-658. [PMID: 31167186 DOI: 10.1159/000499700] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2018] [Accepted: 03/18/2019] [Indexed: 12/15/2022] Open

Affiliation(s)

Joel M White Department of Preventive and Restorative Dental Sciences, University of California, San Francisco, San Francisco, California, USA, .,Center to Address Disparities in Children's Oral Health, University of California, San Francisco, San Francisco, California, USA,
Elizabeth A Mertz Department of Preventive and Restorative Dental Sciences, University of California, San Francisco, San Francisco, California, USA.,Center to Address Disparities in Children's Oral Health, University of California, San Francisco, San Francisco, California, USA.,Philip R. Lee Institute for Health Policy Studies, University of California, San Francisco, San Francisco, California, USA
Joanna M Mullins Willamette Dental Group and Skourtes Institute, Hillsboro, Oregon, USA
Joshua B Even Willamette Dental Group and Skourtes Institute, Hillsboro, Oregon, USA
Trey Guy Willamette Dental Group and Skourtes Institute, Hillsboro, Oregon, USA
Elena Blaga Willamette Dental Group and Skourtes Institute, Hillsboro, Oregon, USA
Aubri M Kottek Center to Address Disparities in Children's Oral Health, University of California, San Francisco, San Francisco, California, USA.,Philip R. Lee Institute for Health Policy Studies, University of California, San Francisco, San Francisco, California, USA
Shwetha V Kumar School of Dentistry, The University of Texas Health Science Center at Houston, Houston, Texas, USA
Suhasini Bangar School of Dentistry, The University of Texas Health Science Center at Houston, Houston, Texas, USA
Ram Vaderhobli Department of Preventive and Restorative Dental Sciences, University of California, San Francisco, San Francisco, California, USA
Ryan Brandon Willamette Dental Group and Skourtes Institute, Hillsboro, Oregon, USA
William Santo Department of Preventive and Restorative Dental Sciences, University of California, San Francisco, San Francisco, California, USA.,Center to Address Disparities in Children's Oral Health, University of California, San Francisco, San Francisco, California, USA
Larry Jenson Department of Preventive and Restorative Dental Sciences, University of California, San Francisco, San Francisco, California, USA
Stuart A Gansky Department of Preventive and Restorative Dental Sciences, University of California, San Francisco, San Francisco, California, USA.,Center to Address Disparities in Children's Oral Health, University of California, San Francisco, San Francisco, California, USA.,Philip R. Lee Institute for Health Policy Studies, University of California, San Francisco, San Francisco, California, USA.,Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, California, USA

Collapse

Williams K, Shorser-Gentile L, Sarvode Mothi S, Berman N, Pasternack M, Geller D, Walter J. Immunoglobulin A Dysgammaglobulinemia Is Associated with Pediatric-Onset Obsessive-Compulsive Disorder. J Child Adolesc Psychopharmacol 2019;29:268-275. [PMID: 30892924 PMCID: PMC7227412 DOI: 10.1089/cap.2018.0043] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]

Abstract

Background: Inflammation and immune dysregulation have been implicated in the pathogenesis of pediatric-onset obsessive-compulsive disorder (OCD) and tic disorders such as Tourette syndrome (TS). Though few replicated studies have identified markers of immune dysfunction in this population, preliminary studies suggest that serum immunoglobulin A (IgA) concentrations may be abnormal in these children with these disorders. Methods: This observational retrospective cohort study, conducted using electronic health records (EHRs), identified 206 children with pediatric-onset OCD and 1024 adults diagnosed with OCD who also had testing for serum levels of IgA. IgA deficiency and serum IgA levels in pediatric OCD were compared with IgA levels from children diagnosed with autism spectrum disorders (ASD; n = 524), tic disorders (n = 157), attention-deficit/hyperactivity disorder (ADHD; n = 534), anxiety disorders (n = 1206), and celiac disease, a condition associated with IgA deficiency (n = 624). Results: Compared with ASD and anxiety disorder cohorts, the pediatric OCD cohort displayed a significantly higher likelihood of IgA deficiency (OR = 1.93; 95% CI = 1.18-3.16, and OR = 1.98; 95% CI = 1.28-3.06, respectively), though no difference was observed between pediatric OCD and TS cohorts. Furthermore, the pediatric OCD cohort displayed similar rates of IgA deficiency and serum IgA levels when compared with the celiac disease cohort. The pediatric OCD cohort also displayed the highest percentage of IgA deficiency (15%,) when compared with TS (14%), celiac disease (14%), ADHD (13%), ASD (8%), and anxiety disorder (8%) cohorts. When segregated by sex, boys with OCD displayed a significantly higher likelihood of IgA deficiency when compared with all comparison cohorts except for celiac disease and tic disorders; no significant difference in IgA deficiency was observed between female cohorts. Pediatric OCD subjects also displayed significantly lower adjusted serum IgA levels than the ASD and anxiety disorder cohorts. Adults with OCD were also significantly less likely than children with OCD to display IgA deficiency (OR = 2.71; 95% CI = 1.71-4.28). When compared with children with celiac disease, no significant difference in IgA levels or rates of IgA deficiency were observed in the pediatric OCD cohort. Conclusions: We provide further evidence of IgA abnormalities in pediatric-onset OCD. These results require further investigation to determine if these abnormalities impact the clinical course of OCD in children.

Collapse

Edgcomb JB, Zima B. Machine Learning, Natural Language Processing, and the Electronic Health Record: Innovations in Mental Health Services Research. Psychiatr Serv 2019;70:346-349. [PMID: 30784377 DOI: 10.1176/appi.ps.201800401] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Dennis J, Yengo-Kahn AM, Kirby P, Solomon GS, Cox NJ, Zuckerman SL. Diagnostic Algorithms to Study Post-Concussion Syndrome Using Electronic Health Records: Validating a Method to Capture an Important Patient Population. J Neurotrauma 2019;36:2167-2177. [PMID: 30773988 DOI: 10.1089/neu.2018.5916] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open

Abstract

Post-concussion syndrome (PCS) is characterized by persistent cognitive, somatic, and emotional symptoms after a mild traumatic brain injury (mTBI). Genetic and other biological variables may contribute to PCS etiology, and the emergence of biobanks linked to electronic health records (EHRs) offers new opportunities for research on PCS. We sought to validate the EHR data of PCS patients by comparing two diagnostic algorithms deployed in the Vanderbilt University Medical Center de-identified database of 2.8 million patient EHRs. The algorithms identified individuals with PCS by: 1) natural language processing (NLP) of narrative text in the EHR combined with structured demographic, diagnostic, and encounter data; or 2) coded billing and procedure data. The predictive value of each algorithm was assessed, and cases and controls identified by each approach were compared on demographic and medical characteristics. The NLP algorithm identified 507 cases and 10,857 controls. The negative predictive value in controls was 78% and the positive predictive value (PPV) in cases was 82%. Conversely, the coded algorithm identified 1142 patients with two or more PCS billing codes and had a PPV of 76%. Comparisons of PCS controls to both case groups recovered known epidemiology of PCS: cases were more likely than controls to be female and to have pre-morbid diagnoses of anxiety, migraine, and post-traumatic stress disorder. In contrast, controls and cases were equally likely to have attention deficit hyperactive disorder and learning disabilities, in accordance with the findings of recent systematic reviews of PCS risk factors. We conclude that EHRs are a valuable research tool for PCS. Ascertainment based on coded data alone had a predictive value comparable to an NLP algorithm, recovered known PCS risk factors, and maximized the number of included patients.

Collapse