Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Xu H, Jiang M, Oetjens M, Bowton EA, Ramirez AH, Jeff JM, Basford MA, Pulley JM, Cowan JD, Wang X, Ritchie MD, Masys DR, Roden DM, Crawford DC, Denny JC. Facilitating pharmacogenetic studies using electronic health records and natural-language processing: a case study of warfarin. J Am Med Inform Assoc 2011;18:387-91. [PMID: 21672908 DOI: 10.1136/amiajnl-2011-000208] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open

For:	Xu H, Jiang M, Oetjens M, Bowton EA, Ramirez AH, Jeff JM, Basford MA, Pulley JM, Cowan JD, Wang X, Ritchie MD, Masys DR, Roden DM, Crawford DC, Denny JC. Facilitating pharmacogenetic studies using electronic health records and natural-language processing: a case study of warfarin. J Am Med Inform Assoc 2011;18:387-91. [PMID: 21672908 DOI: 10.1136/amiajnl-2011-000208] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open

Number

Cited by Other Article(s)

Jafari E, Blackman MH, Karnes JH, Van Driest SL, Crawford DC, Choi L, McDonough CW. Using electronic health records for clinical pharmacology research: Challenges and considerations. Clin Transl Sci 2024;17:e13871. [PMID: 38943244 PMCID: PMC11213823 DOI: 10.1111/cts.13871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Revised: 05/21/2024] [Accepted: 05/24/2024] [Indexed: 07/01/2024] Open

Shuey MM, Lee KM, Keaton J, Khankari NK, Breeyear JH, Walker VM, Miller DR, Heberer KR, Reaven PD, Clarke SL, Lee J, Lynch JA, Vujkovic M, Edwards TL. A genetically supported drug repurposing pipeline for diabetes treatment using electronic health records. EBioMedicine 2023;94:104674. [PMID: 37399599 PMCID: PMC10328805 DOI: 10.1016/j.ebiom.2023.104674] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 06/06/2023] [Accepted: 06/07/2023] [Indexed: 07/05/2023] Open

Abstract

BACKGROUND

The identification of new uses for existing drug therapies has the potential to identify treatments for comorbid conditions that have the added benefit of glycemic control while also providing a rapid, low-cost approach to drug (re)discovery.

METHODS

We developed and tested a genetically-informed drug-repurposing pipeline for diabetes management. This approach mapped genetically-predicted gene expression signals from the largest genome-wide association study for type 2 diabetes mellitus to drug targets using publicly available databases to identify drug-gene pairs. These drug-gene pairs were then validated using a two-step approach: 1) a self-controlled case-series (SCCS) using electronic health records from a discovery and replication population, and 2) Mendelian randomization (MR).

FINDINGS

After filtering on sample size, 20 candidate drug-gene pairs were validated and various medications demonstrated evidence of glycemic regulation including two anti-hypertensive classes: angiotensin-converting enzyme inhibitors as well as calcium channel blockers (CCBs). The CCBs demonstrated the strongest evidence of glycemic reduction in both validation approaches (SCCS HbA1c and glucose reduction: -0.11%, p = 0.01 and -0.85 mg/dL, p = 0.02, respectively; MR: OR = 0.84, 95% CI = 0.81, 0.87, p = 5.0 x 10-25).

INTERPRETATION

Our results support CCBs as a strong candidate medication for blood glucose reduction in addition to cardiovascular disease reduction. Further, these results support the adaptation of this approach for use in future drug-repurposing efforts for other conditions.

FUNDING

National Institutes of Health, Medical Research Council Integrative Epidemiology Unit at the University of Bristol, UK Medical Research Council, American Heart Association, and Department of Veterans Affairs (VA) Informatics and Computing Infrastructure and VA Cooperative Studies Program.

Collapse

Affiliation(s)

Megan M Shuey Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA; Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
Kyung Min Lee VA Informatics and Computer Infrastructure, VA Salt Lake City Health Care System, Salt Lake City, UT, USA
Jacob Keaton Medical Genomics and Metabolic Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA; Division of Epidemiology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
Nikhil K Khankari Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA; Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
Joseph H Breeyear Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA; Nashville VA Medical Center, Nashville, TN, USA
Venexia M Walker Medical Research Council, Integrative Epidemiology Unit, University of Bristol, Bristol, UK; Bristol Medical School, UK; Population Health Sciences, University of Bristol, Bristol, UK; Department of Surgery, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
Donald R Miller Center for Healthcare Organization and Implementation Research, Bedford VA Healthcare System, Bedford, MA, USA; Center for Population Health, Department of Biomedical and Nutritional Sciences, University of Massachusetts, Lowell, MA, USA
Kent R Heberer VA Palo Alto Health Care System, Palo Alto, CA, USA; Departments of Medicine and Endocrinology, Stanford University School of Medicine, Stanford, CA, USA
Peter D Reaven Phoenix VA Health Care System, Phoenix, AZ, USA; College of Medicine, University of Arizona, Phoenix, AZ, USA
Shoa L Clarke Departments of Medicine and Pediatrics, Stanford University School of Medicine, Stanford, CA, USA
Jennifer Lee VA Palo Alto Health Care System, Palo Alto, CA, USA
Julie A Lynch VA Informatics and Computer Infrastructure, VA Salt Lake City Health Care System, Salt Lake City, UT, USA; School of Medicine, University of Utah, Salt Lake City, UT, USA
Marijana Vujkovic Corporal Michael J. Crescenz VA Medical Center, Philadelphia, PA, USA; Department of Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA; Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA.
Todd L Edwards Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA; Division of Epidemiology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA; Nashville VA Medical Center, Nashville, TN, USA.

Collapse

Artificial intelligence in sepsis early prediction and diagnosis using unstructured data in healthcare. Nat Commun 2021;12:711. [PMID: 33514699 PMCID: PMC7846756 DOI: 10.1038/s41467-021-20910-4] [Citation(s) in RCA: 98] [Impact Index Per Article: 32.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2019] [Accepted: 12/28/2020] [Indexed: 12/20/2022] Open

Fu S, Chen D, He H, Liu S, Moon S, Peterson KJ, Shen F, Wang L, Wang Y, Wen A, Zhao Y, Sohn S, Liu H. Clinical concept extraction: A methodology review. J Biomed Inform 2020;109:103526. [PMID: 32768446 PMCID: PMC7746475 DOI: 10.1016/j.jbi.2020.103526] [Citation(s) in RCA: 60] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Revised: 07/30/2020] [Accepted: 08/02/2020] [Indexed: 01/11/2023]

Liu S, Nie W, Gao D, Yang H, Yan J, Hao T. Clinical quantitative information recognition and entity-quantity association from Chinese electronic medical records. INT J MACH LEARN CYB 2020. [DOI: 10.1007/s13042-020-01160-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]

Shuey M, Perkins B, Nian H, Yu C, Luther JM, Brown N. Retrospective cohort study to characterise the blood pressure response to spironolactone in patients with apparent therapy-resistant hypertension using electronic medical record data. BMJ Open 2020;10:e033100. [PMID: 32461291 PMCID: PMC7259833 DOI: 10.1136/bmjopen-2019-033100] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open

Abstract

OBJECTIVE

Identify blood pressure (BP) response to spironolactone in patients with apparent therapy-resistant hypertension (aTRH) using electronic medical records (EMRs) in order to estimate response in a real-world clinical setting.

DESIGN

Developed an algorithm to determine BP and electrolyte response to spironolactone for use in a retrospective cohort study.

SETTING

An academic medical centre in Nashville, Tennessee.

POPULATION

Patients with aTRH prescribed spironolactone.

MAIN OUTCOME MEASURES

Baseline BP and BP response, determined as the change in mean systolic BP (SBP) and diastolic BP (DBP) following spironolactone initiation. Additional response measures were serum sodium, potassium and creatinine, estimated glomerular filtration rate, haemoglobin A1c (HbA1c), glucose, high-density lipoprotein, low-density lipoprotein and triglycerides. Demographic characteristics included race, age, gender, body mass index (BMI), diabetes mellitus, chronic kidney disease stage 3, ischaemic heart disease and smoking.

RESULTS

The mean decreases in SBP and DBP were 8.1 and 3.4 mm Hg, consistent with clinical trial data. Using a mean decrease in SBP of 5 mm Hg or in DBP of 2 mm Hg to define 'responders', 30.3% of patients did not respond. In univariable analyses, responders had higher BMI, baseline SBP, DBP, sodium and HbA1c, and lower creatinine. In multivariable analysis, responders were older and had significantly higher BMI and baseline SBP and DBP, and lower potassium. Increases in potassium and creatinine following spironolactone were larger in responders. When BP was evaluated as a continuous variable, decreases in SBP and DBP correlated with baseline BP, decrease in sodium and increases in potassium and creatinine following spironolactone. The decrease in SBP was associated with decreasing glucose in European Americans.

CONCLUSIONS

We developed an algorithm to assess BP response to a commonly prescribed medication for aTRH using EMRs. Electrolyte changes associated with the BP response to spironolactone are consistent with its mechanism of action of blocking the mineralocorticoid receptor and decreasing epithelial sodium channel activity.

Collapse

Fu S, Leung LY, Raulli AO, Kallmes DF, Kinsman KA, Nelson KB, Clark MS, Luetmer PH, Kingsbury PR, Kent DM, Liu H. Assessment of the impact of EHR heterogeneity for clinical research through a case study of silent brain infarction. BMC Med Inform Decis Mak 2020;20:60. [PMID: 32228556 PMCID: PMC7106829 DOI: 10.1186/s12911-020-1072-9] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2019] [Accepted: 03/12/2020] [Indexed: 01/14/2023] Open

Abstract

Background

The rapid adoption of electronic health records (EHRs) holds great promise for advancing medicine through practice-based knowledge discovery. However, the validity of EHR-based clinical research is questionable due to poor research reproducibility caused by the heterogeneity and complexity of healthcare institutions and EHR systems, the cross-disciplinary nature of the research team, and the lack of standard processes and best practices for conducting EHR-based clinical research.

Method

We developed a data abstraction framework to standardize the process for multi-site EHR-based clinical studies aiming to enhance research reproducibility. The framework was implemented for a multi-site EHR-based research project, the ESPRESSO project, with the goal to identify individuals with silent brain infarctions (SBI) at Tufts Medical Center (TMC) and Mayo Clinic. The heterogeneity of healthcare institutions, EHR systems, documentation, and process variation in case identification was assessed quantitatively and qualitatively.

Result

We discovered a significant variation in the patient populations, neuroimaging reporting, EHR systems, and abstraction processes across the two sites. The prevalence of SBI for patients over age 50 for TMC and Mayo is 7.4 and 12.5% respectively. There is a variation regarding neuroimaging reporting where TMC are lengthy, standardized and descriptive while Mayo’s reports are short and definitive with more textual variations. Furthermore, differences in the EHR system, technology infrastructure, and data collection process were identified.

Conclusion

The implementation of the framework identified the institutional and process variations and the heterogeneity of EHRs across the sites participating in the case study. The experiment demonstrates the necessity to have a standardized process for data abstraction when conducting EHR-based clinical studies.

Collapse

Krzhizhanovskaya VV, Závodszky G, Lees MH, Dongarra JJ, Sloot PMA, Brissos S, Teixeira J. Applicability of Machine Learning Methods to Multi-label Medical Text Classification. LECTURE NOTES IN COMPUTER SCIENCE 2020. [PMCID: PMC7303696 DOI: 10.1007/978-3-030-50423-6_38] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Sinnott JA, Cai F, Yu S, Hejblum BP, Hong C, Kohane IS, Liao KP. PheProb: probabilistic phenotyping using diagnosis codes to improve power for genetic association studies. J Am Med Inform Assoc 2019;25:1359-1365. [PMID: 29788308 DOI: 10.1093/jamia/ocy056] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2017] [Accepted: 04/23/2018] [Indexed: 12/24/2022] Open

Oni-Orisan A, Hoffmann TJ, Ranatunga D, Medina MW, Jorgenson E, Schaefer C, Krauss RM, Iribarren C, Risch N. Characterization of Statin Low-Density Lipoprotein Cholesterol Dose-Response Using Electronic Health Records in a Large Population-Based Cohort. CIRCULATION-GENOMIC AND PRECISION MEDICINE 2019;11:e002043. [PMID: 30354326 DOI: 10.1161/circgen.117.002043] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

Abstract

BACKGROUND

Low-density lipoprotein cholesterol (LDL-C) response to statin therapy has not been fully elucidated in real-world populations. The primary objective of this study was to characterize statin LDL-C dose-response and its heritability in a large, multiethnic population of statin users.

METHODS

We determined the effect of statin dosing on lipid measures utilizing electronic health records in 33 139 statin users from the Kaiser Permanente GERA cohort (Genetic Epidemiology Research on Adult Health and Aging). The relationship between statin defined daily dose and lipid parameter response (percent change) was determined.

RESULTS

Defined daily dose and LDL-C response was associated in a log-linear relationship (β, -6.17; SE, 0.09; P<10^-300) which remained significant after adjusting for prespecified covariates (adjusted β, -5.59; SE, 0.12; P<10^-300). Statin type, sex, age, smoking status, diabetes mellitus, and East Asian race/ethnicity were significant independent predictors of statin-induced changes in LDL-C. Based on a variance-component method within the subset of statin users who had at least 1 first-degree relative who was also a statin user (n=1036), heritability of statin LDL-C response was estimated at 11.7% (SE, 8.6%; P=0.087).

CONCLUSIONS

Using electronic health record data, we observed a statin LDL-C dose-response consistent with the rule of 6% from prior clinical trial data. Clinical and demographic predictors of statin LDL-C response exhibited highly significant but modest effects. Finally, statin-induced changes in LDL-C were not found to be strongly inherited. Ultimately, these findings demonstrate (1) the utility of electronic health records as a reliable source to generate robust phenotypes for pharmacogenomic research and (2) the potential role of statin precision medicine in lipid management.

Collapse

Bottinor WJ, Shuey MM, Manouchehri A, Farber-Eger EH, Xu M, Nair D, Salem JE, Wang TJ, Brittain EL. Renin-Angiotensin-Aldosterone System Modulates Blood Pressure Response During Vascular Endothelial Growth Factor Receptor Inhibition. JACC: CARDIOONCOLOGY 2019;1:14-23. [PMID: 32984850 PMCID: PMC7513950 DOI: 10.1016/j.jaccao.2019.07.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

Abstract

Objectives

This study postulated that antihypertensive therapy with renin-angiotensin-aldosterone system (RAAS) inhibition may mitigate vascular endothelial growth factor inhibitor (VEGFi)–mediated increases in blood pressure more effectively than other antihypertensive medications in patients receiving VEGFi therapy.

Background

VEGFi therapy is commonly used in the treatment of cancer. One common side effect of VEGFi therapy is elevated blood pressure. Evidence suggests that the RAAS may be involved in VEGFi-mediated increases in blood pressure.

Methods

This retrospective cohort analysis was performed using a de-identified version of the electronic health record at Vanderbilt University Medical Center in Nashville, Tennessee. Subjects with cancer who were exposed to VEGFi therapy were identified, and blood pressure and medication data were extracted. Changes in mean systolic and diastolic blood pressure in response to VEGFi therapy in patients receiving RAAS inhibitor (RAASi) therapy before VEGFi initiation were compared with changes in mean systolic and diastolic blood pressure in patients not receiving RAASi therapy before VEGFi initiation.

Results

Mean systolic and diastolic blood pressure rose in both groups after VEGFi use; however, patients who had RAASi therapy before VEGFi initiation had a significantly lower increase in systolic blood pressure as compared with patients with no RAASi therapy (2.46 mm Hg [95% confidence interval: 0.7 to 4.2] compared with 4.56 mm Hg [95% confidence interval: 3.5 to 5.6], respectively; p = 0.034).

Conclusions

In a real-world clinical population, RAASi therapy before VEGFi initiation may ameliorate VEGFi-mediated increases in blood pressure. Randomized clinical trials are needed to further our understanding of the role of RAASi therapy in VEGFi-mediated increases in blood pressure.

Collapse

Fu S, Leung LY, Wang Y, Raulli AO, Kallmes DF, Kinsman KA, Nelson KB, Clark MS, Luetmer PH, Kingsbury PR, Kent DM, Liu H. Natural Language Processing for the Identification of Silent Brain Infarcts From Neuroimaging Reports. JMIR Med Inform 2019;7:e12109. [PMID: 31066686 PMCID: PMC6524454 DOI: 10.2196/12109] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Revised: 02/26/2019] [Accepted: 03/30/2019] [Indexed: 01/25/2023] Open

Abstract

Background

Silent brain infarction (SBI) is defined as the presence of 1 or more brain lesions, presumed to be because of vascular occlusion, found by neuroimaging (magnetic resonance imaging or computed tomography) in patients without clinical manifestations of stroke. It is more common than stroke and can be detected in 20% of healthy elderly people. Early detection of SBI may mitigate the risk of stroke by offering preventative treatment plans. Natural language processing (NLP) techniques offer an opportunity to systematically identify SBI cases from electronic health records (EHRs) by extracting, normalizing, and classifying SBI-related incidental findings interpreted by radiologists from neuroimaging reports.

Objective

This study aimed to develop NLP systems to determine individuals with incidentally discovered SBIs from neuroimaging reports at 2 sites: Mayo Clinic and Tufts Medical Center.

Methods

Both rule-based and machine learning approaches were adopted in developing the NLP system. The rule-based system was implemented using the open source NLP pipeline MedTagger, developed by Mayo Clinic. Features for rule-based systems, including significant words and patterns related to SBI, were generated using pointwise mutual information. The machine learning models adopted convolutional neural network (CNN), random forest, support vector machine, and logistic regression. The performance of the NLP algorithm was compared with a manually created gold standard. The gold standard dataset includes 1000 radiology reports randomly retrieved from the 2 study sites (Mayo and Tufts) corresponding to patients with no prior or current diagnosis of stroke or dementia. 400 out of the 1000 reports were randomly sampled and double read to determine interannotator agreements. The gold standard dataset was equally split to 3 subsets for training, developing, and testing.

Results

Among the 400 reports selected to determine interannotator agreement, 5 reports were removed due to invalid scan types. The interannotator agreements across Mayo and Tufts neuroimaging reports were 0.87 and 0.91, respectively. The rule-based system yielded the best performance of predicting SBI with an accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) of 0.991, 0.925, 1.000, 1.000, and 0.990, respectively. The CNN achieved the best score on predicting white matter disease (WMD) with an accuracy, sensitivity, specificity, PPV, and NPV of 0.994, 0.994, 0.994, 0.994, and 0.994, respectively.

Conclusions

We adopted a standardized data abstraction and modeling process to developed NLP techniques (rule-based and machine learning) to detect incidental SBIs and WMDs from annotated neuroimaging reports. Validation statistics suggested a high feasibility of detecting SBIs and WMDs from EHRs using NLP.

Collapse

Weissenkampen JD, Jiang Y, Eckert S, Jiang B, Li B, Liu DJ. Methods for the Analysis and Interpretation for Rare Variants Associated with Complex Traits. CURRENT PROTOCOLS IN HUMAN GENETICS 2019;101:e83. [PMID: 30849219 PMCID: PMC6455968 DOI: 10.1002/cphg.83] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Chan A, Chien I, Moseley E, Salman S, Kaminer Bourland S, Lamas D, Walling AM, Tulsky JA, Lindvall C. Deep learning algorithms to identify documentation of serious illness conversations during intensive care unit admissions. Palliat Med 2019;33:187-196. [PMID: 30427267 DOI: 10.1177/0269216318810421] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Abstract Background: Timely documentation of care preferences is an endorsed quality indicator for seriously ill patients admitted to intensive care units. Clinicians document their conversations about these preferences as unstructured free text in clinical notes from electronic health records. Aim: To apply deep learning algorithms for automated identification of serious illness conversations documented in physician notes during intensive care unit admissions. Design: Using a retrospective dataset of physician notes, clinicians annotated all text documenting patient care preferences (goals of care or code status limitations), communication with family, and full code status. Clinician-coded text was used to train algorithms to identify documentation and to validate algorithms. The validated algorithms were deployed to assess the percentage of intensive care unit admissions of patients aged ⩾75 that had care preferences documented within the first 48 h. Setting/participants: Patients admitted to one of five intensive care units. Results: Algorithm performance was calculated by comparing machine-identified documentation to clinician-coded documentation. For detecting care preference documentation at the note level, the algorithm had F1-score of 0.92 (95% confidence interval, 0.89 to 0.95), sensitivity of 93.5% (95% confidence interval, 90.0% to 98.0%), and specificity of 91.0% (95% confidence interval, 86.4% to 95.3%). Applied to 1350 admissions of patients aged ⩾75, we found that 64.7% of patient intensive care unit admissions had care preferences documented within the first 48 h. Conclusion: Deep learning algorithms identified patient care preference documentation with sensitivity and specificity approaching that of clinicians and computed in a tiny fraction of time. Future research should determine the generalizability of these methods in multiple healthcare systems. Collapse

Dietrich G, Krebs J, Liman L, Fette G, Ertl M, Kaspar M, Störk S, Puppe F. Replicating medication trend studies using ad hoc information extraction in a clinical data warehouse. BMC Med Inform Decis Mak 2019;19:15. [PMID: 30658633 PMCID: PMC6339317 DOI: 10.1186/s12911-018-0729-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2018] [Accepted: 12/21/2018] [Indexed: 11/16/2022] Open

Abstract

Background

Medication trend studies show the changes of medication over the years and may be replicated using a clinical Data Warehouse (CDW). Even nowadays, a lot of the patient information, like medication data, in the EHR is stored in the format of free text. As the conventional approach of information extraction (IE) demands a high developmental effort, we used ad hoc IE instead. This technique queries information and extracts it on the fly from texts contained in the CDW.

Methods

We present a generalizable approach of ad hoc IE for pharmacotherapy (medications and their daily dosage) presented in hospital discharge letters. We added import and query features to the CDW system, like error tolerant queries to deal with misspellings and proximity search for the extraction of the daily dosage. During the data integration process in the CDW, negated, historical and non-patient context data are filtered. For the replication studies, we used a drug list grouped by ATC (Anatomical Therapeutic Chemical Classification System) codes as input for queries to the CDW.

Results

We achieve an F1 score of 0.983 (precision 0.997, recall 0.970) for extracting medication from discharge letters and an F1 score of 0.974 (precision 0.977, recall 0.972) for extracting the dosage. We replicated three published medical trend studies for hypertension, atrial fibrillation and chronic kidney disease. Overall, 93% of the main findings could be replicated, 68% of sub-findings, and 75% of all findings. One study could be completely replicated with all main and sub-findings.

Conclusion

A novel approach for ad hoc IE is presented. It is very suitable for basic medical texts like discharge letters and finding reports. Ad hoc IE is by definition more limited than conventional IE and does not claim to replace it, but it substantially exceeds the search capabilities of many CDWs and it is convenient to conduct replication studies fast and with high quality.

Collapse

Wang Y, Sohn S, Liu S, Shen F, Wang L, Atkinson EJ, Amin S, Liu H. A clinical text classification paradigm using weak supervision and deep representation. BMC Med Inform Decis Mak 2019;19:1. [PMID: 30616584 PMCID: PMC6322223 DOI: 10.1186/s12911-018-0723-6] [Citation(s) in RCA: 138] [Impact Index Per Article: 27.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2018] [Accepted: 12/10/2018] [Indexed: 01/02/2023] Open

Abstract

BACKGROUND

Automatic clinical text classification is a natural language processing (NLP) technology that unlocks information embedded in clinical narratives. Machine learning approaches have been shown to be effective for clinical text classification tasks. However, a successful machine learning model usually requires extensive human efforts to create labeled training data and conduct feature engineering. In this study, we propose a clinical text classification paradigm using weak supervision and deep representation to reduce these human efforts.

METHODS

We develop a rule-based NLP algorithm to automatically generate labels for the training data, and then use the pre-trained word embeddings as deep representation features for training machine learning models. Since machine learning is trained on labels generated by the automatic NLP algorithm, this training process is called weak supervision. We evaluat the paradigm effectiveness on two institutional case studies at Mayo Clinic: smoking status classification and proximal femur (hip) fracture classification, and one case study using a public dataset: the i2b2 2006 smoking status classification shared task. We test four widely used machine learning models, namely, Support Vector Machine (SVM), Random Forest (RF), Multilayer Perceptron Neural Networks (MLPNN), and Convolutional Neural Networks (CNN), using this paradigm. Precision, recall, and F1 score are used as metrics to evaluate performance.

RESULTS

CNN achieves the best performance in both institutional tasks (F1 score: 0.92 for Mayo Clinic smoking status classification and 0.97 for fracture classification). We show that word embeddings significantly outperform tf-idf and topic modeling features in the paradigm, and that CNN captures additional patterns from the weak supervision compared to the rule-based NLP algorithms. We also observe two drawbacks of the proposed paradigm that CNN is more sensitive to the size of training data, and that the proposed paradigm might not be effective for complex multiclass classification tasks.

CONCLUSION

The proposed clinical text classification paradigm could reduce human efforts of labeled training data creation and feature engineering for applying machine learning to clinical text classification by leveraging weak supervision and deep representation. The experimental experiments have validated the effectiveness of paradigm by two institutional and one shared clinical text classification tasks.

Collapse

Smoller JW. The use of electronic health records for psychiatric phenotyping and genomics. Am J Med Genet B Neuropsychiatr Genet 2018;177:601-612. [PMID: 28557243 PMCID: PMC6440216 DOI: 10.1002/ajmg.b.32548] [Citation(s) in RCA: 58] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/19/2017] [Accepted: 04/20/2017] [Indexed: 12/22/2022]

Wong A, Plasek JM, Montecalvo SP, Zhou L. Natural Language Processing and Its Implications for the Future of Medication Safety: A Narrative Review of Recent Advances and Challenges. Pharmacotherapy 2018;38:822-841. [PMID: 29884988 DOI: 10.1002/phar.2151] [Citation(s) in RCA: 46] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]

Abstract

The safety of medication use has been a priority in the United States since the late 1930s. Recently, it has gained prominence due to the increasing amount of data suggesting that a large amount of patient harm is preventable and can be mitigated with effective risk strategies that have not been sufficiently adopted. Adverse events from medications are part of clinical practice, but the ability to identify a patient's risk and to minimize that risk must be a priority. The ability to identify adverse events has been a challenge due to limitations of available data sources, which are often free text. The use of natural language processing (NLP) may help to address these limitations. NLP is the artificial intelligence domain of computer science that uses computers to manipulate unstructured data (i.e., narrative text or speech data) in the context of a specific task. In this narrative review, we illustrate the fundamentals of NLP and discuss NLP's application to medication safety in four data sources: electronic health records, Internet-based data, published literature, and reporting systems. Given the magnitude of available data from these sources, a growing area is the use of computer algorithms to help automatically detect associations between medications and adverse effects. The main benefit of NLP is in the time savings associated with automation of various medication safety tasks such as the medication reconciliation process facilitated by computers, as well as the potential for near-real-time identification of adverse events for postmarketing surveillance such as those posted on social media that would otherwise go unanalyzed. NLP is limited by a lack of data sharing between health care organizations due to insufficient interoperability capabilities, inhibiting large-scale adverse event monitoring across populations. We anticipate that future work in this area will focus on the integration of data sources from different domains to improve the ability to identify potential adverse events more quickly and to improve clinical decision support with regard to a patient's estimated risk for specific adverse events at the time of medication prescription or review.

Collapse

Shuey MM, Gandelman JS, Chung CP, Nian H, Yu C, Denny JC, Brown NJ. Characteristics and treatment of African-American and European-American patients with resistant hypertension identified using the electronic health record in an academic health centre: a case-control study. BMJ Open 2018;8:e021640. [PMID: 29950471 PMCID: PMC6020960 DOI: 10.1136/bmjopen-2018-021640] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open

Abstract

OBJECTIVE

To identify patients with hypertension with resistant and controlled blood pressure (BP) using electronic health records (EHRs) in order to elucidate practices in the real-world clinical treatment of hypertension and to enable future genetic studies.

DESIGN

Using EHRs, we developed and validated algorithms to identify patients with resistant and controlled hypertension.

SETTING

An academic medical centre in Nashville, Tennessee.

POPULATION

European-American (EA) and African-American (AA) patients with hypertension.

MAIN OUTCOME MEASURES

Demographic characteristics: race, age, gender, body mass index, outpatient BPs and the history of diabetes mellitus, chronic kidney disease stage 3, ischaemic heart disease, transient ischaemic attack, atrial fibrillation and sleep apnoea.

MEDICATION TREATMENT

All antihypertensive medication classes prescribed to a patient at the time of classification and ever prescribed following classification.

RESULTS

The algorithms had performance metrics exceeding 92%. The prevalence of resistant hypertension in the total hypertensive population was 7.3% in EA and 10.5% in AA. At diagnosis, AA were younger, heavier, more often female and had a higher incidence of type 2 diabetes and higher BPs than EA. AA with resistant hypertension were more likely to be treated with vasodilators, dihydropyridine calcium channel blockers and alpha-2 agonists while EA were more likely to be treated with angiotensin receptor blockers, renin inhibitors and beta blockers. Mineralocorticoid receptor antagonists use was increased in patients treated with more than four antihypertensive medications compared with patients treated with three (12.4% vs 2.6% in EA, p<0.001; 12.3% vs 2.8% in AA, p<0.001). The number of patients treated with a mineralocorticoid receptor antagonist increased to 37.4% in EA and 41.2% in AA over a mean follow-up period of 7.4 and 8.7 years, respectively.

CONCLUSIONS

Clinical treatment of resistant hypertension differs in EA and AA patients. These results demonstrate the feasibility of identifying resistant hypertension using an EHR.

Collapse

Wang Y, Wang L, Rastegar-Mojarad M, Moon S, Shen F, Afzal N, Liu S, Zeng Y, Mehrabi S, Sohn S, Liu H. Clinical information extraction applications: A literature review. J Biomed Inform 2018;77:34-49. [PMID: 29162496 PMCID: PMC5771858 DOI: 10.1016/j.jbi.2017.11.011] [Citation(s) in RCA: 316] [Impact Index Per Article: 52.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Revised: 11/01/2017] [Accepted: 11/17/2017] [Indexed: 12/24/2022]

Kennell TI, Willig JH, Cimino JJ. Clinical Informatics Researcher's Desiderata for the Data Content of the Next Generation Electronic Health Record. Appl Clin Inform 2017;8:1159-1172. [PMID: 29270955 DOI: 10.4338/aci-2017-06-r-0101] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open

Abstract

OBJECTIVE

Clinical informatics researchers depend on the availability of high-quality data from the electronic health record (EHR) to design and implement new methods and systems for clinical practice and research. However, these data are frequently unavailable or present in a format that requires substantial revision. This article reports the results of a review of informatics literature published from 2010 to 2016 that addresses these issues by identifying categories of data content that might be included or revised in the EHR.

MATERIALS AND METHODS

We used an iterative review process on 1,215 biomedical informatics research articles. We placed them into generic categories, reviewed and refined the categories, and then assigned additional articles, for a total of three iterations.

RESULTS

Our process identified eight categories of data content issues: Adverse Events, Clinician Cognitive Processes, Data Standards Creation and Data Communication, Genomics, Medication List Data Capture, Patient Preferences, Patient-reported Data, and Phenotyping.

DISCUSSION

These categories summarize discussions in biomedical informatics literature that concern data content issues restricting clinical informatics research. These barriers to research result from data that are either absent from the EHR or are inadequate (e.g., in narrative text form) for the downstream applications of the data. In light of these categories, we discuss changes to EHR data storage that should be considered in the redesign of EHRs, to promote continued innovation in clinical informatics.

CONCLUSION

Based on published literature of clinical informaticians' reuse of EHR data, we characterize eight types of data content that, if included in the next generation of EHRs, would find immediate application in advanced informatics tools and techniques.

Collapse

Sanchez Bocanegra CL, Sevillano Ramos JL, Rizo C, Civit A, Fernandez-Luque L. HealthRecSys: A semantic content-based recommender system to complement health videos. BMC Med Inform Decis Mak 2017;17:63. [PMID: 28506225 PMCID: PMC5433022 DOI: 10.1186/s12911-017-0431-7] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2016] [Accepted: 03/24/2017] [Indexed: 11/17/2022] Open

Abstract

BACKGROUND

The Internet, and its popularity, continues to grow at an unprecedented pace. Watching videos online is very popular; it is estimated that 500 h of video are uploaded onto YouTube, a video-sharing service, every minute and that, by 2019, video formats will comprise more than 80% of Internet traffic. Health-related videos are very popular on YouTube, but their quality is always a matter of concern. One approach to enhancing the quality of online videos is to provide additional educational health content, such as websites, to support health consumers. This study investigates the feasibility of building a content-based recommender system that links health consumers to reputable health educational websites from MedlinePlus for a given health video from YouTube.

METHODS

The dataset for this study includes a collection of health-related videos and their available metadata. Semantic technologies (such as SNOMED-CT and Bio-ontology) were used to recommend health websites from MedlinePlus. A total of 26 healths professionals participated in evaluating 253 recommended links for a total of 53 videos about general health, hypertension, or diabetes. The relevance of the recommended health websites from MedlinePlus to the videos was measured using information retrieval metrics such as the normalized discounted cumulative gain and precision at K.

RESULTS

The majority of websites recommended by our system for health videos were relevant, based on ratings by health professionals. The normalized discounted cumulative gain was between 46% and 90% for the different topics.

CONCLUSIONS

Our study demonstrates the feasibility of using a semantic content-based recommender system to enrich YouTube health videos. Evaluation with end-users, in addition to healthcare professionals, will be required to identify the acceptance of these recommendations in a nonsimulated information-seeking context.

Collapse

Teixeira PL, Wei WQ, Cronin RM, Mo H, VanHouten JP, Carroll RJ, LaRose E, Bastarache LA, Rosenbloom ST, Edwards TL, Roden DM, Lasko TA, Dart RA, Nikolai AM, Peissig PL, Denny JC. Evaluating electronic health record data sources and algorithmic approaches to identify hypertensive individuals. J Am Med Inform Assoc 2016;24:162-171. [PMID: 27497800 DOI: 10.1093/jamia/ocw071] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2015] [Revised: 04/03/2016] [Accepted: 04/07/2016] [Indexed: 12/11/2022] Open

Abstract

OBJECTIVE

Phenotyping algorithms applied to electronic health record (EHR) data enable investigators to identify large cohorts for clinical and genomic research. Algorithm development is often iterative, depends on fallible investigator intuition, and is time- and labor-intensive. We developed and evaluated 4 types of phenotyping algorithms and categories of EHR information to identify hypertensive individuals and controls and provide a portable module for implementation at other sites.

MATERIALS AND METHODS

We reviewed the EHRs of 631 individuals followed at Vanderbilt for hypertension status. We developed features and phenotyping algorithms of increasing complexity. Input categories included International Classification of Diseases, Ninth Revision (ICD9) codes, medications, vital signs, narrative-text search results, and Unified Medical Language System (UMLS) concepts extracted using natural language processing (NLP). We developed a module and tested portability by replicating 10 of the best-performing algorithms at the Marshfield Clinic.

RESULTS

Random forests using billing codes, medications, vitals, and concepts had the best performance with a median area under the receiver operator characteristic curve (AUC) of 0.976. Normalized sums of all 4 categories also performed well (0.959 AUC). The best non-NLP algorithm combined normalized ICD9 codes, medications, and blood pressure readings with a median AUC of 0.948. Blood pressure cutoffs or ICD9 code counts alone had AUCs of 0.854 and 0.908, respectively. Marshfield Clinic results were similar.

CONCLUSION

This work shows that billing codes or blood pressure readings alone yield good hypertension classification performance. However, even simple combinations of input categories improve performance. The most complex algorithms classified hypertension with excellent recall and precision.

Collapse

Affiliation(s)

Pedro L Teixeira Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN, USA
Wei-Qi Wei Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN, USA
Robert M Cronin Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN, USA
Huan Mo Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN, USA
Jacob P VanHouten Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN, USA.,Department of Biostatistics, Vanderbilt University School of Medicine, Nashville, TN, USA
Robert J Carroll Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN, USA
Eric LaRose Biomedical Informatics Research Center, Marshfield Clinic Research Foundation, 1000 N Oak Ave - ML8, Marshfield, WI 54449, USA
Lisa A Bastarache Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN, USA
S Trent Rosenbloom Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN, USA.,Department of Medicine, Vanderbilt University School of Medicine, Nashville, TN, USA
Todd L Edwards Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN, USA
Dan M Roden Department of Medicine, Vanderbilt University School of Medicine, Nashville, TN, USA.,Department of Pharmacology, Vanderbilt University School of Medicine, Nashville, TN, USA
Thomas A Lasko Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN, USA
Richard A Dart Center for Human Genetics, Marshfield Clinic Research Foundation, 1000 N Oak Ave-MLR, Marshfield, WI 54449, USA
Anne M Nikolai Biomedical Informatics Research Center, Marshfield Clinic Research Foundation, 1000 N Oak Ave - ML8, Marshfield, WI 54449, USA
Peggy L Peissig Biomedical Informatics Research Center, Marshfield Clinic Research Foundation, 1000 N Oak Ave - ML8, Marshfield, WI 54449, USA
Joshua C Denny Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN, USA .,Department of Medicine, Vanderbilt University School of Medicine, Nashville, TN, USA

Collapse

Perera G, Broadbent M, Callard F, Chang CK, Downs J, Dutta R, Fernandes A, Hayes RD, Henderson M, Jackson R, Jewell A, Kadra G, Little R, Pritchard M, Shetty H, Tulloch A, Stewart R. Cohort profile of the South London and Maudsley NHS Foundation Trust Biomedical Research Centre (SLaM BRC) Case Register: current status and recent enhancement of an Electronic Mental Health Record-derived data resource. BMJ Open 2016;6:e008721. [PMID: 26932138 PMCID: PMC4785292 DOI: 10.1136/bmjopen-2015-008721] [Citation(s) in RCA: 316] [Impact Index Per Article: 39.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open

Abstract

PURPOSE

The South London and Maudsley National Health Service (NHS) Foundation Trust Biomedical Research Centre (SLaM BRC) Case Register and its Clinical Record Interactive Search (CRIS) application were developed in 2008, generating a research repository of real-time, anonymised, structured and open-text data derived from the electronic health record system used by SLaM, a large mental healthcare provider in southeast London. In this paper, we update this register's descriptive data, and describe the substantial expansion and extension of the data resource since its original development.

PARTICIPANTS

Descriptive data were generated from the SLaM BRC Case Register on 31 December 2014. Currently, there are over 250,000 patient records accessed through CRIS.

FINDINGS TO DATE

Since 2008, the most significant developments in the SLaM BRC Case Register have been the introduction of natural language processing to extract structured data from open-text fields, linkages to external sources of data, and the addition of a parallel relational database (Structured Query Language) output. Natural language processing applications to date have brought in new and hitherto inaccessible data on cognitive function, education, social care receipt, smoking, diagnostic statements and pharmacotherapy. In addition, through external data linkages, large volumes of supplementary information have been accessed on mortality, hospital attendances and cancer registrations.

FUTURE PLANS

Coupled with robust data security and governance structures, electronic health records provide potentially transformative information on mental disorders and outcomes in routine clinical care. The SLaM BRC Case Register continues to grow as a database, with approximately 20,000 new cases added each year, in addition to extension of follow-up for existing cases. Data linkages and natural language processing present important opportunities to enhance this type of research resource further, achieving both volume and depth of data. However, research projects still need to be carefully tailored, so that they take into account the nature and quality of the source information.

Collapse

Laper SM, Restrepo NA, Crawford DC. THE CHALLENGES IN USING ELECTRONIC HEALTH RECORDS FOR PHARMACOGENOMICS AND PRECISION MEDICINE RESEARCH. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2016;21:369-80. [PMID: 26776201 PMCID: PMC4720980] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Warner JL, Levy MA, Neuss MN, Warner JL, Levy MA, Neuss MN. ReCAP: Feasibility and Accuracy of Extracting Cancer Stage Information From Narrative Electronic Health Record Data. J Oncol Pract 2015;12:157-8; e169-7. [PMID: 26306621 DOI: 10.1200/jop.2015.004622] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Pereira NL, Sargent DJ, Farkouh ME, Rihal CS. Genotype-based clinical trials in cardiovascular disease. Nat Rev Cardiol 2015;12:475-87. [PMID: 25940926 PMCID: PMC4687401 DOI: 10.1038/nrcardio.2015.64] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]

Crawford DC, Goodloe R, Farber-Eger E, Boston J, Pendergrass SA, Haines JL, Ritchie MD, Bush WS. Leveraging Epidemiologic and Clinical Collections for Genomic Studies of Complex Traits. Hum Hered 2015. [PMID: 26201699 DOI: 10.1159/000381805] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open

Li L. The potential of translational bioinformatics approaches for pharmacology research. Br J Clin Pharmacol 2015;80:862-7. [PMID: 25753093 DOI: 10.1111/bcp.12622] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2014] [Revised: 02/11/2015] [Accepted: 02/15/2015] [Indexed: 12/17/2022] Open

The effects of electronic medical record phenotyping details on genetic association studies: HDL-C as a case study. BioData Min 2015;8:15. [PMID: 25969697 PMCID: PMC4428098 DOI: 10.1186/s13040-015-0048-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2014] [Accepted: 04/28/2015] [Indexed: 02/01/2023] Open

Castro VM, Minnier J, Murphy SN, Kohane I, Churchill SE, Gainer V, Cai T, Hoffnagle AG, Dai Y, Block S, Weill SR, Nadal-Vicens M, Pollastri AR, Rosenquist JN, Goryachev S, Ongur D, Sklar P, Perlis RH, Smoller JW. Validation of electronic health record phenotyping of bipolar disorder cases and controls. Am J Psychiatry 2015;172:363-72. [PMID: 25827034 PMCID: PMC4441333 DOI: 10.1176/appi.ajp.2014.14030423] [Citation(s) in RCA: 86] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Affiliation(s)

Victor M. Castro Partners Research Information Systems and Computing, Oregon Health & Science University, Portland, OR
Jessica Minnier Department of Public Health & Preventive Medicine, Oregon Health & Science University, Portland, OR
Shawn N. Murphy Partners Research Information Systems and Computing, Oregon Health & Science University, Portland, OR Laboratory of Computer Science and Department of Neurology, Massachusetts General Hospital, Boston, MA
Isaac Kohane Center for Biomedical Informatics, Harvard Medical School, Boston, MA
Susanne E. Churchill Center for Biomedical Informatics, Harvard Medical School, Boston, MA
Vivian Gainer Partners Research Information Systems and Computing, Oregon Health & Science University, Portland, OR
Tianxi Cai Department of Biostatistics, Harvard School of Public Health, Boston, MA
Alison G. Hoffnagle Psychiatric and Neurodevelopmental Genetics Unit, Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA
Yael Dai Psychiatric and Neurodevelopmental Genetics Unit, Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA
Stefanie Block Psychiatric and Neurodevelopmental Genetics Unit, Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA
Sydney R. Weill Psychiatric and Neurodevelopmental Genetics Unit, Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA
Mireya Nadal-Vicens Center for Anxiety and Traumatic Stress Disorders, Massachusetts General Hospital, Boston, MA Department of Psychiatry, Massachusetts General Hospital, Boston, MA
Alisha R. Pollastri Psychiatric and Neurodevelopmental Genetics Unit, Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA Department of Psychiatry, Massachusetts General Hospital, Boston, MA
J. Niels Rosenquist Psychiatric and Neurodevelopmental Genetics Unit, Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA Department of Psychiatry, Massachusetts General Hospital, Boston, MA
Sergey Goryachev Partners Research Information Systems and Computing, Oregon Health & Science University, Portland, OR
Dost Ongur McLean Hospital, Belmont, MA
Pamela Sklar Division of Psychiatric Genomics, Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY
Roy H. Perlis Psychiatric and Neurodevelopmental Genetics Unit, Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA Department of Psychiatry, Massachusetts General Hospital, Boston, MA Center for Experimental Drugs and Diagnostics, Massachusetts General Hospital, Boston, MA
Jordan W. Smoller Psychiatric and Neurodevelopmental Genetics Unit, Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA Department of Psychiatry, Massachusetts General Hospital, Boston, MA

Collapse

Lin C, Karlson EW, Dligach D, Ramirez MP, Miller TA, Mo H, Braggs NS, Cagan A, Gainer V, Denny JC, Savova GK. Automatic identification of methotrexate-induced liver toxicity in patients with rheumatoid arthritis from the electronic medical record. J Am Med Inform Assoc 2015;22:e151-61. [PMID: 25344930 PMCID: PMC5901122 DOI: 10.1136/amiajnl-2014-002642] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2014] [Revised: 08/14/2014] [Accepted: 08/22/2014] [Indexed: 12/22/2022] Open

Pradhan S, Elhadad N, South BR, Martinez D, Christensen L, Vogel A, Suominen H, Chapman WW, Savova G. Evaluating the state of the art in disorder recognition and normalization of the clinical narrative. J Am Med Inform Assoc 2015;22:143-54. [PMID: 25147248 PMCID: PMC4433360 DOI: 10.1136/amiajnl-2013-002544] [Citation(s) in RCA: 77] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2013] [Revised: 07/16/2014] [Accepted: 07/21/2014] [Indexed: 11/04/2022] Open

Denny JC. Surveying Recent Themes in Translational Bioinformatics: Big Data in EHRs, Omics for Drugs, and Personal Genomics. Yearb Med Inform 2014;9:199-205. [PMID: 25123743 PMCID: PMC4287076 DOI: 10.15265/iy-2014-0015] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open

Ross MK, Wei W, Ohno-Machado L. "Big data" and the electronic health record. Yearb Med Inform 2014;9:97-104. [PMID: 25123728 DOI: 10.15265/iy-2014-0003] [Citation(s) in RCA: 76] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open

Xu H, Aldrich MC, Chen Q, Liu H, Peterson NB, Dai Q, Levy M, Shah A, Han X, Ruan X, Jiang M, Li Y, Julien JS, Warner J, Friedman C, Roden DM, Denny JC. Validating drug repurposing signals using electronic health records: a case study of metformin associated with reduced cancer mortality. J Am Med Inform Assoc 2014;22:179-91. [PMID: 25053577 PMCID: PMC4433365 DOI: 10.1136/amiajnl-2014-002649] [Citation(s) in RCA: 141] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open

Abstract

Objectives Drug repurposing, which finds new indications for existing drugs, has received great attention recently. The goal of our work is to assess the feasibility of using electronic health records (EHRs) and automated informatics methods to efficiently validate a recent drug repurposing association of metformin with reduced cancer mortality.

Methods By linking two large EHRs from Vanderbilt University Medical Center and Mayo Clinic to their tumor registries, we constructed a cohort including 32 415 adults with a cancer diagnosis at Vanderbilt and 79 258 cancer patients at Mayo from 1995 to 2010. Using automated informatics methods, we further identified type 2 diabetes patients within the cancer cohort and determined their drug exposure information, as well as other covariates such as smoking status. We then estimated HRs for all-cause mortality and their associated 95% CIs using stratified Cox proportional hazard models. HRs were estimated according to metformin exposure, adjusted for age at diagnosis, sex, race, body mass index, tobacco use, insulin use, cancer type, and non-cancer Charlson comorbidity index.

Results Among all Vanderbilt cancer patients, metformin was associated with a 22% decrease in overall mortality compared to other oral hypoglycemic medications (HR 0.78; 95% CI 0.69 to 0.88) and with a 39% decrease compared to type 2 diabetes patients on insulin only (HR 0.61; 95% CI 0.50 to 0.73). Diabetic patients on metformin also had a 23% improved survival compared with non-diabetic patients (HR 0.77; 95% CI 0.71 to 0.85). These associations were replicated using the Mayo Clinic EHR data. Many site-specific cancers including breast, colorectal, lung, and prostate demonstrated reduced mortality with metformin use in at least one EHR.

Conclusions EHR data suggested that the use of metformin was associated with decreased mortality after a cancer diagnosis compared with diabetic and non-diabetic cancer patients not on metformin, indicating its potential as a chemotherapeutic regimen. This study serves as a model for robust and inexpensive validation studies for drug repurposing signals using EHR data.

Collapse

Affiliation(s)

Hua Xu The University of Texas School of Biomedical Informatics at Houston, Houston, Texas, USA
Melinda C Aldrich Department of Thoracic Surgery, Vanderbilt University School of Medicine, Nashville, Tennessee, USA Division of Epidemiology, Vanderbilt University School of Medicine, Nashville, Tennessee, USA
Qingxia Chen Department of Biostatistics, Vanderbilt University School of Medicine, Nashville, Tennessee, USA Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, USA
Hongfang Liu Division of Biomedical Statistics and Informatics, Mayo Clinic, Rochester, Minnesota, USA
Neeraja B Peterson Department of Medicine, Vanderbilt University School of Medicine, Nashville, Tennessee, USA
Qi Dai Division of Epidemiology, Vanderbilt University School of Medicine, Nashville, Tennessee, USA
Mia Levy Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, USA Department of Medicine, Vanderbilt University School of Medicine, Nashville, Tennessee, USA
Anushi Shah Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, USA
Xue Han Department of Biostatistics, Vanderbilt University School of Medicine, Nashville, Tennessee, USA
Xiaoyang Ruan Division of Biomedical Statistics and Informatics, Mayo Clinic, Rochester, Minnesota, USA
Min Jiang The University of Texas School of Biomedical Informatics at Houston, Houston, Texas, USA
Ying Li Department of Biomedical Informatics, Columbia University, New York, New York, USA
Jamii St Julien Department of Thoracic Surgery, Vanderbilt University School of Medicine, Nashville, Tennessee, USA
Jeremy Warner Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, USA Department of Medicine, Vanderbilt University School of Medicine, Nashville, Tennessee, USA
Carol Friedman Department of Biomedical Informatics, Columbia University, New York, New York, USA
Dan M Roden Department of Medicine, Vanderbilt University School of Medicine, Nashville, Tennessee, USA Department of Pharmacology, Vanderbilt University School of Medicine, Nashville, Tennessee, USA
Joshua C Denny Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, USA Department of Medicine, Vanderbilt University School of Medicine, Nashville, Tennessee, USA

Collapse

Rosenbloom ST, Harris P, Pulley J, Basford M, Grant J, DuBuisson A, Rothman RL. The Mid-South clinical Data Research Network. J Am Med Inform Assoc 2014;21:627-32. [PMID: 24821742 PMCID: PMC4078290 DOI: 10.1136/amiajnl-2014-002745] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open

Wei WQ, Feng Q, Weeke P, Bush W, Waitara MS, Iwuchukwu OF, Roden DM, Wilke RA, Stein CM, Denny JC. Creation and Validation of an EMR-based Algorithm for Identifying Major Adverse Cardiac Events while on Statins. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE PROCEEDINGS. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE 2014;2014:112-9. [PMID: 25717410 PMCID: PMC4333709] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Masanz J, Pakhomov SV, Xu H, Wu ST, Chute CG, Liu H. Open Source Clinical NLP - More than Any Single System. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE PROCEEDINGS. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE 2014;2014:76-82. [PMID: 25954581 PMCID: PMC4419764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Jiang M, Wu Y, Shah A, Priyanka P, Denny JC, Xu H. Extracting and standardizing medication information in clinical text - the MedEx-UIMA system. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE PROCEEDINGS. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE 2014;2014:37-42. [PMID: 25954575 PMCID: PMC4419757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Bui DDA, Zeng-Treitler Q. Learning regular expressions for clinical text classification. J Am Med Inform Assoc 2014;21:850-7. [PMID: 24578357 DOI: 10.1136/amiajnl-2013-002411] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open

Secondary use of clinical data: the Vanderbilt approach. J Biomed Inform 2014;52:28-35. [PMID: 24534443 DOI: 10.1016/j.jbi.2014.02.003] [Citation(s) in RCA: 174] [Impact Index Per Article: 17.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2013] [Revised: 12/21/2013] [Accepted: 02/04/2014] [Indexed: 01/04/2023]

Pathak J, Kho AN, Denny JC. Electronic health records-driven phenotyping: challenges, recent advances, and perspectives. J Am Med Inform Assoc 2014;20:e206-11. [PMID: 24302669 DOI: 10.1136/amiajnl-2013-002428] [Citation(s) in RCA: 165] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open

Oetjens M, Bush WS, Birdwell KA, Dilks HH, Bowton EA, Denny JC, Wilke RA, Roden DM, Crawford DC. Utilization of an EMR-biorepository to identify the genetic predictors of calcineurin-inhibitor toxicity in heart transplant recipients. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2014:253-64. [PMID: 24297552 PMCID: PMC3923429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]

Abstract

Calcineurin-inhibitors CI are immunosuppressive agents prescribed to patients after solid organ transplant to prevent rejection. Although these drugs have been transformative for allograft survival, long-term use is complicated by side effects including nephrotoxicity. Given the narrow therapeutic index of CI, therapeutic drug monitoring is used to prevent acute rejection from underdosing and acute toxicity from overdosing, but drug monitoring does not alleviate long-term side effects. Patients on calcineurin-inhibitors for long periods almost universally experience declines in renal function, and a subpopulation of transplant recipients ultimately develop chronic kidney disease that may progress to end stage renal disease attributable to calcineurin inhibitor toxicity (CNIT). Pharmacogenomics has the potential to identify patients who are at high risk for developing advanced chronic kidney disease caused by CNIT and providing them with existing alternate immunosuppressive therapy. In this study we utilized BioVU, Vanderbilt University Medical Center's DNA biorepository linked to de-identified electronic medical records to identify a cohort of 115 heart transplant recipients prescribed calcineurin-inhibitors to identify genetic risk factors for CNIT We identified 37 cases of nephrotoxicity in our cohort, defining nephrotoxicity as a monthly median estimated glomerular filtration rate (eGFR)<30 mL/min/1.73 m2 at least six months post-transplant for at least three consecutive months. All heart transplant patients were genotyped on the Illumina ADME Core Panel, a pharmacogenomic genotyping platform that assays 184 variants across 34 genes. In Cox regression analysis adjusting for age at transplant, pre-transplant chronic kidney disease, pre-transplant diabetes, and the three most significant principal components (PCAs), we did not identify any markers that met our multiple-testing threshold. As a secondary analysis we also modeled post-transplant eGFR directly with linear mixed models adjusted for age at transplant, cyclosporine use, median BMI, and the three most significant principal components. While no SNPs met our threshold for significance, a SNP previously identified in genetic studies of the dosing of tacrolimus CYP34A rs776746, replicated in an adjusted analysis at an uncorrected p-value of 0.02 (coeff(S.E.)=14.60(6.41)). While larger independent studies will be required to further validate this finding, this study underscores the EMRs usefulness as a resource for longitudinal pharmacogenetic study designs.

Collapse

SLCO1B1 genetic variant associated with statin-induced myopathy: a proof-of-concept study using the clinical practice research datalink. Clin Pharmacol Ther 2013;94:695-701. [PMID: 23942138 PMCID: PMC3831180 DOI: 10.1038/clpt.2013.161] [Citation(s) in RCA: 113] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2013] [Accepted: 08/02/2013] [Indexed: 01/14/2023]

Bartlett G, Antoun J, Zgheib NK. Theranostics in primary care: pharmacogenomics tests and beyond. Expert Rev Mol Diagn 2013;12:841-55. [PMID: 23249202 DOI: 10.1586/erm.12.115] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]

Wu Y, Lei J, Wei WQ, Tang B, Denny JC, Rosenbloom ST, Miller RA, Giuse DA, Zheng K, Xu H. Analyzing differences between chinese and english clinical text: a cross-institution comparison of discharge summaries in two languages. Stud Health Technol Inform 2013;192:662-6. [PMID: 23920639 PMCID: PMC4957806] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Ronquillo JG. How the electronic health record will change the future of health care. THE YALE JOURNAL OF BIOLOGY AND MEDICINE 2012;85:379-86. [PMID: 23012585 PMCID: PMC3447201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Roden DM, Xu H, Denny JC, Wilke RA. Electronic medical records as a tool in clinical pharmacology: opportunities and challenges. Clin Pharmacol Ther 2012;91:1083-86. [PMID: 22534870 DOI: 10.1038/clpt.2012.42] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Samwald M, Coulet A, Huerga I, Powers RL, Luciano JS, Freimuth RR, Whipple F, Pichler E, Prud'hommeaux E, Dumontier M, Marshall MS. Semantically enabling pharmacogenomic data for the realization of personalized medicine. Pharmacogenomics 2012;13:201-12. [PMID: 22256869 DOI: 10.2217/pgs.11.179] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open