Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Luo J, Wu M, Gopukumar D, Zhao Y. Big Data Application in Biomedical Research and Health Care: A Literature Review. Biomed Inform Insights 2016;8:1-10. [PMID: 26843812 PMCID: PMC4720168 DOI: 10.4137/bii.s31559] [Citation(s) in RCA: 153] [Impact Index Per Article: 19.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 08/27/2015] [Revised: 12/06/2015] [Accepted: 12/06/2015] [Indexed: 01/01/2023]

For:	Luo J, Wu M, Gopukumar D, Zhao Y. Big Data Application in Biomedical Research and Health Care: A Literature Review. Biomed Inform Insights 2016;8:1-10. [PMID: 26843812 PMCID: PMC4720168 DOI: 10.4137/bii.s31559] [Citation(s) in RCA: 153] [Impact Index Per Article: 19.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 08/27/2015] [Revised: 12/06/2015] [Accepted: 12/06/2015] [Indexed: 01/01/2023]

Number

Cited by Other Article(s)

Hu J, Zhao C, Shi C, Zhao Z, Ren Z. Speech-based recognition and estimating severity of PTSD using machine learning. J Affect Disord 2024;362:859-868. [PMID: 39009320 DOI: 10.1016/j.jad.2024.07.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Revised: 05/31/2024] [Accepted: 07/11/2024] [Indexed: 07/17/2024]

Razavi M, Ziyadidegan S, Mahmoudzadeh A, Kazeminasab S, Baharlouei E, Janfaza V, Jahromi R, Sasangohar F. Machine Learning, Deep Learning, and Data Preprocessing Techniques for Detecting, Predicting, and Monitoring Stress and Stress-Related Mental Disorders: Scoping Review. JMIR Ment Health 2024;11:e53714. [PMID: 39167782 PMCID: PMC11375388 DOI: 10.2196/53714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 05/01/2024] [Accepted: 05/17/2024] [Indexed: 08/23/2024] Open

Abstract

BACKGROUND

Mental stress and its consequent mental health disorders (MDs) constitute a significant public health issue. With the advent of machine learning (ML), there is potential to harness computational techniques for better understanding and addressing mental stress and MDs. This comprehensive review seeks to elucidate the current ML methodologies used in this domain to pave the way for enhanced detection, prediction, and analysis of mental stress and its subsequent MDs.

OBJECTIVE

This review aims to investigate the scope of ML methodologies used in the detection, prediction, and analysis of mental stress and its consequent MDs.

METHODS

Using a rigorous scoping review process with PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) guidelines, this investigation delves into the latest ML algorithms, preprocessing techniques, and data types used in the context of stress and stress-related MDs.

RESULTS

A total of 98 peer-reviewed publications were examined for this review. The findings highlight that support vector machine, neural network, and random forest models consistently exhibited superior accuracy and robustness among all ML algorithms examined. Physiological parameters such as heart rate measurements and skin response are prevalently used as stress predictors due to their rich explanatory information concerning stress and stress-related MDs, as well as the relative ease of data acquisition. The application of dimensionality reduction techniques, including mappings, feature selection, filtering, and noise reduction, is frequently observed as a crucial step preceding the training of ML algorithms.

CONCLUSIONS

The synthesis of this review identified significant research gaps and outlines future directions for the field. These encompass areas such as model interpretability, model personalization, the incorporation of naturalistic settings, and real-time processing capabilities for the detection and prediction of stress and stress-related MDs.

Collapse

Ying G, Perez-Lao A, Adrien T, Maraganore D, Marra D, Smith G. TICS-M scores in an oldest-old normative cohort identified by computable phenotype. Clin Neuropsychol 2024:1-12. [PMID: 38997666 DOI: 10.1080/13854046.2024.2374894] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2024] [Accepted: 06/27/2024] [Indexed: 07/14/2024]

Maier A, Hartung M, Abovsky M, Adamowicz K, Bader G, Baier S, Blumenthal D, Chen J, Elkjaer M, Garcia-Hernandez C, Helmy M, Hoffmann M, Jurisica I, Kotlyar M, Lazareva O, Levi H, List M, Lobentanzer S, Loscalzo J, Malod-Dognin N, Manz Q, Matschinske J, Mee M, Oubounyt M, Pastrello C, Pico A, Pillich R, Poschenrieder J, Pratt D, Pržulj N, Sadegh S, Saez-Rodriguez J, Sarkar S, Shaked G, Shamir R, Trummer N, Turhan U, Wang RS, Zolotareva O, Baumbach J. Drugst.One - a plug-and-play solution for online systems medicine and network-based drug repurposing. Nucleic Acids Res 2024;52:W481-W488. [PMID: 38783119 PMCID: PMC11223884 DOI: 10.1093/nar/gkae388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 04/08/2024] [Accepted: 04/29/2024] [Indexed: 05/25/2024] Open

Affiliation(s)

Andreas Maier Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany
Michael Hartung Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany
Mark Abovsky Division of Orthopaedic Surgery, Schroeder Arthritis Institute, Toronto, Canada Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, Toronto, ON M5T 0S8, Canada
Klaudia Adamowicz Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany
Gary D Bader Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada The Donnelly Centre, University of Toronto, Toronto, ON, Canada Department of Computer Science, University of Toronto, Toronto, ON, Canada Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada The Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, ON, Canada
Sylvie Baier Data Science in Systems Biology, TUM School of Life Sciences, Technical University of Munich, Munich, Germany
David B Blumenthal Department Artificial Intelligence in Biomedical Engineering (AIBE), Friedrich-Alexander University Erlangen-Nürnberg (FAU), 91052 Erlangen, Germany
Jing Chen Department of Medicine, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA
Maria L Elkjaer Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany Department of Neurology, Odense University Hospital, Odense, Denmark Institute of Clinical Research, University of Southern Denmark, Odense, Denmark Institute of Molecular Medicine, University of Southern Denmark, Odense, Denmark
Carlos Garcia-Hernandez Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
Mohamed Helmy Vaccine and Infectious Disease Organization (VIDO), University of Saskatchewan, Canada School of Public Health, University of Saskatchewan, Canada Department of Computer Science, University of Saskatchewan, Canada Department of Computer Science, Lakehead University, Canada Department of Computer Science, Idaho State University, USA Bioinformatics Institute (BII), A*STAR, Singapore
Markus Hoffmann Data Science in Systems Biology, TUM School of Life Sciences, Technical University of Munich, Munich, Germany Institute for Advanced Study, Technical University of Munich, Germany National Institute of Diabetes, Digestive, and Kidney Diseases, Bethesda, MD 20892, USA
Igor Jurisica Division of Orthopaedic Surgery, Schroeder Arthritis Institute, Toronto, Canada Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, Toronto, ON M5T 0S8, Canada Departments of Medical Biophysics and Computer Science, University of Toronto, Toronto, Canada Institute of Neuroimmunology, Slovak Academy of Sciences, Bratislava, Slovakia
Max Kotlyar Division of Orthopaedic Surgery, Schroeder Arthritis Institute, Toronto, Canada Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, Toronto, ON M5T 0S8, Canada
Olga Lazareva Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany Junior Clinical Cooperation Unit Multiparametric methods for early detection of prostate cancer, German Cancer Research Center (DKFZ), Heidelberg, Germany European Molecular Biology Laboratory, Genome Biology Unit, 69117 Heidelberg, Germany
Hagai Levi Blavatnik School of Computer Science, Tel-Aviv University, Tel-Aviv, Israel
Markus List Data Science in Systems Biology, TUM School of Life Sciences, Technical University of Munich, Munich, Germany
Sebastian Lobentanzer Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
Joseph Loscalzo Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02115, USA
Noel Malod-Dognin Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
Quirin Manz Data Science in Systems Biology, TUM School of Life Sciences, Technical University of Munich, Munich, Germany
Julian Matschinske Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany Data Science in Systems Biology, TUM School of Life Sciences, Technical University of Munich, Munich, Germany
Miles Mee Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada The Donnelly Centre, University of Toronto, Toronto, ON, Canada Department of Computer Science, University of Toronto, Toronto, ON, Canada
Mhaned Oubounyt Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany
Chiara Pastrello Division of Orthopaedic Surgery, Schroeder Arthritis Institute, Toronto, Canada Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, Toronto, ON M5T 0S8, Canada
Alexander R Pico Institute of Data Science and Biotechnology, Gladstone Institutes, 1650 Owens Street, San Francisco, 94158 California, USA
Rudolf T Pillich Department of Medicine, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA
Julian M Poschenrieder Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany Data Science in Systems Biology, TUM School of Life Sciences, Technical University of Munich, Munich, Germany
Dexter Pratt Department of Medicine, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA
Nataša Pržulj Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain Department of Computer Science, University College London, London WC1E 6BT, UK ICREA, Pg. Lluís Companys 23, 08010 Barcelona, Spain
Sepideh Sadegh Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany Data Science in Systems Biology, TUM School of Life Sciences, Technical University of Munich, Munich, Germany Department of Clinical Genetics, Odense University Hospital, Odense, Denmark Clinical Genome Center, Department of Clinical Research, University of Southern Denmark, Odense, Denmark
Julio Saez-Rodriguez Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
Suryadipto Sarkar Department Artificial Intelligence in Biomedical Engineering (AIBE), Friedrich-Alexander University Erlangen-Nürnberg (FAU), 91052 Erlangen, Germany
Gideon Shaked Blavatnik School of Computer Science, Tel-Aviv University, Tel-Aviv, Israel
Ron Shamir Blavatnik School of Computer Science, Tel-Aviv University, Tel-Aviv, Israel
Nico Trummer Data Science in Systems Biology, TUM School of Life Sciences, Technical University of Munich, Munich, Germany
Ugur Turhan Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany
Rui-Sheng Wang Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02115, USA
Olga Zolotareva Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany Data Science in Systems Biology, TUM School of Life Sciences, Technical University of Munich, Munich, Germany
Jan Baumbach Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany Computational Biomedicine Lab, Department of Mathematics and Computer Science, University of Southern Denmark, Odense, Denmark

Collapse

Farag N, Noë A, Patrinos D, Zawati MH. Mapping the Apps: Ethical and Legal Issues with Crowdsourced Smartphone Data using mHealth Applications. Asian Bioeth Rev 2024;16:437-470. [PMID: 39022376 PMCID: PMC11250705 DOI: 10.1007/s41649-024-00296-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 04/03/2024] [Accepted: 04/14/2024] [Indexed: 07/20/2024] Open

Abstract

More than 5 billion people in the world own a smartphone. More than half of these have been used to collect and process health-related data. As such, the existing volume of potentially exploitable health data is unprecedentedly large and growing rapidly. Mobile health applications (apps) on smartphones are some of the worst offenders and are increasingly being used for gathering and exchanging significant amounts of personal health data from the public. This data is often utilized for health research purposes and for algorithm training. While there are advantages to utilizing this data for expanding health knowledge, there are associated risks for the users of these apps, such as privacy concerns and the protection of their data. Consequently, gaining a deeper comprehension of how apps collect and crowdsource data is crucial. To explore how apps are crowdsourcing data and to identify potential ethical, legal, and social issues (ELSI), we conducted an examination of the Apple App Store and the Google Play Store in North America and Europe to identify apps that could potentially gather health data through crowdsourcing. Subsequently, we analyzed their privacy policies, terms of use, and other related documentation to gain insights into the utilization of users' data and the possibility of repurposing it for research or algorithm training purposes. More specifically, we reviewed privacy policies to identify clauses pertaining to the following key categories: research, data sharing, privacy/confidentiality, commercialization, and return of findings. Based on the results of these app search, we developed an App Atlas that presents apps which crowdsource data for research or algorithm training. We identified 46 apps available in the European and Canadian markets that either openly crowdsource health data for research or algorithm training or retain the legal or technical capability to do so. This app search showed an overall lack of consistency and transparency in privacy policies that poses challenges to user comprehensibility, trust, and informed consent. A significant proportion of applications presented contradictions or exhibited considerable ambiguity. For instance, the vast majority of privacy policies in the App Atlas contain ambiguous or contradictory language regarding the sharing of users' data with third parties. This raises a number of ethico-legal concerns which will require further academic and policy attention to ensure a balance between protecting individual interests and maximizing the scientific utility of crowdsourced data. This article represents a key first step in better understanding these concerns and bringing attention to this important issue.

Supplementary Information

The online version contains supplementary material available at 10.1007/s41649-024-00296-3.

Collapse

Lyu C, Joehanes R, Huan T, Levy D, Li Y, Wang M, Liu X, Liu C, Ma J. Enhancing selection of alcohol consumption-associated genes by random forest. Br J Nutr 2024;131:2058-2067. [PMID: 38606596 PMCID: PMC11216877 DOI: 10.1017/s0007114524000795] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/13/2024]

Rubinic I, Kurtov M, Rubinic I, Likic R, Dargan PI, Wood DM. Artificial intelligence in clinical pharmacology: A case study and scoping review of large language models and bioweapon potential. Br J Clin Pharmacol 2024;90:620-628. [PMID: 37658550 DOI: 10.1111/bcp.15899] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2023] [Revised: 08/23/2023] [Accepted: 08/24/2023] [Indexed: 09/03/2023] Open

Bhuvaneshwar K, Gusev Y. Translational bioinformatics and data science for biomarker discovery in mental health: an analytical review. Brief Bioinform 2024;25:bbae098. [PMID: 38493340 PMCID: PMC10944574 DOI: 10.1093/bib/bbae098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 01/23/2024] [Accepted: 02/18/2024] [Indexed: 03/18/2024] Open

Bernier A, Knoppers BM, Bermudez P, Beauvais MJS, Thorogood A. Open Data governance at the Canadian Open Neuroscience Platform (CONP): From the Walled Garden to the Arboretum. Gigascience 2024;13:giad114. [PMID: 38217404 PMCID: PMC10787360 DOI: 10.1093/gigascience/giad114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 11/14/2023] [Accepted: 12/10/2023] [Indexed: 01/15/2024] Open

Yang X, Huang K, Yang D, Zhao W, Zhou X. Biomedical Big Data Technologies, Applications, and Challenges for Precision Medicine: A Review. GLOBAL CHALLENGES (HOBOKEN, NJ) 2024;8:2300163. [PMID: 38223896 PMCID: PMC10784210 DOI: 10.1002/gch2.202300163] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/02/2023] [Revised: 09/20/2023] [Indexed: 01/16/2024]

Alizadeh M, Sampaio Moura N, Schledwitz A, Patil SA, Ravel J, Raufman JP. Gastroenterology Fellowship and Postdoctoral Training in Omics and Statistics-Part I: Why Is It Needed? Dig Dis Sci 2024;69:18-21. [PMID: 37919514 PMCID: PMC10878129 DOI: 10.1007/s10620-023-08136-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Accepted: 09/27/2023] [Indexed: 11/04/2023]

Bergman DR, Norton KA, Jain HV, Jackson T. Connecting Agent-Based Models with High-Dimensional Parameter Spaces to Multidimensional Data Using SMoRe ParS: A Surrogate Modeling Approach. Bull Math Biol 2023;86:11. [PMID: 38159216 PMCID: PMC10757706 DOI: 10.1007/s11538-023-01240-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Accepted: 11/22/2023] [Indexed: 01/03/2024]

Abstract

Across a broad range of disciplines, agent-based models (ABMs) are increasingly utilized for replicating, predicting, and understanding complex systems and their emergent behavior. In the biological and biomedical sciences, researchers employ ABMs to elucidate complex cellular and molecular interactions across multiple scales under varying conditions. Data generated at these multiple scales, however, presents a computational challenge for robust analysis with ABMs. Indeed, calibrating ABMs remains an open topic of research due to their own high-dimensional parameter spaces. In response to these challenges, we extend and validate our novel methodology, Surrogate Modeling for Reconstructing Parameter Surfaces (SMoRe ParS), arriving at a computationally efficient framework for connecting high dimensional ABM parameter spaces with multidimensional data. Specifically, we modify SMoRe ParS to initially confine high dimensional ABM parameter spaces using unidimensional data, namely, single time-course information of in vitro cancer cell growth assays. Subsequently, we broaden the scope of our approach to encompass more complex ABMs and constrain parameter spaces using multidimensional data. We explore this extension with in vitro cancer cell inhibition assays involving the chemotherapeutic agent oxaliplatin. For each scenario, we validate and evaluate the effectiveness of our approach by comparing how well ABM simulations match the experimental data when using SMoRe ParS-inferred parameters versus parameters inferred by a commonly used direct method. In so doing, we show that our approach of using an explicitly formulated surrogate model as an interlocutor between the ABM and the experimental data effectively calibrates the ABM parameter space to multidimensional data. Our method thus provides a robust and scalable strategy for leveraging multidimensional data to inform multiscale ABMs and explore the uncertainty in their parameters.

Collapse

Suarjana IWG, Sudirham, Salam I, Aditama MHR. Artificial intelligence in public health: the potential and ethical considerations of artificial intelligence in public health. J Public Health (Oxf) 2023;45:e834-e835. [PMID: 37477239 DOI: 10.1093/pubmed/fdad116] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Accepted: 06/27/2023] [Indexed: 07/22/2023] Open

Jönsson H, Ahlström H, Kullberg J. Spatial mapping of tumor heterogeneity in whole-body PET-CT: a feasibility study. Biomed Eng Online 2023;22:110. [PMID: 38007471 PMCID: PMC10675915 DOI: 10.1186/s12938-023-01173-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 11/17/2023] [Indexed: 11/27/2023] Open

Abstract

BACKGROUND

Tumor heterogeneity is recognized as a predictor of treatment response and patient outcome. Quantification of tumor heterogeneity across all scales may therefore provide critical insight that ultimately improves cancer management.

METHODS

An image registration-based framework for the study of tumor heterogeneity in whole-body images was evaluated on a dataset of 490 FDG-PET-CT images of lung cancer, lymphoma, and melanoma patients. Voxel-, lesion- and subject-level features were extracted from the subjects' segmented lesion masks and mapped to female and male template spaces for voxel-wise analysis. Resulting lesion feature maps of the three subsets of cancer patients were studied visually and quantitatively. Lesion volumes and lesion distances in subject spaces were compared with resulting properties in template space. The strength of the association between subject and template space for these properties was evaluated with Pearson's correlation coefficient.

RESULTS

Spatial heterogeneity in terms of lesion frequency distribution in the body, metabolic activity, and lesion volume was seen between the three subsets of cancer patients. Lesion feature maps showed anatomical locations with low versus high mean feature value among lesions sampled in space and also highlighted sites with high variation between lesions in each cancer subset. Spatial properties of the lesion masks in subject space correlated strongly with the same properties measured in template space (lesion volume, R = 0.986, p < 0.001; total metabolic volume, R = 0.988, p < 0.001; maximum within-patient lesion distance, R = 0.997, p < 0.001). Lesion volume and total metabolic volume increased on average from subject to template space (lesion volume, 3.1 ± 52 ml; total metabolic volume, 53.9 ± 229 ml). Pair-wise lesion distance decreased on average by 0.1 ± 1.6 cm and maximum within-patient lesion distance increased on average by 0.5 ± 2.1 cm from subject to template space.

CONCLUSIONS

Spatial tumor heterogeneity between subsets of interest in cancer cohorts can successfully be explored in whole-body PET-CT images within the proposed framework. Whole-body studies are, however, especially prone to suffer from regional variation in lesion frequency, and thus statistical power, due to the non-uniform distribution of lesions across a large field of view.

Collapse

Zass L, Johnston K, Benkahla A, Chaouch M, Kumuthini J, Radouani F, Mwita LA, Alsayed N, Allie T, Sathan D, Masamu U, Seuneu Tchamga MS, Tamuhla T, Samtal C, Nembaware V, Gill Z, Ahmed S, Hamdi Y, Fadlelmola F, Tiffin N, Mulder N. Developing Clinical Phenotype Data Collection Standards for Research in Africa. Glob Health Epidemiol Genom 2023;2023:6693323. [PMID: 37766808 PMCID: PMC10522421 DOI: 10.1155/2023/6693323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Revised: 06/30/2023] [Accepted: 07/21/2023] [Indexed: 09/29/2023] Open

Affiliation(s)

Lyndon Zass Computational Biology Division, Department of Integrative Biomedical Sciences, IDM, University of Cape Town, Cape Town, South Africa
Katherine Johnston Computational Biology Division, Department of Integrative Biomedical Sciences, IDM, University of Cape Town, Cape Town, South Africa
Alia Benkahla Laboratory of BioInformatics, BioMathematics and BioStatistics LR16IPT09, Institut Pasteur de Tunis, Tunis, Tunisia
Melek Chaouch Laboratory of BioInformatics, BioMathematics and BioStatistics LR16IPT09, Institut Pasteur de Tunis, Tunis, Tunisia
Judit Kumuthini South African National Bioinformatics Institute (SANBI), Life Sciences Building, University of Western Cape, Bellville, Cape Town, South Africa
Fouzia Radouani Chlamydiae & Mycoplasmas Laboratory Research Department, Institut Pasteur du Maroc, 20360 Casablanca, Morocco
Liberata Alexander Mwita Muhimbili Sickle Cell Program, Department of Hematology and Blood Transfusion, Muhimbili University of Health and Allied Sciences, Dar-es-Salaam, Tanzania
Nihad Alsayed Kush Centre for Genomics & Biomedical Informatics, Biotechnology Perspectives Organization, Khartoum 11111, Sudan
Taryn Allie Computational Biology Division, Department of Integrative Biomedical Sciences, IDM, University of Cape Town, Cape Town, South Africa
Dassen Sathan Software Information Systems Department, FOICDT, University of Mauritius, Reduit, Mauritius
Upendo Masamu Muhimbili Sickle Cell Program, Department of Hematology and Blood Transfusion, Muhimbili University of Health and Allied Sciences, Dar-es-Salaam, Tanzania
Milaine Sergine Seuneu Tchamga Department of Mathematics and Physics, Cape Peninsula University of Technology, Bellville, Cape Town, South Africa
Tsaone Tamuhla Computational Biology Division, Department of Integrative Biomedical Sciences, IDM, University of Cape Town, Cape Town, South Africa
Chaimae Samtal Laboratory of Biotechnology, Environment, Agri-Food and Health, Faculty of Sciences Dhar El Mahraz-Sidi Mohammed Ben Abdellah University, Fez 30000, Morocco
Victoria Nembaware Division of Human Genetics, Department of Pathology, Faculty of Health Sciences, University of Cape Town, Cape Town, South Africa
Zoe Gill Computational Biology Division, Department of Integrative Biomedical Sciences, IDM, University of Cape Town, Cape Town, South Africa Department of Molecular Biology, Johannes Gutenberg University, Mainz, Germany
Samah Ahmed Kush Centre for Genomics & Biomedical Informatics, Biotechnology Perspectives Organization, Khartoum 11111, Sudan
Yosr Hamdi Laboratory of Biomedical Genomics and Oncogenetics, Institut Pasteur de Tunis, University of Tunis El Manar, Tunis, Tunisia Laboratory of Human and Experimental Pathology, Institut Pasteur de Tunis, Tunis, Tunisia
Faisal Fadlelmola Kush Centre for Genomics & Biomedical Informatics, Biotechnology Perspectives Organization, Khartoum 11111, Sudan
Nicki Tiffin Computational Biology Division, Department of Integrative Biomedical Sciences, IDM, University of Cape Town, Cape Town, South Africa South African National Bioinformatics Institute (SANBI), Life Sciences Building, University of Western Cape, Bellville, Cape Town, South Africa Wellcome Centre for Infectious Disease Research in Africa, Institute of Infectious Diseases and Molecular Medicine, Faculty of Cape Town, University of Cape Town, Cape Town, South Africa
Nicola Mulder Computational Biology Division, Department of Integrative Biomedical Sciences, IDM, University of Cape Town, Cape Town, South Africa Wellcome Centre for Infectious Disease Research in Africa, Institute of Infectious Diseases and Molecular Medicine, Faculty of Cape Town, University of Cape Town, Cape Town, South Africa

Collapse

Sinha K, Ghosh N, Sil PC. A Review on the Recent Applications of Deep Learning in Predictive Drug Toxicological Studies. Chem Res Toxicol 2023;36:1174-1205. [PMID: 37561655 DOI: 10.1021/acs.chemrestox.2c00375] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/12/2023]

Abstract

Drug toxicity prediction is an important step in ensuring patient safety during drug design studies. While traditional preclinical studies have historically relied on animal models to evaluate toxicity, recent advances in deep-learning approaches have shown great promise in advancing drug safety science and reducing animal use in preclinical studies. However, deep-learning-based approaches also face challenges in handling large biological data sets, model interpretability, and regulatory acceptance. In this review, we provide an overview of recent developments in deep-learning-based approaches for predicting drug toxicity, highlighting their potential advantages over traditional methods and the need to address their limitations. Deep-learning models have demonstrated excellent performance in predicting toxicity outcomes from various data sources such as chemical structures, genomic data, and high-throughput screening assays. The potential of deep learning for automated feature engineering is also discussed. This review emphasizes the need to address ethical concerns related to the use of deep learning in drug toxicity studies, including the reduction of animal use and ensuring regulatory acceptance. Furthermore, emerging applications of deep learning in drug toxicity prediction, such as predicting drug-drug interactions and toxicity in rare subpopulations, are highlighted. The integration of deep-learning-based approaches with traditional methods is discussed as a way to develop more reliable and efficient predictive models for drug safety assessment, paving the way for safer and more effective drug discovery and development. Overall, this review highlights the critical role of deep learning in predictive toxicology and drug safety evaluation, emphasizing the need for continued research and development in this rapidly evolving field. By addressing the limitations of traditional methods, leveraging the potential of deep learning for automated feature engineering, and addressing ethical concerns, deep-learning-based approaches have the potential to revolutionize drug toxicity prediction and improve patient safety in drug discovery and development.

Collapse

Cunha FF, Blüml V, Zopf LM, Walter A, Wagner M, Weninger WJ, Thomaz LA, Tavora LMN, da Silva Cruz LA, Faria SMM. Lossy Image Compression in a Preclinical Multimodal Imaging Study. J Digit Imaging 2023;36:1826-1850. [PMID: 37038039 PMCID: PMC10406799 DOI: 10.1007/s10278-023-00800-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2022] [Revised: 02/20/2023] [Accepted: 02/21/2023] [Indexed: 04/12/2023] Open

Smith G, Miller A, Marra DE, Wu Y, Bian J, Maraganore DM, Anton S. Evaluation of a Computable Phenotype for Successful Cognitive Aging. Mayo Clin Proc Innov Qual Outcomes 2023;7:212-221. [PMID: 37304063 PMCID: PMC10250575 DOI: 10.1016/j.mayocpiqo.2023.04.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023] Open

Abstract

Objective

To establish, apply, and evaluate a computable phenotype for the recruitment of individuals with successful cognitive aging.

Participants and Methods

Interviews with 10 aging experts identified electronic health record (EHR)-available variables representing successful aging among individuals aged 85 years and older. On the basis of the identified variables, we developed a rule-based computable phenotype algorithm composed of 17 eligibility criteria. Starting September 1, 2019, we applied the computable phenotype algorithm to all living persons aged 85 years and older at the University of Florida Health, which identified 24,024 individuals. This sample was comprised of 13,841 (58%) women, 13,906 (58%) Whites, and 16,557 (69%) non-Hispanics. A priori permission to be contacted for research had been obtained for 11,898 individuals, of whom 470 responded to study announcements and 333 consented to evaluation. Then, we contacted those who consented to evaluate whether their cognitive and functional status clinically met out successful cognitive aging criteria of a modified Telephone Interview for Cognitive Status score of more than 27 and Geriatric Depression Scale of less than 6. The study was completed on December 31, 2022.

Results

Of the 45% of living persons aged 85 years and older included in the University of Florida Health EHR database identified by the computable phenotype as successfully aged, approximately 4% of these responded to study announcements and 333 consented, of which 218 (65%) met successful cognitive aging criteria through direct evaluation.

Conclusion

The study evaluated a computable phenotype algorithm for the recruitment of individuals for a successful aging study using large-scale EHRs. Our study provides proof of concept of using big data and informatics as aids for the recruitment of individuals for prospective cohort studies.

Collapse

Maier A, Hartung M, Abovsky M, Adamowicz K, Bader GD, Baier S, Blumenthal DB, Chen J, Elkjaer ML, Garcia-Hernandez C, Helmy M, Hoffmann M, Jurisica I, Kotlyar M, Lazareva O, Levi H, List M, Lobentanzer S, Loscalzo J, Malod-Dognin N, Manz Q, Matschinske J, Mee M, Oubounyt M, Pico AR, Pillich RT, Poschenrieder JM, Pratt D, Pržulj N, Sadegh S, Saez-Rodriguez J, Sarkar S, Shaked G, Shamir R, Trummer N, Turhan U, Wang R, Zolotareva O, Baumbach J. Drugst.One - A plug-and-play solution for online systems medicine and network-based drug repurposing. ARXIV 2023:arXiv:2305.15453v2. [PMID: 37332567 PMCID: PMC10274948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 06/20/2023]

Affiliation(s)

Andreas Maier Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany
Michael Hartung Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany
Mark Abovsky Division of Orthopaedic Surgery, Schroeder Arthritis Institute, and Data Science Discovery Centre, Osteoarthritis Research Program, Krembil Research Institute, UHN, Toronto, Canada Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, University Health Network, 60 Leonard Avenue, 5KD-407, Toronto, ON, M5T 0S8, Canada
Klaudia Adamowicz Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany
Gary D Bader Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada The Donnelly Centre, University of Toronto, Toronto, ON, Canada Department of Computer Science, University of Toronto, Toronto, ON, Canada Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada The Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, ON, Canada
Sylvie Baier Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, Munich, Germany
David B Blumenthal Department Artificial Intelligence in Biomedical Engineering (AIBE), Friedrich-Alexander University Erlangen-Nürnberg (FAU), 91052 Erlangen, Germany
Jing Chen Department of Medicine, University of California San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA
Maria L Elkjaer Department of Neurology, Odense University Hospital, Odense, Denmark Institute of Clinical Research, University of Southern Denmark, Odense, Denmark Institute of Molecular Medicine, University of Southern Denmark, Odense, Denmark
Carlos Garcia-Hernandez Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
Mohamed Helmy Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada The Donnelly Centre, University of Toronto, Toronto, ON, Canada Department of Computer Science, University of Toronto, Toronto, ON, Canada
Markus Hoffmann Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, Munich, Germany Institute for Advanced Study (Lichtenbergstrasse 2a, D-85748 Garching, Germany), Technical University of Munich, Germany National Institute of Diabetes, Digestive, and Kidney Diseases, Bethesda, MD 20892, United States of America
Igor Jurisica Division of Orthopaedic Surgery, Schroeder Arthritis Institute, and Data Science Discovery Centre, Osteoarthritis Research Program, Krembil Research Institute, UHN, Toronto, Canada Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, University Health Network, 60 Leonard Avenue, 5KD-407, Toronto, ON, M5T 0S8, Canada Departments of Medical Biophysics and Computer Science, University of Toronto, Toronto, Canada Institute of Neuroimmunology, Slovak Academy of Sciences, Bratislava, Slovakia
Max Kotlyar Division of Orthopaedic Surgery, Schroeder Arthritis Institute, and Data Science Discovery Centre, Osteoarthritis Research Program, Krembil Research Institute, UHN, Toronto, Canada Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, University Health Network, 60 Leonard Avenue, 5KD-407, Toronto, ON, M5T 0S8, Canada
Olga Lazareva Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany Junior Clinical Cooperation Unit Multiparametric methods for early detection of prostate cancer, German Cancer Research Center (DKFZ), Heidelberg, Germany European Molecular Biology Laboratory, Genome Biology Unit, 69117 Heidelberg, Germany
Hagai Levi Blavatnik School of Computer Science, Tel-Aviv University, Tel-Aviv, Israel
Markus List Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, Munich, Germany
Sebastian Lobentanzer Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
Joseph Loscalzo Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA
Noel Malod-Dognin Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
Quirin Manz Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, Munich, Germany
Julian Matschinske Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, Munich, Germany
Miles Mee Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada The Donnelly Centre, University of Toronto, Toronto, ON, Canada Department of Computer Science, University of Toronto, Toronto, ON, Canada
Mhaned Oubounyt Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany
Alexander R Pico Institute of Data Science and Biotechnology, Gladstone Institutes, 1650 Owens Street, San Francisco, 94158, California, USA
Rudolf T Pillich Department of Medicine, University of California San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA
Julian M Poschenrieder Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, Munich, Germany
Dexter Pratt Department of Medicine, University of California San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA
Nataša Pržulj Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain Department of Computer Science, University College London, London WC1E 6BT, UK ICREA, Pg. Lluís Companys 23, 08010 Barcelona, Spain
Sepideh Sadegh Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, Munich, Germany Department of Clinical Genetics, Odense University Hospital, Odense, Denmark
Julio Saez-Rodriguez Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
Suryadipto Sarkar Department Artificial Intelligence in Biomedical Engineering (AIBE), Friedrich-Alexander University Erlangen-Nürnberg (FAU), 91052 Erlangen, Germany
Gideon Shaked Blavatnik School of Computer Science, Tel-Aviv University, Tel-Aviv, Israel
Ron Shamir Blavatnik School of Computer Science, Tel-Aviv University, Tel-Aviv, Israel
Nico Trummer Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, Munich, Germany
Ugur Turhan Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany
Ruisheng Wang Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA
Olga Zolotareva Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, Munich, Germany
Jan Baumbach Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany Computational Biomedicine Lab, Department of Mathematics and Computer Science, University of Southern Denmark, Odense, Denmark

Collapse

Wang DC, Xu WD, Wang SN, Wang X, Leng W, Fu L, Liu XY, Qin Z, Huang AF. Lupus nephritis or not? A simple and clinically friendly machine learning pipeline to help diagnosis of lupus nephritis. Inflamm Res 2023:10.1007/s00011-023-01755-7. [PMID: 37300586 DOI: 10.1007/s00011-023-01755-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Revised: 05/17/2023] [Accepted: 05/30/2023] [Indexed: 06/12/2023] Open

Yuan Q, Zhao WL, Qin B. Big data and variceal rebleeding prediction in cirrhosis patients. Artif Intell Gastroenterol 2023;4:1-9. [DOI: 10.35712/aig.v4.i1.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/08/2023] [Revised: 02/03/2023] [Accepted: 03/10/2023] [Indexed: 06/08/2023] Open

Shi Y, Lin J, Zhu J, Gao J, Liu L, Yin M, Yu C, Liu X, Wang Y, Xu C. Predicting the Recurrence of Common Bile Duct Stones After ERCP Treatment with Automated Machine Learning Algorithms. Dig Dis Sci 2023:10.1007/s10620-023-07949-7. [PMID: 37160541 DOI: 10.1007/s10620-023-07949-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Accepted: 09/26/2022] [Indexed: 05/11/2023]

Abstract

BACKGROUND

Recurrence of common bile duct stones (CBDs) commonly happens after endoscopic retrograde cholangiopancreatography (ERCP). The clinical prediction models for the recurrence of CBDs after ERCP are lacking.

AIMS

We aim to develop high-performance prediction models for the recurrence of CBDS after ERCP treatment using automated machine learning (AutoML) and to assess the AutoML models versus the traditional regression models.

METHODS

473 patients with CBDs undergoing ERCP were recruited in the single-center retrospective cohort study. Samples were divided into Training Set (65%) and Validation Set (35%) randomly. Three modeling approaches, including fully automated machine learning (Fully automated), semi-automated machine learning (Semi-automated), and traditional regression were applied to fit prediction models. Models' discrimination, calibration, and clinical benefits were examined. The Shapley additive explanations (SHAP), partial dependence plot (PDP), and SHAP local explanation (SHAPLE) were proposed for the interpretation of the best model.

RESULTS

The area under roc curve (AUROC) of semi-automated gradient boost machine (GBM) model was 0.749 in Validation Set, better than the other fully/semi-automated models and the traditional regression models (highest AUROC = 0.736). The calibration and clinical application of AutoML models were adequate. Through the SHAP-PDP-SHAPLE pipeline, the roles of key variables of the semi-automated GBM model were visualized. Lastly, the best model was deployed online for clinical practitioners.

CONCLUSION

The GBM model based on semi-AutoML is an optimal model to predict the recurrence of CBDs after ERCP treatment. In comparison with traditional regressions, AutoML algorithms present significant strengths in modeling, which show promise in future clinical practices.

Collapse

Peng M, Southern DA, Ocampo W, Kaufman J, Hogan DB, Conly J, Baylis BW, Stelfox HT, Ho C, Ghali WA. Exploring data reduction strategies in the analysis of continuous pressure imaging technology. BMC Med Res Methodol 2023;23:56. [PMID: 36859239 PMCID: PMC9976437 DOI: 10.1186/s12874-023-01875-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Accepted: 02/21/2023] [Indexed: 03/03/2023] Open

Abstract

BACKGROUND

Science is becoming increasingly data intensive as digital innovations bring new capacity for continuous data generation and storage. This progress also brings challenges, as many scientific initiatives are challenged by the shear volumes of data produced. Here we present a case study of a data intensive randomized clinical trial assessing the utility of continuous pressure imaging (CPI) for reducing pressure injuries.

OBJECTIVE

To explore an approach to reducing the amount of CPI data required for analyses to a manageable size without loss of critical information using a nested subset of pressure data.

METHODS

Data from four enrolled study participants excluded from the analytical phase of the study were used to develop an approach to data reduction. A two-step data strategy was used. First, raw data were sampled at different frequencies (5, 30, 60, 120, and 240 s) to identify optimal measurement frequency. Second, similarity between adjacent frames was evaluated using correlation coefficients to identify position changes of enrolled study participants. Data strategy performance was evaluated through visual inspection using heat maps and time series plots.

RESULTS

A sampling frequency of every 60 s provided reasonable representation of changes in interface pressure over time. This approach translated to using only 1.7% of the collected data in analyses. In the second step it was found that 160 frames within 24 h represented the pressure states of study participants. In total, only 480 frames from the 72 h of collected data would be needed for analyses without loss of information. Only ~ 0.2% of the raw data collected would be required for assessment of the primary trial outcome.

CONCLUSIONS

Data reduction is an important component of big data analytics. Our two-step strategy markedly reduced the amount of data required for analyses without loss of information. This data reduction strategy, if validated, could be used in other CPI and other settings where large amounts of both temporal and spatial data must be analysed.

Collapse

Affiliation(s)

Mingkai Peng Libin Cardiovascular Institute of Alberta, University of Calgary, Calgary, AB, Canada
Danielle A Southern O'Brien Institute for Public Health, University of Calgary, Calgary, AB, Canada
Wrechelle Ocampo W21C Research and Innovation Centre, Cumming School of Medicine, GD01 Teaching Research & Wellness Building, University of Calgary, 3280 Hospital Drive, Calgary, NW, Canada
Jaime Kaufman W21C Research and Innovation Centre, Cumming School of Medicine, GD01 Teaching Research & Wellness Building, University of Calgary, 3280 Hospital Drive, Calgary, NW, Canada
David B Hogan O'Brien Institute for Public Health, University of Calgary, Calgary, AB, Canada.,W21C Research and Innovation Centre, Cumming School of Medicine, GD01 Teaching Research & Wellness Building, University of Calgary, 3280 Hospital Drive, Calgary, NW, Canada.,Department of Medicine, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.,Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
John Conly O'Brien Institute for Public Health, University of Calgary, Calgary, AB, Canada.,W21C Research and Innovation Centre, Cumming School of Medicine, GD01 Teaching Research & Wellness Building, University of Calgary, 3280 Hospital Drive, Calgary, NW, Canada.,Department of Medicine, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.,Infection Prevention and Control, Alberta Health Services, Calgary, AB, Canada.,Snyder Institute for Chronic Diseases, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.,Foothills Medical Centre, Special Services Building, Ground Floor, AGW5, Calgary, AB, T2N 2T9, Canada
Barry W Baylis O'Brien Institute for Public Health, University of Calgary, Calgary, AB, Canada.,W21C Research and Innovation Centre, Cumming School of Medicine, GD01 Teaching Research & Wellness Building, University of Calgary, 3280 Hospital Drive, Calgary, NW, Canada.,Department of Medicine, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.,Foothills Medical Centre, Special Services Building, Ground Floor, AGW5, Calgary, AB, T2N 2T9, Canada
Henry T Stelfox O'Brien Institute for Public Health, University of Calgary, Calgary, AB, Canada.,Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.,Department of Critical Care Medicine, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.,Alberta Health Services, Alberta, Canada
Chester Ho Department of Medicine, Division of Physical Medicine & Rehabilitation, University of Alberta, Edmonton, AB, Canada
William A Ghali O'Brien Institute for Public Health, University of Calgary, Calgary, AB, Canada. .,W21C Research and Innovation Centre, Cumming School of Medicine, GD01 Teaching Research & Wellness Building, University of Calgary, 3280 Hospital Drive, Calgary, NW, Canada. .,Department of Medicine, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada. .,Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada. .,Division of General Internal Medicine, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.

Collapse

Rajput D, Wang WJ, Chen CC. Evaluation of a decided sample size in machine learning applications. BMC Bioinformatics 2023;24:48. [PMID: 36788550 PMCID: PMC9926644 DOI: 10.1186/s12859-023-05156-9] [Citation(s) in RCA: 51] [Impact Index Per Article: 51.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Accepted: 01/23/2023] [Indexed: 02/16/2023] Open

Abstract

BACKGROUND

An appropriate sample size is essential for obtaining a precise and reliable outcome of a study. In machine learning (ML), studies with inadequate samples suffer from overfitting of data and have a lower probability of producing true effects, while the increment in sample size increases the accuracy of prediction but may not cause a significant change after a certain sample size. Existing statistical approaches using standardized mean difference, effect size, and statistical power for determining sample size are potentially biased due to miscalculations or lack of experimental details. This study aims to design criteria for evaluating sample size in ML studies. We examined the average and grand effect sizes and the performance of five ML methods using simulated datasets and three real datasets to derive the criteria for sample size. We systematically increase the sample size, starting from 16, by randomly sampling and examine the impact of sample size on classifiers' performance and both effect sizes. Tenfold cross-validation was used to quantify the accuracy.

RESULTS

The results demonstrate that the effect sizes and the classification accuracies increase while the variances in effect sizes shrink with the increment of samples when the datasets have a good discriminative power between two classes. By contrast, indeterminate datasets had poor effect sizes and classification accuracies, which did not improve by increasing sample size in both simulated and real datasets. A good dataset exhibited a significant difference in average and grand effect sizes. We derived two criteria based on the above findings to assess a decided sample size by combining the effect size and the ML accuracy. The sample size is considered suitable when it has appropriate effect sizes (≥ 0.5) and ML accuracy (≥ 80%). After an appropriate sample size, the increment in samples will not benefit as it will not significantly change the effect size and accuracy, thereby resulting in a good cost-benefit ratio.

CONCLUSION

We believe that these practical criteria can be used as a reference for both the authors and editors to evaluate whether the selected sample size is adequate for a study.

Collapse

Pham TD, Ravi V, Fan C, Luo B, Sun XF. Tensor Decomposition of Largest Convolutional Eigenvalues Reveals Pathologic Predictive Power of RhoB in Rectal Cancer Biopsy. THE AMERICAN JOURNAL OF PATHOLOGY 2023;193:579-590. [PMID: 36740183 DOI: 10.1016/j.ajpath.2023.01.007] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 12/29/2022] [Accepted: 01/06/2023] [Indexed: 02/05/2023]

Moradi H, Al-Hourani A, Concilia G, Khoshmanesh F, Nezami FR, Needham S, Baratchi S, Khoshmanesh K. Recent developments in modeling, imaging, and monitoring of cardiovascular diseases using machine learning. Biophys Rev 2023;15:19-33. [PMID: 36909958 PMCID: PMC9995635 DOI: 10.1007/s12551-022-01040-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Accepted: 12/21/2022] [Indexed: 01/12/2023] Open

Data harnessing to nurture the human mind for a tailored approach to the child. Pediatr Res 2023;93:357-365. [PMID: 36180585 DOI: 10.1038/s41390-022-02320-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 07/06/2022] [Accepted: 09/12/2022] [Indexed: 11/08/2022]

Improving child health through Big Data and data science. Pediatr Res 2023;93:342-349. [PMID: 35974162 PMCID: PMC9380977 DOI: 10.1038/s41390-022-02264-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 06/10/2022] [Accepted: 06/28/2022] [Indexed: 12/04/2022]

Abstract

Child health is defined by a complex, dynamic network of genetic, cultural, nutritional, infectious, and environmental determinants at distinct, developmentally determined epochs from preconception to adolescence. This network shapes the future of children, susceptibilities to adult diseases, and individual child health outcomes. Evolution selects characteristics during fetal life, infancy, childhood, and adolescence that adapt to predictable and unpredictable exposures/stresses by creating alternative developmental phenotype trajectories. While child health has improved in the United States and globally over the past 30 years, continued improvement requires access to data that fully represent the complexity of these interactions and to new analytic methods. Big Data and innovative data science methods provide tools to integrate multiple data dimensions for description of best clinical, predictive, and preventive practices, for reducing racial disparities in child health outcomes, for inclusion of patient and family input in medical assessments, and for defining individual disease risk, mechanisms, and therapies. However, leveraging these resources will require new strategies that intentionally address institutional, ethical, regulatory, cultural, technical, and systemic barriers as well as developing partnerships with children and families from diverse backgrounds that acknowledge historical sources of mistrust. We highlight existing pediatric Big Data initiatives and identify areas of future research. IMPACT: Big Data and data science can improve child health. This review highlights the importance for child health of child-specific and life course-based Big Data and data science strategies. This review provides recommendations for future pediatric-specific Big Data and data science research.

Collapse

Taipalus T, Isomöttönen V, Erkkilä H, Äyrämö S. Data Analytics in Healthcare: A Tertiary Study. SN COMPUTER SCIENCE 2022;4:87. [PMID: 36532635 PMCID: PMC9734338 DOI: 10.1007/s42979-022-01507-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 11/14/2022] [Indexed: 12/13/2022]

Prediction of COVID-19 diagnosis based on openEHR artefacts. Sci Rep 2022;12:12549. [PMID: 35869091 PMCID: PMC9306245 DOI: 10.1038/s41598-022-15968-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Accepted: 07/01/2022] [Indexed: 11/08/2022] Open

Li G, Togo R, Ogawa T, Haseyama M. Compressed gastric image generation based on soft-label dataset distillation for medical data sharing. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022;227:107189. [PMID: 36323177 DOI: 10.1016/j.cmpb.2022.107189] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Revised: 07/07/2022] [Accepted: 10/17/2022] [Indexed: 06/16/2023]

Diakou I, Papakonstantinou E, Papageorgiou L, Pierouli K, Dragoumani K, Spandidos DA, Bacopoulou F, Chrousos GP, Goulielmos GΝ, Eliopoulos E, Vlachakis D. Multiple sclerosis and computational biology (Review). Biomed Rep 2022;17:96. [PMID: 36382258 PMCID: PMC9634047 DOI: 10.3892/br.2022.1579] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Accepted: 09/27/2022] [Indexed: 12/02/2022] Open

Affiliation(s)

Io Diakou Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
Eleni Papakonstantinou Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
Louis Papageorgiou Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
Katerina Pierouli Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
Konstantina Dragoumani Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
Demetrios A. Spandidos Laboratory of Clinical Virology, School of Medicine, University of Crete, 71003 Heraklion, Greece
Flora Bacopoulou University Research Institute of Maternal and Child Health and Precision Medicine, and UNESCO Chair on Adolescent Health Care, National and Kapodistrian University of Athens, ‘Aghia Sophia’ Children's Hospital, 11527 Athens, Greece
George P. Chrousos University Research Institute of Maternal and Child Health and Precision Medicine, and UNESCO Chair on Adolescent Health Care, National and Kapodistrian University of Athens, ‘Aghia Sophia’ Children's Hospital, 11527 Athens, Greece
Georges Ν. Goulielmos Section of Molecular Pathology and Human Genetics, Department of Internal Medicine, School of Medicine, University of Crete, 71003 Heraklion, Greece
Elias Eliopoulos Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
Dimitrios Vlachakis Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece University Research Institute of Maternal and Child Health and Precision Medicine, and UNESCO Chair on Adolescent Health Care, National and Kapodistrian University of Athens, ‘Aghia Sophia’ Children's Hospital, 11527 Athens, Greece Division of Endocrinology and Metabolism, Center of Clinical, Experimental Surgery and Translational Research, Biomedical Research Foundation of The Academy of Athens, 11527 Athens, Greece

Collapse

Kundu A, Fu R, Grace D, Logie CH, Abramovich A, Baskerville B, Yager C, Schwartz R, Mitsakakis N, Planinac L, Chaiton M. Correlates of wanting to seek help for mental health and substance use concerns by sexual and gender minority young adults during the COVID-19 pandemic: A machine learning analysis. PLoS One 2022;17:e0277438. [PMID: 36383536 PMCID: PMC9668172 DOI: 10.1371/journal.pone.0277438] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 10/26/2022] [Indexed: 11/17/2022] Open

Affiliation(s)

Anasua Kundu Institute of Medical Science, University of Toronto, Toronto, Canada Centre for Addiction and Mental Health, Toronto, Canada Ontario Tobacco Research Unit, University of Toronto, Toronto, Canada
Rui Fu Department of Otolaryngology—Head and Neck Surgery, Sunnybrook Research Institute, University of Toronto, Toronto, Canada Dalla Lana School of Public Health, University of Toronto, Toronto, Canada
Daniel Grace Dalla Lana School of Public Health, University of Toronto, Toronto, Canada
Carmen H. Logie Factor-Inwentash Faculty of Social Work, University of Toronto, Toronto, Canada United Nations University Institute for Water, Environment & Health, Hamilton, Canada
Alex Abramovich Centre for Addiction and Mental Health, Toronto, Canada Dalla Lana School of Public Health, University of Toronto, Toronto, Canada Department of Psychiatry, University of Toronto, Toronto, Canada
Bruce Baskerville Canadian Institutes of Health Research, Ottawa, Canada School of Pharmacy, Faculty of Science, University of Waterloo, Kitchener, Canada
Christina Yager Centre for Addiction and Mental Health, Toronto, Canada
Robert Schwartz Centre for Addiction and Mental Health, Toronto, Canada Ontario Tobacco Research Unit, University of Toronto, Toronto, Canada Dalla Lana School of Public Health, University of Toronto, Toronto, Canada
Nicholas Mitsakakis Dalla Lana School of Public Health, University of Toronto, Toronto, Canada Children’s Hospital of Eastern Ontario Research Institute, Ottawa, Canada
Lynn Planinac Ontario Tobacco Research Unit, University of Toronto, Toronto, Canada
Michael Chaiton Institute of Medical Science, University of Toronto, Toronto, Canada Centre for Addiction and Mental Health, Toronto, Canada Ontario Tobacco Research Unit, University of Toronto, Toronto, Canada Dalla Lana School of Public Health, University of Toronto, Toronto, Canada

Collapse

Datta A, Nicolaï B, Vitrac O, Verboven P, Erdogdu F, Marra F, Sarghini F, Koh C. Computer-aided food engineering. NATURE FOOD 2022;3:894-904. [PMID: 37118206 DOI: 10.1038/s43016-022-00617-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2021] [Accepted: 09/09/2022] [Indexed: 04/30/2023]

Jeong JC, Hands I, Kolesar JM, Rao M, Davis B, Dobyns Y, Hurt-Mueller J, Levens J, Gregory J, Williams J, Witt L, Kim EM, Burton C, Elbiheary AA, Chang M, Durbin EB. Local data commons: the sleeping beauty in the community of data commons. BMC Bioinformatics 2022;23:386. [PMID: 36151511 PMCID: PMC9502580 DOI: 10.1186/s12859-022-04922-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2022] [Accepted: 09/12/2022] [Indexed: 12/03/2022] Open

Abstract

Background

Public Data Commons (PDC) have been highlighted in the scientific literature for their capacity to collect and harmonize big data. On the other hand, local data commons (LDC), located within an institution or organization, have been underrepresented in the scientific literature, even though they are a critical part of research infrastructure. Being closest to the sources of data, LDCs provide the ability to collect and maintain the most up-to-date, high-quality data within an organization, closest to the sources of the data. As a data provider, LDCs have many challenges in both collecting and standardizing data, moreover, as a consumer of PDC, they face problems of data harmonization stemming from the monolithic harmonization pipeline designs commonly adapted by many PDCs. Unfortunately, existing guidelines and resources for building and maintaining data commons exclusively focus on PDC and provide very little information on LDC.

Results

This article focuses on four important observations. First, there are three different types of LDC service models that are defined based on their roles and requirements. These can be used as guidelines for building new LDC or enhancing the services of existing LDC. Second, the seven core services of LDC are discussed, including cohort identification and facilitation of genomic sequencing, the management of molecular reports and associated infrastructure, quality control, data harmonization, data integration, data sharing, and data access control. Third, instead of commonly developed monolithic systems, we propose a new data sharing method for data harmonization that combines both divide-and-conquer and bottom-up approaches. Finally, an end-to-end LDC implementation is introduced with real-world examples.

Conclusions

Although LDCs are an optimal place to identify and address data quality issues, they have traditionally been relegated to the role of passive data provider for much larger PDC. Indeed, many LDCs limit their functions to only conducting routine data storage and transmission tasks due to a lack of information on how to design, develop, and improve their services using limited resources. We hope that this work will be the first small step in raising awareness among the LDCs of their expanded utility and to publicize to a wider audience the importance of LDC.

Collapse

Affiliation(s)

Jong Cheol Jeong Division of Biomedical Informatics, College of Medicine, University of Kentucky, Lexington, KY, USA. .,Cancer Research Informatics Shared Resource Facility, Markey Cancer Center, Lexington, KY, USA.
Isaac Hands Cancer Research Informatics Shared Resource Facility, Markey Cancer Center, Lexington, KY, USA.,Kentucky Cancer Registry, Lexington, KY, USA
Jill M Kolesar Department of Pharmacy Practice and Science, College of Pharmacy, University of Kentucky, Lexington, KY, USA
Mahadev Rao Department of Pharmacy Practice, Center for Translational Research, Manipal College of Pharmaceutical Sciences, Manipal Academy of Higher Education, Manipal, Karnataka, India
Bront Davis Cancer Research Informatics Shared Resource Facility, Markey Cancer Center, Lexington, KY, USA.,Kentucky Cancer Registry, Lexington, KY, USA
York Dobyns Cancer Research Informatics Shared Resource Facility, Markey Cancer Center, Lexington, KY, USA.,Kentucky Cancer Registry, Lexington, KY, USA
Joseph Hurt-Mueller Cancer Research Informatics Shared Resource Facility, Markey Cancer Center, Lexington, KY, USA.,Kentucky Cancer Registry, Lexington, KY, USA
Justin Levens Cancer Research Informatics Shared Resource Facility, Markey Cancer Center, Lexington, KY, USA.,Kentucky Cancer Registry, Lexington, KY, USA
Jenny Gregory Cancer Research Informatics Shared Resource Facility, Markey Cancer Center, Lexington, KY, USA.,Kentucky Cancer Registry, Lexington, KY, USA
John Williams Cancer Research Informatics Shared Resource Facility, Markey Cancer Center, Lexington, KY, USA.,Kentucky Cancer Registry, Lexington, KY, USA
Lisa Witt Cancer Research Informatics Shared Resource Facility, Markey Cancer Center, Lexington, KY, USA.,Kentucky Cancer Registry, Lexington, KY, USA
Eun Mi Kim Department of Computer Science, Eastern Kentucky University, Richmond, KY, USA
Carlee Burton Cancer Research Informatics Shared Resource Facility, Markey Cancer Center, Lexington, KY, USA
Amir A Elbiheary Cancer Research Informatics Shared Resource Facility, Markey Cancer Center, Lexington, KY, USA
Mingguang Chang Cancer Research Informatics Shared Resource Facility, Markey Cancer Center, Lexington, KY, USA
Eric B Durbin Division of Biomedical Informatics, College of Medicine, University of Kentucky, Lexington, KY, USA. .,Cancer Research Informatics Shared Resource Facility, Markey Cancer Center, Lexington, KY, USA. .,Kentucky Cancer Registry, Lexington, KY, USA.

Collapse

A Deep Learning Model Incorporating Knowledge Representation Vectors and Its Application in Diabetes Prediction. DISEASE MARKERS 2022;2022:7593750. [PMID: 35990251 PMCID: PMC9391170 DOI: 10.1155/2022/7593750] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 07/24/2022] [Accepted: 07/30/2022] [Indexed: 01/09/2023]

Interactive exploration of a global clinical network from a large breast cancer cohort. NPJ Digit Med 2022;5:113. [PMID: 35948579 PMCID: PMC9365762 DOI: 10.1038/s41746-022-00647-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Accepted: 06/27/2022] [Indexed: 11/08/2022] Open

Kim T, Choi H, Lee SM. Parametric and non-parametric estimation of reference intervals for routine laboratory tests: an analysis of health check-up data for 260 889 young men in the South Korean military. BMJ Open 2022;12:e062617. [PMID: 35879016 PMCID: PMC9328105 DOI: 10.1136/bmjopen-2022-062617] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Accepted: 07/04/2022] [Indexed: 11/03/2022] Open

Abstract

OBJECTIVES

Determination of reference intervals (RIs) using big data faces several obstacles due to heterogeneity in analysers, period and ethnicity. The present study aimed to establish the RIs for routine common blood count (CBC) and biochemistry laboratory tests in homogeneous, healthy, male Korean soldiers in their 20s using a large health check-up data set, comparing parametric and non-parametric estimation.

DESIGN

A multicentre, cross-sectional study.

SETTING

Seven armed forces hospitals in South Korea.

PARTICIPANTS

A total of 609 649 men underwent health examination when promoted to corporal between January 2015 and September 2021. 260 889 eligible individuals aged 20-25 were included in the analysis.

MAIN OUTCOMES AND MEASURES

The RIs were established by parametric and non-parametric methods. In the parametric approach, maximum likelihood estimation was applied to measure the Box-Cox transformation parameter and the values at the 2.5th and 97.5th percentiles were recalculated. The non-parametric approach adopted the Tukey's exclusion test and the values at the 2.5th and 97.5th percentiles were obtained. Classification by body mass index was also performed.

RESULTS

The obtained RIs for haematology parameters were comparable between devices. If the values followed a Gaussian distribution, parametric and non-parametric methods were well matched for haematology and biochemical markers. When the values were right-skewed, the upper limits were higher with parametric than with non-parametric methods. Participants with obesity showed higher RIs for CBC, some liver function tests and some lipid profiles than participants without obesity.

CONCLUSIONS

Using data from healthy, male Korean soldiers in their 20s, we proposed the RIs for CBC and biochemical parameters, comparing parametric and non-parametric estimation. As such approaches based on large data sets become more prevalent, further studies are needed to discriminate eligible individuals and determine RIs in an extrapolated sample.

Collapse

Comparing Worldwide, National, and Independent Notifications about Adverse Drug Reactions Due to COVID-19 Vaccines. INFORMATION 2022. [DOI: 10.3390/info13070329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open

Grosman L, Muller A, Dag I, Goldgeier H, Harush O, Herzlinger G, Nebenhaus K, Valetta F, Yashuv T, Dick N. Artifact3-D: New software for accurate, objective and efficient 3D analysis and documentation of archaeological artifacts. PLoS One 2022;17:e0268401. [PMID: 35709137 PMCID: PMC9202890 DOI: 10.1371/journal.pone.0268401] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Accepted: 04/20/2022] [Indexed: 11/19/2022] Open

Go S, Wang Q, Wang B, Jiang Y, Bajalovic N, Loke DK. Continual Learning Electrical Conduction in Resistive‐Switching‐Memory Materials. ADVANCED THEORY AND SIMULATIONS 2022. [DOI: 10.1002/adts.202200226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Zenker S, Strech D, Ihrig K, Jahns R, Müller G, Schickhardt C, Schmidt G, Speer R, Winkler E, von Kielmansegg SG, Drepper J. Data protection-compliant broad consent for secondary use of health care data and human biosamples for (bio)medical research: Towards a new German national standard. J Biomed Inform 2022;131:104096. [PMID: 35643273 DOI: 10.1016/j.jbi.2022.104096] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2021] [Revised: 04/05/2022] [Accepted: 05/20/2022] [Indexed: 01/10/2023]

Abstract

BACKGROUND

The secondary use of deidentified but not anonymized patient data is a promising approach for enabling precision medicine and learning health care systems. In most national jurisdictions (e.g., in Europe), this type of secondary use requires patient consent. While various ethical, legal, and technical analyses have stressed the opportunities and challenges for different types of consent over the past decade, no country has yet established a national consent standard accepted by the relevant authorities.

METHODS

A working group of the national Medical Informatics Initiative in Germany conducted a requirements analysis and developed a GDPR-compliant broad consent standard. The development included consensus procedures within the Medical Informatics Initiative, a documented consultation process with all relevant stakeholder groups and authorities, and the ultimate submission for approval via the national data protection authorities.

RESULTS

This paper presents the broad consent text together with a guidance document on mandatory safeguards for broad consent implementation. The mandatory safeguards comprise i) independent review of individual research projects, ii) organizational measures to protect patients from involuntary disclosure of protected information, and iii) comprehensive information for patients and public transparency. This paper further describes the key issues discussed with the relevant authorities, especially the position on additional or alternative consent approaches such as dynamic consent.

DISCUSSION

Both the resulting broad consent text and the national consensus process are relevant for similar activities internationally. A key challenge of aligning consent documents with the various stakeholders was explaining and justifying the decision to use broad consent and the decision against using alternative models such as dynamic consent. Public transparency for all secondary use projects and their results emerged as a key factor in this justification. While currently largely limited to academic medicine in Germany, the first steps for extending this broad consent approach to wider areas of application, including smaller institutions and medical practices, are currently under consideration.

Collapse

Affiliation(s)

Sven Zenker Staff Unit for Scientific & Medical Technology Development & Coordination (MWTek), Commercial Directorate, Institute for Medical Biometry, Informatics & Epidemiology, Department of Anesthesiology and Intensive Care Medicine, University Hospital Bonn, Venusbergcampus 1, 53127 Bonn, Germany.
Daniel Strech QUEST Center, Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Charitéplatz 1, 10117 Berlin, Germany
Kristina Ihrig Department of Medicine, Hematology/Oncology, Goethe University, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany; German Cancer Consortium (DKTK), Partner Site Frankfurt/Mainz, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 280, 69120 Heidelberg, Germany
Roland Jahns Interdisciplinary Bank of Biomaterials and Data Würzburg (ibdw), University and University Hospital of Würzburg, Building A8/A9, Straubmühlweg 2a, 97078 Würzburg, Germany
Gabriele Müller Center for Evidence-Based Healthcare, University Hospital Carl Gustav Carus and Carl Gustav Carus Faculty of Medicine, Technische Universität Dresden, Fetscherstr. 74, 01307 Dresden, Germany
Christoph Schickhardt Section of Translational Medical Ethics, National Center for Tumor Diseases, German Cancer Research Center, Im Neuenheimer Feld 460, 69120 Heidelberg, Germany
Georg Schmidt Department of Internal Medicine 1, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany, German Centre for Cardiovascular Research partner site Munich Heart Alliance, Munich, Germany
Ronald Speer LIFE - Leipzig Research Center for Civilization Diseases, Medical Faculty, Leipzig University, Philipp-Rosenthal-Straße 27, 04103 Leipzig, Germany
Eva Winkler Section for Translational Medical Ethics, Dept Medical Oncology, National Center for Tumor Diseases, Heidelberg University Hospital, INF 460, 69121 Heidelberg
Sebastian Graf von Kielmansegg Chair of Public Law and Medical Law, Kiel University, Leibnizstraße 2, 24118 Kiel, Germany
Johannes Drepper TMF - Technology, Methods, and Infrastructure for Networked Medical Research, Charlottenstrasse 42, 10117 Berlin, Germany

Collapse

Zadeh FA, Ardalani MV, Salehi AR, Jalali Farahani R, Hashemi M, Mohammed AH. An Analysis of New Feature Extraction Methods Based on Machine Learning Methods for Classification Radiological Images. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022;2022:3035426. [PMID: 35634075 PMCID: PMC9131703 DOI: 10.1155/2022/3035426] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/01/2022] [Revised: 02/02/2022] [Accepted: 03/08/2022] [Indexed: 12/02/2022]

Wan S, Zhao X, Niu Z, Dong L, Wu Y, Gu S, Feng Y, Hua X. Influence of ambient air pollution on successful pregnancy with frozen embryo transfer: A machine learning prediction model. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2022;236:113444. [PMID: 35367879 DOI: 10.1016/j.ecoenv.2022.113444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Revised: 03/18/2022] [Accepted: 03/19/2022] [Indexed: 06/14/2023]

Development of Elderly Life Quality Database in Thailand with a Correlation Feature Analysis. SUSTAINABILITY 2022. [DOI: 10.3390/su14084468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]

Armenta-Medina D, Brambila-Tapia AJL, Miranda-Jiménez S, Rodea-Montero ER. A Web Application for Biomedical Text Mining of Scientific Literature Associated with Coronavirus-Related Syndromes: Coronavirus Finder. Diagnostics (Basel) 2022;12:887. [PMID: 35453935 PMCID: PMC9028729 DOI: 10.3390/diagnostics12040887] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Revised: 02/10/2022] [Accepted: 02/11/2022] [Indexed: 12/10/2022] Open

Shi X, Nikolic G, Fischaber S, Black M, Rankin D, Epelde G, Beristain A, Alvarez R, Arrue M, Pita Costa J, Grobelnik M, Stopar L, Pajula J, Umer A, Poliwoda P, Wallace J, Carlin P, Pääkkönen J, De Moor B. System Architecture of a European Platform for Health Policy Decision Making: MIDAS. Front Public Health 2022;10:838438. [PMID: 35433572 PMCID: PMC9008448 DOI: 10.3389/fpubh.2022.838438] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Accepted: 01/13/2022] [Indexed: 12/01/2022] Open

Abstract

Background

Healthcare data is a rich yet underutilized resource due to its disconnected, heterogeneous nature. A means of connecting healthcare data and integrating it with additional open and social data in a secure way can support the monumental challenge policy-makers face in safely accessing all relevant data to assist in managing the health and wellbeing of all. The goal of this study was to develop a novel health data platform within the MIDAS (Meaningful Integration of Data Analytics and Services) project, that harnesses the potential of latent healthcare data in combination with open and social data to support evidence-based health policy decision-making in a privacy-preserving manner.

Methods

The MIDAS platform was developed in an iterative and collaborative way with close involvement of academia, industry, healthcare staff and policy-makers, to solve tasks including data storage, data harmonization, data analytics and visualizations, and open and social data analytics. The platform has been piloted and tested by health departments in four European countries, each focusing on different region-specific health challenges and related data sources.

Results

A novel health data platform solving the needs of Public Health decision-makers was successfully implemented within the four pilot regions connecting heterogeneous healthcare datasets and open datasets and turning large amounts of previously isolated data into actionable information allowing for evidence-based health policy-making and risk stratification through the application and visualization of advanced analytics.

Conclusions

The MIDAS platform delivers a secure, effective and integrated solution to deal with health data, providing support for health policy decision-making, planning of public health activities and the implementation of the Health in All Policies approach. The platform has proven transferable, sustainable and scalable across policies, data and regions.

Collapse

Affiliation(s)

Xi Shi Department of Electrical Engineering (ESAT), Stadius Center for Dynamical Systems, Signal Processing and Data Analytics, KU Leuven, Leuven, Belgium Vlerick Business School, Leuven, Belgium *Correspondence: Xi Shi
Gorana Nikolic Department of Electrical Engineering (ESAT), Stadius Center for Dynamical Systems, Signal Processing and Data Analytics, KU Leuven, Leuven, Belgium
Scott Fischaber Analytics Engines, Belfast, United Kingdom
Michaela Black School of Computing, Engineering and Intelligent Systems, Ulster University, Londonderry, United Kingdom
Debbie Rankin School of Computing, Engineering and Intelligent Systems, Ulster University, Londonderry, United Kingdom
Gorka Epelde Vicomtech Foundation, Basque Research and Technology Alliance (BRTA), Donostia-San Sebastián, Spain EHealth Group, Biodonostia Health Research Institute, Donostia-San Sebastián, Spain
Andoni Beristain Vicomtech Foundation, Basque Research and Technology Alliance (BRTA), Donostia-San Sebastián, Spain EHealth Group, Biodonostia Health Research Institute, Donostia-San Sebastián, Spain
Roberto Alvarez Vicomtech Foundation, Basque Research and Technology Alliance (BRTA), Donostia-San Sebastián, Spain EHealth Group, Biodonostia Health Research Institute, Donostia-San Sebastián, Spain
Monica Arrue Vicomtech Foundation, Basque Research and Technology Alliance (BRTA), Donostia-San Sebastián, Spain EHealth Group, Biodonostia Health Research Institute, Donostia-San Sebastián, Spain
Joao Pita Costa Quintelligence, Ljubljana, Slovenia AI Lab, Institute Jozef Stefan, Ljubljana, Slovenia
Marko Grobelnik Quintelligence, Ljubljana, Slovenia AI Lab, Institute Jozef Stefan, Ljubljana, Slovenia
Luka Stopar Quintelligence, Ljubljana, Slovenia AI Lab, Institute Jozef Stefan, Ljubljana, Slovenia
Juha Pajula Data-Driven Solutions, Smart Health, VTT Technical Research Centre of Finland, Tampere, Finland
Adil Umer Data-Driven Solutions, Smart Health, VTT Technical Research Centre of Finland, Tampere, Finland
Peter Poliwoda IBM Ireland Lab, Innovation Exchange, International Business Machines Corporation, Dublin, Ireland
Jonathan Wallace School of Computing, Ulster University, Jordanstown, United Kingdom
Paul Carlin Faculty of Wellbeing, Education and Language Studies, Open University, Belfast, United Kingdom
Jarmo Pääkkönen Centre for Health and Technology, University of Oulu, Oulu, Finland
Bart De Moor Department of Electrical Engineering (ESAT), Stadius Center for Dynamical Systems, Signal Processing and Data Analytics, KU Leuven, Leuven, Belgium

Collapse

Valenzuela W, Balsiger F, Wiest R, Scheidegger O. Medical-Blocks: A Platform for Exploration, Management, Analysis, and Sharing of Data in Biomedical Research. JMIR Form Res 2022;6:e32287. [PMID: 35232718 PMCID: PMC9039815 DOI: 10.2196/32287] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Revised: 02/04/2022] [Accepted: 02/28/2022] [Indexed: 02/07/2023] Open

Abstract

BACKGROUND

Biomedical research requires healthcare institutions to provide sensitive clinical data to leverage data science and artificial intelligence technologies. However, providing healthcare data to researchers simple and secure, proves to be challenging for healthcare institutions.

OBJECTIVE

We describe and introduce Medical-Blocks, a platform for data exploration, data management, data analysis, and data sharing in biomedical research.

METHODS

The specification requirements for Medical-Blocks included: i) Connection to data sources of healthcare institutions with an interface for data exploration, ii) management of data in an internal file storage system, iii) data analysis through visualization and classification of data, and iv) data sharing via a file hosting service for collaboration. Medical-Blocks should be simple to use via a web-based user interface and extensible with new functionalities by a modular design via microservices ("blocks"). The scalability of the platform should be ensured by containerization. Security and legal regulations were considered during the development.

RESULTS

Medical-Blocks is a web application that runs in the cloud or as a local instance at a healthcare institution. Local instances of Medical-Blocks access data sources such as electronic health records and picture archiving and communications system (PACS) at healthcare institutions. Researchers and clinicians can explore, manage, and analyze the available data through Medical-Blocks. The data analysis involves classification of data for metadata extraction and the formation of cohorts. In collaborations, metadata (e.g., number of patients per cohort) and/or the data itself can be shared through Medical-Blocks locally or via a cloud instance to other researchers and clinicians.

CONCLUSIONS

Medical-Blocks facilitates biomedical research by providing a centralized platform to interact with medical data in collaborative research projects. The access to and management of medical data is simplified. Data can be swiftly analyzed to form cohorts for research and be shared among researchers. The modularity of Medical-Blocks makes the platform feasible for biomedical research where heterogenous medical data is needed.

CLINICALTRIAL

Collapse

John Cremin C, Dash S, Huang X. Big Data: Historic Advances and Emerging Trends in Biomedical Research. CURRENT RESEARCH IN BIOTECHNOLOGY 2022. [DOI: 10.1016/j.crbiot.2022.02.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022] Open

Ahne A, Fagherazzi G, Tannier X, Czernichow T, Orchard F. Improving Diabetes-Related Biomedical Literature Exploration in the Clinical Decision-making Process via Interactive Classification and Topic Discovery: Methodology Development Study. J Med Internet Res 2022;24:e27434. [PMID: 35040795 PMCID: PMC8808347 DOI: 10.2196/27434] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2021] [Revised: 04/06/2021] [Accepted: 11/10/2021] [Indexed: 11/30/2022] Open

Abstract

BACKGROUND

The amount of available textual health data such as scientific and biomedical literature is constantly growing and becoming more and more challenging for health professionals to properly summarize those data and practice evidence-based clinical decision making. Moreover, the exploration of unstructured health text data is challenging for professionals without computer science knowledge due to limited time, resources, and skills. Current tools to explore text data lack ease of use, require high computational efforts, and incorporate domain knowledge and focus on topics of interest with difficulty.

OBJECTIVE

We developed a methodology able to explore and target topics of interest via an interactive user interface for health professionals with limited computer science knowledge. We aim to reach near state-of-the-art performance while reducing memory consumption, increasing scalability, and minimizing user interaction effort to improve the clinical decision-making process. The performance was evaluated on diabetes-related abstracts from PubMed.

METHODS

The methodology consists of 4 parts: (1) a novel interpretable hierarchical clustering of documents where each node is defined by headwords (words that best represent the documents in the node), (2) an efficient classification system to target topics, (3) minimized user interaction effort through active learning, and (4) a visual user interface. We evaluated our approach on 50,911 diabetes-related abstracts providing a hierarchical Medical Subject Headings (MeSH) structure, a unique identifier for a topic. Hierarchical clustering performance was compared against the implementation in the machine learning library scikit-learn. On a subset of 2000 randomly chosen diabetes abstracts, our active learning strategy was compared against 3 other strategies: random selection of training instances, uncertainty sampling that chooses instances about which the model is most uncertain, and an expected gradient length strategy based on convolutional neural networks (CNNs).

RESULTS

For the hierarchical clustering performance, we achieved an F1 score of 0.73 compared to 0.76 achieved by scikit-learn. Concerning active learning performance, after 200 chosen training samples based on these strategies, the weighted F1 score of all MeSH codes resulted in a satisfying 0.62 F1 score using our approach, 0.61 using the uncertainty strategy, 0.63 using the CNN, and 0.45 using the random strategy. Moreover, our methodology showed a constant low memory use with increased number of documents.

CONCLUSIONS

We proposed an easy-to-use tool for health professionals with limited computer science knowledge who combine their domain knowledge with topic exploration and target specific topics of interest while improving transparency. Furthermore, our approach is memory efficient and highly parallelizable, making it interesting for large Big Data sets. This approach can be used by health professionals to gain deep insights into biomedical literature to ultimately improve the evidence-based clinical decision making process.

Collapse