Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

155
(from Reference Citation Analysis)

Article PDFs (36)

Cited by > 0 (105)

Searched Name

Generalizability

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Iacovino ML, Celant S, Tomassini L, Arenare L, Caglio A, Canciello A, Salerno F, Olimpieri PP, Di Segni S, Sferrazza A, Piccirillo MC, Beretta GD, Pinto C, Blasi L, Cinieri S, Cavanna L, Di Maio M, Russo P, Perrone F. Comparison of baseline patient characteristics in Italian oncology drug monitoring registries and clinical trials: a real-world cross-sectional study. Lancet Reg Health Eur 2024;41:100912. [PMID: 38665620 PMCID: PMC11041834 DOI: 10.1016/j.lanepe.2024.100912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 04/05/2024] [Accepted: 04/08/2024] [Indexed: 04/28/2024]

Abstract

Background

Generalizability of registrative clinical trials to real-world clinical practice is influenced by comparability of patients in the two settings. We compared characteristics of cancer patients in registrative trials with real-world clinical practice in Italy.

Methods

Data on age, sex and performance status (PS) were derived from web-based monitoring registries developed by Italian Medicines Agency (AIFA) and corresponding registrative trials reported in the European Public Assessment Reports (EPAR) of European Medicines Agency (EMA). Weighted means were calculated in registries and trials and differences were described. Multivariate analysis was performed using Principal Component Analysis and Cluster Analysis.

Findings

From January, 2013 to April, 2023, 419,461 unique pairs of patients and therapeutic indications were recorded in 129 AIFA registries. Within 140 related trials, 87,452 patients had been enrolled. Median age and rate of elderly (≥65 years old) patients were higher in monitoring registries than in clinical trials [mean difference of median age 5.3 years, p < 0.001; mean difference of elderly rate 17.17% (95% CI 1.06, 1.48)]. Overall, rate of female patients was not different between registries and trials [mean difference -0.55% (95% CI -1.06, -0.05)]. Mean rate of patients with deteriorated PS was low both in trials (3.1%) and in registries (4.3%) with a mean difference of 1.27% (95% CI 1.06, 1.48). Two clusters were identified with multivariate analysis: one including more registries (higher median age and elderly rate, lower female rate, higher rate of deteriorated patients), the other more trials (lower median age and elderly rate, higher female rate, lower rate of deteriorated patients).

Interpretation

This study supports that cancer patients enrolled in trials do only partially represent those who have been treated in Italy in clinical practice. Inclusiveness of registrative trials should be increased to ensure generalizability of results to real-world population.

Funding

Partially supported by Italian Ministry of Health.

Collapse

Steingrimsson JA, Barker DH, Bie R, Dahabreh IJ. Systematically missing data in causally interpretable meta-analysis. Biostatistics 2024;25:289-305. [PMID: 36977366 PMCID: PMC11017122 DOI: 10.1093/biostatistics/kxad006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Revised: 02/15/2023] [Accepted: 03/13/2023] [Indexed: 03/30/2023] Open

Keyes KM, Pakserian D, Rudolph KE, Salum G, Stuart EA. Population Neuroscience: Understanding Concepts of Generalizability and Transportability and Their Application to Improving the Public's Health. Curr Top Behav Neurosci 2024. [PMID: 38589636 DOI: 10.1007/7854_2024_465] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/10/2024]

Rossi L, Fiorentino MC, Mancini A, Paolanti M, Rosati R, Zingaretti P. Generalizability and robustness evaluation of attribute-based zero-shot learning. Neural Netw 2024;175:106278. [PMID: 38581809 DOI: 10.1016/j.neunet.2024.106278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 02/15/2024] [Accepted: 03/26/2024] [Indexed: 04/08/2024]

Samadi ME, Guzman-Maldonado J, Nikulina K, Mirzaieazar H, Sharafutdinov K, Fritsch SJ, Schuppert A. A hybrid modeling framework for generalizable and interpretable predictions of ICU mortality across multiple hospitals. Sci Rep 2024;14:5725. [PMID: 38459085 PMCID: PMC10923850 DOI: 10.1038/s41598-024-55577-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2023] [Accepted: 02/26/2024] [Indexed: 03/10/2024] Open

Tervo-Clemmens B, Karim ZA, Khan SZ, Ravindranath O, Somerville LH, Schuster RM, Gilman JM, Evins AE. The Developmental Timing but Not Magnitude of Adolescent Risk-Taking Propensity Is Consistent Across Social, Environmental, and Psychological Factors. J Adolesc Health 2024;74:613-616. [PMID: 38085210 DOI: 10.1016/j.jadohealth.2023.11.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Revised: 10/03/2023] [Accepted: 11/02/2023] [Indexed: 02/05/2024]

Davidashvilly S, Cardei M, Hssayeni M, Chi C, Ghoraani B. Deep neural networks for wearable sensor-based activity recognition in Parkinson's disease: investigating generalizability and model complexity. Biomed Eng Online 2024;23:17. [PMID: 38336781 PMCID: PMC10858599 DOI: 10.1186/s12938-024-01214-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Accepted: 01/25/2024] [Indexed: 02/12/2024] Open

Abstract

BACKGROUND

The research gap addressed in this study is the applicability of deep neural network (NN) models on wearable sensor data to recognize different activities performed by patients with Parkinson's Disease (PwPD) and the generalizability of these models to PwPD using labeled healthy data.

METHODS

The experiments were carried out utilizing three datasets containing wearable motion sensor readings on common activities of daily living. The collected readings were from two accelerometer sensors. PAMAP2 and MHEALTH are publicly available datasets collected from 10 and 9 healthy, young subjects, respectively. A private dataset of a similar nature collected from 14 PwPD patients was utilized as well. Deep NN models were implemented with varying levels of complexity to investigate the impact of data augmentation, manual axis reorientation, model complexity, and domain adaptation on activity recognition performance.

RESULTS

A moderately complex model trained on the augmented PAMAP2 dataset and adapted to the Parkinson domain using domain adaptation achieved the best activity recognition performance with an accuracy of 73.02%, which was significantly higher than the accuracy of 63% reported in previous studies. The model's F1 score of 49.79% significantly improved compared to the best cross-testing of 33.66% F1 score with only data augmentation and 2.88% F1 score without data augmentation or domain adaptation.

CONCLUSION

These findings suggest that deep NN models originating on healthy data have the potential to recognize activities performed by PwPD accurately and that data augmentation and domain adaptation can improve the generalizability of models in the healthy-to-PwPD transfer scenario. The simple/moderately complex architectures tested in this study could generalize better to the PwPD domain when trained on a healthy dataset compared to the most complex architectures used. The findings of this study could contribute to the development of accurate wearable-based activity monitoring solutions for PwPD, improving clinical decision-making and patient outcomes based on patient activity levels.

Collapse

Tian W, Zhang Z, Bouffard D, Wu H, Xin K, Gu X, Liao Z. Enhancing interpretability and generalizability of deep learning-based emulator in three-dimensional lake hydrodynamics using Koopman operator and transfer learning: Demonstrated on the example of lake Zurich. Water Res 2024;249:120996. [PMID: 38103441 DOI: 10.1016/j.watres.2023.120996] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 11/02/2023] [Accepted: 12/07/2023] [Indexed: 12/19/2023]

Ingrasciotta Y, Spini A, L'Abbate L, Fiore ES, Carollo M, Ientile V, Isgrò V, Cavazzana A, Biasi V, Rossi P, Ejlli L, Belleudi V, Poggi F, Sapigni E, Puccini A, Ancona D, Stella P, Pollina Addario S, Allotta A, Leoni O, Zanforlini M, Tuccori M, Gini R, Trifirò G. Comparing clinical trial population representativeness to real-world users of 17 biologics approved for immune-mediated inflammatory diseases: An external validity analysis of 66,639 biologic users from the Italian VALORE project. Pharmacol Res 2024;200:107074. [PMID: 38232909 DOI: 10.1016/j.phrs.2024.107074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Revised: 01/11/2024] [Accepted: 01/11/2024] [Indexed: 01/19/2024]

Affiliation(s)

Ylenia Ingrasciotta University of Verona, Department of Diagnostics and Public Health, Verona, Italy
Andrea Spini University of Verona, Department of Diagnostics and Public Health, Verona, Italy
Luca L'Abbate University of Messina, Department of Biomedical and Dental Sciences and Morphofunctional Imaging, Messina, Italy
Elena Sofia Fiore University of Verona, Department of Diagnostics and Public Health, Verona, Italy
Massimo Carollo University of Verona, Department of Diagnostics and Public Health, Verona, Italy
Valentina Ientile University of Verona, Department of Diagnostics and Public Health, Verona, Italy
Valentina Isgrò University of Verona, Department of Diagnostics and Public Health, Verona, Italy
Anna Cavazzana Azienda Zero, Regione Veneto, Italy
Valeria Biasi Azienda Zero, Regione Veneto, Italy
Paola Rossi Direzione Centrale Salute Regione Friuli-Venezia Giulia, Trieste, Italy
Lucian Ejlli Direzione Centrale Salute Regione Friuli-Venezia Giulia, Trieste, Italy
Valeria Belleudi Lazio Regional Health Service, Department of Epidemiology, Rome, Italy
Francesca Poggi Lazio Regional Health Service, Department of Epidemiology, Rome, Italy
Ester Sapigni Emilia-Romagna Health Department, Hospital Assistance Service, Drug and Medical Device Area, Bologna, Italy
Aurora Puccini Emilia-Romagna Health Department, Hospital Assistance Service, Drug and Medical Device Area, Bologna, Italy
Domenica Ancona Apulian Regional Health Department, Bari, Italy
Paolo Stella Apulian Regional Health Department, Bari, Italy
Sebastiano Pollina Addario Epidemiologic Observatory of the Sicily Regional Health Service, Palermo, Italy
Alessandra Allotta Epidemiologic Observatory of the Sicily Regional Health Service, Palermo, Italy
Olivia Leoni Lombardy Regional Centre of Pharmacovigilance and Regional Epidemiologic Observatory, Milan, Italy
Martina Zanforlini Azienda Regionale per l'Innovazione e gli Acquisti, S.p.A, Milan, Italy
Marco Tuccori University Hospital of Pisa, Unit of Adverse Drug Reaction Monitoring, Italy
Rosa Gini Agenzia Regionale di Sanità Toscana, Florence, Italy
Gianluca Trifirò University of Verona, Department of Diagnostics and Public Health, Verona, Italy.

Collapse

Mansolf M, Blackwell CK, Cella D, Lai JS. Assessing the interchangeability of linked scores in multivariable statistical analyses. Qual Life Res 2024:10.1007/s11136-023-03592-x. [PMID: 38294666 DOI: 10.1007/s11136-023-03592-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/18/2023] [Indexed: 02/01/2024]

Ghaderi H, Foreman B, Reddy CK, Subbian V. Discovery of Generalizable TBI Phenotypes Using Multivariate Time-Series Clustering. ArXiv 2024:arXiv:2401.08002v1. [PMID: 38313201 PMCID: PMC10836078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 02/06/2024]

Luo Y, Chen W, Zhan L, Qiu J, Jia T. Multi-feature concatenation and multi-classifier stacking: An interpretable and generalizable machine learning method for MDD discrimination with rsfMRI. Neuroimage 2024;285:120497. [PMID: 38142755 DOI: 10.1016/j.neuroimage.2023.120497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Revised: 11/21/2023] [Accepted: 12/11/2023] [Indexed: 12/26/2023] Open

Elliott MR, Carroll O, Grieve R, Carpenter J. Improving transportability of randomized controlled trial inference using robust prediction methods. Stat Methods Med Res 2023;32:2365-2385. [PMID: 37936293 DOI: 10.1177/09622802231210944] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2023]

O'Hare KJM, Linscott RJ. Measurement invariance of brief forms of the Schizotypal Personality Questionnaire across convenience versus random samples. Schizophr Res 2023;262:76-83. [PMID: 37931562 DOI: 10.1016/j.schres.2023.10.033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Revised: 09/25/2023] [Accepted: 10/28/2023] [Indexed: 11/08/2023]

Sudarshan NJ, Bowden SC. Common Factor Structure of the Ten Subtest Wechsler Adult Intelligence Scale-Fourth Edition in a Clinical Sample and 15 Subtest Version in the Standardization Sample. Arch Clin Neuropsychol 2023;38:1646-1658. [PMID: 37222085 PMCID: PMC10681435 DOI: 10.1093/arclin/acad035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/16/2023] [Indexed: 05/25/2023] Open

Zsidai B, Hilkert AS, Kaarre J, Narup E, Senorski EH, Grassi A, Ley C, Longo UG, Herbst E, Hirschmann MT, Kopf S, Seil R, Tischer T, Samuelsson K, Feldt R. A practical guide to the implementation of AI in orthopaedic research - part 1: opportunities in clinical application and overcoming existing challenges. J Exp Orthop 2023;10:117. [PMID: 37968370 PMCID: PMC10651597 DOI: 10.1186/s40634-023-00683-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Accepted: 10/21/2023] [Indexed: 11/17/2023] Open

Affiliation(s)

Bálint Zsidai Sahlgrenska Sports Medicine Center, Gothenburg, Sweden. Department of Orthopaedics, Institute of Clinical Sciences, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden.
Ann-Sophie Hilkert Department of Computer Science and Engineering, Chalmers University of Technology, Gothenburg, Sweden Medfield Diagnostics AB, Gothenburg, Sweden
Janina Kaarre Sahlgrenska Sports Medicine Center, Gothenburg, Sweden Department of Orthopaedics, Institute of Clinical Sciences, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden Department of Orthopaedic Surgery, UPMC Freddie Fu Sports Medicine Center, University of Pittsburgh, Pittsburgh, USA
Eric Narup Sahlgrenska Sports Medicine Center, Gothenburg, Sweden Department of Orthopaedics, Institute of Clinical Sciences, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
Eric Hamrin Senorski Sahlgrenska Sports Medicine Center, Gothenburg, Sweden Department of Health and Rehabilitation, Institute of Neuroscience and Physiology, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden Sportrehab Sports Medicine Clinic, Gothenburg, Sweden
Alberto Grassi Department of Orthopaedics, Institute of Clinical Sciences, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden IIa Clinica Ortopedica E Traumatologica, IRCCS Istituto Ortopedico Rizzoli, Bologna, Italy
Christophe Ley Department of Mathematics, University of Luxembourg, Esch-Sur-Alzette, Luxembourg
Umile Giuseppe Longo Department of Orthopaedic and Trauma Surgery, Campus Bio-Medico University, Rome, Italy
Elmar Herbst Department of Trauma, Hand and Reconstructive Surgery, University Hospital Münster, Münster, Germany
Michael T Hirschmann Department of Orthopedic Surgery and Traumatology, Head Knee Surgery and DKF Head of Research, Kantonsspital Baselland, 4101, Bruderholz, Switzerland
Sebastian Kopf Center of Orthopaedics and Traumatology, University Hospital Brandenburg a.d.H., Brandenburg Medical School Theodor Fontane, 14770, Brandenburg a.d.H., Germany Faculty of Health Sciences Brandenburg, Brandenburg Medical School Theodor Fontane, 14770, Brandenburg a.d.H., Germany
Romain Seil Department of Orthopaedic Surgery, Centre Hospitalier Luxembourg and Luxembourg Institute of Health, Luxembourg, Luxembourg
Thomas Tischer Clinic for Orthopaedics and Trauma Surgery, Malteser Waldkrankenhaus St. Marien, Erlangen, Germany
Kristian Samuelsson Sahlgrenska Sports Medicine Center, Gothenburg, Sweden Department of Orthopaedics, Institute of Clinical Sciences, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden Department of Orthopaedics, Sahlgrenska University Hospital, Mölndal, Sweden
Robert Feldt Department of Orthopaedics, Institute of Clinical Sciences, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden

Collapse

Rajaraman S, Yang F, Zamzmi G, Xue Z, Antani S. Can Deep Adult Lung Segmentation Models Generalize to the Pediatric Population? Expert Syst Appl 2023;229:120531. [PMID: 37397242 PMCID: PMC10310063 DOI: 10.1016/j.eswa.2023.120531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]

Nilsson A, Björk J, Strömberg U, Bonander C. Can non-participants in a follow-up be used to draw conclusions about incidences and prevalences in the full population invited at baseline? An investigation based on the Swedish MDC cohort. BMC Med Res Methodol 2023;23:228. [PMID: 37821822 PMCID: PMC10568880 DOI: 10.1186/s12874-023-02053-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 10/01/2023] [Indexed: 10/13/2023] Open

Abstract

BACKGROUND

Participants in epidemiological cohorts may not be representative of the full invited population, limiting the generalizability of prevalence and incidence estimates. We propose that this problem can be remedied by exploiting data on baseline participants who refused to participate in a re-examination, as such participants may be more similar to baseline non-participants than what baseline participants who agree to participate in the re-examination are.

METHODS

We compared background characteristics, mortality, and disease incidences across the full population invited to the Malmö Diet and Cancer (MDC) study, the baseline participants, the baseline non-participants, the baseline participants who participated in a re-examination, and the baseline participants who did not participate in the re-examination. We then considered two models for estimating characteristics and outcomes in the full population: one ("the substitution model") assuming that the baseline non-participants were similar to the baseline participants who refused to participate in the re-examination, and one ("the extrapolation model") assuming that differences between the full group of baseline participants and the baseline participants who participated in the re-examination could be extended to infer results in the full population. Finally, we compared prevalences of baseline risk factors including smoking, risky drinking, overweight, and obesity across baseline participants, baseline participants who participated in the re-examination, and baseline participants who did not participate in the re-examination, and used the above models to estimate the prevalences of these factors in the full invited population.

RESULTS

Compared to baseline non-participants, baseline participants were less likely to be immigrants, had higher socioeconomic status, and lower mortality and disease incidences. Baseline participants not participating in the re-examination generally resembled the full population. The extrapolation model often generated characteristics and incidences even more similar to the full population. The prevalences of risk factors, particularly smoking, were estimated to be substantially higher in the full population than among the baseline participants.

CONCLUSIONS

Participants in epidemiological cohorts such as the MDC study are unlikely to be representative of the full invited population. Exploiting data on baseline participants who did not participate in a re-examination can be a simple and useful way to improve the generalizability of prevalence and incidence estimates.

Collapse

Buckley PR, Murry VM, Gust CJ, Ladika A, Pampel FC. Racial and Ethnic Representation in Preventive Intervention Research: a Methodological Study. Prev Sci 2023;24:1261-1274. [PMID: 37386352 DOI: 10.1007/s11121-023-01564-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/07/2023] [Indexed: 07/01/2023]

Abstract

Individuals who are Asian or Asian American, Black or African American, Native American or American Indian or Alaska Native, Native Hawaiian or Pacific Islander, and Hispanic or Latino (i.e., presently considered racial ethnic minoritized groups in the USA) lacked equal access to resources for mitigating risk during COVID-19, which highlighted public health disparities and exacerbated inequities rooted in structural racism that have contributed to many injustices, such as failing public school systems and unsafe neighborhoods. Minoritized groups are also vulnerable to climate change wherein the most severe harms disproportionately fall upon underserved communities. While systemic changes are needed to address these pervasive syndemic conditions, immediate efforts involve examining strategies to promote equitable health and well-being-which served as the impetus for this study. We conducted a descriptive analysis on the prevalence of culturally tailored interventions and reporting of sample characteristics among 885 programs with evaluations published from 2010 to 2021 and recorded in the Blueprints for Healthy Youth Development registry. Inferential analyses also examined (1) reporting time trends and (2) the relationship between study quality (i.e., strong methods, beneficial effects) and culturally tailored programs and racial ethnic enrollment. Two percent of programs were developed for Black or African American youth, and 4% targeted Hispanic or Latino populations. For the 77% of studies that reported race, most enrollees were White (35%) followed by Black or African American (28%), and 31% collapsed across race or categorized race with ethnicity. In the 64% of studies that reported ethnicity, 32% of enrollees were Hispanic or Latino. Reporting has not improved, and there was no relationship between high-quality studies and programs developed for racial ethnic youth, or samples with high proportions of racial ethnic enrollees. Research gaps on racial ethnic groups call for clear reporting and better representation to reduce disparities and improve the utility of interventions.

Collapse

Nilsson A, Strömberg U, Björk J, Forsberg A, Fritzell K, Kemp Gudmundsdottir KR, Engdahl J, Bonander C. Examining the continuum of resistance model in two population-based screening studies in Sweden. Prev Med Rep 2023;35:102317. [PMID: 37519442 PMCID: PMC10372382 DOI: 10.1016/j.pmedr.2023.102317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 06/20/2023] [Accepted: 07/08/2023] [Indexed: 08/01/2023] Open

Lin C, Bulls LS, Tepfer LJ, Vyas AD, Thornton MA. Advancing Naturalistic Affective Science with Deep Learning. Affect Sci 2023;4:550-562. [PMID: 37744976 PMCID: PMC10514024 DOI: 10.1007/s42761-023-00215-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Accepted: 08/03/2023] [Indexed: 09/26/2023]

Abstract

People express their own emotions and perceive others' emotions via a variety of channels, including facial movements, body gestures, vocal prosody, and language. Studying these channels of affective behavior offers insight into both the experience and perception of emotion. Prior research has predominantly focused on studying individual channels of affective behavior in isolation using tightly controlled, non-naturalistic experiments. This approach limits our understanding of emotion in more naturalistic contexts where different channels of information tend to interact. Traditional methods struggle to address this limitation: manually annotating behavior is time-consuming, making it infeasible to do at large scale; manually selecting and manipulating stimuli based on hypotheses may neglect unanticipated features, potentially generating biased conclusions; and common linear modeling approaches cannot fully capture the complex, nonlinear, and interactive nature of real-life affective processes. In this methodology review, we describe how deep learning can be applied to address these challenges to advance a more naturalistic affective science. First, we describe current practices in affective research and explain why existing methods face challenges in revealing a more naturalistic understanding of emotion. Second, we introduce deep learning approaches and explain how they can be applied to tackle three main challenges: quantifying naturalistic behaviors, selecting and manipulating naturalistic stimuli, and modeling naturalistic affective processes. Finally, we describe the limitations of these deep learning methods, and how these limitations might be avoided or mitigated. By detailing the promise and the peril of deep learning, this review aims to pave the way for a more naturalistic affective science.

Collapse

Magoc T, Allen KS, McDonnell C, Russo JP, Cummins J, Vest JR, Harle CA. Generalizability and portability of natural language processing system to extract individual social risk factors. Int J Med Inform 2023;177:105115. [PMID: 37302362 DOI: 10.1016/j.ijmedinf.2023.105115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2023] [Revised: 05/15/2023] [Accepted: 05/30/2023] [Indexed: 06/13/2023]

Ong SWX, Tong SYC, Daneman N. Are we enrolling the right patients? A scoping review of external validity and generalizability of clinical trials in bloodstream infections. Clin Microbiol Infect 2023:S1198-743X(23)00402-0. [PMID: 37633330 DOI: 10.1016/j.cmi.2023.08.019] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Revised: 08/15/2023] [Accepted: 08/20/2023] [Indexed: 08/28/2023]

Abstract

BACKGROUND

Having a representative population in randomized clinical trials (RCTs) improves external validity and generalizability of trial results. There are limited data examining differences between RCT-enrolled and real-world populations in bloodstream infections (BSI).

OBJECTIVES

We conducted a scoping review aiming to review studies assessing generalizability of BSI RCT populations, to identify sub-groups that have been systematically under-represented and to explore approaches to improve external validity of future RCTs.

SOURCES

MEDLINE, Embase, and Cochrane Library databases were searched for terms related to external validity or generalizability, BSI, and clinical trials in papers published up to 1 August 2023. Studies comparing enrolled versus nonenrolled patients, or papers discussing external validity or generalizability in the context of BSI RCTs were included.

CONTENT

Sixteen papers were included in the final review. Five compared RCT-enrolled and nonenrolled participants from the same source population. There were significant differences between the two groups in all studies, with nonenrolled patients having a greater comorbidity burden and consistently worse outcomes including mortality. We identified several barriers to improving generalizability of RCT populations and outlined potential approaches to reduce these barriers, such as alternative/simplified consent processes, streamlining eligibility criteria and follow-up procedures, quota-based sampling techniques, and ensuring diversity in site and study team selection.

IMPLICATIONS

Study cohorts in BSI RCTs are not representative of the general BSI patient population. As we increasingly adopt large pragmatic trials in infectious diseases, it is important to recognize the importance of maximizing generalizability to ensure that our research findings are of direct relevance to our patients.

Collapse

Zhou S, Wang N, Wang L, Sun J, Blaes A, Liu H, Zhang R. A cross-institutional evaluation on breast cancer phenotyping NLP algorithms on electronic health records. Comput Struct Biotechnol J 2023;22:32-40. [PMID: 37680211 PMCID: PMC10480628 DOI: 10.1016/j.csbj.2023.08.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Revised: 08/15/2023] [Accepted: 08/21/2023] [Indexed: 09/09/2023] Open

Abstract

Objective

Transformer-based language models are prevailing in the clinical domain due to their excellent performance on clinical NLP tasks. The generalizability of those models is usually ignored during the model development process. This study evaluated the generalizability of CancerBERT, a Transformer-based clinical NLP model, along with classic machine learning models, i.e., conditional random field (CRF), bi-directional long short-term memory CRF (BiLSTM-CRF), across different clinical institutes through a breast cancer phenotype extraction task.

Materials and methods

Two clinical corpora of breast cancer patients were collected from the electronic health records from the University of Minnesota (UMN) and Mayo Clinic (MC), and annotated following the same guideline. We developed three types of NLP models (i.e., CRF, BiLSTM-CRF and CancerBERT) to extract cancer phenotypes from clinical texts. We evaluated the generalizability of models on different test sets with different learning strategies (model transfer vs locally trained). The entity coverage score was assessed with their association with the model performances.

Results

We manually annotated 200 and 161 clinical documents at UMN and MC. The corpora of the two institutes were found to have higher similarity between the target entities than the overall corpora. The CancerBERT models obtained the best performances among the independent test sets from two clinical institutes and the permutation test set. The CancerBERT model developed in one institute and further fine-tuned in another institute achieved reasonable performance compared to the model developed on local data (micro-F1: 0.925 vs 0.932).

Conclusions

The results indicate the CancerBERT model has superior learning ability and generalizability among the three types of clinical NLP models for our named entity recognition task. It has the advantage to recognize complex entities, e.g., entities with different labels.

Collapse

Nakayama LF, Zago Ribeiro L, de Oliveira JAE, de Matos JCRG, Mitchell WG, Malerbi FK, Celi LA, Regatieri CVS. Fairness and generalizability of OCT normative databases: a comparative analysis. Int J Retina Vitreous 2023;9:48. [PMID: 37605208 PMCID: PMC10440930 DOI: 10.1186/s40942-023-00459-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Accepted: 03/26/2023] [Indexed: 08/23/2023] Open

Abstract

PURPOSE

In supervised Machine Learning algorithms, labels and reports are important in model development. To provide a normality assessment, the OCT has an in-built normative database that provides a color base scale from the measurement database comparison. This article aims to evaluate and compare normative databases of different OCT machines, analyzing patient demographic, contrast inclusion and exclusion criteria, diversity index, and statistical approach to assess their fairness and generalizability.

METHODS

Data were retrieved from Cirrus, Avanti, Spectralis, and Triton's FDA-approval and equipment manual. The following variables were compared: number of eyes and patients, inclusion and exclusion criteria, statistical approach, sex, race and ethnicity, age, participant country, and diversity index.

RESULTS

Avanti OCT has the largest normative database (640 eyes). In every database, the inclusion and exclusion criteria were similar, including adult patients and excluding pathological eyes. Spectralis has the largest White (79.7%) proportionately representation, Cirrus has the largest Asian (24%), and Triton has the largest Black (22%) patient representation. In all databases, the statistical analysis applied was Regression models. The sex diversity index is similar in all datasets, and comparable to the ten most populous contries. Avanti dataset has the highest diversity index in terms of race, followed by Cirrus, Triton, and Spectralis.

CONCLUSION

In all analyzed databases, the data framework is static, with limited upgrade options and lacking normative databases for new modules. As a result, caution in OCT normality interpretation is warranted. To address these limitations, there is a need for more diverse, representative, and open-access datasets that take into account patient demographics, especially considering the development of supervised Machine Learning algorithms in healthcare.

Collapse

Tik N, Gal S, Madar A, Ben-David T, Bernstein-Eliav M, Tavor I. Generalizing prediction of task-evoked brain activity across datasets and populations. Neuroimage 2023;276:120213. [PMID: 37268097 DOI: 10.1016/j.neuroimage.2023.120213] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Revised: 05/28/2023] [Accepted: 05/30/2023] [Indexed: 06/04/2023] Open

Abstract

Predictions of task-based functional magnetic resonance imaging (fMRI) from task-free resting-state (rs) fMRI have gained popularity over the past decade. This method holds a great promise for studying individual variability in brain function without the need to perform highly demanding tasks. However, in order to be broadly used, prediction models must prove to generalize beyond the dataset they were trained on. In this work, we test the generalizability of prediction of task-fMRI from rs-fMRI across sites, MRI vendors and age-groups. Moreover, we investigate the data requirements for successful prediction. We use the Human Connectome Project (HCP) dataset to explore how different combinations of training sample sizes and number of fMRI datapoints affect prediction success in various cognitive tasks. We then apply models trained on HCP data to predict brain activations in data from a different site, a different MRI vendor (Phillips vs. Siemens scanners) and a different age group (children from the HCP-development project). We demonstrate that, depending on the task, a training set of approximately 20 participants with 100 fMRI timepoints each yields the largest gain in model performance. Nevertheless, further increasing sample size and number of timepoints results in significantly improved predictions, until reaching approximately 450-600 training participants and 800-1000 timepoints. Overall, the number of fMRI timepoints influences prediction success more than the sample size. We further show that models trained on adequate amounts of data successfully generalize across sites, vendors and age groups and provide predictions that are both accurate and individual-specific. These findings suggest that large-scale publicly available datasets may be utilized to study brain function in smaller, unique samples.

Collapse

Missiou A, Ntalaouti E, Lionis C, Evangelou E, Tatsioni A. Underreporting contextual factors preclude the applicability appraisal in primary care randomized controlled trials. J Clin Epidemiol 2023;160:24-32. [PMID: 37311513 DOI: 10.1016/j.jclinepi.2023.06.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Revised: 05/21/2023] [Accepted: 06/06/2023] [Indexed: 06/15/2023]

Steingrimsson JA. Extending prediction models for use in a new target population with failure time outcomes. Biostatistics 2023;24:728-742. [PMID: 35389429 DOI: 10.1093/biostatistics/kxac011] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Revised: 03/14/2022] [Accepted: 03/21/2022] [Indexed: 07/20/2023] Open

Hong H, Liu L, Mojtabai R, Stuart EA. Calibrated meta-analysis to estimate the efficacy of mental health treatments in target populations: an application to paliperidone trials for treatment of schizophrenia. BMC Med Res Methodol 2023;23:150. [PMID: 37365521 DOI: 10.1186/s12874-023-01958-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Accepted: 05/25/2023] [Indexed: 06/28/2023] Open

Abstract

BACKGROUNDS

Meta-analyses can be a powerful tool but need to calibrate potential unrepresentativeness of the included trials to a target population. Estimating target population average treatment effects (TATE) in meta-analyses is important to understand how treatments perform in well-defined target populations. This study estimated TATE of paliperidone palmitate in patients with schizophrenia using meta-analysis with individual patient trial data and target population data.

METHODS

We conducted a meta-analysis with data from four randomized clinical trials and target population data from the Clinical Antipsychotic Trials of Intervention Effectiveness (CATIE) study. Efficacy was measured using the Positive and Negative Syndrome Scale (PANSS). Weights to equate the trial participants and target population were calculated by comparing baseline characteristics between the trials and CATIE. A calibrated weighted meta-analysis with random effects was performed to estimate the TATE of paliperidone compared to placebo.

RESULTS

A total of 1,738 patients were included in the meta-analysis along with 1,458 patients in CATIE. After weighting, the covariate distributions of the trial participants and target population were similar. Compared to placebo, paliperidone palmitate was associated with a significant reduction of the PANSS total score under both unweighted (mean difference 9.07 [4.43, 13.71]) and calibrated weighted (mean difference 6.15 [2.22, 10.08]) meta-analysis.

CONCLUSIONS

The effect of paliperidone palmitate compared with placebo is slightly smaller in the target population than that estimated directly from the unweighted meta-analysis. Representativeness of samples of trials included in a meta-analysis to a target population should be assessed and incorporated properly to obtain the most reliable evidence of treatment effects in target populations.

Collapse

Huo T, Glueck DH, Shenkman EA, Muller KE. Stratified split sampling of electronic health records. BMC Med Res Methodol 2023;23:128. [PMID: 37231360 DOI: 10.1186/s12874-023-01938-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Accepted: 05/04/2023] [Indexed: 05/27/2023] Open

Abstract

Although superficially similar to data from clinical research, data extracted from electronic health records may require fundamentally different approaches for model building and analysis. Because electronic health record data is designed for clinical, rather than scientific use, researchers must first provide clear definitions of outcome and predictor variables. Yet an iterative process of defining outcomes and predictors, assessing association, and then repeating the process may increase Type I error rates, and thus decrease the chance of replicability, defined by the National Academy of Sciences as the chance of "obtaining consistent results across studies aimed at answering the same scientific question, each of which has obtained its own data."[1] In addition, failure to account for subgroups may mask heterogeneous associations between predictor and outcome by subgroups, and decrease the generalizability of the findings. To increase chances of replicability and generalizability, we recommend using a stratified split sample approach for studies using electronic health records. A split sample approach divides the data randomly into an exploratory set for iterative variable definition, iterative analyses of association, and consideration of subgroups. The confirmatory set is used only to replicate results found in the first set. The addition of the word 'stratified' indicates that rare subgroups are oversampled randomly by including them in the exploratory sample at higher rates than appear in the population. The stratified sampling provides a sufficient sample size for assessing heterogeneity of association by testing for effect modification by group membership. An electronic health record study of the associations between socio-demographic factors and uptake of hepatic cancer screening, and potential heterogeneity of association in subgroups defined by gender, self-identified race and ethnicity, census-tract level poverty and insurance type illustrates the recommended approach.

Collapse

Malik HB, Norman JB. Best Practices and Methodological Strategies for Addressing Generalizability in Neuropsychological Assessment. J Pediatr Neuropsychol 2023;9:47-63. [PMID: 37250805 PMCID: PMC10182845 DOI: 10.1007/s40817-023-00145-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Revised: 04/11/2023] [Accepted: 04/15/2023] [Indexed: 05/31/2023]

Zhang J, Ma X, Zhang J, Sun D, Zhou X, Mi C, Wen H. Insights into geospatial heterogeneity of landslide susceptibility based on the SHAP-XGBoost model. J Environ Manage 2023;332:117357. [PMID: 36731409 DOI: 10.1016/j.jenvman.2023.117357] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Revised: 01/05/2023] [Accepted: 01/22/2023] [Indexed: 06/18/2023]

Abstract

The spatial heterogeneity of landslide influencing factors is the main reason for the poor generalizability of the susceptibility evaluation model. This study aimed to construct a comprehensive explanatory framework for landslide susceptibility evaluation models based on the SHAP (SHapley Additive explanation)-XGBoost (eXtreme Gradient Boosting) algorithm, analyze the regional characteristics and spatial heterogeneity of landslide influencing factors, and discuss the heterogeneity of the generalizability of the models under different landscapes. Firstly, we selected different regions in typical mountainous hilly region and constructed a geospatial database containing 12 landslide influencing factors such as elevation, annual average rainfall, slope, lithology, and NDVI through field surveys, satellite images, and a literature review. Subsequently, the landslide susceptibility evaluation model was constructed based on the XGBoost algorithm and spatial database, and the prediction results of the landslide susceptibility evaluation model were explained based on regional topography, geology, and hydrology using the SHAP algorithm. Finally, the model was generalized and applied to regions with both similar and very different topography, geology, meteorology, and vegetation, to explore the spatial heterogeneity of the generalizability of the model. The following conclusions were drawn: the spatial distribution of landslides is heterogeneous and complex, and the contribution of each influencing factor on the occurrence of landslides has obvious regional characteristics and spatial heterogeneity. The generalizability of the landslide susceptibility evaluation model is spatially heterogeneous and has better generalizability to regions with similar regional characteristics. Further explanation of the XGBoost landslide susceptibility evaluation model using the SHAP method allows quantitative analysis of the differences in how much various factors contribute to disasters due to spatial heterogeneity, from the perspective of global and local evaluation units. In summary, the integrated explanatory framework based on the SHAP-XGBoost model can quantify the contribution of influencing factors on landslide occurrence at both global and local levels, which is conducive to the construction and improvement of the influencing factor system of landslide susceptibility in different regions. It can also provide a reference for predicting potential landslide hazard-prone areas and for Explainable Artificial Intelligence (XAI) research.

Collapse

Yang Y, Sánchez-Tójar A, O'Dea RE, Noble DWA, Koricheva J, Jennions MD, Parker TH, Lagisz M, Nakagawa S. Publication bias impacts on effect size, statistical power, and magnitude (Type M) and sign (Type S) errors in ecology and evolutionary biology. BMC Biol 2023;21:71. [PMID: 37013585 PMCID: PMC10071700 DOI: 10.1186/s12915-022-01485-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Accepted: 11/29/2022] [Indexed: 04/05/2023] Open

Abstract

Collaborative efforts to directly replicate empirical studies in the medical and social sciences have revealed alarmingly low rates of replicability, a phenomenon dubbed the 'replication crisis'. Poor replicability has spurred cultural changes targeted at improving reliability in these disciplines. Given the absence of equivalent replication projects in ecology and evolutionary biology, two inter-related indicators offer the opportunity to retrospectively assess replicability: publication bias and statistical power. This registered report assesses the prevalence and severity of small-study (i.e., smaller studies reporting larger effect sizes) and decline effects (i.e., effect sizes decreasing over time) across ecology and evolutionary biology using 87 meta-analyses comprising 4,250 primary studies and 17,638 effect sizes. Further, we estimate how publication bias might distort the estimation of effect sizes, statistical power, and errors in magnitude (Type M or exaggeration ratio) and sign (Type S). We show strong evidence for the pervasiveness of both small-study and decline effects in ecology and evolution. There was widespread prevalence of publication bias that resulted in meta-analytic means being over-estimated by (at least) 0.12 standard deviations. The prevalence of publication bias distorted confidence in meta-analytic results, with 66% of initially statistically significant meta-analytic means becoming non-significant after correcting for publication bias. Ecological and evolutionary studies consistently had low statistical power (15%) with a 4-fold exaggeration of effects on average (Type M error rates = 4.4). Notably, publication bias reduced power from 23% to 15% and increased type M error rates from 2.7 to 4.4 because it creates a non-random sample of effect size evidence. The sign errors of effect sizes (Type S error) increased from 5% to 8% because of publication bias. Our research provides clear evidence that many published ecological and evolutionary findings are inflated. Our results highlight the importance of designing high-power empirical studies (e.g., via collaborative team science), promoting and encouraging replication studies, testing and correcting for publication bias in meta-analyses, and adopting open and transparent research practices, such as (pre)registration, data- and code-sharing, and transparent reporting.

Collapse

Okada G, Yoshioka T, Yamashita A, Itai E, Yokoyama S, Kamishikiryo T, Shinzato H, Masuda Y, Mitsuyama Y, Kan S, Kurata A, Takamura M, Yoshino A, Mantani A, Yamamoto O, Yokota N, Tamura T, Jitsuiki H, Kawato M, Yamashita O, Sakai Y, Okamoto Y. Verification of the brain network marker of major depressive disorder: Test-retest reliability and anterograde generalization performance for newly acquired data. J Affect Disord 2023;326:262-6. [PMID: 36717028 DOI: 10.1016/j.jad.2023.01.087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Revised: 12/23/2022] [Accepted: 01/04/2023] [Indexed: 02/01/2023]

Basnight-Brown D, Janssen SMJ, Thomas AK. Exploration of human cognitive universals and human cognitive diversity. Mem Cognit 2023;51:505-8. [PMID: 36859524 DOI: 10.3758/s13421-023-01410-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/21/2023] [Indexed: 03/03/2023]

Seaborn K, Barbareschi G, Chandra S. Not Only WEIRD but "Uncanny"? A Systematic Review of Diversity in Human-Robot Interaction Research. Int J Soc Robot 2023:1-30. [PMID: 37359427 PMCID: PMC9993363 DOI: 10.1007/s12369-023-00968-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/19/2023] [Indexed: 03/29/2023]

Cook RR, Foot C, Arah OA, Humphreys K, Rudolph KE, Luo SX, Tsui JI, Levander XA, Korthuis PT. Estimating the impact of stimulant use on initiation of buprenorphine and extended-release naltrexone in two clinical trials and real-world populations. Addict Sci Clin Pract 2023;18:11. [PMID: 36788634 PMCID: PMC9930351 DOI: 10.1186/s13722-023-00364-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Accepted: 02/01/2023] [Indexed: 02/16/2023] Open

Abstract

BACKGROUND

Co-use of stimulants and opioids is rapidly increasing. Randomized clinical trials (RCTs) have established the efficacy of medications for opioid use disorder (MOUD), but stimulant use may decrease the likelihood of initiating MOUD treatment. Furthermore, trial participants may not represent "real-world" populations who would benefit from treatment.

METHODS

We conducted a two-stage analysis. First, associations between stimulant use (time-varying urine drug screens for cocaine, methamphetamine, or amphetamines) and initiation of buprenorphine or extended-release naltrexone (XR-NTX) were estimated across two RCTs (CTN-0051 X:BOT and CTN-0067 CHOICES) using adjusted Cox regression models. Second, results were generalized to three target populations who would benefit from MOUD: Housed adults identifying the need for OUD treatment, as characterized by the National Survey on Drug Use and Health (NSDUH); adults entering OUD treatment, as characterized by Treatment Episodes Dataset (TEDS); and adults living in rural regions of the U.S. with high rates of injection drug use, as characterized by the Rural Opioids Initiative (ROI). Generalizability analyses adjusted for differences in demographic characteristics, substance use, housing status, and depression between RCT and target populations using inverse probability of selection weighting.

RESULTS

Analyses included 673 clinical trial participants, 139 NSDUH respondents (weighted to represent 661,650 people), 71,751 TEDS treatment episodes, and 1,933 ROI participants. The majority were aged 30-49 years, male, and non-Hispanic White. In RCTs, stimulant use reduced the likelihood of MOUD initiation by 32% (adjusted HR [aHR] = 0.68, 95% CI 0.49-0.94, p = 0.019). Stimulant use associations were slightly attenuated and non-significant among housed adults needing treatment (25% reduction, aHR = 0.75, 0.48-1.18, p = 0.215) and adults entering OUD treatment (28% reduction, aHR = 0.72, 0.51-1.01, p = 0.061). The association was more pronounced, but still non-significant among rural people injecting drugs (39% reduction, aHR = 0.61, 0.35-1.06, p = 0.081). Stimulant use had a larger negative impact on XR-NTX initiation compared to buprenorphine, especially in the rural population (76% reduction, aHR = 0.24, 0.08-0.69, p = 0.008).

CONCLUSIONS

Stimulant use is a barrier to buprenorphine or XR-NTX initiation in clinical trials and real-world populations that would benefit from OUD treatment. Interventions to address stimulant use among patients with OUD are urgently needed, especially among rural people injecting drugs, who already suffer from limited access to MOUD.

Collapse

Robertson SE, Steingrimsson JA, Dahabreh IJ. Regression-based estimation of heterogeneous treatment effects when extending inferences from a randomized trial to a target population. Eur J Epidemiol 2023;38:123-133. [PMID: 36626100 PMCID: PMC10986821 DOI: 10.1007/s10654-022-00901-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Accepted: 07/11/2022] [Indexed: 01/11/2023]

Gard AM, Hyde LW, Heeringa SG, West BT, Mitchell C. Why weight? Analytic approaches for large-scale population neuroscience data. Dev Cogn Neurosci 2023;59:101196. [PMID: 36630774 PMCID: PMC9843279 DOI: 10.1016/j.dcn.2023.101196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 12/30/2022] [Accepted: 01/05/2023] [Indexed: 01/09/2023] Open

Jujjavarapu C, Suri P, Pejaver V, Friedly J, Gold LS, Meier E, Cohen T, Mooney SD, Heagerty PJ, Jarvik JG. Predicting decompression surgery by applying multimodal deep learning to patients' structured and unstructured health data. BMC Med Inform Decis Mak 2023;23:2. [PMID: 36609379 PMCID: PMC9824905 DOI: 10.1186/s12911-022-02096-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Accepted: 12/29/2022] [Indexed: 01/08/2023] Open

Abstract

BACKGROUND

Low back pain (LBP) is a common condition made up of a variety of anatomic and clinical subtypes. Lumbar disc herniation (LDH) and lumbar spinal stenosis (LSS) are two subtypes highly associated with LBP. Patients with LDH/LSS are often started with non-surgical treatments and if those are not effective then go on to have decompression surgery. However, recommendation of surgery is complicated as the outcome may depend on the patient's health characteristics. We developed a deep learning (DL) model to predict decompression surgery for patients with LDH/LSS.

MATERIALS AND METHOD

We used datasets of 8387 and 8620 patients from a prospective study that collected data from four healthcare systems to predict early (within 2 months) and late surgery (within 12 months after a 2 month gap), respectively. We developed a DL model to use patients' demographics, diagnosis and procedure codes, drug names, and diagnostic imaging reports to predict surgery. For each prediction task, we evaluated the model's performance using classical and generalizability evaluation. For classical evaluation, we split the data into training (80%) and testing (20%). For generalizability evaluation, we split the data based on the healthcare system. We used the area under the curve (AUC) to assess performance for each evaluation. We compared results to a benchmark model (i.e. LASSO logistic regression).

RESULTS

For classical performance, the DL model outperformed the benchmark model for early surgery with an AUC of 0.725 compared to 0.597. For late surgery, the DL model outperformed the benchmark model with an AUC of 0.655 compared to 0.635. For generalizability performance, the DL model outperformed the benchmark model for early surgery. For late surgery, the benchmark model outperformed the DL model.

CONCLUSIONS

For early surgery, the DL model was preferred for classical and generalizability evaluation. However, for late surgery, the benchmark and DL model had comparable performance. Depending on the prediction task, the balance of performance may shift between DL and a conventional ML method. As a result, thorough assessment is needed to quantify the value of DL, a relatively computationally expensive, time-consuming and less interpretable method.

Collapse

Affiliation(s)

Chethan Jujjavarapu Department of Biomedical Informatics and Medical Education, School of Medicine, University of Washington, Box 358047, Seattle, WA, 98195, USA
Pradeep Suri Clinical Learning, Evidence and Research Center, University of Washington, 4333 Brooklyn Ave NE, Seattle, WA, 98105, USA Department of Rehabilitation Medicine, University of Washington, 1959 NE Pacific St, Seattle, WA, 98195, USA
Vikas Pejaver Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
Janna Friedly Clinical Learning, Evidence and Research Center, University of Washington, 4333 Brooklyn Ave NE, Seattle, WA, 98105, USA Department of Rehabilitation Medicine, University of Washington, 1959 NE Pacific St, Seattle, WA, 98195, USA
Laura S Gold Clinical Learning, Evidence and Research Center, University of Washington, 4333 Brooklyn Ave NE, Seattle, WA, 98105, USA Department of Radiology, University of Washington, 1959 NE Pacific Street, Seattle, WA, 98195, USA
Eric Meier Clinical Learning, Evidence and Research Center, University of Washington, 4333 Brooklyn Ave NE, Seattle, WA, 98105, USA Department of Biostatistics, University of Washington, Box 357232, Seattle, WA, 98195-7232, USA Center for Biomedical Statistics, University of Washington, Seattle, WA, USA
Trevor Cohen Department of Biomedical Informatics and Medical Education, School of Medicine, University of Washington, Box 358047, Seattle, WA, 98195, USA
Sean D Mooney Department of Biomedical Informatics and Medical Education, School of Medicine, University of Washington, Box 358047, Seattle, WA, 98195, USA
Patrick J Heagerty Department of Biostatistics, University of Washington, Box 357232, Seattle, WA, 98195-7232, USA Center for Biomedical Statistics, University of Washington, Seattle, WA, USA
Jeffrey G Jarvik Clinical Learning, Evidence and Research Center, University of Washington, 4333 Brooklyn Ave NE, Seattle, WA, 98105, USA. Department of Radiology, University of Washington, 1959 NE Pacific Street, Seattle, WA, 98195, USA. Department of Neurological Surgery, University of Washington, 1959 NE Pacific Street, Seattle, WA, 98195, USA. Department of Health Services, University of Washington, Box 357660, Seattle, WA, 98195-7660, USA.

Collapse

Wang P, Giovannucci EL. Are exposure-disease relationships assessed in cohorts of health professionals generalizable?: a comparative analysis based on WCRF/AICR systematic literature reviews. Cancer Causes Control 2023;34:39-45. [PMID: 36197566 DOI: 10.1007/s10552-022-01633-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Accepted: 09/15/2022] [Indexed: 01/10/2023]

Jalusic KO, Ellenberger D, Stahmann A, Berger K. Adverse events in MS patients fulfilling or not inclusion criteria of the respective clinical trial - The problem of generalizability. Mult Scler Relat Disord 2023;69:104422. [PMID: 36455503 DOI: 10.1016/j.msard.2022.104422] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2021] [Revised: 11/16/2022] [Accepted: 11/18/2022] [Indexed: 11/21/2022]

Krantz MF, Hjorthøj C, Ellersgaard D, Hemager N, Christiani C, Spang KS, Burton BK, Gregersen M, Søndergaard A, Greve A, Ohland J, Mortensen PB, Plessen KJ, Bliksted V, Jepsen JRM, Thorup AAE, Mors O, Nordentoft M. Examining selection bias in a population-based cohort study of 522 children with familial high risk of schizophrenia or bipolar disorder, and controls: The Danish High Risk and Resilience Study VIA 7. Soc Psychiatry Psychiatr Epidemiol 2023;58:113-140. [PMID: 36087138 DOI: 10.1007/s00127-022-02338-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Accepted: 07/08/2022] [Indexed: 01/20/2023]

Abstract

PURPOSE

Knowledge about representativity of familial high-risk studies of schizophrenia and bipolar disorder is essential to generalize study conclusions. The Danish High Risk and Resilience Study (VIA 7), a population-based case-control familial high-risk study, creates a unique opportunity for combining assessment and register data to examine cohort representativity.

METHODS

Through national registers, we identified the population of 11,959 children of parents with schizophrenia (FHR-SZ) or bipolar disorder (FHR-BP) and controls from which the 522 children participating in The VIA 7 Study (202 FHR-SZ, 120 FHR-BP and 200 controls) were selected. Socio-economic and health data were obtained to compare high-risk groups and controls, and participants versus non-participants. Selection bias impact on results was analyzed through inverse probability weights.

RESULTS

In the total sample of 11,959 children, FHR-SZ and FHR-BP children had more socio-economic and health disadvantages than controls (p < 0.001 for most). VIA 7 non-participants had a poorer function, e.g. more paternal somatic and mental illness (p = 0.02 and p = 0.04 for FHR-SZ), notifications of concern (FHR-BP and PBC p < 0.001), placements out of home (p = 0.03 for FHR-SZ), and lower level of education (p ≤ 0.01 for maternal FHR-SZ and FHR-BP, p = 0.001 for paternal FHR-BP). Inverse probability weighted analyses of results generated from the VIA Study showed minor changes in study findings after adjustment for the found selection bias.

CONCLUSIONS

Familial high-risk families have multiple socio-economic and health disadvantages. In The VIA 7 Study, although comparable regarding mental illness severity after their child's birth, socioeconomic and health disadvantages are more profound amongst non-participants than amongst participants.

Collapse

Affiliation(s)

Mette Falkenberg Krantz CORE- Copenhagen Research Center for Mental Health, Mental Health Center Copenhagen, The Danish High Risk and Resilience Study VIA 7 and VIA 11, Capital Region of Denmark, Copenhagen University Hospital, Gentofte Hospitalsvej 15, opg. 15, 1. Sal., 2900, Hellerup, Denmark. .,Faculty of Health and Medical Sciences, Institute of Clinical Medicine, University of Copenhagen, Blegdamsvej 3B, 2200, Copenhagen, Denmark. .,iPSYCH -The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Fuglesangs Allé 26, Aarhus N, 8210, Arhus, Denmark.
Carsten Hjorthøj CORE- Copenhagen Research Center for Mental Health, Mental Health Center Copenhagen, The Danish High Risk and Resilience Study VIA 7 and VIA 11, Capital Region of Denmark, Copenhagen University Hospital, Gentofte Hospitalsvej 15, opg. 15, 1. Sal., 2900, Hellerup, Denmark.,iPSYCH -The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Fuglesangs Allé 26, Aarhus N, 8210, Arhus, Denmark.,Department of Public Health, Section of Epidemiology, University of Copenhagen, Copenhagen, Denmark
Ditte Ellersgaard CORE- Copenhagen Research Center for Mental Health, Mental Health Center Copenhagen, The Danish High Risk and Resilience Study VIA 7 and VIA 11, Capital Region of Denmark, Copenhagen University Hospital, Gentofte Hospitalsvej 15, opg. 15, 1. Sal., 2900, Hellerup, Denmark.,iPSYCH -The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Fuglesangs Allé 26, Aarhus N, 8210, Arhus, Denmark
Nicoline Hemager CORE- Copenhagen Research Center for Mental Health, Mental Health Center Copenhagen, The Danish High Risk and Resilience Study VIA 7 and VIA 11, Capital Region of Denmark, Copenhagen University Hospital, Gentofte Hospitalsvej 15, opg. 15, 1. Sal., 2900, Hellerup, Denmark.,Faculty of Health and Medical Sciences, Institute of Clinical Medicine, University of Copenhagen, Blegdamsvej 3B, 2200, Copenhagen, Denmark.,iPSYCH -The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Fuglesangs Allé 26, Aarhus N, 8210, Arhus, Denmark
Camilla Christiani CORE- Copenhagen Research Center for Mental Health, Mental Health Center Copenhagen, The Danish High Risk and Resilience Study VIA 7 and VIA 11, Capital Region of Denmark, Copenhagen University Hospital, Gentofte Hospitalsvej 15, opg. 15, 1. Sal., 2900, Hellerup, Denmark.,Faculty of Health and Medical Sciences, Institute of Clinical Medicine, University of Copenhagen, Blegdamsvej 3B, 2200, Copenhagen, Denmark.,iPSYCH -The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Fuglesangs Allé 26, Aarhus N, 8210, Arhus, Denmark
Katrine Søborg Spang Research Unit at Child and Adolescent Mental Health Center Copenhagen, Gentofte Hospitalsvej 3A, opg. 3A, 1. sal, 2900, Hellerup, Denmark.,iPSYCH -The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Fuglesangs Allé 26, Aarhus N, 8210, Arhus, Denmark
Birgitte Klee Burton Research Unit at Child and Adolescent Mental Health Center Copenhagen, Gentofte Hospitalsvej 3A, opg. 3A, 1. sal, 2900, Hellerup, Denmark
Maja Gregersen CORE- Copenhagen Research Center for Mental Health, Mental Health Center Copenhagen, The Danish High Risk and Resilience Study VIA 7 and VIA 11, Capital Region of Denmark, Copenhagen University Hospital, Gentofte Hospitalsvej 15, opg. 15, 1. Sal., 2900, Hellerup, Denmark.,Faculty of Health and Medical Sciences, Institute of Clinical Medicine, University of Copenhagen, Blegdamsvej 3B, 2200, Copenhagen, Denmark.,iPSYCH -The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Fuglesangs Allé 26, Aarhus N, 8210, Arhus, Denmark
Anne Søndergaard CORE- Copenhagen Research Center for Mental Health, Mental Health Center Copenhagen, The Danish High Risk and Resilience Study VIA 7 and VIA 11, Capital Region of Denmark, Copenhagen University Hospital, Gentofte Hospitalsvej 15, opg. 15, 1. Sal., 2900, Hellerup, Denmark.,Faculty of Health and Medical Sciences, Institute of Clinical Medicine, University of Copenhagen, Blegdamsvej 3B, 2200, Copenhagen, Denmark.,iPSYCH -The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Fuglesangs Allé 26, Aarhus N, 8210, Arhus, Denmark
Aja Greve iPSYCH -The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Fuglesangs Allé 26, Aarhus N, 8210, Arhus, Denmark.,The Psychosis Research Unit, Aarhus University Hospital, Psychiatry, Palle Juul-Jensens Boulevard 175, Aarhus N, 8200, Arhus, Denmark.,Department of Clinical Medicine, Faculty of Health and Medical Services, Aarhus University, Palle Juul-Jensens Boulevard 82, Aarhus N, 8200, Arhus, Denmark
Jessica Ohland CORE- Copenhagen Research Center for Mental Health, Mental Health Center Copenhagen, The Danish High Risk and Resilience Study VIA 7 and VIA 11, Capital Region of Denmark, Copenhagen University Hospital, Gentofte Hospitalsvej 15, opg. 15, 1. Sal., 2900, Hellerup, Denmark.,iPSYCH -The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Fuglesangs Allé 26, Aarhus N, 8210, Arhus, Denmark
Preben Bo Mortensen iPSYCH -The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Fuglesangs Allé 26, Aarhus N, 8210, Arhus, Denmark.,Department of Economics and Business Economics, National Centre for Register-Based Research, Aarhus University, Fuglesangs Allé 26, Bygning R2640-R2641, Aarhus V, 8210, Arhus, Denmark
Kerstin Jessica Plessen CORE- Copenhagen Research Center for Mental Health, Mental Health Center Copenhagen, The Danish High Risk and Resilience Study VIA 7 and VIA 11, Capital Region of Denmark, Copenhagen University Hospital, Gentofte Hospitalsvej 15, opg. 15, 1. Sal., 2900, Hellerup, Denmark.,iPSYCH -The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Fuglesangs Allé 26, Aarhus N, 8210, Arhus, Denmark.,Division of Child and Adolescent Psychiatry, Department of Psychiatry, Lausanne University Hospital, Avenue d'Echallens 9, 1004, Lausanne, Switzerland
Vibeke Bliksted iPSYCH -The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Fuglesangs Allé 26, Aarhus N, 8210, Arhus, Denmark.,The Psychosis Research Unit, Aarhus University Hospital, Psychiatry, Palle Juul-Jensens Boulevard 175, Aarhus N, 8200, Arhus, Denmark.,Department of Clinical Medicine, Faculty of Health and Medical Services, Aarhus University, Palle Juul-Jensens Boulevard 82, Aarhus N, 8200, Arhus, Denmark
Jens Richardt Møllegaard Jepsen CORE- Copenhagen Research Center for Mental Health, Mental Health Center Copenhagen, The Danish High Risk and Resilience Study VIA 7 and VIA 11, Capital Region of Denmark, Copenhagen University Hospital, Gentofte Hospitalsvej 15, opg. 15, 1. Sal., 2900, Hellerup, Denmark.,Research Unit at Child and Adolescent Mental Health Center Copenhagen, Gentofte Hospitalsvej 3A, opg. 3A, 1. sal, 2900, Hellerup, Denmark.,iPSYCH -The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Fuglesangs Allé 26, Aarhus N, 8210, Arhus, Denmark.,Mental Health Services, Capital Region of Denmark, Center for Clinical Intervention and Neuropsychiatric Schizophrenia Research, Mental Health Center Glostrup, Nordstjernevej 41, 2600, Glostrup, Denmark
Anne A E Thorup CORE- Copenhagen Research Center for Mental Health, Mental Health Center Copenhagen, The Danish High Risk and Resilience Study VIA 7 and VIA 11, Capital Region of Denmark, Copenhagen University Hospital, Gentofte Hospitalsvej 15, opg. 15, 1. Sal., 2900, Hellerup, Denmark.,Faculty of Health and Medical Sciences, Institute of Clinical Medicine, University of Copenhagen, Blegdamsvej 3B, 2200, Copenhagen, Denmark.,Research Unit at Child and Adolescent Mental Health Center Copenhagen, Gentofte Hospitalsvej 3A, opg. 3A, 1. sal, 2900, Hellerup, Denmark.,iPSYCH -The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Fuglesangs Allé 26, Aarhus N, 8210, Arhus, Denmark
Ole Mors iPSYCH -The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Fuglesangs Allé 26, Aarhus N, 8210, Arhus, Denmark.,The Psychosis Research Unit, Aarhus University Hospital, Psychiatry, Palle Juul-Jensens Boulevard 175, Aarhus N, 8200, Arhus, Denmark.,Department of Clinical Medicine, Faculty of Health and Medical Services, Aarhus University, Palle Juul-Jensens Boulevard 82, Aarhus N, 8200, Arhus, Denmark
Merete Nordentoft CORE- Copenhagen Research Center for Mental Health, Mental Health Center Copenhagen, The Danish High Risk and Resilience Study VIA 7 and VIA 11, Capital Region of Denmark, Copenhagen University Hospital, Gentofte Hospitalsvej 15, opg. 15, 1. Sal., 2900, Hellerup, Denmark.,Faculty of Health and Medical Sciences, Institute of Clinical Medicine, University of Copenhagen, Blegdamsvej 3B, 2200, Copenhagen, Denmark.,iPSYCH -The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Fuglesangs Allé 26, Aarhus N, 8210, Arhus, Denmark

Collapse

Khan KS, Bueno Cavanillas A, Zamora J. [Systematic reviews in five steps: V. Interpreting the findings]. Semergen 2023;49:101854. [PMID: 36410229 DOI: 10.1016/j.semerg.2022.101854] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Revised: 09/10/2022] [Accepted: 09/17/2022] [Indexed: 11/19/2022]

Müller L, Kloeckner R, Mildenberger P, Pinto Dos Santos D. [Validation and implementation of artificial intelligence in radiology : Quo vadis in 2022?]. Radiologie (Heidelb) 2022;63:381-386. [PMID: 36510007 DOI: 10.1007/s00117-022-01097-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 11/17/2022] [Indexed: 12/14/2022]

Fink DS, Stohl M, Mannes ZL, Shmulewitz D, Wall M, Gutkind S, Olfson M, Gradus J, Keyhani S, Maynard C, Keyes KM, Sherman S, Martins S, Saxon AJ, Hasin DS. Comparing mental and physical health of U.S. veterans by VA healthcare use: implications for generalizability of research in the VA electronic health records. BMC Health Serv Res 2022;22:1500. [PMID: 36494829 PMCID: PMC9733218 DOI: 10.1186/s12913-022-08899-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Accepted: 11/28/2022] [Indexed: 12/13/2022] Open

Affiliation(s)

David S. Fink grid.413734.60000 0000 8499 1112New York State Psychiatric Institute, New York, NY USA
Malka Stohl grid.413734.60000 0000 8499 1112New York State Psychiatric Institute, New York, NY USA
Zachary L. Mannes grid.21729.3f0000000419368729Columbia University Mailman School of Public Health, New York, NY USA
Dvora Shmulewitz grid.413734.60000 0000 8499 1112New York State Psychiatric Institute, New York, NY USA ,2grid.21729.3f0000000419368729Columbia University Mailman School of Public Health, New York, NY USA
Melanie Wall grid.413734.60000 0000 8499 1112New York State Psychiatric Institute, New York, NY USA ,2grid.21729.3f0000000419368729Columbia University Mailman School of Public Health, New York, NY USA
Sarah Gutkind grid.21729.3f0000000419368729Columbia University Mailman School of Public Health, New York, NY USA
Mark Olfson grid.413734.60000 0000 8499 1112New York State Psychiatric Institute, New York, NY USA ,2grid.21729.3f0000000419368729Columbia University Mailman School of Public Health, New York, NY USA
Jaimie Gradus grid.189504.10000 0004 1936 7558Boston University School of Public Health, Boston, MA USA
Salomeh Keyhani Veteran Affairs, San Francisco, VA USA ,5grid.266102.10000 0001 2297 6811University of California, San Francisco, CA USA
Charles Maynard grid.413919.70000 0004 0420 6540Veteran Affairs, Puget Sound Health Care System, Seattle, WA USA ,7grid.34477.330000000122986657University of Washington, Seattle, WA USA
Katherine M. Keyes grid.21729.3f0000000419368729Columbia University Mailman School of Public Health, New York, NY USA
Scott Sherman grid.137628.90000 0004 1936 8753New York University, New York, NY USA
Silvia Martins grid.21729.3f0000000419368729Columbia University Mailman School of Public Health, New York, NY USA
Andrew J. Saxon grid.413919.70000 0004 0420 6540Veteran Affairs, Puget Sound Health Care System, Seattle, WA USA ,7grid.34477.330000000122986657University of Washington, Seattle, WA USA
Deborah S. Hasin grid.413734.60000 0000 8499 1112New York State Psychiatric Institute, New York, NY USA ,2grid.21729.3f0000000419368729Columbia University Mailman School of Public Health, New York, NY USA ,9grid.239585.00000 0001 2285 2675Department of Psychiatry, Columbia University Medical Center, 1051 Riverside Dr., Unit 123, New York, NY 10032 USA

Collapse

Nadeem SA, Comellas AP, Hoffman EA, Saha PK. Airway Detection in COPD at Low-Dose CT Using Deep Learning and Multiparametric Freeze and Grow. Radiol Cardiothorac Imaging 2022;4:e210311. [PMID: 36601453 PMCID: PMC9806731 DOI: 10.1148/ryct.210311] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2021] [Revised: 09/27/2022] [Accepted: 10/27/2022] [Indexed: 06/17/2023]

Abstract

PURPOSE

To present and validate a fully automated airway detection method at low-dose CT in patients with chronic obstructive pulmonary disease (COPD).

MATERIALS AND METHODS

In this retrospective study, deep learning (DL) and freeze-and-grow (FG) methods were optimized and applied to automatically detect airways at low-dose CT. Four data sets were used: two data sets consisting of matching standard- and low-dose CT scans from the Genetic Epidemiology of COPD (COPDGene) phase II (2014-2017) cohort (n = 2 × 236; mean age ± SD, 70 years ± 9; 123 women); one data set consisting of low-dose CT scans from the COPDGene phase III (2018-2020) cohort (n = 335; mean age ± SD, 73 years ± 8; 173 women); and one data set consisting of low-dose, anonymized CT scans from the 2003 Dutch-Belgian Randomized Lung Cancer Screening trial (n = 55) acquired by using different CT scanners. Performance measures for different methods were computed and compared by using the Wilcoxon signed rank test.

RESULTS

At low-dose CT, 56 294 of 62 480 (90.1%) airways of the reference total airway count (TAC) and 32 109 of 37 864 (84.8%) airways of the peripheral TAC (TAC_p), detected at standard-dose CT, were detected. Significant losses (P < .001) of 14 526 of 76 453 (19.0%) airways and 884 of 6908 (12.8%) airways in the TAC and 12 256 of 43 462 (28.2%) airways and 699 of 3882 (18.0%) airways in the TAC_p were observed, respectively, for the multiprotocol and multiscanner data without retraining. When using the automated low-dose CT method, TAC values of 347, 342, 323, and 266 and TAC_p values of 205, 202, 289, and 141 were observed for those who have never smoked and participants at Global Initiative for Chronic Obstructive Lung Disease stages 0, 1, and 2, respectively, which were superior to the respective values previously reported for matching groups when using a semiautomated method at standard-dose CT.

CONCLUSION

A low-cost, automated CT-based airway detection method was suitable for investigation of airway phenotypes at low-dose CT.Keywords: Airway, Airway Count, Airway Detection, Chronic Obstructive Pulmonary Disease, CT, Deep Learning, Generalizability, Low-Dose CT, Segmentation, Thorax, LungClinical trial registration no. NCT00608764 Supplemental material is available for this article. © RSNA, 2022.

Collapse

Patel AU, Mohanty SK, Parwani AV. Applications of Digital and Computational Pathology and Artificial Intelligence in Genitourinary Pathology Diagnostics. Surg Pathol Clin 2022;15:759-785. [PMID: 36344188 DOI: 10.1016/j.path.2022.08.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

van Klaveren D, Zanos TP, Nelson J, Levy TJ, Park JG, Retel Helmrich IRA, Rietjens JAC, Basile MJ, Hajizadeh N, Lingsma HF, Kent DM. Prognostic models for COVID-19 needed updating to warrant transportability over time and space. BMC Med 2022;20:456. [PMID: 36424619 PMCID: PMC9686462 DOI: 10.1186/s12916-022-02651-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Accepted: 11/04/2022] [Indexed: 11/25/2022] Open

Abstract

BACKGROUND

Supporting decisions for patients who present to the emergency department (ED) with COVID-19 requires accurate prognostication. We aimed to evaluate prognostic models for predicting outcomes in hospitalized patients with COVID-19, in different locations and across time.

METHODS

We included patients who presented to the ED with suspected COVID-19 and were admitted to 12 hospitals in the New York City (NYC) area and 4 large Dutch hospitals. We used second-wave patients who presented between September and December 2020 (2137 and 3252 in NYC and the Netherlands, respectively) to evaluate models that were developed on first-wave patients who presented between March and August 2020 (12,163 and 5831). We evaluated two prognostic models for in-hospital death: The Northwell COVID-19 Survival (NOCOS) model was developed on NYC data and the COVID Outcome Prediction in the Emergency Department (COPE) model was developed on Dutch data. These models were validated on subsequent second-wave data at the same site (temporal validation) and at the other site (geographic validation). We assessed model performance by the Area Under the receiver operating characteristic Curve (AUC), by the E-statistic, and by net benefit.

RESULTS

Twenty-eight-day mortality was considerably higher in the NYC first-wave data (21.0%), compared to the second-wave (10.1%) and the Dutch data (first wave 10.8%; second wave 10.0%). COPE discriminated well at temporal validation (AUC 0.82), with excellent calibration (E-statistic 0.8%). At geographic validation, discrimination was satisfactory (AUC 0.78), but with moderate over-prediction of mortality risk, particularly in higher-risk patients (E-statistic 2.9%). While discrimination was adequate when NOCOS was tested on second-wave NYC data (AUC 0.77), NOCOS systematically overestimated the mortality risk (E-statistic 5.1%). Discrimination in the Dutch data was good (AUC 0.81), but with over-prediction of risk, particularly in lower-risk patients (E-statistic 4.0%). Recalibration of COPE and NOCOS led to limited net benefit improvement in Dutch data, but to substantial net benefit improvement in NYC data.

CONCLUSIONS

NOCOS performed moderately worse than COPE, probably reflecting unique aspects of the early pandemic in NYC. Frequent updating of prognostic models is likely to be required for transportability over time and space during a dynamic pandemic.

Collapse

Affiliation(s)

David van Klaveren Department of Public Health, Erasmus MC University Medical Center Rotterdam, Dr. Molewaterplein 50, 3015 GE, Rotterdam, The Netherlands. .,Predictive Analytics and Comparative Effectiveness Center, Institute for Clinical Research and Health Policy Studies, Tufts Medical Center, Boston, USA.
Theodoros P Zanos Institute of Bioelectronic Medicine, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, USA
Jason Nelson Predictive Analytics and Comparative Effectiveness Center, Institute for Clinical Research and Health Policy Studies, Tufts Medical Center, Boston, USA
Todd J Levy Institute of Bioelectronic Medicine, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, USA
Jinny G Park Predictive Analytics and Comparative Effectiveness Center, Institute for Clinical Research and Health Policy Studies, Tufts Medical Center, Boston, USA
Isabel R A Retel Helmrich Department of Public Health, Erasmus MC University Medical Center Rotterdam, Dr. Molewaterplein 50, 3015 GE, Rotterdam, The Netherlands
Judith A C Rietjens Department of Public Health, Erasmus MC University Medical Center Rotterdam, Dr. Molewaterplein 50, 3015 GE, Rotterdam, The Netherlands
Melissa J Basile Division of Pulmonary Critical Care and Sleep Medicine, Department of Medicine, Donald and Barbara Zucker School of Medicine at Hofstra/Northwell Health, Hempstead, NY, USA
Negin Hajizadeh Division of Pulmonary Critical Care and Sleep Medicine, Department of Medicine, Donald and Barbara Zucker School of Medicine at Hofstra/Northwell Health, Hempstead, NY, USA
Hester F Lingsma Department of Public Health, Erasmus MC University Medical Center Rotterdam, Dr. Molewaterplein 50, 3015 GE, Rotterdam, The Netherlands
David M Kent Predictive Analytics and Comparative Effectiveness Center, Institute for Clinical Research and Health Policy Studies, Tufts Medical Center, Boston, USA

Collapse

Maleki F, Ovens K, Gupta R, Reinhold C, Spatz A, Forghani R. Generalizability of Machine Learning Models: Quantitative Evaluation of Three Methodological Pitfalls. Radiol Artif Intell 2022;5:e220028. [PMID: 36721408 PMCID: PMC9885377 DOI: 10.1148/ryai.220028] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2022] [Revised: 10/10/2022] [Accepted: 10/24/2022] [Indexed: 11/17/2022]

Abstract

Purpose

To investigate the impact of the following three methodological pitfalls on model generalizability: (a) violation of the independence assumption, (b) model evaluation with an inappropriate performance indicator or baseline for comparison, and (c) batch effect.

Materials and Methods

The authors used retrospective CT, histopathologic analysis, and radiography datasets to develop machine learning models with and without the three methodological pitfalls to quantitatively illustrate their effect on model performance and generalizability. F1 score was used to measure performance, and differences in performance between models developed with and without errors were assessed using the Wilcoxon rank sum test when applicable.

Results

Violation of the independence assumption by applying oversampling, feature selection, and data augmentation before splitting data into training, validation, and test sets seemingly improved model F1 scores by 71.2% for predicting local recurrence and 5.0% for predicting 3-year overall survival in head and neck cancer and by 46.0% for distinguishing histopathologic patterns in lung cancer. Randomly distributing data points for a patient across datasets superficially improved the F1 score by 21.8%. High model performance metrics did not indicate high-quality lung segmentation. In the presence of a batch effect, a model built for pneumonia detection had an F1 score of 98.7% but correctly classified only 3.86% of samples from a new dataset of healthy patients.

Conclusion

Machine learning models developed with these methodological pitfalls, which are undetectable during internal evaluation, produce inaccurate predictions; thus, understanding and avoiding these pitfalls is necessary for developing generalizable models.Keywords: Random Forest, Diagnosis, Prognosis, Convolutional Neural Network (CNN), Medical Image Analysis, Generalizability, Machine Learning, Deep Learning, Model Evaluation Supplemental material is available for this article. Published under a CC BY 4.0 license.

Collapse