Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Yu B, Kumbier K. Veridical data science. Proc Natl Acad Sci U S A 2020;117:3920-9. [PMID: 32054788 DOI: 10.1073/pnas.1901326117] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open

For:	Yu B, Kumbier K. Veridical data science. Proc Natl Acad Sci U S A 2020;117:3920-9. [PMID: 32054788 DOI: 10.1073/pnas.1901326117] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open

Number

Cited by Other Article(s)

Li J, Ionides EL, King AA, Pascual M, Ning N. Inference on spatiotemporal dynamics for coupled biological populations. J R Soc Interface 2024;21:20240217. [PMID: 38981516 DOI: 10.1098/rsif.2024.0217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Accepted: 06/07/2024] [Indexed: 07/11/2024] Open

Mandros P, Gallagher I, Fanfani V, Chen C, Fischer J, Ismail A, Hsu L, Saha E, DeConti DK, Quackenbush J. node2vec2rank: Large Scale and Stable Graph Differential Analysis via Multi-Layer Node Embeddings and Ranking. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.16.599201. [PMID: 38948759 PMCID: PMC11212899 DOI: 10.1101/2024.06.16.599201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/02/2024]

Haab B, Qian L, Staal B, Jain M, Fahrmann J, Worthington C, Prosser D, Velokokhatnaya L, Lopez C, Tang R, Hurd MW, Natarajan G, Kumar S, Smith L, Hanash S, Batra SK, Maitra A, Lokshin A, Huang Y, Brand RE. A Rigorous Multi-Laboratory Study of Known PDAC Biomarkers Identifies Increased Sensitivity and Specificity Over CA19-9 Alone. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.22.595399. [PMID: 38826212 PMCID: PMC11142185 DOI: 10.1101/2024.05.22.595399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2024]

Wang Q, Tang TM, Youlton N, Weldy CS, Kenney AM, Ronen O, Weston Hughes J, Chin ET, Sutton SC, Agarwal A, Li X, Behr M, Kumbier K, Moravec CS, Wilson Tang WH, Margulies KB, Cappola TP, Butte AJ, Arnaout R, Brown JB, Priest JR, Parikh VN, Yu B, Ashley EA. Epistasis regulates genetic control of cardiac hypertrophy. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2023.11.06.23297858. [PMID: 37987017 PMCID: PMC10659487 DOI: 10.1101/2023.11.06.23297858] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2023]

Behr M, Kumbier K, Cordova-Palomera A, Aguirre M, Ronen O, Ye C, Ashley E, Butte AJ, Arnaout R, Brown B, Priest J, Yu B. Learning epistatic polygenic phenotypes with Boolean interactions. PLoS One 2024;19:e0298906. [PMID: 38625909 PMCID: PMC11020961 DOI: 10.1371/journal.pone.0298906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Accepted: 01/31/2024] [Indexed: 04/18/2024] Open

Abstract

Detecting epistatic drivers of human phenotypes is a considerable challenge. Traditional approaches use regression to sequentially test multiplicative interaction terms involving pairs of genetic variants. For higher-order interactions and genome-wide large-scale data, this strategy is computationally intractable. Moreover, multiplicative terms used in regression modeling may not capture the form of biological interactions. Building on the Predictability, Computability, Stability (PCS) framework, we introduce the epiTree pipeline to extract higher-order interactions from genomic data using tree-based models. The epiTree pipeline first selects a set of variants derived from tissue-specific estimates of gene expression. Next, it uses iterative random forests (iRF) to search training data for candidate Boolean interactions (pairwise and higher-order). We derive significance tests for interactions, based on a stabilized likelihood ratio test, by simulating Boolean tree-structured null (no epistasis) and alternative (epistasis) distributions on hold-out test data. Finally, our pipeline computes PCS epistasis p-values that probabilisticly quantify improvement in prediction accuracy via bootstrap sampling on the test set. We validate the epiTree pipeline in two case studies using data from the UK Biobank: predicting red hair and multiple sclerosis (MS). In the case of predicting red hair, epiTree recovers known epistatic interactions surrounding MC1R and novel interactions, representing non-linearities not captured by logistic regression models. In the case of predicting MS, a more complex phenotype than red hair, epiTree rankings prioritize novel interactions surrounding HLA-DRB1, a variant previously associated with MS in several populations. Taken together, these results highlight the potential for epiTree rankings to help reduce the design space for follow up experiments.

Collapse

Affiliation(s)

Merle Behr Faculty of Informatics and Data Science, University of Regensburg, Regensburg, Germany
Karl Kumbier Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, United States of America
Aldo Cordova-Palomera Department of Pediatrics, Stanford Medicine, Stanford, CA, United States of America
Matthew Aguirre Department of Pediatrics, Stanford Medicine, Stanford, CA, United States of America Department of Biomedical Data Science, Stanford Medicine, Stanford, CA, United States of America
Omer Ronen Department of Statistics, University of California at Berkeley, Berkeley, CA, United States of America
Chengzhong Ye Department of Statistics, University of California at Berkeley, Berkeley, CA, United States of America
Euan Ashley Division of Cardiovascular Medicine, Stanford Medicine, Stanford, CA, United States of America
Atul J. Butte Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA, United States of America
Rima Arnaout Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA, United States of America Division of Cardiology, Department of Medicine, University of California, San Francisco, San Francisco, CA, United States of America
Ben Brown Department of Statistics, University of California at Berkeley, Berkeley, CA, United States of America Biosciences Area, Lawrence Berkeley National Laboratory, Berkeley, CA, United States of America
James Priest Department of Pediatrics, Stanford Medicine, Stanford, CA, United States of America
Bin Yu Department of Statistics, University of California at Berkeley, Berkeley, CA, United States of America Department of Electrical Engineering and Computer Sciences and Center for Computational Biology, University of California at Berkeley, Berkeley, CA, United States of America

Collapse

Lasko TA, Strobl EV, Stead WW. Why do probabilistic clinical models fail to transport between sites. NPJ Digit Med 2024;7:53. [PMID: 38429353 PMCID: PMC10907678 DOI: 10.1038/s41746-024-01037-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 02/14/2024] [Indexed: 03/03/2024] Open

Cheng F, Wang F, Tang J, Zhou Y, Fu Z, Zhang P, Haines JL, Leverenz JB, Gan L, Hu J, Rosen-Zvi M, Pieper AA, Cummings J. Artificial intelligence and open science in discovery of disease-modifying medicines for Alzheimer's disease. Cell Rep Med 2024;5:101379. [PMID: 38382465 PMCID: PMC10897520 DOI: 10.1016/j.xcrm.2023.101379] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Revised: 08/15/2023] [Accepted: 12/19/2023] [Indexed: 02/23/2024]

Affiliation(s)

Feixiong Cheng Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA; Cleveland Clinic Genome Center, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA; Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH 44195, USA.
Fei Wang Department of Population Health Sciences, Weill Cornell Medical College, Cornell University, New York, NY 10065, USA
Jian Tang Mila-Quebec Institute for Learning Algorithms and CIFAR AI Research Chair, HEC Montreal, Montréal, QC H3T 2A7, Canada
Yadi Zhou Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
Zhimin Fu Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA; College of Pharmacy, Northeast Ohio Medical University, Rootstown, OH 44272, USA
Pengyue Zhang Department of Biostatistics and Health Data Science, Indiana University, Indianapolis, IN 46037, USA
Jonathan L Haines Cleveland Institute for Computational Biology, and Department of Population & Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH 44106, USA
James B Leverenz Lou Ruvo Center for Brain Health, Neurological Institute, Cleveland Clinic, Cleveland, OH 44195, USA
Li Gan Helen and Robert Appel Alzheimer's Disease Research Institute, Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY 10021, USA
Jianying Hu IBM Research, Yorktown Heights, New York, NY 10598, USA
Michal Rosen-Zvi AI for Accelerated Healthcare and Life Sciences Discovery, IBM Research Labs, Haifa 3498825, Israel; Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem 9190500, Israel
Andrew A Pieper Brain Health Medicines Center, Harrington Discovery Institute, University Hospitals Cleveland Medical Center, Cleveland, OH, 44106, USA; Department of Psychiatry, Case Western Reserve University, Cleveland, OH 44106, USA; Geriatric Psychiatry, GRECC, Louis Stokes Cleveland VA Medical Center, Cleveland, OH 44106, USA; Institute for Transformative Molecular Medicine, School of Medicine, Case Western Reserve University, Cleveland OH 44106, USA; Department of Pathology, Case Western Reserve University, School of Medicine, Cleveland, OH, 44106, USA; Department of Neurosciences, Case Western Reserve University, School of Medicine, Cleveland, OH 44106, USA
Jeffrey Cummings Chambers-Grundy Center for Transformative Neuroscience, Department of Brain Health, School of Integrated Health Sciences, UNLV, Las Vegas, NV 89154, USA

Collapse

Tognolini M, Lodola A, Giorgio C. Drug discovery: In silico dry data can bypass biological wet data? Br J Pharmacol 2024;181:340-344. [PMID: 37872106 DOI: 10.1111/bph.16266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 09/27/2023] [Accepted: 10/10/2023] [Indexed: 10/25/2023] Open

Zhang H, Liu S, Wang Y, Huang H, Sun L, Yuan Y, Cheng L, Liu X, Ning K. Deep learning enhanced the diagnostic merit of serum glycome for multiple cancers. iScience 2024;27:108715. [PMID: 38226168 PMCID: PMC10788220 DOI: 10.1016/j.isci.2023.108715] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Revised: 10/24/2023] [Accepted: 12/11/2023] [Indexed: 01/17/2024] Open

Affiliation(s)

Haobo Zhang Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center of AI Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, China
Si Liu Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center of AI Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, China Department of Epidemiology and Health Statistics, School of Public Health, Fujian Medical University, Fuzhou, Fujian, China
Yi Wang Department of Laboratory Medicine, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
Hanhui Huang Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center of AI Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, China
Lukang Sun Department of Laboratory Medicine, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
Youyuan Yuan Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center of AI Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, China
Liming Cheng Department of Laboratory Medicine, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
Xin Liu Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center of AI Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, China
Kang Ning Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center of AI Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, China

Collapse

Hu ZT, Yu Y, Chen R, Yeh SJ, Chen B, Huang H. Large-Scale Information Retrieval and Correction of Noisy Pharmacogenomic Datasets through Residual Thresholded Deep Matrix Factorization. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.07.570723. [PMID: 38106027 PMCID: PMC10723412 DOI: 10.1101/2023.12.07.570723] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]

Frostig T, Benjamini Y, Kehat O, Weiss-Meilik A, Mandel D, Peleg B, Strauss Z, Mitelpunkt A. Developing a length of stay prediction model for newborns, achieving better accuracy with greater usability. Int J Med Inform 2023;180:105267. [PMID: 37918217 DOI: 10.1016/j.ijmedinf.2023.105267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Revised: 10/13/2023] [Accepted: 10/20/2023] [Indexed: 11/04/2023]

Abstract

BACKGROUND

One in ten newborn children is born prematurely. The elongated length of stay (LOS) of these children in the Neonatal Intensive Care Unit (NICU) has important implications on hospital occupancy figures, healthcare and management costs, as well as the psychology of parents. In order to allow accurate planning and resource allocation, this study aims to create a generalizable and robust model to predict the NICU LOS of preterm newborns.

METHODS

Data were collected from a large tertiary center NICU between 2011 and 2018 and relates to 5,362 newborns. The selected model was externally validated using a data set of 8,768 newborns from another tertiary center NICU. This report compares several models, such as Random Forest (RF), quantile RF, and other feature selection methods, including LASSO and AIC step-forward selection. In addition, a novel step-forward selection based on False Discovery Rate (FDR) for quantile regression is presented and evaluated.

RESULTS

A high-orderquantile regression model for predicting preterm newborns' LOS that uses only four features available at birth had more attractive properties than other richer ones. The model achieved a Mean Absolute Error (MAE) of 6.26 days on the internal validation set (average LOS 27.04) and an MAE of 6.04 days on the external validation set (average LOS 29.32). The suggested model surpassed the accuracy obtained by models in the literature. It is shown empirically that the FDR-based selection has better properties than the AIC-based step-forward selection approach.

CONCLUSION

This paper demonstrates a process to create a predictive model for NICU LOS in preterm newborns, where each step is reasoned. We obtain a simple and robust model for NICU LOS prediction, which achieves far better results than the current model used for financing NICUs. Utilizing this model, we have created an easy-to-use online web application to ease parents' worries and to assist NICU management: https://tzviel.shinyapps.io/calcuLOS.

Collapse

Hunter DJ, Holmes C. Where Medical Statistics Meets Artificial Intelligence. N Engl J Med 2023;389:1211-1219. [PMID: 37754286 DOI: 10.1056/nejmra2212850] [Citation(s) in RCA: 18] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 09/28/2023]

Irajizad E, Kenney A, Tang T, Vykoukal J, Wu R, Murage E, Dennison JB, Sans M, Long JP, Loftus M, Chabot JA, Kluger MD, Kastrinos F, Brais L, Babic A, Jajoo K, Lee LS, Clancy TE, Ng K, Bullock A, Genkinger JM, Maitra A, Do KA, Yu B, Wolpin BM, Hanash S, Fahrmann JF. A blood-based metabolomic signature predictive of risk for pancreatic cancer. Cell Rep Med 2023;4:101194. [PMID: 37729870 PMCID: PMC10518621 DOI: 10.1016/j.xcrm.2023.101194] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 12/20/2022] [Accepted: 08/21/2023] [Indexed: 09/22/2023]

Affiliation(s)

Ehsan Irajizad Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX, USA; Department of Clinical Cancer Prevention, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
Ana Kenney Department of Statistics, University of California, Berkeley, Berkeley, CA, USA
Tiffany Tang Department of Statistics, University of California, Berkeley, Berkeley, CA, USA
Jody Vykoukal Department of Clinical Cancer Prevention, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
Ranran Wu Department of Clinical Cancer Prevention, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
Eunice Murage Department of Clinical Cancer Prevention, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
Jennifer B Dennison Department of Clinical Cancer Prevention, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
Marta Sans Division of Gastroenterology, Hepatology and Endoscopy, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
James P Long Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
Maureen Loftus Dana-Farber Brigham and Women's Cancer Center, Division of Gastrointestinal Oncology, Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
John A Chabot Division of Digestive and Liver Diseases, Columbia University Irving Medical Cancer and the Vagelos College of Physicians and Surgeons, New York, NY, USA
Michael D Kluger Division of Digestive and Liver Diseases, Columbia University Irving Medical Cancer and the Vagelos College of Physicians and Surgeons, New York, NY, USA
Fay Kastrinos Division of Digestive and Liver Diseases, Columbia University Irving Medical Cancer and the Vagelos College of Physicians and Surgeons, New York, NY, USA; Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY, USA
Lauren Brais Dana-Farber Brigham and Women's Cancer Center, Division of Gastrointestinal Oncology, Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
Ana Babic Dana-Farber Brigham and Women's Cancer Center, Division of Gastrointestinal Oncology, Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
Kunal Jajoo Division of Gastroenterology, Hepatology and Endoscopy, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
Linda S Lee Division of Gastroenterology, Hepatology and Endoscopy, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
Thomas E Clancy Dana-Farber Brigham and Women's Cancer Center, Division of Surgical Oncology, Department of Surgery, Brigham and Women's Hospital, Harvard Medical School, Boston, MA USA
Kimmie Ng Dana-Farber Brigham and Women's Cancer Center, Division of Gastrointestinal Oncology, Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
Andrea Bullock Division of Hematology/Oncology, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
Jeanine M Genkinger Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY, USA; Department of Epidemiology, Columbia Mailman School of Public Health, New York, NY, USA
Anirban Maitra Department of Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
Kim-Anh Do Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
Bin Yu Department of Statistics, University of California, Berkeley, Berkeley, CA, USA
Brian M Wolpin Dana-Farber Brigham and Women's Cancer Center, Division of Gastrointestinal Oncology, Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
Sam Hanash Department of Clinical Cancer Prevention, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.
Johannes F Fahrmann Department of Clinical Cancer Prevention, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.

Collapse

Landeros A, Xu J, Lange K. MM optimization: Proximal distance algorithms, path following, and trust regions. Proc Natl Acad Sci U S A 2023;120:e2303168120. [PMID: 37339185 PMCID: PMC10319036 DOI: 10.1073/pnas.2303168120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 05/09/2023] [Indexed: 06/22/2023] Open

Aw A, Jin LC, Ioannidis N, Song YS. The Impact of Stability Considerations on Genetic Fine-Mapping. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.11.536456. [PMID: 37090514 PMCID: PMC10120703 DOI: 10.1101/2023.04.11.536456] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/25/2023]

Sadybekov AV, Katritch V. Computational approaches streamlining drug discovery. Nature 2023;616:673-685. [PMID: 37100941 DOI: 10.1038/s41586-023-05905-z] [Citation(s) in RCA: 135] [Impact Index Per Article: 135.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Accepted: 03/01/2023] [Indexed: 04/28/2023]

Broderick T, Gelman A, Meager R, Smith AL, Zheng T. Toward a taxonomy of trust for probabilistic machine learning. SCIENCE ADVANCES 2023;9:eabn3999. [PMID: 36791188 PMCID: PMC9931201 DOI: 10.1126/sciadv.abn3999] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Accepted: 01/13/2023] [Indexed: 06/18/2023]

Marmolejo‐Ramos F, Tejo M, Brabec M, Kuzilek J, Joksimovic S, Kovanovic V, González J, Kneib T, Bühlmann P, Kook L, Briseño‐Sánchez G, Ospina R. Distributional regression modeling via generalized additive models for location, scale, and shape: An overview through a data set from learning analytics. WILEY INTERDISCIPLINARY REVIEWS. DATA MINING AND KNOWLEDGE DISCOVERY 2023;13:e1479. [PMID: 37502671 PMCID: PMC10369920 DOI: 10.1002/widm.1479] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 06/11/2022] [Accepted: 10/05/2022] [Indexed: 07/29/2023]

De Paolis Kaluza MC, Jain S, Radivojac P. An Approach to Identifying and Quantifying Bias in Biomedical Data. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2023;28:311-322. [PMID: 36540987 PMCID: PMC9782737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Marmolejo-Ramos F, Ospina R, García-Ceja E, Correa JC. Ingredients for Responsible Machine Learning: A Commented Review of The Hitchhiker’s Guide to Responsible Machine Learning. JOURNAL OF STATISTICAL THEORY AND APPLICATIONS 2022;21:175-185. [PMID: 36160758 PMCID: PMC9483296 DOI: 10.1007/s44199-022-00048-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 09/02/2022] [Indexed: 11/25/2022] Open

Kornblith AE, Singh C, Devlin G, Addo N, Streck CJ, Holmes JF, Kuppermann N, Grupp-Phelan J, Fineman J, Butte AJ, Yu B. Predictability and stability testing to assess clinical decision instrument performance for children after blunt torso trauma. PLOS DIGITAL HEALTH 2022;1:e0000076. [PMID: 36812570 PMCID: PMC9931266 DOI: 10.1371/journal.pdig.0000076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Accepted: 06/14/2022] [Indexed: 11/18/2022]

Abstract

OBJECTIVE

The Pediatric Emergency Care Applied Research Network (PECARN) has developed a clinical-decision instrument (CDI) to identify children at very low risk of intra-abdominal injury. However, the CDI has not been externally validated. We sought to vet the PECARN CDI with the Predictability Computability Stability (PCS) data science framework, potentially increasing its chance of a successful external validation.

MATERIALS & METHODS

We performed a secondary analysis of two prospectively collected datasets: PECARN (12,044 children from 20 emergency departments) and an independent external validation dataset from the Pediatric Surgical Research Collaborative (PedSRC; 2,188 children from 14 emergency departments). We used PCS to reanalyze the original PECARN CDI along with new interpretable PCS CDIs developed using the PECARN dataset. External validation was then measured on the PedSRC dataset.

RESULTS

Three predictor variables (abdominal wall trauma, Glasgow Coma Scale Score <14, and abdominal tenderness) were found to be stable. A CDI using only these three variables would achieve lower sensitivity than the original PECARN CDI with seven variables on internal PECARN validation but achieve the same performance on external PedSRC validation (sensitivity 96.8% and specificity 44%). Using only these variables, we developed a PCS CDI which had a lower sensitivity than the original PECARN CDI on internal PECARN validation but performed the same on external PedSRC validation (sensitivity 96.8% and specificity 44%).

CONCLUSION

The PCS data science framework vetted the PECARN CDI and its constituent predictor variables prior to external validation. We found that the 3 stable predictor variables represented all of the PECARN CDI's predictive performance on independent external validation. The PCS framework offers a less resource-intensive method than prospective validation to vet CDIs before external validation. We also found that the PECARN CDI will generalize well to new populations and should be prospectively externally validated. The PCS framework offers a potential strategy to increase the chance of a successful (costly) prospective validation.

Collapse

Affiliation(s)

Aaron E. Kornblith Department of Emergency Medicine, University of California, San Francisco, San Francisco, United States of America Department of Pediatrics, University of California, San Francisco, San Francisco, United States of America
Chandan Singh Department of Electrical Engineering & Computer Science, University of California, Berkeley, Berkeley, United States of America
Gabriel Devlin Department of Pediatrics, University of California, San Francisco, San Francisco, United States of America
Newton Addo Department of Emergency Medicine, University of California, San Francisco, San Francisco, United States of America
Christian J. Streck Department of Surgery, Medical University of South Carolina, Children’s Hospital, Charleston, United States of America
James F. Holmes Department of Emergency Medicine, University of California, Davis, Davis, United States of America
Nathan Kuppermann Department of Emergency Medicine, University of California, Davis, Davis, United States of America Department of Pediatrics, University of California, Davis, Davis, United States of America
Jacqueline Grupp-Phelan Department of Emergency Medicine, University of California, San Francisco, San Francisco, United States of America Department of Pediatrics, University of California, San Francisco, San Francisco, United States of America
Jeffrey Fineman Department of Pediatrics, University of California, San Francisco, San Francisco, United States of America
Atul J. Butte Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, United States of America
Bin Yu Department of Electrical Engineering & Computer Science, University of California, Berkeley, Berkeley, United States of America Departments of Statistics, University of California, Berkeley, Berkeley, United States of America * E-mail:

Collapse

Trella AL, Zhang KW, Nahum-Shani I, Shetty V, Doshi-Velez F, Murphy SA. Designing Reinforcement Learning Algorithms for Digital Interventions: Pre-Implementation Guidelines. ALGORITHMS 2022;15:255. [PMID: 36713810 PMCID: PMC9881427 DOI: 10.3390/a15080255] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]

Lu JH, Callahan A, Patel BS, Morse KE, Dash D, Pfeffer MA, Shah NH. Assessment of Adherence to Reporting Guidelines by Commonly Used Clinical Prediction Models From a Single Vendor: A Systematic Review. JAMA Netw Open 2022;5:e2227779. [PMID: 35984654 PMCID: PMC9391954 DOI: 10.1001/jamanetworkopen.2022.27779] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open

Abstract

IMPORTANCE

Various model reporting guidelines have been proposed to ensure clinical prediction models are reliable and fair. However, no consensus exists about which model details are essential to report, and commonalities and differences among reporting guidelines have not been characterized. Furthermore, how well documentation of deployed models adheres to these guidelines has not been studied.

OBJECTIVES

To assess information requested by model reporting guidelines and whether the documentation for commonly used machine learning models developed by a single vendor provides the information requested.

EVIDENCE REVIEW

MEDLINE was queried using machine learning model card and reporting machine learning from November 4 to December 6, 2020. References were reviewed to find additional publications, and publications without specific reporting recommendations were excluded. Similar elements requested for reporting were merged into representative items. Four independent reviewers and 1 adjudicator assessed how often documentation for the most commonly used models developed by a single vendor reported the items.

FINDINGS

From 15 model reporting guidelines, 220 unique items were identified that represented the collective reporting requirements. Although 12 items were commonly requested (requested by 10 or more guidelines), 77 items were requested by just 1 guideline. Documentation for 12 commonly used models from a single vendor reported a median of 39% (IQR, 37%-43%; range, 31%-47%) of items from the collective reporting requirements. Many of the commonly requested items had 100% reporting rates, including items concerning outcome definition, area under the receiver operating characteristics curve, internal validation, and intended clinical use. Several items reported half the time or less related to reliability, such as external validation, uncertainty measures, and strategy for handling missing data. Other frequently unreported items related to fairness (summary statistics and subgroup analyses, including for race and ethnicity or sex).

CONCLUSIONS AND RELEVANCE

These findings suggest that consistent reporting recommendations for clinical predictive models are needed for model developers to share necessary information for model deployment. The many published guidelines would, collectively, require reporting more than 200 items. Model documentation from 1 vendor reported the most commonly requested items from model reporting guidelines. However, areas for improvement were identified in reporting items related to model reliability and fairness. This analysis led to feedback to the vendor, which motivated updates to the documentation for future users.

Collapse

Provable Boolean interaction recovery from tree ensemble obtained via random forests. Proc Natl Acad Sci U S A 2022;119:e2118636119. [PMID: 35609192 PMCID: PMC9295780 DOI: 10.1073/pnas.2118636119] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Nicholson G, Blangiardo M, Briers M, Diggle PJ, Fjelde TE, Ge H, Goudie RJB, Jersakova R, King RE, Lehmann BCL, Mallon AM, Padellini T, Teh YW, Holmes C, Richardson S. Interoperability of statistical models in pandemic preparedness: principles and reality. Stat Sci 2022;37:183-206. [PMID: 35664221 PMCID: PMC7612804 DOI: 10.1214/22-sts854] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Navigating the pitfalls of applying machine learning in genomics. Nat Rev Genet 2022;23:169-181. [PMID: 34837041 DOI: 10.1038/s41576-021-00434-9] [Citation(s) in RCA: 66] [Impact Index Per Article: 33.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/28/2021] [Indexed: 11/08/2022]

A New Method to Compare the Interpretability of Rule-Based Algorithms. AI 2021. [DOI: 10.3390/ai2040037] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Pfister N, Williams EG, Peters J, Aebersold R, Bühlmann P. Stabilizing variable selection and regression. Ann Appl Stat 2021. [DOI: 10.1214/21-aoas1487] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Wu Y, Di B, Luo Y, Grieneisen ML, Zeng W, Zhang S, Deng X, Tang Y, Shi G, Yang F, Zhan Y. A robust approach to deriving long-term daily surface NO₂ levels across China: Correction to substantial estimation bias in back-extrapolation. ENVIRONMENT INTERNATIONAL 2021;154:106576. [PMID: 33901976 DOI: 10.1016/j.envint.2021.106576] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2020] [Revised: 04/09/2021] [Accepted: 04/09/2021] [Indexed: 06/12/2023]

Abstract

BACKGROUND

Long-term surface NO₂ data are essential for retrospective policy evaluation and chronic human exposure assessment. In the absence of NO₂ observations for Mainland China before 2013, training a model with 2013-2018 data to make predictions for 2005-2012 (back-extrapolation) could cause substantial estimation bias due to concept drift.

OBJECTIVE

This study aims to correct the estimation bias in order to reconstruct the spatiotemporal distribution of daily surface NO₂ levels across China during 2005-2018.

METHODS

On the basis of ground- and satellite-based data, we proposed the robust back-extrapolation with a random forest (RBE-RF) to simulate the surface NO₂ through intermediate modeling of the scaling factors. For comparison purposes, we also employed a random forest (Base-RF), as a representative of the commonly used approach, to directly model the surface NO₂ levels.

RESULTS

The validation against Taiwan's NO₂ observations during 2005-2012 showed that RBE-RF adequately corrected the substantial underestimation by Base-RF. The RMSE decreased from 10.1 to 8.2 µg/m³, 7.1 to 4.3 µg/m³, and 6.1 to 2.9 µg/m³ in predicting daily, monthly, and annual levels, respectively. For North China with the most severe pollution, the population-weighted NO₂ ([NO₂]_pw) during 2005-2012 was estimated as 40.2 and 50.9 µg/m³ by Base-RF and RBE-RF, respectively, i.e., 21.0% difference. While both models predicted that the national annual [NO₂]_pw increased during 2005-2011 and then decreased, the interannual trends were underestimated by >50.2% by Base-RF relative to RBE-RF. During 2005-2018, the nationwide population that lived in the areas with NO₂ > 40 µg/m³ were estimated as 259 and 460 million by Base-RF and RBE-RF, respectively.

CONCLUSION

With RBE-RF, we corrected the estimation bias in back-extrapolation and obtained a full-coverage dataset of daily surface NO₂ across China during 2005-2018, which is valuable for environmental management and epidemiological research.

Collapse

Affiliation(s)

Yangyang Wu Department of Environmental Science and Engineering, Sichuan University, Chengdu, Sichuan 610065, China
Baofeng Di Department of Environmental Science and Engineering, Sichuan University, Chengdu, Sichuan 610065, China; Institute for Disaster Management and Reconstruction, Sichuan University, Chengdu, Sichuan 610200, China
Yuzhou Luo Department of Land, Air, and Water Resources, University of California, Davis, CA 95616, United States
Michael L Grieneisen Department of Land, Air, and Water Resources, University of California, Davis, CA 95616, United States
Wen Zeng Department of Environmental Science and Engineering, Sichuan University, Chengdu, Sichuan 610065, China
Shifu Zhang Department of Environmental Science and Engineering, Sichuan University, Chengdu, Sichuan 610065, China
Xunfei Deng Institute of Digital Agriculture, Zhejiang Academy of Agricultural Sciences, Hangzhou, Zhejiang 310021, China
Yulei Tang Department of Environmental Science and Engineering, Sichuan University, Chengdu, Sichuan 610065, China; Natural Resources Comprehensive Survey Command Center, China Geological Survey, Beijing 100055, China
Guangming Shi Department of Environmental Science and Engineering, Sichuan University, Chengdu, Sichuan 610065, China; National Engineering Research Center for Flue Gas Desulfurization, Chengdu, Sichuan 610065, China
Fumo Yang Department of Environmental Science and Engineering, Sichuan University, Chengdu, Sichuan 610065, China; National Engineering Research Center for Flue Gas Desulfurization, Chengdu, Sichuan 610065, China
Yu Zhan Department of Environmental Science and Engineering, Sichuan University, Chengdu, Sichuan 610065, China; National Engineering Research Center for Flue Gas Desulfurization, Chengdu, Sichuan 610065, China; Yibin Institute of Industrial Technology, Sichuan University Yibin Park, Yibin 644000, China.

Collapse

The accuracy versus interpretability trade-off in fraud detection model. DATA & POLICY 2021. [DOI: 10.1017/dap.2021.3] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

Xing X, Zhao Z, Liu JS. Controlling False Discovery Rate Using Gaussian Mirrors. J Am Stat Assoc 2021. [DOI: 10.1080/01621459.2021.1923510] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Mo W, Qi Z, Liu Y. Rejoinder: Learning Optimal Distributionally Robust Individualized Treatment Rules. J Am Stat Assoc 2021;116:699-707. [PMID: 34177008 PMCID: PMC8221610 DOI: 10.1080/01621459.2020.1866581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2020] [Accepted: 12/12/2020] [Indexed: 10/21/2022]

Knowledge Management for Sustainable Development in the Era of Continuously Accelerating Technological Revolutions: A Framework and Models. SUSTAINABILITY 2021. [DOI: 10.3390/su13063353] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Principles for data analysis workflows. PLoS Comput Biol 2021;17:e1008770. [PMID: 33735208 PMCID: PMC7971542 DOI: 10.1371/journal.pcbi.1008770] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open

Dimitriadis T, Gneiting T, Jordan AI. Stable reliability diagrams for probabilistic classifiers. Proc Natl Acad Sci U S A 2021;118:e2016191118. [PMID: 33597296 PMCID: PMC7923594 DOI: 10.1073/pnas.2016191118] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Ward OG, Huang Z, Davison A, Zheng T. Next waves in veridical network embedding*. Stat Anal Data Min 2021. [DOI: 10.1002/sam.11486] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Rothenhäusler D, Meinshausen N, Bühlmann P, Peters J. Anchor regression: Heterogeneous data meet causality. J R Stat Soc Series B Stat Methodol 2021. [DOI: 10.1111/rssb.12398] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Yu B. Independence and Diversity as Taught by My Mentors. LEADERSHIP IN STATISTICS AND DATA SCIENCE 2021:341-348. [DOI: 10.1007/978-3-030-60060-0_23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]

Candès E, Sabatti C. Discussion of the Paper “Prediction, Estimation, and Attribution” by B. Efron. Int Stat Rev 2020. [DOI: 10.1111/insr.12412] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Yu B, Barter R. The Data Science Process: One Culture. Int Stat Rev 2020. [DOI: 10.1111/insr.12416] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Dwivedi R, Tan YS, Park B, Wei M, Horgan K, Madigan D, Yu B. Stable Discovery of Interpretable Subgroups via Calibration in Causal Studies. Int Stat Rev 2020. [DOI: 10.1111/insr.12427] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Hur C, Wi J, Kim Y. Facilitating the Development of Deep Learning Models with Visual Analytics for Electronic Health Records. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020;17:E8303. [PMID: 33182703 PMCID: PMC7697823 DOI: 10.3390/ijerph17228303] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Revised: 10/27/2020] [Accepted: 11/04/2020] [Indexed: 11/24/2022]

Bühlmann P, Ćevid D. Deconfounding and Causal Regularisation for Stability and External Validity. Int Stat Rev 2020. [DOI: 10.1111/insr.12426] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Veridical Causal Inference using Propensity Score Methods for Comparative Effectiveness Research with Medical Claims. HEALTH SERVICES AND OUTCOMES RESEARCH METHODOLOGY 2020;21:206-228. [PMID: 34040495 DOI: 10.1007/s10742-020-00222-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Toward causality and improving external validity. Proc Natl Acad Sci U S A 2020;117:25963-25965. [PMID: 33046646 PMCID: PMC7584988 DOI: 10.1073/pnas.2018002117] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open

Yu B, Barter R. The Data Science Process: One Culture. J Am Stat Assoc 2020. [DOI: 10.1080/01621459.2020.1762615] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Candès E, Sabatti C. Discussion of the Paper “Prediction, Estimation, and Attribution” by B. Efron. J Am Stat Assoc 2020. [DOI: 10.1080/01621459.2020.1762618] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

QnAs with Bin Yu. Proc Natl Acad Sci U S A 2020;117:3893-3894. [DOI: 10.1073/pnas.2001302117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open