Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Efron B. Prediction, Estimation, and Attribution. Int Stat Rev 2020. [DOI: 10.1111/insr.12409] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Number

Cited by Other Article(s)

Sankaran K, Jeganathan P. mbtransfer: Microbiome intervention analysis using transfer functions and mirror statistics. PLoS Comput Biol 2024;20:e1012196. [PMID: 38875277 PMCID: PMC11210883 DOI: 10.1371/journal.pcbi.1012196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Revised: 06/27/2024] [Accepted: 05/27/2024] [Indexed: 06/16/2024] Open

Servius L, Pigoli D, Ng J, Fraternali F. Predicting class switch recombination in B-cells from antibody repertoire data. Biom J 2024;66:e2300171. [PMID: 38785212 DOI: 10.1002/bimj.202300171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 03/01/2024] [Accepted: 03/07/2024] [Indexed: 05/25/2024]

Zhu R, Luo W, Grieneisen ML, Zuoqiu S, Zhan Y, Yang F. A novel approach to deriving the fine-scale daily NO₂ dataset during 2005-2020 in China: Improving spatial resolution and temporal coverage to advance exposure assessment. ENVIRONMENTAL RESEARCH 2024;249:118381. [PMID: 38331142 DOI: 10.1016/j.envres.2024.118381] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/02/2023] [Revised: 01/22/2024] [Accepted: 01/30/2024] [Indexed: 02/10/2024]

Abstract

Surface NO2 pollution can result in serious health consequences such as cardiovascular disease, asthma, and premature mortality. Due to the extensive spatial variation in surface NO2, the spatial resolution of a NO2 dataset has a significant impact on the exposure and health impact assessment. There is currently no long-term, high-resolution, and publicly available NO2 dataset for China. To fill this gap, this study generated a NO2 dataset named RBE-DS-NO2 for China during 2005-2020 at 1 km and daily resolution. We employed the robust back-extrapolation via a data augmentation approach (RBE-DA) to ensure the predictive accuracy in back-extrapolation before 2013, and utilized an improved spatial downscaling technique (DS) to refine the spatial resolution from 10 km to 1 km. Back-extrapolation validation based on 2005-2012 observations from sites in Taiwan province yielded an R2 of 0.72 and RMSE of 10.7 μg/m3, while cross-validation across China during 2013-2020 showed an R2 of 0.73 and RMSE of 9.6 μg/m3. RBE-DS-NO2 better captured spatiotemporal variation of surface NO2 in China compared to the existing publicly available datasets. Exposure assessment using RBE-DS-NO2 show that the population living in non-attainment areas (NO2 ≥ 30 μg/m3) grew from 376 million in 2005 to 612 million in 2012, then declined to 404 million by 2020. Unlike this national trend, exposure levels in several major cities (e.g., Shanghai and Chengdu) continued to increase during 2012-2020, driven by population growth and urban migration. Furthermore, this study revealed that low-resolution dataset (i.e., the 10 km intermediate dataset before the downscaling) overestimated NO2 levels, due to the limited specificity of the low-resolution model in simulating the relationship between NO2 and the predictor variables. Such limited specificity likely biased previous long-term NO2 exposure and health impact studies employing low-resolution datasets. The RBE-DS-NO2 dataset enables robust long-term assessments of NO2 exposure and health impacts in China.

Collapse

Shen B, Coruzzi GM, Shasha D. Bipartite networks represent causality better than simple networks: evidence, algorithms, and applications. Front Genet 2024;15:1371607. [PMID: 38798697 PMCID: PMC11120958 DOI: 10.3389/fgene.2024.1371607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Accepted: 04/17/2024] [Indexed: 05/29/2024] Open

Abstract

A network, whose nodes are genes and whose directed edges represent positive or negative influences of a regulatory gene and its targets, is often used as a representation of causality. To infer a network, researchers often develop a machine learning model and then evaluate the model based on its match with experimentally verified "gold standard" edges. The desired result of such a model is a network that may extend the gold standard edges. Since networks are a form of visual representation, one can compare their utility with architectural or machine blueprints. Blueprints are clearly useful because they provide precise guidance to builders in construction. If the primary role of gene regulatory networks is to characterize causality, then such networks should be good tools of prediction because prediction is the actionable benefit of knowing causality. But are they? In this paper, we compare prediction quality based on "gold standard" regulatory edges from previous experimental work with non-linear models inferred from time series data across four different species. We show that the same non-linear machine learning models have better predictive performance, with improvements from 5.3% to 25.3% in terms of the reduction in the root mean square error (RMSE) compared with the same models based on the gold standard edges. Having established that networks fail to characterize causality properly, we suggest that causality research should focus on four goals: (i) predictive accuracy; (ii) a parsimonious enumeration of predictive regulatory genes for each target gene g; (iii) the identification of disjoint sets of predictive regulatory genes for each target g of roughly equal accuracy; and (iv) the construction of a bipartite network (whose node types are genes and models) representation of causality. We provide algorithms for all goals.

Collapse

Rafiee M, Jahangiri-Rad M, Mohseni-Bandpei A, Razmi E. Impacts of socioeconomic and environmental factors on neoplasms incidence rates using machine learning and GIS: a cross-sectional study in Iran. Sci Rep 2024;14:10604. [PMID: 38719879 PMCID: PMC11078954 DOI: 10.1038/s41598-024-61397-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Accepted: 05/06/2024] [Indexed: 05/12/2024] Open

Abstract

Neoplasm is an umbrella term used to describe either benign or malignant conditions. The correlations between socioeconomic and environmental factors and the occurrence of new-onset of neoplasms have already been demonstrated in a body of research. Nevertheless, few studies have specifically dealt with the nature of relationship, significance of risk factors, and geographic variation of them, particularly in low- and middle-income communities. This study, thus, set out to (1) analyze spatiotemporal variations of the age-adjusted incidence rate (AAIR) of neoplasms in Iran throughout five time periods, (2) investigate relationships between a collection of environmental and socioeconomic indicators and the AAIR of neoplasms all over the country, and (3) evaluate geographical alterations in their relative importance. Our cross-sectional study design was based on county-level data from 2010 to 2020. AAIR of neoplasms data was acquired from the Institute for Health Metrics and Evaluation (IHME). HotSpot analyses and Anselin Local Moran's I indices were deployed to precisely identify AAIR of neoplasms high- and low-risk clusters. Multi-scale geographically weight regression (MGWR) analysis was worked out to evaluate the association between each explanatory variable and the AAIR of neoplasms. Utilizing random forests (RF), we also examined the relationships between environmental (e.g., UV index and PM2.5 concentration) and socioeconomic (e.g., Gini coefficient and literacy rate) factors and AAIR of neoplasms. AAIR of neoplasms displayed a significant increasing trend over the study period. According to the MGWR, the only factor that significantly varied spatially and was associated with the AAIR of neoplasms in Iran was the UV index. A good accuracy RF model was confirmed for both training and testing data with correlation coefficients R2 greater than 0.91 and 0.92, respectively. UV index and Gini coefficient ranked the highest variables in the prediction of AAIR of neoplasms, based on the relative influence of each variable. More research using machine learning approaches taking the advantages of considering all possible determinants is required to assess health strategies outcomes and properly formulate policy planning.

Collapse

Chakraborty S, Guan Z, Begg CB, Shen R. Topical hidden genome: discovering latent cancer mutational topics using a Bayesian multilevel context-learning approach. Biometrics 2024;80:ujae030. [PMID: 38682463 PMCID: PMC11056772 DOI: 10.1093/biomtc/ujae030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Revised: 03/18/2024] [Accepted: 04/04/2024] [Indexed: 05/01/2024]

Huang B, Kong L, Wang C, Ju F, Zhang Q, Zhu J, Gong T, Zhang H, Yu C, Zheng WM, Bu D. Protein Structure Prediction: Challenges, Advances, and the Shift of Research Paradigms. GENOMICS, PROTEOMICS & BIOINFORMATICS 2023;21:913-925. [PMID: 37001856 PMCID: PMC10928435 DOI: 10.1016/j.gpb.2022.11.014] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/15/2022] [Revised: 11/23/2022] [Accepted: 11/30/2022] [Indexed: 03/31/2023]

Affiliation(s)

Bin Huang Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; University of Chinese Academy of Sciences, Beijing 100049, China
Lupeng Kong Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; Changping Laboratory, Beijing 102206, China
Chao Wang Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
Fusong Ju Microsoft Research AI4Science, Beijing 100080, China
Qi Zhang Huawei Noah's Ark Lab, Wuhan 430206, China
Jianwei Zhu Microsoft Research AI4Science, Beijing 100080, China
Tiansu Gong Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; University of Chinese Academy of Sciences, Beijing 100049, China
Haicang Zhang Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; University of Chinese Academy of Sciences, Beijing 100049, China; Zhongke Big Data Academy, Zhengzhou 450046, China.
Chungong Yu Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; University of Chinese Academy of Sciences, Beijing 100049, China; Zhongke Big Data Academy, Zhengzhou 450046, China.
Wei-Mou Zheng Institute of Theoretical Physics, Chinese Academy of Sciences, Beijing 100190, China.
Dongbo Bu Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; University of Chinese Academy of Sciences, Beijing 100049, China; Zhongke Big Data Academy, Zhengzhou 450046, China.

Collapse

Ringwaldt EM, Brook BW, Buettel JC, Cunningham CX, Fuller C, Gardiner R, Hamer R, Jones M, Martin AM, Carver S. Host, environment, and anthropogenic factors drive landscape dynamics of an environmentally transmitted pathogen: Sarcoptic mange in the bare-nosed wombat. J Anim Ecol 2023;92:1786-1801. [PMID: 37221666 DOI: 10.1111/1365-2656.13960] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 05/09/2023] [Indexed: 05/25/2023]

Abstract

Understanding the spatial dynamics and drivers of wildlife pathogens is constrained by sampling logistics, with implications for advancing the field of landscape epidemiology and targeted allocation of management resources. However, visually apparent wildlife diseases, when combined with remote-surveillance and distribution modelling technologies, present an opportunity to overcome this landscape-scale problem. Here, we investigated dynamics and drivers of landscape-scale wildlife disease, using clinical signs of sarcoptic mange (caused by Sarcoptes scabiei) in its bare-nosed wombat (BNW; Vombatus ursinus) host. We used 53,089 camera-trap observations from over 3261 locations across the 68,401 km2 area of Tasmania, Australia, combined with landscape data and ensemble species distribution modelling (SDM). We investigated: (1) landscape variables predicted to drive habitat suitability of the host; (2) host and landscape variables associated with clinical signs of disease in the host; and (3) predicted locations and environmental conditions at greatest risk of disease occurrence, including some Bass Strait islands where BNW translocations are proposed. We showed that the Tasmanian landscape, and ecosystems therein, are nearly ubiquitously suited to BNWs. Only high mean annual precipitation reduced habitat suitability for the host. In contrast, clinical signs of sarcoptic mange disease in BNWs were widespread, but heterogeneously distributed across the landscape. Mange (which is environmentally transmitted in BNWs) was most likely to be observed in areas of increased host habitat suitability, lower annual precipitation, near sources of freshwater and where topographic roughness was minimal (e.g. human modified landscapes, such as farmland and intensive land-use areas, shrub and grass lands). Thus, a confluence of host, environmental and anthropogenic variables appear to influence the risk of environmental transmission of S. scabiei. We identified that the Bass Strait Islands are highly suitable for BNWs and predicted a mix of high and low suitability for the pathogen. This study is the largest spatial assessment of sarcoptic mange in any host species, and advances understanding of the landscape epidemiology of environmentally transmitted S. scabiei. This research illustrates how host-pathogen co-suitability can be useful for allocating management resources in the landscape.

Collapse

Data driven contagion risk management in low-income countries using machine learning applications with COVID-19 in South Asia. Sci Rep 2023;13:3732. [PMID: 36878910 PMCID: PMC9987367 DOI: 10.1038/s41598-023-30348-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Accepted: 02/21/2023] [Indexed: 03/08/2023] Open

Candès E, Lei L, Ren Z. Conformalized survival analysis. J R Stat Soc Series B Stat Methodol 2023. [DOI: 10.1093/jrsssb/qkac004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]

Garrett KA, Bebber DP, Etherton BA, Gold KM, Plex Sulá AI, Selvaraj MG. Climate Change Effects on Pathogen Emergence: Artificial Intelligence to Translate Big Data for Mitigation. ANNUAL REVIEW OF PHYTOPATHOLOGY 2022;60:357-378. [PMID: 35650670 DOI: 10.1146/annurev-phyto-021021-042636] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]

Fokkema M, Iliescu D, Greiff S, Ziegler M. Machine Learning and Prediction in Psychological Assessment. EUROPEAN JOURNAL OF PSYCHOLOGICAL ASSESSMENT 2022. [DOI: 10.1027/1015-5759/a000714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Tan ZC, Murphy MC, Alpay HS, Taylor SD, Meyer AS. Tensor-structured decomposition improves systems serology analysis. Mol Syst Biol 2021;17:e10243. [PMID: 34487431 PMCID: PMC8420856 DOI: 10.15252/msb.202110243] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Revised: 08/12/2021] [Accepted: 08/16/2021] [Indexed: 01/04/2023] Open

Guerriero S, Pascual M, Ajossa S, Neri M, Musa E, Graupera B, Rodriguez I, Alcazar JL. Artificial intelligence (AI) in the detection of rectosigmoid deep endometriosis. Eur J Obstet Gynecol Reprod Biol 2021;261:29-33. [PMID: 33873085 DOI: 10.1016/j.ejogrb.2021.04.012] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2021] [Revised: 04/06/2021] [Accepted: 04/11/2021] [Indexed: 12/12/2022]

Abstract

OBJECTIVES

The aim of this study was to compare the accuracy of seven classical Machine Learning (ML) models trained with ultrasound (US) soft markers to raise suspicion of endometriotic bowel involvement.

MATERIALS AND METHODS

Input data to the models was retrieved from a database of a previously published study on bowel endometriosis performed on 333 patients. The following models have been tested: k-nearest neighbors algorithm (k-NN), Naive Bayes, Neural Networks (NNET-neuralnet), Support Vector Machine (SVM), Decision Tree, Random Forest, and Logistic Regression. The data driven strategy has been to split randomly the complete dataset in two different datasets. The training dataset and the test dataset with a 67 % and 33 % of the original cases respectively. All models were trained on the training dataset and the predictions have been evaluated using the test dataset. The best model was chosen based on the accuracy demonstrated on the test dataset. The information used in all the models were: age; presence of US signs of uterine adenomyosis; presence of an endometrioma; adhesions of the ovary to the uterus; presence of "kissing ovaries"; absence of sliding sign. All models have been trained using CARET package in R with ten repeated 10-fold cross-validation. Accuracy, Sensitivity, Specificity, positive (PPV) and negative (NPV) predictive value were calculated using a 50 % threshold. Presence of intestinal involvement was defined in all cases in the test dataset with an estimated probability greater than 0.5.

RESULTS

In our previous study from where the inputs were retrieved, 106 women had a final expert US diagnosis of rectosigmoid endometriosis. In term of diagnostic accuracy the best model was the Neural Net (Accuracy, 0.73; sensitivity, 0.72; specificity 0.73; PPV 0.52; and NPV 0.86) but without significant difference with the others.

CONCLUSIONS

The accuracy of ultrasound soft markers in raising suspicion of rectosigmoid endometriosis using Artificial Intelligence (AI) models showed similar results to the logistic model.

Collapse