Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Datta A, Banerjee S, Finley AO, Gelfand AE. On nearest-neighbor Gaussian process models for massive spatial data. ACTA ACUST UNITED AC 2016;8:162-171. [PMID: 29657666 DOI: 10.1002/wics.1383] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

For:	Datta A, Banerjee S, Finley AO, Gelfand AE. On nearest-neighbor Gaussian process models for massive spatial data. ACTA ACUST UNITED AC 2016;8:162-171. [PMID: 29657666 DOI: 10.1002/wics.1383] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Number

Cited by Other Article(s)

Ver Hoef JM, Dumelle M, Higham M, Peterson EE, Isaak DJ. Indexing and partitioning the spatial linear model for large data sets. PLoS One 2023;18:e0291906. [PMID: 37910525 PMCID: PMC10619847 DOI: 10.1371/journal.pone.0291906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Accepted: 09/07/2023] [Indexed: 11/03/2023] Open

Abstract

We consider four main goals when fitting spatial linear models: 1) estimating covariance parameters, 2) estimating fixed effects, 3) kriging (making point predictions), and 4) block-kriging (predicting the average value over a region). Each of these goals can present different challenges when analyzing large spatial data sets. Current research uses a variety of methods, including spatial basis functions (reduced rank), covariance tapering, etc, to achieve these goals. However, spatial indexing, which is very similar to composite likelihood, offers some advantages. We develop a simple framework for all four goals listed above by using indexing to create a block covariance structure and nearest-neighbor predictions while maintaining a coherent linear model. We show exact inference for fixed effects under this block covariance construction. Spatial indexing is very fast, and simulations are used to validate methods and compare to another popular method. We study various sample designs for indexing and our simulations showed that indexing leading to spatially compact partitions are best over a range of sample sizes, autocorrelation values, and generating processes. Partitions can be kept small, on the order of 50 samples per partition. We use nearest-neighbors for kriging and block kriging, finding that 50 nearest-neighbors is sufficient. In all cases, confidence intervals for fixed effects, and prediction intervals for (block) kriging, have appropriate coverage. Some advantages of spatial indexing are that it is available for any valid covariance matrix, can take advantage of parallel computing, and easily extends to non-Euclidean topologies, such as stream networks. We use stream networks to show how spatial indexing can achieve all four goals, listed above, for very large data sets, in a matter of minutes, rather than days, for an example data set.

Collapse

Mukerjee R. Improving upon the effective sample size based on Godambe information for block likelihood inference. Comput Stat 2023. [DOI: 10.1007/s00180-023-01328-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]

Saha A, Datta A, Banerjee S. Scalable Predictions for Spatial Probit Linear Mixed Models Using Nearest Neighbor Gaussian Processes. JOURNAL OF DATA SCIENCE : JDS 2022;20:533-544. [PMID: 37786782 PMCID: PMC10544813 DOI: 10.6339/22-jds1073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 10/04/2023]

Moran KR, Wheeler MW. Fast increased fidelity samplers for approximate Bayesian Gaussian process regression. J R Stat Soc Series B Stat Methodol 2022;84:1198-1228. [PMID: 36570797 PMCID: PMC9770094 DOI: 10.1111/rssb.12494] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]

Davies TM, Banerjee S, Martin AP, Turnbull RE. A nearest‐neighbour Gaussian process spatial factor model for censored, multi‐depth geochemical data. J R Stat Soc Ser C Appl Stat 2022. [DOI: 10.1111/rssc.12565] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

A bioinformatic analysis of WFDC2 (HE4) expression in high grade serous ovarian cancer reveals tumor-specific changes in metabolic and extracellular matrix gene expression. Med Oncol 2022;39:71. [PMID: 35568777 PMCID: PMC9107348 DOI: 10.1007/s12032-022-01665-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Accepted: 01/22/2022] [Indexed: 10/31/2022]

Abstract

Human epididymis protein-4 (HE4/WFDC2) has been well-studied as an ovarian cancer clinical biomarker. To improve our understanding of its functional role in high grade serous ovarian cancer, we determined transcriptomic differences between ovarian tumors with high- versus low-WFDC2 mRNA levels in The Cancer Genome Atlas dataset. High-WFDC2 transcript levels were significantly associated with reduced survival in stage III/IV serous ovarian cancer patients. Differential expression and correlation analyses revealed secretory leukocyte peptidase inhibitor (SLPI/WFDC4) as the gene most positively correlated with WFDC2, while A kinase anchor protein-12 was most negatively correlated. WFDC2 and SLPI were strongly correlated across many cancers. Gene ontology analysis revealed enrichment of oxidative phosphorylation in differentially expressed genes associated with high-WFDC2 levels, while extracellular matrix organization was enriched among genes associated with low-WFDC2 levels. Immune cell subsets found to be positively correlated with WFDC2 levels were B cells and plasmacytoid dendritic cells, while neutrophils and endothelial cells were negatively correlated with WFDC2. Results were compared with DepMap cell culture gene expression data. Gene ontology analysis of k-means clustering revealed that genes associated with low-WFDC2 were also enriched in extracellular matrix and adhesion categories, while high-WFDC2 genes were enriched in epithelial cell proliferation and peptidase activity. These results support previous findings regarding the effect of HE4/WFDC2 on ovarian cancer pathogenesis in cell lines and mouse models, while adding another layer of complexity to its potential functions in ovarian tumor tissue. Further experimental explorations of these findings in the context of the tumor microenvironment are merited.

Collapse

Saha A, Basu S, Datta A. Random Forests for Spatially Dependent Data. J Am Stat Assoc 2021. [DOI: 10.1080/01621459.2021.1950003] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Chen J, Stein ML. Linear-Cost Covariance Functions for Gaussian Random Fields. J Am Stat Assoc 2021. [DOI: 10.1080/01621459.2021.1919122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Katzfuss M, Guinness J. A General Framework for Vecchia Approximations of Gaussian Processes. Stat Sci 2021. [DOI: 10.1214/19-sts755] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Liu H, Ong YS, Shen X, Cai J. When Gaussian Process Meets Big Data: A Review of Scalable GPs. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2020;31:4405-4423. [PMID: 31944966 DOI: 10.1109/tnnls.2019.2957109] [Citation(s) in RCA: 79] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]

Davis BJK, Curriero FC. Development and Evaluation of Geostatistical Methods for Non-Euclidean-Based Spatial Covariance Matrices. MATHEMATICAL GEOSCIENCES 2019;51:767-791. [PMID: 31827631 PMCID: PMC6905632 DOI: 10.1007/s11004-019-09791-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2018] [Accepted: 02/23/2019] [Indexed: 06/01/2023]

Gelfand AE, Shirota S. Preferential sampling for presence/absence data and for fusion of presence/absence data with presence‐only data. ECOL MONOGR 2019. [DOI: 10.1002/ecm.1372] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Finley AO, Datta A, Cook BC, Morton DC, Andersen HE, Banerjee S. Efficient algorithms for Bayesian Nearest Neighbor Gaussian Processes. J Comput Graph Stat 2019;28:401-414. [PMID: 31543693 PMCID: PMC6753955 DOI: 10.1080/10618600.2018.1537924] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2017] [Revised: 07/18/2018] [Accepted: 10/09/2018] [Indexed: 10/27/2022]

Taylor-Rodriguez D, Finley AO, Datta A, Babcock C, Andersen HE, Cook BD, Morton DC, Banerjee S. Spatial Factor Models for High-Dimensional and Large Spatial Data: An Application in Forest Variable Mapping. Stat Sin 2019;29:1155-1180. [PMID: 33311955 PMCID: PMC7731981 DOI: 10.5705/ss.202018.0005] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Heaton MJ, Datta A, Finley AO, Furrer R, Guinness J, Guhaniyogi R, Gerber F, Gramacy RB, Hammerling D, Katzfuss M, Lindgren F, Nychka DW, Sun F, Zammit-Mangion A. A Case Study Competition Among Methods for Analyzing Large Spatial Data. JOURNAL OF AGRICULTURAL, BIOLOGICAL, AND ENVIRONMENTAL STATISTICS 2018;24:398-425. [PMID: 31496633 PMCID: PMC6709111 DOI: 10.1007/s13253-018-00348-w] [Citation(s) in RCA: 91] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Accepted: 12/05/2018] [Indexed: 10/27/2022]

Guinness J. Permutation and Grouping Methods for Sharpening Gaussian Process Approximations. Technometrics 2018;60:415-429. [PMID: 31447491 DOI: 10.1080/00401706.2018.1437476] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Gelfand AE, Banerjee S. Bayesian Modeling and Analysis of Geostatistical Data. ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION 2017;4:245-266. [PMID: 29392155 PMCID: PMC5790124 DOI: 10.1146/annurev-statistics-060116-054155] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]