Taylor & Francis Group
Browse
ucgs_a_1704293_sm7925.zip (1.44 MB)

A Fused Gaussian Process Model for Very Large Spatial Data

Download (1.44 MB)
Version 3 2021-09-29, 16:19
Version 2 2020-02-05, 19:46
Version 1 2020-01-15, 16:46
dataset
posted on 2021-09-29, 16:19 authored by Pulong Ma, Emily L. Kang

With the development of new remote sensing technology, large or even massive spatial datasets covering the globe become available. Statistical analysis of such data is challenging. This article proposes a semiparametric approach to model large or massive spatial datasets. In particular, a Gaussian process with additive components is proposed, with its covariance structure consisting of two components: one component is flexible without assuming a specific parametric covariance function but is able to achieve dimension reduction; the other is parametric and simultaneously induces sparsity. The inference algorithm for parameter estimation and spatial prediction is devised. The resulting spatial prediction methodology that we call fused Gaussian process (FGP), is applied to simulated data and a massive satellite dataset. The results demonstrate the computational and inferential benefits of FGP over competing methods and show that FGP is robust against model misspecification and captures spatial nonstationarity. Supplementary materials for this article are available online.

Funding

This work was supported in part by an allocation of computing time from the Ohio Supercomputer Center. This research was part of Ma’s Ph.D. dissertation supported by the Charles Phelps Taft Dissertation Fellowship at the University of Cincinnati. Ma’s research was partially supported by the National Science Foundation under grant DMS-1638521 to the Statistical and Applied Mathematical Sciences Institute. Kang’s research was supported by the Simons Foundation’s Collaboration Award (#317298), a NASA ROSES grant NNH18ZDA001N-SLSCVC, and the Taft Research Center at the University of Cincinnati.

History