TY - JOUR
T1 - Optimal design for correlated processes with input-dependent noise
AU - Boukouvalas, Alexis
AU - Cornford, D.
AU - Stehlík, M.
N1 - Funding Information:
The work on the paper was partially undertaken at the Isaac Newton Institute for Mathematical Sciences at Cambridge, UK, during the Design and Analysis of Experiments research programme hosted by the Institute. The 3rd author was partially supported by ANR project Desire. We also wish to thank Ian Vernon for useful discussions on the Prokaryotic Autoregulatory network used in Section 6 . This work was supported by EPSRC/RCUK as part of the MUCM Basic Technology project ( EP/D048893/1 ). We thank the editor and reviewers, whose insightful comments helped us to sharpen the paper considerably.
PY - 2014/3
Y1 - 2014/3
N2 - Optimal design for parameter estimation in Gaussian process regression models with input-dependent noise is examined. The motivation stems from the area of computer experiments, where computationally demanding simulators are approximated using Gaussian process emulators to act as statistical surrogates. In the case of stochastic simulators, which produce a random output for a given set of model inputs, repeated evaluations are useful, supporting the use of replicate observations in the experimental design. The findings are also applicable to the wider context of experimental design for Gaussian process regression and kriging. Designs are proposed with the aim of minimising the variance of the Gaussian process parameter estimates. A heteroscedastic Gaussian process model is presented which allows for an experimental design technique based on an extension of Fisher information to heteroscedastic models. It is empirically shown that the error of the approximation of the parameter variance by the inverse of the Fisher information is reduced as the number of replicated points is increased. Through a series of simulation experiments on both synthetic data and a systems biology stochastic simulator, optimal designs with replicate observations are shown to outperform space-filling designs both with and without replicate observations. Guidance is provided on best practice for optimal experimental design for stochastic response models.
AB - Optimal design for parameter estimation in Gaussian process regression models with input-dependent noise is examined. The motivation stems from the area of computer experiments, where computationally demanding simulators are approximated using Gaussian process emulators to act as statistical surrogates. In the case of stochastic simulators, which produce a random output for a given set of model inputs, repeated evaluations are useful, supporting the use of replicate observations in the experimental design. The findings are also applicable to the wider context of experimental design for Gaussian process regression and kriging. Designs are proposed with the aim of minimising the variance of the Gaussian process parameter estimates. A heteroscedastic Gaussian process model is presented which allows for an experimental design technique based on an extension of Fisher information to heteroscedastic models. It is empirically shown that the error of the approximation of the parameter variance by the inverse of the Fisher information is reduced as the number of replicated points is increased. Through a series of simulation experiments on both synthetic data and a systems biology stochastic simulator, optimal designs with replicate observations are shown to outperform space-filling designs both with and without replicate observations. Guidance is provided on best practice for optimal experimental design for stochastic response models.
KW - correlated observations
KW - emulation
KW - Gaussian process
KW - heteroscedastic noise
KW - optimal design of experiments
UR - http://www.scopus.com/inward/record.url?scp=84889097474&partnerID=8YFLogxK
U2 - 10.1016/j.csda.2013.09.024
DO - 10.1016/j.csda.2013.09.024
M3 - Article
AN - SCOPUS:84889097474
SN - 0167-9473
VL - 71
SP - 1088
EP - 1102
JO - Computational Statistics and Data Analysis
JF - Computational Statistics and Data Analysis
ER -