TY - JOUR
T1 - Modeling depth of the redox interface at high resolution at national scale using random forest and residual Gaussian simulation
AU - Koch, Julian
AU - Stisen, Simon
AU - Refsgaard, Jens C.
AU - Ernstsen, Vibeke
AU - Jakobsen, Peter R.
AU - Højberg, Anker L.
N1 - Publisher Copyright:
© 2019. American Geophysical Union.
PY - 2019/1/1
Y1 - 2019/1/1
N2 - The management of water resources needs robust methods to efficiently reduce nitrate loads. Knowledge on where natural denitrification takes place in the subsurface is thereby essential. Nitrate is naturally reduced in anoxic environments and high-resolution information of the redox interface, that is, the depth of the uppermost reduced zone is crucial to understand the variability of the denitrification potential. In this study we explore the opportunity to use random forest (RF) regression to model redox depth across Denmark at 100-m resolution based on ~13,000 boreholes as training data. We highlight the importance of expert knowledge to guide the RF model in areas where our conceptual understanding is not represented correctly in the training data set by addition of artificial observations. We apply random forest regression kriging in which sequential Gaussian simulation models the RF residuals. The RF model reaches a R
2 score of 0.48 for an independent validation test. Including sequential Gaussian simulation honors observations through local conditioning, and the spread of 800 realizations can be utilized to map uncertainty. Emphasis is put on adequate handling of nonstationarities in variance and spatial correlation of the RF residuals. The RF residuals show no spatial correlation for large parts of the modeling domain, and a local variance scaling method is applied to account for the nonstationary variance. Moreover, we present and exemplify a framework where newly acquired field data can easily be integrated into random forest regression kriging to quickly update local models.
AB - The management of water resources needs robust methods to efficiently reduce nitrate loads. Knowledge on where natural denitrification takes place in the subsurface is thereby essential. Nitrate is naturally reduced in anoxic environments and high-resolution information of the redox interface, that is, the depth of the uppermost reduced zone is crucial to understand the variability of the denitrification potential. In this study we explore the opportunity to use random forest (RF) regression to model redox depth across Denmark at 100-m resolution based on ~13,000 boreholes as training data. We highlight the importance of expert knowledge to guide the RF model in areas where our conceptual understanding is not represented correctly in the training data set by addition of artificial observations. We apply random forest regression kriging in which sequential Gaussian simulation models the RF residuals. The RF model reaches a R
2 score of 0.48 for an independent validation test. Including sequential Gaussian simulation honors observations through local conditioning, and the spread of 800 realizations can be utilized to map uncertainty. Emphasis is put on adequate handling of nonstationarities in variance and spatial correlation of the RF residuals. The RF residuals show no spatial correlation for large parts of the modeling domain, and a local variance scaling method is applied to account for the nonstationary variance. Moreover, we present and exemplify a framework where newly acquired field data can easily be integrated into random forest regression kriging to quickly update local models.
UR - http://www.scopus.com/inward/record.url?scp=85065987041&partnerID=8YFLogxK
U2 - 10.1029/2018WR023939
DO - 10.1029/2018WR023939
M3 - Article
SN - 0043-1397
VL - 55
SP - 1451
EP - 1469
JO - Water Resources Research
JF - Water Resources Research
IS - 2
ER -