TY - JOUR
T1 - Exploring the exceptional performance of a deep learning stream temperature model and the value of streamflow data
AU - Rahmani, Farshid
AU - Lawson, Kathryn
AU - Ouyang, Wenyu
AU - Appling, Alison
AU - Oliver, Samantha
AU - Shen, Chaopeng
N1 - Funding Information:
FR was supported by the Pennsylvania Water Resources Research Center graduate internship G19AC00425, with funding for that fellowship and AA and SO provided by the Integrated Water Prediction Program at the US Geological Survey. CS was supported by National Science Foundation Award OAC #1940190. Data sources have been cited in the paper, and all model inputs, outputs, and code are archived in a data release (Rahmani et al 2020). The LSTM code for modeling streamflow is available at https://github.com/mhpi/hydroDL. CS and KL have financial interests in HydroSapient, Inc., a company which could potentially benefit from the results of this research. This interest has been reviewed by the University in accordance with its Individual Conflict of Interest policy, for the purpose of maintaining the objectivity and the integrity of research at The Pennsylvania State University. Any use of trade, firm, or product names is for descriptive purposes only and does not imply endorsement by the US Government.
Publisher Copyright:
© 2021 The Author(s).
PY - 2021/2
Y1 - 2021/2
N2 - Stream water temperature (Ts) is a variable of critical importance for aquatic ecosystem health. Ts is strongly affected by groundwater-surface water interactions which can be learned from streamflow records, but previously such information was challenging to effectively absorb with process-based models due to parameter equifinality. Based on the long short-term memory (LSTM) deep learning architecture, we developed a basin-centric lumped daily mean Ts model, which was trained over 118 data-rich basins with no major dams in the conterminous United States, and showed strong results. At a national scale, we obtained a median root-mean-square error of 0.69°C, Nash-Sutcliffe model efficiency coefficient of 0.985, and correlation of 0.994, which are marked improvements over previous values reported in literature. The addition of streamflow observations as a model input strongly elevated the performance of this model. In the absence of measured streamflow, we showed that a two-stage model could be used, where simulated streamflow from a pre-trained LSTM model (Qsim) still benefited the Ts model even though no new information was brought directly into the inputs of the Ts model. The model indirectly used information learned from streamflow observations provided during the training of Qsim, potentially to improve internal representation of physically meaningful variables. Our results indicate that strong relationships exist between basin-averaged forcing variables, catchment attributes, and Ts that can be simulated by a single model trained by data on the continental scale.
AB - Stream water temperature (Ts) is a variable of critical importance for aquatic ecosystem health. Ts is strongly affected by groundwater-surface water interactions which can be learned from streamflow records, but previously such information was challenging to effectively absorb with process-based models due to parameter equifinality. Based on the long short-term memory (LSTM) deep learning architecture, we developed a basin-centric lumped daily mean Ts model, which was trained over 118 data-rich basins with no major dams in the conterminous United States, and showed strong results. At a national scale, we obtained a median root-mean-square error of 0.69°C, Nash-Sutcliffe model efficiency coefficient of 0.985, and correlation of 0.994, which are marked improvements over previous values reported in literature. The addition of streamflow observations as a model input strongly elevated the performance of this model. In the absence of measured streamflow, we showed that a two-stage model could be used, where simulated streamflow from a pre-trained LSTM model (Qsim) still benefited the Ts model even though no new information was brought directly into the inputs of the Ts model. The model indirectly used information learned from streamflow observations provided during the training of Qsim, potentially to improve internal representation of physically meaningful variables. Our results indicate that strong relationships exist between basin-averaged forcing variables, catchment attributes, and Ts that can be simulated by a single model trained by data on the continental scale.
UR - http://www.scopus.com/inward/record.url?scp=85100611472&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85100611472&partnerID=8YFLogxK
U2 - 10.1088/1748-9326/abd501
DO - 10.1088/1748-9326/abd501
M3 - Article
AN - SCOPUS:85100611472
SN - 1748-9318
VL - 16
JO - Environmental Research Letters
JF - Environmental Research Letters
IS - 2
M1 - 024025
ER -