TY - JOUR
T1 - Creating 1-km long-term (1980–2014) daily average air temperatures over the Tibetan Plateau by integrating eight types of reanalysis and land data assimilation products downscaled with MODIS-estimated temperature lapse rates based on machine learning
AU - Zhang, Hongbo
AU - Immerzeel, W.w.
AU - Zhang, Fan
AU - De Kok, Remco J.
AU - Gorrie, Sally J.
AU - Ye, Ming
N1 - Funding Information:
The produced 1-km daily air temperature of the Tibetan Plateau during 1980-2014 can be obtained at http://data.tpdc.ac.cn/en/data/62234872-c39c-4614-a7ac-348c9437e7d5/ , with DOI numbers 10.11888/Meteoro.tpdc.270377. The developed monthly TLRs with eight types of resolutions and the corresponding quality control files can be obtained at http://hongbozhang.net/wph/?page_id=504 . This study was supported by State Key Laboratory of Cryospheric Science, Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences (Grant Number: SKLCS-OP-2020-13), the Second Tibetan Plateau Scientific Expedition and Research Program (Grant No. 2019QZKK0203), State Key Laboratory of Hydrology-Water Resources and Hydraulic Engineering, Nanjing Hydraulic Research Institute (Grant No. 2019nkms02), the National Natural Science Foundation of China (Grant No. 41701079), the “Strategic Priority Research Program” of the Chinese Academy of Sciences (Grant No. XDA20100300 and XDA20060202), the European Research Council (ERC) under the European Union Horizon 2020 Research and Innovation Program (Grant Agreement No. 676819), Netherlands Organization for Scientific Research (Grant No. 016.181.308 and ALWOP.467) and the China Scholarship Council. We appreciate China Meteorological Administration and multiple field stations of the Institute of Tibetan Plateau Research, Chinese Academy of Sciences for providing daily or sub-daily air temperature observations. Thanks to Dr. Philip Kraaijenbrink for helping obtain the daily mean temperature data from ERA5.
Funding Information:
The produced 1-km daily air temperature of the Tibetan Plateau during 1980-2014 can be obtained at http://data.tpdc.ac.cn/en/data/62234872-c39c-4614-a7ac-348c9437e7d5/, with DOI numbers 10.11888/Meteoro.tpdc.270377. The developed monthly TLRs with eight types of resolutions and the corresponding quality control files can be obtained at http://hongbozhang.net/wph/?page_id=504. This study was supported by State Key Laboratory of Cryospheric Science, Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences (Grant Number: SKLCS-OP-2020-13), the Second Tibetan Plateau Scientific Expedition and Research Program (Grant No. 2019QZKK0203), State Key Laboratory of Hydrology-Water Resources and Hydraulic Engineering, Nanjing Hydraulic Research Institute (Grant No. 2019nkms02), the National Natural Science Foundation of China (Grant No. 41701079), the ?Strategic Priority Research Program? of the Chinese Academy of Sciences (Grant No. XDA20100300 and XDA20060202), the European Research Council (ERC) under the European Union Horizon 2020 Research and Innovation Program (Grant Agreement No. 676819), Netherlands Organization for Scientific Research (Grant No. 016.181.308 and ALWOP.467) and the China Scholarship Council. We appreciate China Meteorological Administration and multiple field stations of the Institute of Tibetan Plateau Research, Chinese Academy of Sciences for providing daily or sub-daily air temperature observations. Thanks to Dr. Philip Kraaijenbrink for helping obtain the daily mean temperature data from ERA5.
Publisher Copyright:
© 2021 The Authors
PY - 2021/5/1
Y1 - 2021/5/1
N2 - Air temperature (Tair) is critical to modeling environmental processes (e.g. snow/glacier melting) in high-elevation areas of the Tibetan Plateau (TP). To resolve the issue that Tair observations are scarce in the TP western part and at high elevation, many studies have estimated daily air temperatures by using MODIS land surface temperature (LST) and various reanalysis datasets. These estimates are however inadequate for supporting high-resolution long-term hydrological simulations or climate analysis due to the high cloud cover, short time span or low spatial resolution. To improve the Tair estimation, this study develops a novel machine-learning based method that uses the Gradient Boosting model to efficiently integrate observations from high-elevation stations with eight widely used air temperature reanalysis and assimilation datasets (i.e., NNRP-2, 20CRV2c, JRA-55, ERA-Interim, MERRA-2, CFSR, ERA5 and GLDAS2) downscaled with remote sensing-based temperature lapse rates (TLR). This method is used to generate a new dataset of daily air temperature with the 1-km resolution for the period of 1980–2014. To overcome the problem that TLR derived from limited stations may be unreliable, a new TLR estimation method is developed to first estimate spatially continuous monthly TLRs from MODIS LST and then downscale daily mean Tair from eight reanalysis and assimilation datasets to obtain Tair at the 1-km resolution using the MODIS-estimated TLRs. The Gradient Boosting (GB) model is selected for integrating the eight downscaled Tair and five other auxiliary variables. The models are trained and validated using observations from 100 common stations (i.e. China Meteorology Administration stations) and 13 independent high-elevation stations (4 on glaciers). The results show that the proposed TLR estimation method can efficiently reduce exceptional TLRs in the meantime keeping acceptable downscaling accuracy. The downscaled Tair from JRA-55 is the best among the eight downscaled datasets followed by ERA-Interim, MERRA-2, CFSR and others. Finally, the GB-integrated Tair further outperforms the downscaled JRA-55 Tair with the mean root-mean-squared-deviation (RMSD) of 1.7 °C versus 2.0 °C, especially in high-elevation stations with mean RMSD of 1.9 °C versus 2.7 °C. Both the MODIS-estimated TLR and the high-elevation training observations are demonstrated to significantly improve the air temperature estimation accuracy of the GB model in high-elevation stations. This study also provides a framework for integrating multiple reanalysis and assimilation temperature data with elevation correction in mountainous regions that is not restricted to the TP.
AB - Air temperature (Tair) is critical to modeling environmental processes (e.g. snow/glacier melting) in high-elevation areas of the Tibetan Plateau (TP). To resolve the issue that Tair observations are scarce in the TP western part and at high elevation, many studies have estimated daily air temperatures by using MODIS land surface temperature (LST) and various reanalysis datasets. These estimates are however inadequate for supporting high-resolution long-term hydrological simulations or climate analysis due to the high cloud cover, short time span or low spatial resolution. To improve the Tair estimation, this study develops a novel machine-learning based method that uses the Gradient Boosting model to efficiently integrate observations from high-elevation stations with eight widely used air temperature reanalysis and assimilation datasets (i.e., NNRP-2, 20CRV2c, JRA-55, ERA-Interim, MERRA-2, CFSR, ERA5 and GLDAS2) downscaled with remote sensing-based temperature lapse rates (TLR). This method is used to generate a new dataset of daily air temperature with the 1-km resolution for the period of 1980–2014. To overcome the problem that TLR derived from limited stations may be unreliable, a new TLR estimation method is developed to first estimate spatially continuous monthly TLRs from MODIS LST and then downscale daily mean Tair from eight reanalysis and assimilation datasets to obtain Tair at the 1-km resolution using the MODIS-estimated TLRs. The Gradient Boosting (GB) model is selected for integrating the eight downscaled Tair and five other auxiliary variables. The models are trained and validated using observations from 100 common stations (i.e. China Meteorology Administration stations) and 13 independent high-elevation stations (4 on glaciers). The results show that the proposed TLR estimation method can efficiently reduce exceptional TLRs in the meantime keeping acceptable downscaling accuracy. The downscaled Tair from JRA-55 is the best among the eight downscaled datasets followed by ERA-Interim, MERRA-2, CFSR and others. Finally, the GB-integrated Tair further outperforms the downscaled JRA-55 Tair with the mean root-mean-squared-deviation (RMSD) of 1.7 °C versus 2.0 °C, especially in high-elevation stations with mean RMSD of 1.9 °C versus 2.7 °C. Both the MODIS-estimated TLR and the high-elevation training observations are demonstrated to significantly improve the air temperature estimation accuracy of the GB model in high-elevation stations. This study also provides a framework for integrating multiple reanalysis and assimilation temperature data with elevation correction in mountainous regions that is not restricted to the TP.
KW - MODIS land surface temperature
KW - Reanalysis data
KW - Spatial downscaling
KW - Temperature lapse rate
KW - Tibetan Plateau
UR - http://www.scopus.com/inward/record.url?scp=85114367840&partnerID=8YFLogxK
U2 - 10.1016/j.jag.2021.102295
DO - 10.1016/j.jag.2021.102295
M3 - Article
SN - 0303-2434
VL - 97
SP - 1
EP - 18
JO - International Journal of Applied Earth Observation and Geoinformation
JF - International Journal of Applied Earth Observation and Geoinformation
M1 - 102295
ER -