Loading…

Machine learning for accelerating process‐based computation of land biogeochemical cycles

Global change ecology nowadays embraces ever‐growing large observational datasets (big‐data) and complex mathematical models that track hundreds of ecological processes (big‐model). The rapid advancement of the big‐data‐big‐model has reached its bottleneck: high computational requirements prevent fu...

Full description

Saved in:
Bibliographic Details
Published in:Global change biology 2023-06, Vol.29 (11), p.3221-3234
Main Authors: Sun, Yan, Goll, Daniel S., Huang, Yuanyuan, Ciais, Philippe, Wang, Ying‐Ping, Bastrikov, Vladislav, Wang, Yilong
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Global change ecology nowadays embraces ever‐growing large observational datasets (big‐data) and complex mathematical models that track hundreds of ecological processes (big‐model). The rapid advancement of the big‐data‐big‐model has reached its bottleneck: high computational requirements prevent further development of models that need to be integrated over long time‐scales to simulate the distribution of ecosystems carbon and nutrient pools and fluxes. Here, we introduce a machine‐learning acceleration (MLA) tool to tackle this grand challenge. We focus on the most resource‐consuming step in terrestrial biosphere models (TBMs): the equilibration of biogeochemical cycles (spin‐up), a prerequisite that can take up to 98% of the computational time. Through three members of the ORCHIDEE TBM family part of the IPSL Earth System Model, including versions that describe the complex interactions between nitrogen, phosphorus and carbon that do not have any analytical solution for the spin‐up, we show that an unoptimized MLA reduced the computation demand by 77%–80% for global studies via interpolating the equilibrated state of biogeochemical variables for a subset of model pixels. Despite small biases in the MLA‐derived equilibrium, the resulting impact on the predicted regional carbon balance over recent decades is minor. We expect a one‐order of magnitude lower computation demand by optimizing the choices of machine learning algorithms, their settings, and balancing the trade‐off between quality of MLA predictions and need for TBM simulations for training data generation and bias reduction. Our tool is agnostic to gridded models (beyond TBMs), compatible with existing spin‐up acceleration procedures, and opens the door to a wide variety of future applications, with complex non‐linear models benefit most from the computational efficiency. The rapid advancement of the big‐data‐big‐model has reached its bottleneck: high computational requirements prevent further development of Terrestrial Biosphere Models (TBM) that need to be integrated over long time scales to simulate the distribution of ecosystems carbon and nutrient pools and fluxes. To tackle this grand challenge, we developed a machine‐learning acceleration (MLA) tool for the most resource‐consuming step in TBMs: the equilibration of biogeochemical cycles (spin‐up). We show that an unoptimized MLA reduced the computation demand by 77%–80% for global studies via interpolating the equilibrated state of biogeoch
ISSN:1354-1013
1365-2486
DOI:10.1111/gcb.16623