MetaFlux: Meta-learning global carbon fluxes from sparse spatiotemporal observations

Nathaniel, Juan; Liu, Jiangong; Gentine, Pierre

doi:10.1038/s41597-023-02349-y

Download PDF

Data Descriptor
Open access
Published: 11 July 2023

MetaFlux: Meta-learning global carbon fluxes from sparse spatiotemporal observations

Scientific Data volume 10, Article number: 440 (2023) Cite this article

4253 Accesses
1 Citations
13 Altmetric
Metrics details

Subjects

Abstract

We provide a global, long-term carbon flux dataset of gross primary production and ecosystem respiration generated using meta-learning, called MetaFlux. The idea behind meta-learning stems from the need to learn efficiently given sparse data by learning how to learn broad features across tasks to better infer other poorly sampled ones. Using meta-trained ensemble of deep models, we generate global carbon products on daily and monthly timescales at a 0.25-degree spatial resolution from 2001 to 2021, through a combination of reanalysis and remote-sensing products. Site-level validation finds that MetaFlux ensembles have lower validation error by 5–7% compared to their non-meta-trained counterparts. In addition, they are more robust to extreme observations, with 4–24% lower errors. We also checked for seasonality, interannual variability, and correlation to solar-induced fluorescence of the upscaled product and found that MetaFlux outperformed other machine-learning based carbon product, especially in the tropics and semi-arids by 10–40%. Overall, MetaFlux can be used to study a wide range of biogeochemical processes.

The FLUXNET2015 dataset and the ONEFlux processing pipeline for eddy covariance data

Article Open access 09 July 2020

Eighteen years of upland grassland carbon flux data: reference datasets, processing, and gap-filling procedure

Article Open access 23 May 2023

The FLUXCOM ensemble of global land-atmosphere energy fluxes

Article Open access 27 May 2019

Background & Summary

Data sparsity is a prevalent challenge in climate science and ecology. For example, in-situ observations tend to be spatially and temporally sparse due to sensor malfunctions, limited sensor locations, or non-ideal climate conditions such as persistent cloud cover. Consequently, understanding many climate processes can be difficult because the data do not capture the full natural variability in both space and time. FLUXNET2015 is a global network of eddy-covariance stations that captures carbon, water, and energy exchanges between the atmosphere and biosphere and provides high-quality ecosystem-scale observations spanning many climate and ecosystem types¹. However, its coverage is neither continuous nor temporally dense, especially in the years prior to 2000¹. Furthermore, its distribution across climate zones is not balanced, with only around 8% and 11% of the current operational stations located in the tropics and semi-arid regions, which are regions of critical importance for the global carbon cycle². For instance, there is increasing evidence that most of the global interannual carbon variability can be attributed to the semi-arid ecosystems in the southern hemisphere³. Thus, the lack of high-resolution observations in these data-sparse, yet important areas may inhibit our overall understanding of the global carbon cycle, especially in light of climate change.

The machine-learning community has tried to tackle the data sparsity problem in many ways, including the development of several few-shot learning approaches^4,5. One of these is the meta-learning approach that “learns how to learn from different tasks”. The idea behind this learning paradigm closely resembles how humans learn: we extract high-level features from previously learned tasks to quickly solve new problems. For instance, we can memorize a new person’s face with very few samples because we understand how a face should look after seeing many other faces. Although applications of meta-learning have been limited^6,7, there has been a growing popularity in the applied sciences^8,9. However, as far as we know, there is little work being done on the use of meta-learning in climate and environmental sciences, especially with regards to sparse and extreme spatiotemporal observations. In addition, to the best of our knowledge, there has been no upscaling effort to date that uses an ensemble of meta-trained deep models to produce a spatiotemporally continuous climate product from sparse observations. And given the importance of carbon fluxes in diagnosing the earth’s changing climate^10,11, there is a growing need to have a globally continuous, high-resolution dataset that best represents critical regions that, unfortunately, tend to have few data points.

To bridge these gaps, we aim to (i) evaluate the performance of meta-learning in environments where spatiotemporal information is sparse, (ii) check its robustness when predicting extreme cases that are critical to the carbon cycle, and (iii) upscale point data to a globally continuous map using an ensemble of meta-trained deep models. In particular, we focus on gross primary production (GPP) and ecosystem respiration (R_eco) as specific applications of the broader terrestrial carbon cycle. The upscaled product is resolved at 0.25-degree spatial resolution, spanning either at daily or monthly temporal resolutions between the years 2001 and 2021. Preliminary analysis shows that our global product (“MetaFlux”) is internally consistent in terms of its seasonality, interannual trend, and variability, and has high correlation with satellite-based solar-induced fluorescence (SIF) – a proxy for photosynthesis – when compared with other data-driven products, especially in the tropics and semi-arid critical regions.

Finally, the dataset is freely accessible in Zenodo at https://doi.org/10.5281/zenodo.7761881¹², while the meta-learning code can be reproduced and extended from https://github.com/juannat7/metaflux with example notebooks to apply our approach to your specific use cases. The overall methodology is summarized in Fig. 1.

Methods

Meta-learning: learning how to learn

Meta-learning is a machine-learning paradigm that trains a model to learn new tasks from sparse data efficiently¹³, leveraging information that comes from tasks with more available data. In general and as illustrated in Fig. 3a, meta-learning involves two stages: a meta-training and a meta-update stage. During meta-training, the model, f_θ (a function f that is parameterized by θ), proposes intermediate parameters, ϕ, that minimize the loss of base tasks¹⁴. In the meta-update step, the model fine-tunes its parameter, θ, given ϕ and the target tasks¹⁴, seeking the optimal parameter θ*. We define target tasks as a collection of stations in data-sparse regions, including the tropics, semi-arid regions, and representative stations in each ecoregion defined by plant functional types (PFTs), while the base tasks consist of the complement of the former (i.e. stations in data-abundant regions). Given the two-step gradient update procedures in meta-training and meta-update loops, the optimal parameters θ*, are not biased toward data-abundant base tasks, as illustrated in Fig. 3a, whereas in the baseline case (Fig. 3b), the optimal learned parameters would be biased toward data-abundant tasks as each data point contributes similarly to model’s learning. The details on how the data is split, including how the base (D^base) and target (D^target) datasets are divided for training and testing, are provided in the “Training setup” section below and illustrated in Fig. 4. For this work, we use an optimization-based meta-learning approach that is adapted from the model-agnostic meta-learning (MAML)¹³ as detailed in Fig. 2a.

Differentiable learners

Next, we will discuss the different deep learning models used for this work, including a multilayer perceptron (MLP), long-short term memory (LSTM), and bi-directional LSTM (BiLSTM).

Multilayer perceptron (MLP)

MLP or the feedforward artificial neural network is a fully connected deep model that can capture nonlinear relationships between inputs and the response variable¹⁵. Generally, an MLP consists of the input and output layers with several hidden layers and is activated by a set of nonlinear functions. In this paper, we use the Leaky Rectified Linear Unit (LeakyReLU) which is formulated in Eq. 1, where α controls the extent of “leakiness” in the negative x direction. The MLP model receives instantaneous weather data and vegetation index.

$$R(x)=\left\{\begin{array}{cc}x & x > 0\\ \alpha x & \,x < =0\end{array}\right.$$

(1)

Long-Short Term Memory (LSTM)

Time-series representing environmental processes tends to be strongly autocorrelated in time¹⁶ and Recurrent Neural Networks (RNN) were first introduced to solve this issue. The LSTM model was then proposed by¹⁷ to address the issues of vanishing and exploding gradients commonly observed in RNNs. The model can preserve long-term dependencies of sequential data through its gated structure that controls how information flows across cells. It is able to leverage association across multiple timesteps to inform inferential tasks where time dependency is present and significant. This can be especially useful to represent water stress, for instance, that depends not only on current daily precipitation but also on previous time steps of precipitation (water supply) and evaporative demand (temperature, radiation, humidity).

Bi-directional LSTM (BiLSTM)

The BiLSTM model is trained on both forward and backward timesteps to best estimate the value at the current timestep, t¹⁸, similar to reanalysis products (compared to weather forecasts that only use past information like LSTMs). BiLSTM has been used in cases where the past is as important as the future contexts¹⁹. The equations describing a BiLSTM cell are similar to that in LSTM with a slight modification in the hidden state representations that have to capture the forward and backward timesteps.

Training setup

We train an ensemble of meta-trained deep networks. The purpose of training an ensemble is to quantify uncertainty and reduce the bias of each individual model²⁰. The final model architecture and hyperparameters, including batch size and learning rates, are determined after performing a k-fold cross validation (k = 5) on the training set. In all, we use a 3-layer MLP with a hidden size of 350. We replace the first layer with either the LSTM or BiLSTM modules for the LSTM and BiLSTM models respectively to capture temporal features prior to the final prediction layer. The optimized ensemble is trained by minimizing the mean squared error (MSE). We train our models to estimate GPP and R_eco at daily and monthly timescale across 206 FLUXNET2015 sites¹ (retrievable from https://fluxnet.org/data/fluxnet2015-dataset/) using a combination of meteorological and remote sensing inputs including precipitation, air temperature at 2-meter (Ta), vapor pressure deficit (VPD), and incoming short-wave radiation from ERA5 reanalysis data²¹ (retrievable from https://cds.climate.copernicus.eu/) and leaf area index (LAI) from the Moderate Resolution Imaging Spectroradiometer (MODIS) product²² (retrievable from https://modis-land.gsfc.nasa.gov/). We retrieve the associated time-series closest to each tower site. In particular, we use the night-time partitioning methods for GPP and R_eco and match each flux record with reanalysis and remote sensing data corresponding to the station of interest. Aside from the target variable, we perform a z-normalization of the inputs to improve the learning process (Eq. 2).

$$z-normalized\;X=\frac{X-E(X)}{{\sigma }_{X}}$$

(2)

where E(X) is the estimated expected value of the variable X on our training data, and σ_X the corresponding sample standard deviation.

Next, we define class C_i as a set of batches consisting of input variables and target flux observations, with i denoting the index of batches of size 256. The definition of batch differs between linear MLP and time-series models, such as LSTM and BiLSTM. A batch, in the linear MLP case, refers to a collection of instantaneous data points, while in time-series models, it corresponds to a set of 30-day continuous data points. The choice of a 30-day window to form a single data point in the latter case is to better capture seasonal water stress²³. We construct D^target by randomly selecting half of the stations in the tropical and semi-arid regions (defined by the Köppen classification²⁴, retrievable from http://www.gloh2o.org/koppen/), which are sparse, and one station from each plant functional type (PFT) including those in the cropland and boreal areas²⁵. By extension, the D^base consist of stations that are complement to D^target. Each dataset is divided into training (D_train) and testing (D_test) sets with an 80:20 split ratio. As the term suggest, the former is used to train the model f_θ, while the latter is used to validate the performance of the model. Overall, the set of all possible datasets, D, includes $\left({D}_{train}^{base},{D}_{test}^{base},{D}_{train}^{target},{D}_{test}^{target}\right)$. Figure 4 illustrates how the dataset for meta-learning is constructed, with base tasks data being used for meta-training and that from target tasks for meta-update steps. We meta-train three models: MLP, LSTM, and BiLSTM, with an ensemble of five members each; where their individual weights are randomly initialized.

The non-meta-learning baseline uses identical architectures and hyperparameters, but the models do not have a meta-update outer loop (see Fig. 2b). This learning paradigm is similar to a single-step gradient descent learning approach²⁶ where we backpropagate the gradient of the loss function to update f_θ. This learning mode, however, can be biased to representations of tasks that have a lot more data. To ensure that a similar data structure is used in the baseline case, we compile $\left({D}_{train}^{base},{D}_{train}^{target}\right)$ as the training set and $\left({D}_{test}^{base},{D}_{test}^{target}\right)$ as the testing set.

Upscaling of global products

For the upscaling portion of this work, we use a similar set of meteorological and remote sensing inputs as during training at either the daily or monthly timesteps. Since VPD is not available in the existing ERA5 catalogue, we estimate it from air and dewpoint temperatures through the saturated (SVP) and actual vapor pressure (AVP) relation: VPD = SVP-AVP, which are both functions of Ta and dewpoint temperatures (Td). Finally, the spatial resolution of the resulting data inputs is harmonized to 0.25-degree using an arithmetic averaging. The final product has four variables, including the ensemble mean estimate of GPP and R_eco, and its uncertainty as captured by the standard deviation.

Evaluation on the site and global level

First, we compare the performance of meta-trained versus non-meta-trained models in terms of their RMSE scores on the testing sets. In addition, we evaluate how robust meta-trained models are in predicting extreme fluxes. This is done by selecting GPP or R_eco fluxes that exceed a predefined z-normalized threshold, t, that we vary between 1.0 and 2.0 (i.e. higher threshold means more extreme observations away from the mean value).

Next, we evaluate the upscaled product by analyzing its seasonality and interannual trends across climate zones. Thereafter, we compute the interannual variability using the interannual coefficient of variation (CV; Eq. 3) at the pixel level:

$$CV=\frac{\sigma }{\mu }$$

(3)

where σ and μ are the interannual standard deviation and mean, respectively.

Finally, the Pearson correlation coefficient between GPP and solar-induced fluorescence (SIF) from CSIF²⁷ (retrievable from https://figshare.com/articles/dataset/CSIF/6387494) and TROPOMI SIF²⁸ (retrievable from http://ftp.sron.nl/open-access-data-2/TROPOMI/tropomi/sif/v2.1/l2b/) is calculated across climate zones on a monthly timescale, for the periods 2001–2018 and 2019–2020, respetively. To benchmark our product, we compare our GPP-SIF correlation estimate, r(GPP_metaflux, SIF) with that of Fluxcom data-driven product²⁹, r(GPP_fluxcom, SIF), between the years 2001 and 2020. Generally-speaking, a higher correlation corresponds to a better GPP estimate, though this is not always the case as different ecosystem regimes and physiological characteristics may manifest different associative patterns³⁰.

Data Records

The global products amount to around 50GB and are freely accessible in Zenodo at https://doi.org/10.5281/zenodo.7761881¹². The spatial resolution is 0.25-degree, extending between 90-degree north to 90-degree south, and between 180-degree west and 180-degree east. We mask out cold regions that consist of the Arctic circle and Antarctica. Each Network Common Data Form (NetCDF) file contains four variables: GPP, R_eco, GPP_std, R_eco_std that represent GPP, R_eco ensemble mean and their uncertainties respectively. Temporally, each file is resolved at either the daily or monthly timescale. For instance, Fig. 5 illustrates the annual ensemble mean, while Fig. 6 the ensemble uncertainties of GPP and R_eco for the year 2021. We note that GPP tends to have higher uncertainty than R_eco, especially in the equator and higher-latitude regions.

For the daily product, the naming convention for each .nc file is METAFLUX_GPP_RECO_daily_<year><month>.nc; where <year> takes a value between 2001 and 2021 and <month> between 01 and 12 for January and December.

For the monthly product, we perform identical training and upscaling steps but using monthly, rather than daily fluxes, reanalysis, and remote sensing products. The naming convention for each file is METAFLUX_GPP_RECO_monthly_<year>.nc; where <year> takes a value between 2001 and 2021.

Technical Validation

In this section, we first evaluate our meta-learning approach based on site-level validation RMSE and its robustness to extreme observations. Next, we examine the seasonality, interannual trend and variability, and correlation with independent SIF products.

Evaluation of meta-learning as a learning framework

Convergence and site-level performance

As illustrated in Fig. 7 and Table 1, meta-trained deep models generally perform better than their baseline non-meta-trained counterparts. For instance, the validation RMSE of the meta-trained MLP on GPP is 3.13 gC m⁻² d⁻¹ ± 0.06 as compared to 3.47 gC m⁻² d⁻¹ ± 0.07 in the baseline case. A similar result is observed for R_eco where the RMSE of the meta-trained MLP is 3.07 gC m⁻² d⁻¹ ± 0.05 as compared to 3.31 gC m⁻² d⁻¹ ± 0.07 in the baseline case. In addition, the choice of deep networks matters. Overall, models that incorporate temporal information, i.e. the LSTM and BiLSTM models, perform better than models that do not. In the GPP case, for example, the non-meta-trained BiLSTM model has the lowest validation error of 3.00 gC m⁻² d⁻¹ ± 0.04, followed by the meta-trained LSTM model with an RMSE of 3.06 gC m⁻² d⁻¹ ± 0.06. This confirms our physical intuition that water stress, which tends to regulate productivity, builds up over many days to months and thus requires a memory process as captured by the recurrent neural networks. Moreover, plant photosynthesis and respiration can acclimate to the prevailing environmental conditions, such as temperature, light and VPD^31,32, which tend to be captured more effectively by memory-informed models. Nonetheless, the addition of bi-directionality in the BiLSTM model does not appear to significantly reduce error in the meta-trained models. This can be because the concept of data assimilation from future context has been captured through the process of meta-learning itself or that the signal coming from unidirectional timeseries is sufficiently saturated to parameterize the model. In other words, since our meta-learning approach primarily considers the spatial heterogeneity of the fluxes (e.g., across climate zones and PFTs), this spatial information, along with the temporal signals coming from BiLSTM gradient steps, result in a more unstable learning due to signal oversaturation which is evident from the larger convergence spread across model runs. This can be regularized by considering not just spatial, but also the spatiotemporal heterogeneity in a meta-learning approach³³, though this will increase the complexity of the algorithm and could potentially limit its extrapolation capacity. This remains the subject of future work.

Table 1 Site-level validation RMSE (gCm⁻² d⁻¹) across differentiable models for GPP and R_eco.

Full size table

Robustness under extreme conditions

Making an accurate estimate for extreme cases is especially important in climate science because extreme weather tends to cause catastrophic damages, such as major droughts, wildfires, or plant mortality^34,35. Fig. 8 illustrates the performance of our meta-trained models under an increasing magnitude of extremes as defined by the z-normalized threshold, t. In general, our meta-trained models (orange line) are more robust in predicting extreme cases of observed GPP and R_eco (i.e. lower validation RMSE) than their baseline counterparts (blue line), with a difference of around 1.2 gC m⁻² d⁻¹ and 0.7 gC m⁻² d⁻¹ for GPP and R_eco respectively.

If we further examine model performance across climate zones (Tables 2, 3) and select extreme fluxes with a normalized-target threshold, t, that is greater than 1.0, we find that our meta-trained models outperform the baselines. The reason why we choose this threshold is to have sufficient extreme observations across climate zones such that a more meaningful comparison can be made. In the GPP case, for example, meta-trained ensemble has lower validation RMSE of 3.78 gC m⁻² d⁻¹ ± 0.33 (versus 4.10 gC m⁻² d⁻¹ ± 0.29) and 3.04 gC m⁻² d⁻¹ ± 0.02 (versus 3.45 gC m⁻² d⁻¹ ± 0.06) in the semi-arid and tropics, respectively. A similar finding is observed for R_eco where meta-trained ensemble has lower validation RMSE of 2.35 gC m⁻² d⁻¹ ± 0.04 (versus 2.65 gC m⁻² d⁻¹ ± 0.06) in the tropics. These results are promising as the representation of both the tropics and semi-arid regions in many upscaled products is often challenging due to the limited number of observations available and the complex, memory-like processes involved. For example, in the semi-arid regions, there is a build-up of time-dependent water stresses³⁶, while in the tropics, there is a complex seasonal cycle of leaf flushing and phenology^37,38. Our approach is superior in its ability to reproduce carbon fluxes in the tropics and semi-arid areas because limited data here are optimally enriched with shared information coming from other data-abundant regions through meta-learning.

Table 2 Robustness of meta-trained ensemble when inferring extreme GPP observations across climate zones at the normalized-target threshold, t > 1.0 (all units in gC m⁻² d⁻¹).

Full size table

Table 3 Robustness of meta-trained ensemble when inferring extreme R_eco observations across climate zones at the normalized-target threshold, t > 1.0 (all units in gC m⁻² d⁻¹).

Full size table

Evaluation of meta-learned global data

Now that we have validated our meta-learning framework on the site level, we proceed to evaluate the internal consistency of our upscaled product. This includes the analysis of seasonality, interannual variability, and comparison to SIF as an independent photosynthesis product.

Temporal analysis

First, we analyze the seasonality of our upscaled GPP and R_eco across months for the years between 2001 and 2021. As shown in Fig. 9, both fluxes exhibit similar seasonality albeit at different magnitudes. The tropics (including the dry and wet regions) contribute the most to the global GPP and R_eco, as expected^39,40, while the semi-arid regions contribute the least⁴¹. Carbon fluxes in the temperate (northern hemisphere) and continental regions exhibit unimodal variations that peak in the summer (June, July, and August - JJA), while those in the southern temperate regions peak in December, January, and February (DJF)⁴². On average, the temperate regions have higher carbon fluxes than the continental areas, which tend to be limited by light and temperature, with shorter growing seasons⁴³.

Another interesting analysis is to understand the long-term trends of our global carbon fluxes. As observed in Fig. 10, our meta-trained global carbon product shows an overall increase in GPP by 0.0113 PgCyr⁻¹ and R_eco by 0.0101 PgCyr⁻¹. We extend Fig. 10 by making a comparison with other carbon flux products, including those from light response function (LRF)⁴⁴, P-model⁴⁵, MODIS17 (MOD17)⁴⁶, Soil Moisture Active Passive (SMAP)⁴⁷, vegetation photosynthesis model (VPM)²⁷, and Global LAnd Surface Satellite (GLASS)⁴⁸ for GPP, as well as Fluxcom²⁹ for both GPP and R_eco. Overall, they show similar peaks and declines, albeit at varying magnitudes (between 100–140 PgC yr⁻¹) as shown in Fig. 11.

Interannual variability

We find that the semi-arid regions of Australia, America, and some parts of the northern latitudes have the largest interannual variability of GPP and R_eco (Fig. 12). This is consistent with results from² and³ that reported significant contribution of these regions, particularly of the Australian ecosystems, in explaining much of the global carbon interannual variability. As a result, the high turnover rate of carbon pools in these semi-arid environments warrants further research into how the climate and anthropogenic factors can account for this large interannual variability, such as the extent of carbon stock decomposition (e.g. due to wildfire) and accumulation during the dry and wet seasons. In addition, our upscaled product shows high interannual variability in the dry tropical regions of Asia. However, this variation becomes smaller in the tropical forests of Asia, Africa, and America owing to their relatively stable climate. This can be attributed to the region’s sensitivity to rainfall pattern driven by El Niño-Southern Oscillation (ENSO), or soil moisture^49,50 and rapid land-use changes⁵¹. In contrast to Fluxcom, our upscaled product does not show as much interannual variability, especially in desert regions (eg. Australia, Central America, South America, and Central Asia), which may be more accurate owing to the extremely low primary productivity there in the first place⁵². Nonetheless, we note that in some parts of the globe, especially along the Sahel and continental Western Europe, the interannual variability of carbon fluxes from MetaFlux is smaller. Physically, this phenomenon has been reported by⁵³ and⁵⁴ who observe how variations in terrestrial carbon productivity tend to be stronger in space rather than time. The second plausible reason would be that the ensemble captures much of this variability (i.e. expressed as standard deviation), where each member model learns a different temporal structure that can result in lower than expected mean interannual variability. Lastly, and as highlighted in Figs. 13, 14, meta-learning attempts to learn efficiently from historically underrepresented regions, such as the tropics, which tend to have low interannual variability. This potentially results in a reduction in such variability at higher latitudes, especially along the temperate and continental regions.

Comparison with Solar-induced fluorescence (SIF)

In order to evaluate the quality of the seasonal cycle of our product, in particular GPP, we measure its correlation coefficient with several SIF products. MetaFlux GPP demonstrates higher Pearson correlation coefficient with both CSIF and TROPOSIF (Figs. 13, 14, Tables 4, 5) than Fluxcom GPP across the temperate, semi-arid, and tropical regions with values higher than 0.8 to 0.9 even at the very northern latitudes. In particular, the correlation coefficient of our upscaled product with TROPOSIF in the semi-arid, tropics, and temperate regions are 0.856 ± 0.083 (versus 0.726 ± 0.165), 0.546 ± 0.299 (versus 0.343 ± 0.164), and 0.919 ± 0.002 (versus 0.826 ± 0.021), respectively. A similar trend is also observed in the CSIF case where the correlation in the semi-arid, tropics, and temperate regions are 0.925 ± 0.026 (versus 0.914 ± 0.060), 0.772 ± 0.080 (versus 0.608 ± 0.105), and 0.922 ± 0.022 (versus 0.914 ± 0.037), respectively. Across the two SIF products, however, we observe weaker correlation strength in the continental regions. Upon further inspection of Figs. 13, 14, the weaker association could be due to the slight uptick of our GPP estimate during the DJF period, which can be attributed to the lower quality of LAI retrievals because of snow cover.

Table 4 Mean Pearson correlation coefficient for the seasonality of GPP (MetaFlux and Fluxcom) and CSIF in the years 2001-2018 across climate zones.

Full size table

Table 5 Mean Pearson correlation coefficient for the seasonality of GPP (MetaFlux and Fluxcom) and TROPOSIF in the years 2019–2020 across climate zones. The numbers in bold represent product with higher GPP-SIF correlation.

Full size table

Finally, we inspect the pixel-level correlation distribution of MetaFlux and Fluxcom with long-term CSIF product, as illustrated in Fig. 15. In general, MetaFlux has lower correlation with SIF in the tropical rainforest of Indonesia and Amazon as well as the arid regions of Australia, Gobi, Arabian, Syrian, Karakum, Taklamakan, Gobi, and the Great Plains in Northern America. This trend is consistent with earlier reports by⁵⁵, for example, who showed how arid and extremely wet tropical regions (e.g. rainforests) tend to have low GPP-SIF correlation because of weak seasonality that essentially drop correlations to background noise level.

In summary, we have developed a new terrestrial carbon flux product, MetaFlux, using an ensemble of meta-learned deep networks. We have demonstrated how meta-learning can better estimate fluxes in data-sparse, yet critical regions (e.g. semi-arid and the tropics) and are more robust to predicting extreme observations. Our global product is able to outperform other reference product when evaluated against independent measurement, such as SIF or on flux tower networks. We believe that although data sparsity can be a major limiting factor to our complete understanding of many climate processes, leveraging knowledge in other similar domains can be powerful to better understand processes and their response to the environment.

Usage Notes

The data is permanently stored in Zenodo at https://doi.org/10.5281/zenodo.7761881¹² and is available at either the monthly or daily temporal scale. Each file contains four variables, GPP, R_eco, GPP_std, and R_eco_std, that are resolved continuously at a 0.25-degree spatial resolution. We purposely mask out the cold regions of Antarctica and the Arctic circle because we assume the lack of GPP and R_eco there. Although we do not mask out the arid regions (e.g. deserts), we would recommend users to do so in order to remove any artifical, though small, estimates. In addition, we do not estimate net ecosystem exchange (NEE) explicitly. One of the primary reasons is because their fluxes (and by extension, their magnitude of variability) are significantly lower than that of GPP and R_eco, making estimations from current input variables difficult and because the underlying drivers of GPP and R_eco can differ. Separating the fluxes will ensure better generalization across regimes. Nonetheless, since we use the night-time partitioning algorithm that extrapolates respiration-based NEE estimates (where GPP is assumed to be absent during night-time) to daytime⁵⁶, users are able to get an approximation of NEE by subtracting GPP from R_eco (i.e., R_eco - GPP). However, this approximation is still subject to broader validation, which we leave for future work.

Code availability

The meta-learning code is freely available and accessible at https://github.com/juannat7/metaflux. The repository contains notebooks that are customizable to one’s needs beyond the scope of this work. Further questions, feedback, or comments can be directed to the corresponding author.

References

Pastorello, G. et al. The fluxnet2015 dataset and the oneflux processing pipeline for eddy covariance data. Scientific data 7, 1–27 (2020).
Article Google Scholar
Poulter, B. et al. Contribution of semi-arid ecosystems to interannual variability of the global carbon cycle. Nature 509, 600–603 (2014).
Article CAS PubMed ADS Google Scholar
Ahlström, A. et al. The dominant role of semi-arid ecosystems in the trend and variability of the land co2 sink. Science 348, 895–899 (2015).
Article PubMed ADS Google Scholar
Sun, Q., Liu, Y., Chua, T.-S. & Schiele, B. Meta-transfer learning for few-shot learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 403–412 (2019).
Wang, Y., Yao, Q., Kwok, J. T. & Ni, L. M. Generalizing from a few examples: A survey on few-shot learning. ACM computing surveys (csur) 53, 1–34 (2020).
Google Scholar
Li, D., Yang, Y., Song, Y.-Z. & Hospedales, T. Learning to generalize: Meta-learning for domain generalization. In Proceedings of the AAAI conference on artificial intelligence, vol. 32 (2018).
Hospedales, T., Antoniou, A., Micaelli, P. & Storkey, A. Meta-learning in neural networks: A survey. IEEE transactions on pattern analysis and machine intelligence 44, 5149–5169 (2021).
Google Scholar
Tseng, G., Kerner, H., Nakalembe, C. & Becker-Reshef, I. Learning to predict crop type from heterogeneous sparse labels using meta-learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 1111–1120 (2021).
Pan, Z. et al. Spatio-temporal meta learning for urban traffic prediction. IEEE Transactions on Knowledge and Data Engineering 34, 1462–1476 (2020).
Article Google Scholar
Friedlingstein, P. et al. Global carbon budget 2020. Earth System Science Data 12, 3269–3340 (2020).
Article ADS Google Scholar
da Silva, A. F. et al. Netzeroco 2, an ai framework for accelerated nature-based carbon sequestration. In 2022 IEEE International Conference on Big Data (Big Data), 4881–4887 (IEEE, 2022).
Nathaniel, J., Liu, J. & Gentine, P. MetaFlux: Meta-learning global carbon fluxes from sparse spatiotemporal observations. Zenodo https://doi.org/10.5281/zenodo.7761881 (2023).
Finn, C., Abbeel, P. & Levine, S. Model-agnostic meta-learning for fast adaptation of deep networks. In International conference on machine learning, 1126–1135 (PMLR, 2017).
Nichol, A., Achiam, J. & Schulman, J. On first-order meta-learning algorithms. arXiv preprint arXiv:1803.02999 (2018).
Gardner, M. W. & Dorling, S. Artificial neural networks (the multilayer perceptron)–a review of applications in the atmospheric sciences. Atmospheric environment 32, 2627–2636 (1998).
Article CAS ADS Google Scholar
Buch, J., Williams, A. P., Juang, C. S., Hansen, W. D. & Gentine, P. Smlfire1. 0: a stochastic machine learning (sml) model for wildfire activity in the western united states. EGUsphere 1–39 (2022).
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural computation 9, 1735–1780 (1997).
Article CAS PubMed Google Scholar
Schuster, M. & Paliwal, K. K. Bidirectional recurrent neural networks. IEEE transactions on Signal Processing 45, 2673–2681 (1997).
Article ADS Google Scholar
Li, C., Zhang, Y. & Ren, X. Modeling hourly soil temperature using deep bilstm neural network. Algorithms 13, 173 (2020).
Article CAS Google Scholar
Ganaie, M. A., Hu, M., Malik, A., Tanveer, M. & Suganthan, P. Ensemble deep learning: A review. Engineering Applications of Artificial Intelligence 115, 105151 (2022).
Article Google Scholar
Hersbach, H. et al. The era5 global reanalysis. Quarterly Journal of the Royal Meteorological Society 146, 1999–2049, https://doi.org/10.1002/qj.3803 (2020).
Article ADS Google Scholar
Justice, C. et al. The modis fire products. Remote sensing of Environment 83, 244–262, https://doi.org/10.1016/S0034-4257(02)00076-7 (2002).
Article ADS Google Scholar
Baraloto, C., Morneau, F., Bonal, D., Blanc, L. & Ferry, B. Seasonal water stress tolerance and habitat associations within four neotropical tree genera. Ecology 88, 478–489 (2007).
Article PubMed Google Scholar
Beck, H. E. et al. Present and future köppen-geiger climate classification maps at 1-km resolution. Scientific data 5, 1–12, https://doi.org/10.1038/sdata.2018.214 (2018).
Article Google Scholar
Poulter, B. et al. Plant functional type mapping for earth system models. Geoscientific Model Development 4, 993–1010, https://doi.org/10.5194/gmd-4-993-2011 (2011).
Article ADS Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. nature 521, 436–444 (2015).
Article CAS PubMed ADS Google Scholar
Zhang, Y., Joiner, J., Alemohammad, S. H., Zhou, S. & Gentine, P. A global spatially contiguous solar-induced fluorescence (csif) dataset using neural networks. Biogeosciences 15, 5779–5800, https://doi.org/10.6084/m9.figshare.6387494 (2018).
Article CAS ADS Google Scholar
Guanter, L. et al. The troposif global sun-induced fluorescence dataset from the sentinel-5p tropomi mission. Earth System Science Data 13, 5423–5440, https://doi.org/10.5270/esa-s5p_innovation-sif-20180501_20210320-v2.1-202104 (2021).
Article ADS Google Scholar
Jung, M. et al. Scaling carbon fluxes from eddy covariance sites to globe: synthesis and evaluation of the fluxcom approach. Biogeosciences https://doi.org/10.5194/bg-17-1343-2020 (2020).
Zhan, W. et al. Two for one: Partitioning co2 fluxes and understanding the relationship between solar-induced chlorophyll fluorescence and gross primary productivity using machine learning. Agricultural and Forest Meteorology 321, 108980 (2022).
Article ADS Google Scholar
Reich, P. B. et al. Boreal and temperate trees show strong acclimation of respiration to warming. Nature 531, 633–636 (2016).
Article CAS PubMed ADS Google Scholar
Berry, J. & Bjorkman, O. Photosynthetic response and adaptation to temperature in higher plants. Annual Review of plant physiology 31, 491–543 (1980).
Article Google Scholar
Zhang, K., Zhang, X., Song, H., Pan, H. & Wang, B. Air quality prediction model based on spatiotemporal data analysis and metalearning. Wireless Communications and Mobile Computing 2021, 1–11 (2021).
Article PubMed Google Scholar
Juang, C. S. et al. Rapid growth of large forest fires drives the exponential response of annual forest-fire area to aridity in the western united states. Geophysical Research Letters 49, e2021GL097131 (2022).
Article CAS PubMed PubMed Central ADS Google Scholar
Miralles, D. G., Teuling, A. J. & Van Heerwaarden, C. C. & Vilà-Guerau de Arellano, J. Mega-heatwave temperatures due to combined soil desiccation and atmospheric heat accumulation. Nature geoscience 7, 345–349 (2014).
Article CAS ADS Google Scholar
Falkenmark, M., Lundqvist, J. & Widstrand, C. Macro-scale water scarcity requires micro-scale approaches: Aspects of vulnerability in semi-arid development. In Natural resources forum, vol. 13, 258–267 (Wiley Online Library, 1989).
Singh, K. & Kushwaha, C. Emerging paradigms of tree phenology in dry tropics. Current Science 964–975 (2005).
Chen, X. et al. Vapor pressure deficit and sunlight explain seasonality of leaf phenology and photosynthesis across amazonian evergreen broadleaved forest. Global Biogeochemical Cycles 35, e2020GB006893 (2021).
Article CAS ADS Google Scholar
Chen, M. et al. Regional contribution to variability and trends of global gross primary productivity. Environmental Research Letters 12, 105005 (2017).
Article ADS Google Scholar
Nathaniel, J., Klein, L. J., Watson, C. D., Nyirjesy, G. & Albrecht, C. M. Aboveground carbon biomass estimate with physics-informed deep network. arXiv preprint arXiv:2210.13752 (2022).
Chen, Y. et al. Contrasting performance of the remotely-derived gpp products over different climate zones across china. Remote Sensing 11, 1855 (2019).
Article ADS Google Scholar
Falge, E. et al. Seasonality of ecosystem respiration and gross primary production as derived from fluxnet measurements. Agricultural and Forest Meteorology 113, 53–74 (2002).
Article ADS Google Scholar
Schwalm, C. R. et al. Photosynthetic light use efficiency of three biomes across an east–west continental-scale transect in canada. Agricultural and Forest Meteorology 140, 269–286 (2006).
Article ADS Google Scholar
Tagesson, T. et al. A physiology-based earth observation model indicates stagnation in the global gross primary production during recent decades. Global Change Biology 27, 836–854 (2021).
Article CAS PubMed ADS Google Scholar
Stocker, B. D. et al. Drought impacts on terrestrial primary production underestimated by satellite monitoring. Nature Geoscience 12, 264–270 (2019).
Article CAS ADS Google Scholar
Running, S. W. et al. A continuous satellite-derived measure of global terrestrial primary production. Bioscience 54, 547–560 (2004).
Article Google Scholar
Booth, B. B. et al. High sensitivity of future global warming to land carbon cycle processes. Environmental Research Letters 7, 024002 (2012).
Article ADS Google Scholar
Liang, S. et al. The global land surface satellite (glass) product suite. Bulletin of the American Meteorological Society 102, E323–E337 (2021).
Article Google Scholar
Allen, K. et al. Will seasonally dry tropical forests be sensitive or resistant to future changes in rainfall regimes? Environmental Research Letters 12, 023001 (2017).
Article ADS Google Scholar
Skulovich, O. & Gentine, P. A long-term consistent artificial intelligence and remote sensing-based soil moisture dataset. Scientific Data 10, 154 (2023).
Article PubMed PubMed Central Google Scholar
Miles, L. et al. A global overview of the conservation status of tropical dry forests. Journal of biogeography 33, 491–505 (2006).
Article Google Scholar
Hadley, N. F. & Szarek, S. R. Productivity of desert ecosystems. BioScience 31, 747–753 (1981).
Article Google Scholar
Sala, O. E., Gherardi, L. A., Reichmann, L., Jobbagy, E. & Peters, D. Legacies of precipitation fluctuations on primary production: theory and data synthesis. Philosophical Transactions of the Royal Society B: Biological Sciences 367, 3135–3144 (2012).
Article Google Scholar
Knapp, A. K., Ciais, P. & Smith, M. D. Reconciling inconsistencies in precipitation–productivity relationships: implications for climate change. New Phytologist 214, 41–47 (2017).
Article PubMed Google Scholar
Sanders, A. F. et al. Spaceborne sun-induced vegetation fluorescence time series from 2007 to 2015 evaluated with australian flux tower measurements. Remote Sensing 8, 895 (2016).
Article ADS Google Scholar
Reichstein, M. et al. On the separation of net ecosystem exchange into assimilation and ecosystem respiration: review and improved algorithm. Global change biology 11, 1424–1439 (2005).
Article ADS Google Scholar

Download references

Acknowledgements

We would like to thank Martin Jung and Ulrich Weber for giving us access to the latest Fluxcom dataset, and the two anonymous reviewers whose feedbacks have significantly improved the manuscript. The authors would like to acknowledge funding from the NSF LEAP Science and Technology Center award #2019625, Department of Energy grant, USMILE European Research Council grant and LEMONTREE Schmidt Futures funding.

Author information

Authors and Affiliations

Department of Earth and Environmental Engineering, Columbia University, New York, NY, 10027, USA
Juan Nathaniel, Jiangong Liu & Pierre Gentine
Climate School, Columbia University, New York, NY, 10027, USA
Pierre Gentine

Authors

Juan Nathaniel
View author publications
You can also search for this author in PubMed Google Scholar
Jiangong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Pierre Gentine
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.G. conceived the idea and supervised the work, J.L. processed the data, J.N. and J.L. designed the experiments, J.N. conducted the experiments, analysed the results, and wrote the first manuscript draft. All authors reviewed the manuscript.

Corresponding author

Correspondence to Juan Nathaniel.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Nathaniel, J., Liu, J. & Gentine, P. MetaFlux: Meta-learning global carbon fluxes from sparse spatiotemporal observations. Sci Data 10, 440 (2023). https://doi.org/10.1038/s41597-023-02349-y

Download citation

Received: 24 March 2023
Accepted: 29 June 2023
Published: 11 July 2023
DOI: https://doi.org/10.1038/s41597-023-02349-y

Subjects

Abstract

Similar content being viewed by others

The FLUXNET2015 dataset and the ONEFlux processing pipeline for eddy covariance data

Eighteen years of upland grassland carbon flux data: reference datasets, processing, and gap-filling procedure

The FLUXCOM ensemble of global land-atmosphere energy fluxes

Background & Summary

Methods

Meta-learning: learning how to learn

Differentiable learners

Multilayer perceptron (MLP)

Long-Short Term Memory (LSTM)

Bi-directional LSTM (BiLSTM)

Training setup

Upscaling of global products

Evaluation on the site and global level

Data Records

Technical Validation

Evaluation of meta-learning as a learning framework

Convergence and site-level performance

Robustness under extreme conditions

Evaluation of meta-learned global data

Temporal analysis

Interannual variability

Comparison with Solar-induced fluorescence (SIF)

Usage Notes

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links