Abstract
We provide a global, long-term carbon flux dataset of gross primary production and ecosystem respiration generated using meta-learning, called MetaFlux. The idea behind meta-learning stems from the need to learn efficiently given sparse data by learning how to learn broad features across tasks to better infer other poorly sampled ones. Using meta-trained ensemble of deep models, we generate global carbon products on daily and monthly timescales at a 0.25-degree spatial resolution from 2001 to 2021, through a combination of reanalysis and remote-sensing products. Site-level validation finds that MetaFlux ensembles have lower validation error by 5–7% compared to their non-meta-trained counterparts. In addition, they are more robust to extreme observations, with 4–24% lower errors. We also checked for seasonality, interannual variability, and correlation to solar-induced fluorescence of the upscaled product and found that MetaFlux outperformed other machine-learning based carbon product, especially in the tropics and semi-arids by 10–40%. Overall, MetaFlux can be used to study a wide range of biogeochemical processes.
Similar content being viewed by others
Background & Summary
Data sparsity is a prevalent challenge in climate science and ecology. For example, in-situ observations tend to be spatially and temporally sparse due to sensor malfunctions, limited sensor locations, or non-ideal climate conditions such as persistent cloud cover. Consequently, understanding many climate processes can be difficult because the data do not capture the full natural variability in both space and time. FLUXNET2015 is a global network of eddy-covariance stations that captures carbon, water, and energy exchanges between the atmosphere and biosphere and provides high-quality ecosystem-scale observations spanning many climate and ecosystem types1. However, its coverage is neither continuous nor temporally dense, especially in the years prior to 20001. Furthermore, its distribution across climate zones is not balanced, with only around 8% and 11% of the current operational stations located in the tropics and semi-arid regions, which are regions of critical importance for the global carbon cycle2. For instance, there is increasing evidence that most of the global interannual carbon variability can be attributed to the semi-arid ecosystems in the southern hemisphere3. Thus, the lack of high-resolution observations in these data-sparse, yet important areas may inhibit our overall understanding of the global carbon cycle, especially in light of climate change.
The machine-learning community has tried to tackle the data sparsity problem in many ways, including the development of several few-shot learning approaches4,5. One of these is the meta-learning approach that “learns how to learn from different tasks”. The idea behind this learning paradigm closely resembles how humans learn: we extract high-level features from previously learned tasks to quickly solve new problems. For instance, we can memorize a new person’s face with very few samples because we understand how a face should look after seeing many other faces. Although applications of meta-learning have been limited6,7, there has been a growing popularity in the applied sciences8,9. However, as far as we know, there is little work being done on the use of meta-learning in climate and environmental sciences, especially with regards to sparse and extreme spatiotemporal observations. In addition, to the best of our knowledge, there has been no upscaling effort to date that uses an ensemble of meta-trained deep models to produce a spatiotemporally continuous climate product from sparse observations. And given the importance of carbon fluxes in diagnosing the earth’s changing climate10,11, there is a growing need to have a globally continuous, high-resolution dataset that best represents critical regions that, unfortunately, tend to have few data points.
To bridge these gaps, we aim to (i) evaluate the performance of meta-learning in environments where spatiotemporal information is sparse, (ii) check its robustness when predicting extreme cases that are critical to the carbon cycle, and (iii) upscale point data to a globally continuous map using an ensemble of meta-trained deep models. In particular, we focus on gross primary production (GPP) and ecosystem respiration (Reco) as specific applications of the broader terrestrial carbon cycle. The upscaled product is resolved at 0.25-degree spatial resolution, spanning either at daily or monthly temporal resolutions between the years 2001 and 2021. Preliminary analysis shows that our global product (“MetaFlux”) is internally consistent in terms of its seasonality, interannual trend, and variability, and has high correlation with satellite-based solar-induced fluorescence (SIF) – a proxy for photosynthesis – when compared with other data-driven products, especially in the tropics and semi-arid critical regions.
Finally, the dataset is freely accessible in Zenodo at https://doi.org/10.5281/zenodo.776188112, while the meta-learning code can be reproduced and extended from https://github.com/juannat7/metaflux with example notebooks to apply our approach to your specific use cases. The overall methodology is summarized in Fig. 1.
Methods
Meta-learning: learning how to learn
Meta-learning is a machine-learning paradigm that trains a model to learn new tasks from sparse data efficiently13, leveraging information that comes from tasks with more available data. In general and as illustrated in Fig. 3a, meta-learning involves two stages: a meta-training and a meta-update stage. During meta-training, the model, fθ (a function f that is parameterized by θ), proposes intermediate parameters, ϕ, that minimize the loss of base tasks14. In the meta-update step, the model fine-tunes its parameter, θ, given ϕ and the target tasks14, seeking the optimal parameter θ*. We define target tasks as a collection of stations in data-sparse regions, including the tropics, semi-arid regions, and representative stations in each ecoregion defined by plant functional types (PFTs), while the base tasks consist of the complement of the former (i.e. stations in data-abundant regions). Given the two-step gradient update procedures in meta-training and meta-update loops, the optimal parameters θ*, are not biased toward data-abundant base tasks, as illustrated in Fig. 3a, whereas in the baseline case (Fig. 3b), the optimal learned parameters would be biased toward data-abundant tasks as each data point contributes similarly to model’s learning. The details on how the data is split, including how the base (Dbase) and target (Dtarget) datasets are divided for training and testing, are provided in the “Training setup” section below and illustrated in Fig. 4. For this work, we use an optimization-based meta-learning approach that is adapted from the model-agnostic meta-learning (MAML)13 as detailed in Fig. 2a.
Differentiable learners
Next, we will discuss the different deep learning models used for this work, including a multilayer perceptron (MLP), long-short term memory (LSTM), and bi-directional LSTM (BiLSTM).
Multilayer perceptron (MLP)
MLP or the feedforward artificial neural network is a fully connected deep model that can capture nonlinear relationships between inputs and the response variable15. Generally, an MLP consists of the input and output layers with several hidden layers and is activated by a set of nonlinear functions. In this paper, we use the Leaky Rectified Linear Unit (LeakyReLU) which is formulated in Eq. 1, where α controls the extent of “leakiness” in the negative x direction. The MLP model receives instantaneous weather data and vegetation index.
Long-Short Term Memory (LSTM)
Time-series representing environmental processes tends to be strongly autocorrelated in time16 and Recurrent Neural Networks (RNN) were first introduced to solve this issue. The LSTM model was then proposed by17 to address the issues of vanishing and exploding gradients commonly observed in RNNs. The model can preserve long-term dependencies of sequential data through its gated structure that controls how information flows across cells. It is able to leverage association across multiple timesteps to inform inferential tasks where time dependency is present and significant. This can be especially useful to represent water stress, for instance, that depends not only on current daily precipitation but also on previous time steps of precipitation (water supply) and evaporative demand (temperature, radiation, humidity).
Bi-directional LSTM (BiLSTM)
The BiLSTM model is trained on both forward and backward timesteps to best estimate the value at the current timestep, t18, similar to reanalysis products (compared to weather forecasts that only use past information like LSTMs). BiLSTM has been used in cases where the past is as important as the future contexts19. The equations describing a BiLSTM cell are similar to that in LSTM with a slight modification in the hidden state representations that have to capture the forward and backward timesteps.
Training setup
We train an ensemble of meta-trained deep networks. The purpose of training an ensemble is to quantify uncertainty and reduce the bias of each individual model20. The final model architecture and hyperparameters, including batch size and learning rates, are determined after performing a k-fold cross validation (k = 5) on the training set. In all, we use a 3-layer MLP with a hidden size of 350. We replace the first layer with either the LSTM or BiLSTM modules for the LSTM and BiLSTM models respectively to capture temporal features prior to the final prediction layer. The optimized ensemble is trained by minimizing the mean squared error (MSE). We train our models to estimate GPP and Reco at daily and monthly timescale across 206 FLUXNET2015 sites1 (retrievable from https://fluxnet.org/data/fluxnet2015-dataset/) using a combination of meteorological and remote sensing inputs including precipitation, air temperature at 2-meter (Ta), vapor pressure deficit (VPD), and incoming short-wave radiation from ERA5 reanalysis data21 (retrievable from https://cds.climate.copernicus.eu/) and leaf area index (LAI) from the Moderate Resolution Imaging Spectroradiometer (MODIS) product22 (retrievable from https://modis-land.gsfc.nasa.gov/). We retrieve the associated time-series closest to each tower site. In particular, we use the night-time partitioning methods for GPP and Reco and match each flux record with reanalysis and remote sensing data corresponding to the station of interest. Aside from the target variable, we perform a z-normalization of the inputs to improve the learning process (Eq. 2).
where E(X) is the estimated expected value of the variable X on our training data, and σX the corresponding sample standard deviation.
Next, we define class Ci as a set of batches consisting of input variables and target flux observations, with i denoting the index of batches of size 256. The definition of batch differs between linear MLP and time-series models, such as LSTM and BiLSTM. A batch, in the linear MLP case, refers to a collection of instantaneous data points, while in time-series models, it corresponds to a set of 30-day continuous data points. The choice of a 30-day window to form a single data point in the latter case is to better capture seasonal water stress23. We construct Dtarget by randomly selecting half of the stations in the tropical and semi-arid regions (defined by the Köppen classification24, retrievable from http://www.gloh2o.org/koppen/), which are sparse, and one station from each plant functional type (PFT) including those in the cropland and boreal areas25. By extension, the Dbase consist of stations that are complement to Dtarget. Each dataset is divided into training (Dtrain) and testing (Dtest) sets with an 80:20 split ratio. As the term suggest, the former is used to train the model fθ, while the latter is used to validate the performance of the model. Overall, the set of all possible datasets, D, includes \(\left({D}_{train}^{base},{D}_{test}^{base},{D}_{train}^{target},{D}_{test}^{target}\right)\). Figure 4 illustrates how the dataset for meta-learning is constructed, with base tasks data being used for meta-training and that from target tasks for meta-update steps. We meta-train three models: MLP, LSTM, and BiLSTM, with an ensemble of five members each; where their individual weights are randomly initialized.
The non-meta-learning baseline uses identical architectures and hyperparameters, but the models do not have a meta-update outer loop (see Fig. 2b). This learning paradigm is similar to a single-step gradient descent learning approach26 where we backpropagate the gradient of the loss function to update fθ. This learning mode, however, can be biased to representations of tasks that have a lot more data. To ensure that a similar data structure is used in the baseline case, we compile \(\left({D}_{train}^{base},{D}_{train}^{target}\right)\) as the training set and \(\left({D}_{test}^{base},{D}_{test}^{target}\right)\) as the testing set.
Upscaling of global products
For the upscaling portion of this work, we use a similar set of meteorological and remote sensing inputs as during training at either the daily or monthly timesteps. Since VPD is not available in the existing ERA5 catalogue, we estimate it from air and dewpoint temperatures through the saturated (SVP) and actual vapor pressure (AVP) relation: VPD = SVP-AVP, which are both functions of Ta and dewpoint temperatures (Td). Finally, the spatial resolution of the resulting data inputs is harmonized to 0.25-degree using an arithmetic averaging. The final product has four variables, including the ensemble mean estimate of GPP and Reco, and its uncertainty as captured by the standard deviation.
Evaluation on the site and global level
First, we compare the performance of meta-trained versus non-meta-trained models in terms of their RMSE scores on the testing sets. In addition, we evaluate how robust meta-trained models are in predicting extreme fluxes. This is done by selecting GPP or Reco fluxes that exceed a predefined z-normalized threshold, t, that we vary between 1.0 and 2.0 (i.e. higher threshold means more extreme observations away from the mean value).
Next, we evaluate the upscaled product by analyzing its seasonality and interannual trends across climate zones. Thereafter, we compute the interannual variability using the interannual coefficient of variation (CV; Eq. 3) at the pixel level:
where σ and μ are the interannual standard deviation and mean, respectively.
Finally, the Pearson correlation coefficient between GPP and solar-induced fluorescence (SIF) from CSIF27 (retrievable from https://figshare.com/articles/dataset/CSIF/6387494) and TROPOMI SIF28 (retrievable from http://ftp.sron.nl/open-access-data-2/TROPOMI/tropomi/sif/v2.1/l2b/) is calculated across climate zones on a monthly timescale, for the periods 2001–2018 and 2019–2020, respetively. To benchmark our product, we compare our GPP-SIF correlation estimate, r(GPPmetaflux, SIF) with that of Fluxcom data-driven product29, r(GPPfluxcom, SIF), between the years 2001 and 2020. Generally-speaking, a higher correlation corresponds to a better GPP estimate, though this is not always the case as different ecosystem regimes and physiological characteristics may manifest different associative patterns30.
Data Records
The global products amount to around 50GB and are freely accessible in Zenodo at https://doi.org/10.5281/zenodo.776188112. The spatial resolution is 0.25-degree, extending between 90-degree north to 90-degree south, and between 180-degree west and 180-degree east. We mask out cold regions that consist of the Arctic circle and Antarctica. Each Network Common Data Form (NetCDF) file contains four variables: GPP, Reco, GPP_std, Reco_std that represent GPP, Reco ensemble mean and their uncertainties respectively. Temporally, each file is resolved at either the daily or monthly timescale. For instance, Fig. 5 illustrates the annual ensemble mean, while Fig. 6 the ensemble uncertainties of GPP and Reco for the year 2021. We note that GPP tends to have higher uncertainty than Reco, especially in the equator and higher-latitude regions.
For the daily product, the naming convention for each .nc file is METAFLUX_GPP_RECO_daily_<year><month>.nc; where <year> takes a value between 2001 and 2021 and <month> between 01 and 12 for January and December.
For the monthly product, we perform identical training and upscaling steps but using monthly, rather than daily fluxes, reanalysis, and remote sensing products. The naming convention for each file is METAFLUX_GPP_RECO_monthly_<year>.nc; where <year> takes a value between 2001 and 2021.
Technical Validation
In this section, we first evaluate our meta-learning approach based on site-level validation RMSE and its robustness to extreme observations. Next, we examine the seasonality, interannual trend and variability, and correlation with independent SIF products.
Evaluation of meta-learning as a learning framework
Convergence and site-level performance
As illustrated in Fig. 7 and Table 1, meta-trained deep models generally perform better than their baseline non-meta-trained counterparts. For instance, the validation RMSE of the meta-trained MLP on GPP is 3.13 gC m−2 d−1 ± 0.06 as compared to 3.47 gC m−2 d−1 ± 0.07 in the baseline case. A similar result is observed for Reco where the RMSE of the meta-trained MLP is 3.07 gC m−2 d−1 ± 0.05 as compared to 3.31 gC m−2 d−1 ± 0.07 in the baseline case. In addition, the choice of deep networks matters. Overall, models that incorporate temporal information, i.e. the LSTM and BiLSTM models, perform better than models that do not. In the GPP case, for example, the non-meta-trained BiLSTM model has the lowest validation error of 3.00 gC m−2 d−1 ± 0.04, followed by the meta-trained LSTM model with an RMSE of 3.06 gC m−2 d−1 ± 0.06. This confirms our physical intuition that water stress, which tends to regulate productivity, builds up over many days to months and thus requires a memory process as captured by the recurrent neural networks. Moreover, plant photosynthesis and respiration can acclimate to the prevailing environmental conditions, such as temperature, light and VPD31,32, which tend to be captured more effectively by memory-informed models. Nonetheless, the addition of bi-directionality in the BiLSTM model does not appear to significantly reduce error in the meta-trained models. This can be because the concept of data assimilation from future context has been captured through the process of meta-learning itself or that the signal coming from unidirectional timeseries is sufficiently saturated to parameterize the model. In other words, since our meta-learning approach primarily considers the spatial heterogeneity of the fluxes (e.g., across climate zones and PFTs), this spatial information, along with the temporal signals coming from BiLSTM gradient steps, result in a more unstable learning due to signal oversaturation which is evident from the larger convergence spread across model runs. This can be regularized by considering not just spatial, but also the spatiotemporal heterogeneity in a meta-learning approach33, though this will increase the complexity of the algorithm and could potentially limit its extrapolation capacity. This remains the subject of future work.
Robustness under extreme conditions
Making an accurate estimate for extreme cases is especially important in climate science because extreme weather tends to cause catastrophic damages, such as major droughts, wildfires, or plant mortality34,35. Fig. 8 illustrates the performance of our meta-trained models under an increasing magnitude of extremes as defined by the z-normalized threshold, t. In general, our meta-trained models (orange line) are more robust in predicting extreme cases of observed GPP and Reco (i.e. lower validation RMSE) than their baseline counterparts (blue line), with a difference of around 1.2 gC m−2 d−1 and 0.7 gC m−2 d−1 for GPP and Reco respectively.
If we further examine model performance across climate zones (Tables 2, 3) and select extreme fluxes with a normalized-target threshold, t, that is greater than 1.0, we find that our meta-trained models outperform the baselines. The reason why we choose this threshold is to have sufficient extreme observations across climate zones such that a more meaningful comparison can be made. In the GPP case, for example, meta-trained ensemble has lower validation RMSE of 3.78 gC m−2 d−1 ± 0.33 (versus 4.10 gC m−2 d−1 ± 0.29) and 3.04 gC m−2 d−1 ± 0.02 (versus 3.45 gC m−2 d−1 ± 0.06) in the semi-arid and tropics, respectively. A similar finding is observed for Reco where meta-trained ensemble has lower validation RMSE of 2.35 gC m−2 d−1 ± 0.04 (versus 2.65 gC m−2 d−1 ± 0.06) in the tropics. These results are promising as the representation of both the tropics and semi-arid regions in many upscaled products is often challenging due to the limited number of observations available and the complex, memory-like processes involved. For example, in the semi-arid regions, there is a build-up of time-dependent water stresses36, while in the tropics, there is a complex seasonal cycle of leaf flushing and phenology37,38. Our approach is superior in its ability to reproduce carbon fluxes in the tropics and semi-arid areas because limited data here are optimally enriched with shared information coming from other data-abundant regions through meta-learning.
Evaluation of meta-learned global data
Now that we have validated our meta-learning framework on the site level, we proceed to evaluate the internal consistency of our upscaled product. This includes the analysis of seasonality, interannual variability, and comparison to SIF as an independent photosynthesis product.
Temporal analysis
First, we analyze the seasonality of our upscaled GPP and Reco across months for the years between 2001 and 2021. As shown in Fig. 9, both fluxes exhibit similar seasonality albeit at different magnitudes. The tropics (including the dry and wet regions) contribute the most to the global GPP and Reco, as expected39,40, while the semi-arid regions contribute the least41. Carbon fluxes in the temperate (northern hemisphere) and continental regions exhibit unimodal variations that peak in the summer (June, July, and August - JJA), while those in the southern temperate regions peak in December, January, and February (DJF)42. On average, the temperate regions have higher carbon fluxes than the continental areas, which tend to be limited by light and temperature, with shorter growing seasons43.
Another interesting analysis is to understand the long-term trends of our global carbon fluxes. As observed in Fig. 10, our meta-trained global carbon product shows an overall increase in GPP by 0.0113 PgCyr−1 and Reco by 0.0101 PgCyr−1. We extend Fig. 10 by making a comparison with other carbon flux products, including those from light response function (LRF)44, P-model45, MODIS17 (MOD17)46, Soil Moisture Active Passive (SMAP)47, vegetation photosynthesis model (VPM)27, and Global LAnd Surface Satellite (GLASS)48 for GPP, as well as Fluxcom29 for both GPP and Reco. Overall, they show similar peaks and declines, albeit at varying magnitudes (between 100–140 PgC yr−1) as shown in Fig. 11.
Interannual variability
We find that the semi-arid regions of Australia, America, and some parts of the northern latitudes have the largest interannual variability of GPP and Reco (Fig. 12). This is consistent with results from2 and3 that reported significant contribution of these regions, particularly of the Australian ecosystems, in explaining much of the global carbon interannual variability. As a result, the high turnover rate of carbon pools in these semi-arid environments warrants further research into how the climate and anthropogenic factors can account for this large interannual variability, such as the extent of carbon stock decomposition (e.g. due to wildfire) and accumulation during the dry and wet seasons. In addition, our upscaled product shows high interannual variability in the dry tropical regions of Asia. However, this variation becomes smaller in the tropical forests of Asia, Africa, and America owing to their relatively stable climate. This can be attributed to the region’s sensitivity to rainfall pattern driven by El Niño-Southern Oscillation (ENSO), or soil moisture49,50 and rapid land-use changes51. In contrast to Fluxcom, our upscaled product does not show as much interannual variability, especially in desert regions (eg. Australia, Central America, South America, and Central Asia), which may be more accurate owing to the extremely low primary productivity there in the first place52. Nonetheless, we note that in some parts of the globe, especially along the Sahel and continental Western Europe, the interannual variability of carbon fluxes from MetaFlux is smaller. Physically, this phenomenon has been reported by53 and54 who observe how variations in terrestrial carbon productivity tend to be stronger in space rather than time. The second plausible reason would be that the ensemble captures much of this variability (i.e. expressed as standard deviation), where each member model learns a different temporal structure that can result in lower than expected mean interannual variability. Lastly, and as highlighted in Figs. 13, 14, meta-learning attempts to learn efficiently from historically underrepresented regions, such as the tropics, which tend to have low interannual variability. This potentially results in a reduction in such variability at higher latitudes, especially along the temperate and continental regions.
Comparison with Solar-induced fluorescence (SIF)
In order to evaluate the quality of the seasonal cycle of our product, in particular GPP, we measure its correlation coefficient with several SIF products. MetaFlux GPP demonstrates higher Pearson correlation coefficient with both CSIF and TROPOSIF (Figs. 13, 14, Tables 4, 5) than Fluxcom GPP across the temperate, semi-arid, and tropical regions with values higher than 0.8 to 0.9 even at the very northern latitudes. In particular, the correlation coefficient of our upscaled product with TROPOSIF in the semi-arid, tropics, and temperate regions are 0.856 ± 0.083 (versus 0.726 ± 0.165), 0.546 ± 0.299 (versus 0.343 ± 0.164), and 0.919 ± 0.002 (versus 0.826 ± 0.021), respectively. A similar trend is also observed in the CSIF case where the correlation in the semi-arid, tropics, and temperate regions are 0.925 ± 0.026 (versus 0.914 ± 0.060), 0.772 ± 0.080 (versus 0.608 ± 0.105), and 0.922 ± 0.022 (versus 0.914 ± 0.037), respectively. Across the two SIF products, however, we observe weaker correlation strength in the continental regions. Upon further inspection of Figs. 13, 14, the weaker association could be due to the slight uptick of our GPP estimate during the DJF period, which can be attributed to the lower quality of LAI retrievals because of snow cover.
Finally, we inspect the pixel-level correlation distribution of MetaFlux and Fluxcom with long-term CSIF product, as illustrated in Fig. 15. In general, MetaFlux has lower correlation with SIF in the tropical rainforest of Indonesia and Amazon as well as the arid regions of Australia, Gobi, Arabian, Syrian, Karakum, Taklamakan, Gobi, and the Great Plains in Northern America. This trend is consistent with earlier reports by55, for example, who showed how arid and extremely wet tropical regions (e.g. rainforests) tend to have low GPP-SIF correlation because of weak seasonality that essentially drop correlations to background noise level.
In summary, we have developed a new terrestrial carbon flux product, MetaFlux, using an ensemble of meta-learned deep networks. We have demonstrated how meta-learning can better estimate fluxes in data-sparse, yet critical regions (e.g. semi-arid and the tropics) and are more robust to predicting extreme observations. Our global product is able to outperform other reference product when evaluated against independent measurement, such as SIF or on flux tower networks. We believe that although data sparsity can be a major limiting factor to our complete understanding of many climate processes, leveraging knowledge in other similar domains can be powerful to better understand processes and their response to the environment.
Usage Notes
The data is permanently stored in Zenodo at https://doi.org/10.5281/zenodo.776188112 and is available at either the monthly or daily temporal scale. Each file contains four variables, GPP, Reco, GPP_std, and Reco_std, that are resolved continuously at a 0.25-degree spatial resolution. We purposely mask out the cold regions of Antarctica and the Arctic circle because we assume the lack of GPP and Reco there. Although we do not mask out the arid regions (e.g. deserts), we would recommend users to do so in order to remove any artifical, though small, estimates. In addition, we do not estimate net ecosystem exchange (NEE) explicitly. One of the primary reasons is because their fluxes (and by extension, their magnitude of variability) are significantly lower than that of GPP and Reco, making estimations from current input variables difficult and because the underlying drivers of GPP and Reco can differ. Separating the fluxes will ensure better generalization across regimes. Nonetheless, since we use the night-time partitioning algorithm that extrapolates respiration-based NEE estimates (where GPP is assumed to be absent during night-time) to daytime56, users are able to get an approximation of NEE by subtracting GPP from Reco (i.e., Reco - GPP). However, this approximation is still subject to broader validation, which we leave for future work.
Code availability
The meta-learning code is freely available and accessible at https://github.com/juannat7/metaflux. The repository contains notebooks that are customizable to one’s needs beyond the scope of this work. Further questions, feedback, or comments can be directed to the corresponding author.
References
Pastorello, G. et al. The fluxnet2015 dataset and the oneflux processing pipeline for eddy covariance data. Scientific data 7, 1–27 (2020).
Poulter, B. et al. Contribution of semi-arid ecosystems to interannual variability of the global carbon cycle. Nature 509, 600–603 (2014).
Ahlström, A. et al. The dominant role of semi-arid ecosystems in the trend and variability of the land co2 sink. Science 348, 895–899 (2015).
Sun, Q., Liu, Y., Chua, T.-S. & Schiele, B. Meta-transfer learning for few-shot learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 403–412 (2019).
Wang, Y., Yao, Q., Kwok, J. T. & Ni, L. M. Generalizing from a few examples: A survey on few-shot learning. ACM computing surveys (csur) 53, 1–34 (2020).
Li, D., Yang, Y., Song, Y.-Z. & Hospedales, T. Learning to generalize: Meta-learning for domain generalization. In Proceedings of the AAAI conference on artificial intelligence, vol. 32 (2018).
Hospedales, T., Antoniou, A., Micaelli, P. & Storkey, A. Meta-learning in neural networks: A survey. IEEE transactions on pattern analysis and machine intelligence 44, 5149–5169 (2021).
Tseng, G., Kerner, H., Nakalembe, C. & Becker-Reshef, I. Learning to predict crop type from heterogeneous sparse labels using meta-learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 1111–1120 (2021).
Pan, Z. et al. Spatio-temporal meta learning for urban traffic prediction. IEEE Transactions on Knowledge and Data Engineering 34, 1462–1476 (2020).
Friedlingstein, P. et al. Global carbon budget 2020. Earth System Science Data 12, 3269–3340 (2020).
da Silva, A. F. et al. Netzeroco 2, an ai framework for accelerated nature-based carbon sequestration. In 2022 IEEE International Conference on Big Data (Big Data), 4881–4887 (IEEE, 2022).
Nathaniel, J., Liu, J. & Gentine, P. MetaFlux: Meta-learning global carbon fluxes from sparse spatiotemporal observations. Zenodo https://doi.org/10.5281/zenodo.7761881 (2023).
Finn, C., Abbeel, P. & Levine, S. Model-agnostic meta-learning for fast adaptation of deep networks. In International conference on machine learning, 1126–1135 (PMLR, 2017).
Nichol, A., Achiam, J. & Schulman, J. On first-order meta-learning algorithms. arXiv preprint arXiv:1803.02999 (2018).
Gardner, M. W. & Dorling, S. Artificial neural networks (the multilayer perceptron)–a review of applications in the atmospheric sciences. Atmospheric environment 32, 2627–2636 (1998).
Buch, J., Williams, A. P., Juang, C. S., Hansen, W. D. & Gentine, P. Smlfire1. 0: a stochastic machine learning (sml) model for wildfire activity in the western united states. EGUsphere 1–39 (2022).
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural computation 9, 1735–1780 (1997).
Schuster, M. & Paliwal, K. K. Bidirectional recurrent neural networks. IEEE transactions on Signal Processing 45, 2673–2681 (1997).
Li, C., Zhang, Y. & Ren, X. Modeling hourly soil temperature using deep bilstm neural network. Algorithms 13, 173 (2020).
Ganaie, M. A., Hu, M., Malik, A., Tanveer, M. & Suganthan, P. Ensemble deep learning: A review. Engineering Applications of Artificial Intelligence 115, 105151 (2022).
Hersbach, H. et al. The era5 global reanalysis. Quarterly Journal of the Royal Meteorological Society 146, 1999–2049, https://doi.org/10.1002/qj.3803 (2020).
Justice, C. et al. The modis fire products. Remote sensing of Environment 83, 244–262, https://doi.org/10.1016/S0034-4257(02)00076-7 (2002).
Baraloto, C., Morneau, F., Bonal, D., Blanc, L. & Ferry, B. Seasonal water stress tolerance and habitat associations within four neotropical tree genera. Ecology 88, 478–489 (2007).
Beck, H. E. et al. Present and future köppen-geiger climate classification maps at 1-km resolution. Scientific data 5, 1–12, https://doi.org/10.1038/sdata.2018.214 (2018).
Poulter, B. et al. Plant functional type mapping for earth system models. Geoscientific Model Development 4, 993–1010, https://doi.org/10.5194/gmd-4-993-2011 (2011).
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. nature 521, 436–444 (2015).
Zhang, Y., Joiner, J., Alemohammad, S. H., Zhou, S. & Gentine, P. A global spatially contiguous solar-induced fluorescence (csif) dataset using neural networks. Biogeosciences 15, 5779–5800, https://doi.org/10.6084/m9.figshare.6387494 (2018).
Guanter, L. et al. The troposif global sun-induced fluorescence dataset from the sentinel-5p tropomi mission. Earth System Science Data 13, 5423–5440, https://doi.org/10.5270/esa-s5p_innovation-sif-20180501_20210320-v2.1-202104 (2021).
Jung, M. et al. Scaling carbon fluxes from eddy covariance sites to globe: synthesis and evaluation of the fluxcom approach. Biogeosciences https://doi.org/10.5194/bg-17-1343-2020 (2020).
Zhan, W. et al. Two for one: Partitioning co2 fluxes and understanding the relationship between solar-induced chlorophyll fluorescence and gross primary productivity using machine learning. Agricultural and Forest Meteorology 321, 108980 (2022).
Reich, P. B. et al. Boreal and temperate trees show strong acclimation of respiration to warming. Nature 531, 633–636 (2016).
Berry, J. & Bjorkman, O. Photosynthetic response and adaptation to temperature in higher plants. Annual Review of plant physiology 31, 491–543 (1980).
Zhang, K., Zhang, X., Song, H., Pan, H. & Wang, B. Air quality prediction model based on spatiotemporal data analysis and metalearning. Wireless Communications and Mobile Computing 2021, 1–11 (2021).
Juang, C. S. et al. Rapid growth of large forest fires drives the exponential response of annual forest-fire area to aridity in the western united states. Geophysical Research Letters 49, e2021GL097131 (2022).
Miralles, D. G., Teuling, A. J. & Van Heerwaarden, C. C. & Vilà-Guerau de Arellano, J. Mega-heatwave temperatures due to combined soil desiccation and atmospheric heat accumulation. Nature geoscience 7, 345–349 (2014).
Falkenmark, M., Lundqvist, J. & Widstrand, C. Macro-scale water scarcity requires micro-scale approaches: Aspects of vulnerability in semi-arid development. In Natural resources forum, vol. 13, 258–267 (Wiley Online Library, 1989).
Singh, K. & Kushwaha, C. Emerging paradigms of tree phenology in dry tropics. Current Science 964–975 (2005).
Chen, X. et al. Vapor pressure deficit and sunlight explain seasonality of leaf phenology and photosynthesis across amazonian evergreen broadleaved forest. Global Biogeochemical Cycles 35, e2020GB006893 (2021).
Chen, M. et al. Regional contribution to variability and trends of global gross primary productivity. Environmental Research Letters 12, 105005 (2017).
Nathaniel, J., Klein, L. J., Watson, C. D., Nyirjesy, G. & Albrecht, C. M. Aboveground carbon biomass estimate with physics-informed deep network. arXiv preprint arXiv:2210.13752 (2022).
Chen, Y. et al. Contrasting performance of the remotely-derived gpp products over different climate zones across china. Remote Sensing 11, 1855 (2019).
Falge, E. et al. Seasonality of ecosystem respiration and gross primary production as derived from fluxnet measurements. Agricultural and Forest Meteorology 113, 53–74 (2002).
Schwalm, C. R. et al. Photosynthetic light use efficiency of three biomes across an east–west continental-scale transect in canada. Agricultural and Forest Meteorology 140, 269–286 (2006).
Tagesson, T. et al. A physiology-based earth observation model indicates stagnation in the global gross primary production during recent decades. Global Change Biology 27, 836–854 (2021).
Stocker, B. D. et al. Drought impacts on terrestrial primary production underestimated by satellite monitoring. Nature Geoscience 12, 264–270 (2019).
Running, S. W. et al. A continuous satellite-derived measure of global terrestrial primary production. Bioscience 54, 547–560 (2004).
Booth, B. B. et al. High sensitivity of future global warming to land carbon cycle processes. Environmental Research Letters 7, 024002 (2012).
Liang, S. et al. The global land surface satellite (glass) product suite. Bulletin of the American Meteorological Society 102, E323–E337 (2021).
Allen, K. et al. Will seasonally dry tropical forests be sensitive or resistant to future changes in rainfall regimes? Environmental Research Letters 12, 023001 (2017).
Skulovich, O. & Gentine, P. A long-term consistent artificial intelligence and remote sensing-based soil moisture dataset. Scientific Data 10, 154 (2023).
Miles, L. et al. A global overview of the conservation status of tropical dry forests. Journal of biogeography 33, 491–505 (2006).
Hadley, N. F. & Szarek, S. R. Productivity of desert ecosystems. BioScience 31, 747–753 (1981).
Sala, O. E., Gherardi, L. A., Reichmann, L., Jobbagy, E. & Peters, D. Legacies of precipitation fluctuations on primary production: theory and data synthesis. Philosophical Transactions of the Royal Society B: Biological Sciences 367, 3135–3144 (2012).
Knapp, A. K., Ciais, P. & Smith, M. D. Reconciling inconsistencies in precipitation–productivity relationships: implications for climate change. New Phytologist 214, 41–47 (2017).
Sanders, A. F. et al. Spaceborne sun-induced vegetation fluorescence time series from 2007 to 2015 evaluated with australian flux tower measurements. Remote Sensing 8, 895 (2016).
Reichstein, M. et al. On the separation of net ecosystem exchange into assimilation and ecosystem respiration: review and improved algorithm. Global change biology 11, 1424–1439 (2005).
Acknowledgements
We would like to thank Martin Jung and Ulrich Weber for giving us access to the latest Fluxcom dataset, and the two anonymous reviewers whose feedbacks have significantly improved the manuscript. The authors would like to acknowledge funding from the NSF LEAP Science and Technology Center award #2019625, Department of Energy grant, USMILE European Research Council grant and LEMONTREE Schmidt Futures funding.
Author information
Authors and Affiliations
Contributions
P.G. conceived the idea and supervised the work, J.L. processed the data, J.N. and J.L. designed the experiments, J.N. conducted the experiments, analysed the results, and wrote the first manuscript draft. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Nathaniel, J., Liu, J. & Gentine, P. MetaFlux: Meta-learning global carbon fluxes from sparse spatiotemporal observations. Sci Data 10, 440 (2023). https://doi.org/10.1038/s41597-023-02349-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-023-02349-y