1 km land use/land cover change of China under comprehensive socioeconomic and climate scenarios for 2020–2100

Luo, Meng; Hu, Guohua; Chen, Guangzhao; Liu, Xiaojuan; Hou, Haiyan; Li, Xia

doi:10.1038/s41597-022-01204-w

Download PDF

Data Descriptor
Open access
Published: 28 March 2022

1 km land use/land cover change of China under comprehensive socioeconomic and climate scenarios for 2020–2100

Scientific Data volume 9, Article number: 110 (2022) Cite this article

11k Accesses
17 Citations
12 Altmetric
Metrics details

Subjects

Abstract

In the past decades, China has undergone dramatic land use/land cover (LULC) changes. Such changes are expected to continue and profoundly affect our environment. To navigate future uncertainties toward sustainability, increasing efforts have been invested in projecting China’s future LULC following the Shared Socioeconomic Pathways (SSPs) and/or Representative Concentration Pathways (RCPs). To supplements existing datasets with a high spatial resolution, comprehensive pathway coverage, and delicate account for urban land change, here we present a 1-km gridded LULC dataset for China under 24 comprehensive SSP-RCP scenarios covering 2020–2100 at 10-year intervals. Our approach is to integrate the Global Change Analysis Model (GCAM) and Future Land Use Simulation (FLUS) model. This dataset shows good performance compared to remotely sensed CCI-LC data and is generally spatio-temporally consistent with the Land Use Harmonization version-2 dataset. This new dataset (available at https://doi.org/10.6084/m9.figshare.14776128.v1) provides a valuable alternative for multi-scenario-based research with high spatial resolution, such as earth system modeling, ecosystem services, and carbon neutrality.

Measurement(s)	Land Use and Land Cover Change
Technology Type(s)	computational modeling technique
Factor Type(s)	Shared Socioeconomic Pathways scenarios • Representative Concentration Pathways scenarios • land use and land cover change
Sample Characteristic - Environment	Land
Sample Characteristic - Location	China

Global land use for 2015–2100 at 0.05° resolution under diverse socioeconomic and climate scenarios

Article Open access 02 October 2020

Temporal-Spatial Analysis of the Warming Effect of Different Cultivated Land Urbanization of Metropolitan Area in China

Article Open access 17 February 2020

Global land projection based on plant functional types with a 1-km resolution under socio-climatic scenarios

Article Open access 30 March 2022

Background & Summary

Land use/land cover (LULC) plays a crucial role in the interactions between the human system and the Earth system¹, which relates directly to a wide range of issues that involve big stakes, e.g., biodiversity², energy balance³, carbon cycle², hydrologic cycle⁴, and climate extremes⁵. In this regard, LULC in China has been undergoing dramatic changes in the past few decades, with nationwide and worldwide social-environmental consequences^1,6,7. For instance, since the “Reform and Opening-up” in 1978, China’s rapid urban growth has prevailed by invading a large proportion of croplands^8,9,10. However, after implementing afforestation and reforestation programs, China has shown a significant vegetation greening trend, contributed mainly by forests (42%) and croplands (32%)^1,11. These complicated changes are subject to the influences of a variety of social and economic factors. Further investigating and predicting future LULC in China is of vital importance for future land use policy decisions and sustainable management of ecosystems. This could provide the crucial information to balance the anthropogenic climate change and social-economic development.

Combining socio-economic scenarios, emission pathways, and diverse sectoral information in a unified framework can be used to assess future LULC under different policies and global mitigation targets. Integrated Assessment Models (IAMs) are commonly used to quantify outcomes of LULC under different Shared Socioeconomic Pathways (SSPs, representing alternative socio-economic developments)¹² and Representative Concentration Pathways (RCPs, representing greenhouse gas concentration trajectories)¹³, such as the Asian-Pacific Integrated Model/computable general equilibrium (AIM/CGE)¹⁴, Integrated Model to Assess the Global Environment (IMAGE)¹⁵, and Global Change Analysis Model (GCAM)¹⁶. Among them, GCAM, an open-source global integrated and multi-sector model adopted by the Intergovernmental Panel on Climate Change (IPCC)¹⁷, has been widely used to project future LULC under diverse socioeconomic and emission scenarios at both regional and global scales^18,19,20. GCAM V5.2 explicitly incorporates modules of water supply and demands which is vital in the agriculture and land use sectors²¹. Besides, GCAM V5.2 also considers the assumptions of water technological advancements under different SSPs, which has significant impacts on water demands in a water-constrained world²².

Mapping spatially explicit LULC patterns at a high spatial resolution is important for analyzing the local spatial details of LULC and understanding the local interactions among human activities and ecological processes in the alternative future²³. Some research pointed out that LULC data with coarse spatial resolution could ignore a large proportion of small urban patches, with a severe underestimation of urban land area and urban growth²⁴. By contrast, LULC data with a fine spatial resolution (like 1-km) could provide more necessary spatial details and accurately reflect the heterogeneous spatial characteristics of LULC²⁴. However, the LULC data produced by IAMs are typically in the subregion levels (e.g., the regional, agroecological or water-basin levels)^16,25,26 or with coarse spatial resolution^14,27,28. Combining IAMs under global scenarios with spatially-explicit LULC models at a coherent framework provides a feasible scheme to project future LULC with a finer spatial resolution^18,19,20. For example, the Future Land Use Simulation (FLUS) model has been used to generate spatially-explicit LULC data by combining IAMs^19,24, which can simulate high-spatial-resolution LULC change with generally higher accuracy than the single neural network-based cellular automata (CA) model^29,30, the Conversion of Land Use and its Effects at Small regional extent (CLUE-S)³¹, and other models^24,32,33.

Recently, a set of SSP-RCP frameworks have been proposed^34,35 to describe potential pathways under diverse socio-economic and emission conditions. However, comprehensive assessments of China’s LULCs under full combinations of SSP and RCP scenarios with high resolutions remain to be conducted. It can facilitate the thorough analysis of our uncertain future under challenges of mitigation and adaptation³⁵ and is also crucial for the net-zero emission research^36,37. Some studies produced the LULC projections under the combinations of SSP and RCP scenarios at a coarse spatial resolution, such as the 0.5-degree LULC data projected by AIM/GEC¹⁴ and five arc-minute gridded LULC data produced by IMAGE²⁷. In contrast, some other studies generated the future LULC data with a high spatial resolution but only covered very limited scenarios. For example, Dong et al.¹⁹ developed 1-km resolution LULC data in China using the integrated GCAM and FLUS model. Cao et al.²⁰ spatialized global LULC data at a 1-km resolution based on the integrated GCAM and CA model. However, missing some important scenarios could hinder the applications in ecological and hydrological modelling³⁸,³⁹. For example, SSP5-RCP1.9, which is a combination of a strict climate target and a fossil-fueled development scenario, may be required to represent extreme conditions of human activities in the modelling. Projecting high-spatial-resolution LULC with all possible combinations of SSP and RCP, enables a comprehensive analysis of LULC under different socio-economic assumptions and mitigation policies and can support a deep understanding of local LULC dynamics with more spatial details.

In addition, the future urban land change has not been well considered in the existing future LULC data. Urban land is a key driver for many environmental and societal changes across scales⁴⁰ and is also crucial for studying LULC projections^32,41. Some models currently assume that there is no urban land change in the future, such as GCAM⁴². However, there has been a dramatic change of global urban land in the past decades⁴³, implying that assuming a constant coverage of the urban land in the future is unrealistic^24,40. Some other studies projected urban shrinkage simply based on the empirical relationship between urban land, Gross Domestic Product (GDP), population, and other factors^19,27. Nevertheless, the decrease of the urban population does not necessarily lead to numerous land conversions from urban to non-urban areas^44,45, especially in China⁴⁶. Recently, Chen et al. (2020a) developed a 1-km gridded dataset of globally future urban land expansion under SSPs, which provides a more reasonable projection of future urban land change. This dataset offers us a great opportunity to calibrate urban land change in LULC predictions.

The overarching goal of this study is to develop a high-resolution gridded LULC dataset in China under a full SSP/RCP matrix from 2020 to 2100, where the urban land change is well incorporated. First, we generated the LULC projections for China under all possible combinations of SSP and RCP using GCAM at the water-basin level. Then, we used the urban land dataset developed by Chen et al. (2020a) to calibrate the urban land demand (the projected total area of urban) projected by GCAM. Finally, we downscaled the water-basin-level LULC projections to 1-km grids using the FLUS model. This newly gridded dataset fills the gap between high spatial resolution and limited scenarios in the current LULC predictions, which can enhance climate change research under diverse socioeconomic and emission assumptions, provide support for making policies to limit global warming to below 2 °C or 1.5 °C by 2100, the target of the Paris agreement³⁶ and help reduce the uncertainties of the Earth system modelling. It will be helpful to those researches focusing on individual socio-economic or emission conditions. Besides, this dataset will be valuable for wide unified and comparable multi-scenario-based research, such as ecosystem services⁴⁷, biodiversity⁴⁸,soil erosion by water⁴⁹, and carbon neutrality in China.

Methods

Overall framework

Figure 1 shows the methodological framework of the integrated GCAM-FLUS model for producing high-resolution LULC dataset in China. First, we used the GCAM model to project the LULC demands of China from 2020 to 2100 under all possible SSP-RCP scenarios (24 scenarios in total, see Table 1) with a 10-year interval at a regional scale. Second, we harmonized the LULC types with a reclassification scheme (see Table 2) and calibrated the land demands of GCAM-generated LULC data based on the historical LULC data collected from the European Space Agency Climate Change Initiative (CCI-LC)⁵⁰. Further, we calibrated the urban land demands using a well-validated future urban expansion dataset under SSPs produced by Chen et al. (2020a), freely downloaded from http://www.geosimulation.cn/GlobalSSPsUrbanProduct.html. Finally, we integrate comprehensive SSP-RCP with a land use model (FLUS) to downscale the GCAM-based LULC data into 1-km spatial resolution.

Table 1 SSP by RCP scenarios matrix and the coupled scenarios covered in this study.

Full size table

Table 2 The LULC reclassification scheme in this study.

Full size table

The 24 SSP-RCP scenarios (listed in Table 1) are composed of five baseline scenarios that do not include any climate mitigation policy and 19 combined scenarios. In this study, the climate conditions are represented by three RCP levels (2.6, 4.5, 6.0 Wm⁻²) and two additional forcing levels (1.9 and 3.4 Wm⁻²). The level of 8.5 Wm⁻² is not included because the forcing levels of all five baseline scenarios in our simulations are lower than 8.5 Wm⁻². The two additional forcing levels are related to current policy targets³⁶ and belong to 1.5 °C and 2 °C scenarios^14,38, which play a key role in reducing the impact and risks of climate change^51,52.

Projection of future regional LULC demands

GCAM, a market-equilibrium dynamic-recursive model, well represents the interactions of five sub-systems: energy, water, agriculture and land use, economy, and climate²⁵. GCAM is one of the marker models used to quantify SSP and RCP scenarios and has been widely used to analyze future LULC changes under different scenarios. In the newly released GCAM v5.2, the land-use module subdivides 32 global geo-political regions into the water-basin levels, and there are totally 24 subregions in China (Figure S1). GCAM uses a logit model to represent the sharing of the economic decisions for land use in one region. Thus, there is a distribution of profit behind each competing land use. GCAM also uses a nesting strategy to reflect the differences in alternatives through different LULC types with logit exponents. Besides, GCAM implements both SSP and RCP scenarios. This study used GCAM v5.2 to project the water-basin level demands of different LULC types from 2020 to 2100 with a 10-year interval under the 24 scenarios.

Calibration of LULC types and land demands

There are some inconsistencies of LULC types between the output of GCAM and the initial LULC map (i.e., CCI-LC) used for spatial downscaling. GCAM mainly includes nine LULC types composed of Cropland, Forest, Pasture, Grassland, Shrub, Urban land, Tundra, Rock, Ice and Desert, and Biomass. By contrast, CCI-LC mainly includes Cropland, Grassland, Shrub, Forest, Bareland, Ice, Water and Urban areas. Besides, there are considerable differences in land areas between the GCAM-derived future land demands and CCI-LC dataset. These inconsistencies may cause significant errors and uncertainties in the future LULC simulations. Therefore, we first built a reclassification scheme composed of 8 LULC types (Table 2) and used it to harmonize the LULC types of these two datasets. Then, we further adjusted the GCAM-derived future land demands by:

$$A{(t+1)}^{k}=\left\{\begin{array}{c}\left(1+\frac{A{\left(t+1\right)}_{GCAM}^{k}-A{\left(t\right)}_{GCAM}^{k}}{A{\left(t\right)}_{GCAM}^{k}}\right)\times A{(t)}_{CCI-LC}^{k}\quad t=2010\\ \left(1+\frac{A{\left(t+1\right)}_{GCAM}^{k}-A{\left(t\right)}_{GCAM}^{k}}{A{\left(t\right)}_{GCAM}^{k}}\right)\times A{(t)}^{k}\quad 2020\le t\le 2100\end{array}\right.$$

(1)

where $A{(t+1)}^{k}$ is the calibrated area of type k at year t + 1, $A{\left(t+1\right)}_{GCAM}^{k}$ is the area of type k at year t-1 calculated by GCAM, and $A{(t)}_{CCI-LC}^{k}$ is the area of type k at year t from the CCI-LC data.

Calibration of urban demand

GCAM assumes that the future urban demands remain constant, which is unreasonable⁴⁰ and needs to be re-adjusted. Chen et al. (2020a) generated a 1-km future urban land expansion dataset based on the established relationships between urban land demand, GDP and urbanization rate. This dataset has a much higher spatial resolution which could capture more spatial details of urban land patterns than other urban expansion data and shows excellent performance in terms of the Figure of Merit (FoM)³². In addition, this urban land dataset considers more reasonable situations in the urban shrinkage stage^45,46. In this study, we used this urban land dataset to calibrate future GCAM-derived urban demands. We first calculated the urban areas of 24 subregions under different SSP scenarios based on this dataset, and used it to update the future urban demands of GCAM-derived LULC data. Then, we used the methods developed by Li et al. ²⁴ to adjust the areas of the remaining LUCCs proportionally. Equation 2 describes how we deduct the demand of urban encroachment in other LULC types.

$${A}_{adjust}{(t)}^{k}=\left(A{(t)}^{U}-A{(0)}^{U}\right)\times {H}_{n}^{k\to U}$$

(2)

where ${A}_{adjust}{(t)}^{k}$ is the area of type k encroached by urban at year t, $A{(t)}^{U}A$ is the projected area of urban land (U) for year t, $A{(0)}^{U}$ is the original area of urban in 2010, n is the number of LULC types (except urban, water and ice), and ${H}_{n}^{k\to U}$ is the empirical proportion of land loss encroached by urban of type k and the sum of ${H}_{n}^{k\to U}$ of all n types is 1.

In this study, we assumed that urban could encroach other LULC types except for water and ice, and the urban demand under the same SSP scenario is identical. We used the CCI-LC data in 2000 and 2010 to calculate the proportion of land loss due to the urban encroachment on each LULC type and derive ${{\rm{H}}}_{{\rm{n}}}^{{\rm{k}}\to {\rm{U}}}$ for each subregion (Table 3). Then we combined ${{\rm{H}}}_{{\rm{n}}}^{{\rm{k}}\to {\rm{U}}}$ and aforementioned urban land dataset to adjust the demands of other LULC types in the GCAM-derived LULC data.

Table 3 The calculated ${{\rm{H}}}_{{\rm{n}}}^{{\rm{k}}\to {\rm{U}}}$ for different LULC types in 24 subregions of China.

Full size table

Downscaling of regional LULC projections

Based on the GCAM-derived LULC demands calibrated through the steps abovementioned, we used the FLUS model to simulate the future LULC of China at 1-km resolution. FLUS integrates the top-down system dynamics and bottom-up CA and can explicitly simulate the spatial trajectories of multiple LULC types³³. The first part of FLUS aims to train and estimate the occurrence probabilities of LULC on a specific grid cell based on artificial neural networks (ANN). Specifically, we first collected the CCI-LC data in 2010 and 15 driving factors (shown in Table 4) as the training data. The driving factors were mainly selected based on relevant studies and can reflect different heterogeneous characteristics (i.e., climate, soil, topography, population, economics, transportation, etc.) related to LULC^24,32,33. All these driving factors are reprojected into 1-km × 1-km grids with a spatial reference of the Albers equal-area conic projection. Then we trained the ANN model based on a 1% uniform sample rate for each subregion and used the trained model to estimate the occurrence probabilities (OP, determined by the characteristics of each pixcel) of each pixel. The adopted ANN model has one input layer, one hidden layer with 10 neurons and one output layer in this study. Each neuron of the input layer is associated with a driving factor, and that of the output layer corresponds to OP for a specific LULC type. The sigmoid activation function was used for the hidden layer. The second part of the model, CA, considers OP, conversion cost, neighborhood condition and competition among the different LULC types to estimate the combined probability for LULC conversion. In this step, the LULC type with a higher OP estimated by the previous step is more likely to be predicted as the target LULC type. In contrast, those with a relatively lower OP can still be converted based on the roulette selection mechanism. During the allocation stage, we adopted several assumptions: first, urban expansion is irreversible; second, water and ice are not involved in LULC conversion; and third, bareland can only be infringed by urban or stay unchanged in the future, considering that GCAM cannot project the bareland change in the future and the Bareland can only change due to the urban expansion (see the previous section). Under these assumptions, combined with CCI-LC data in 2010 as an initial LULC map, we used the FLUS model to produce 1-km LULC dataset in China from 2020 to 2100.

Table 4 Specifications of the 15 driving factors used during the downscaling process.

Full size table

Data Records

The generated LULC dataset with 1-km spatial resolution and 10-year time step from 2020–2100 covers 24 SSP-RCP scenarios in total (Table 2). The dataset is publicly available in https://doi.org/10.6084/m9.figshare.14776128.v1⁵³ and http://www.geosimulation.cn/. All the data is stored in a commonly-used geotiff format with a spatial reference of the Albers equal-area conic projection, which can be easily accessed by ARCGIS, ENVI, MATLAB, etc. For the file naming and structure, all the files with the same SSP-RCP scenarios were grouped into the same folders with the name of “SSP-RCP” and each geotiff file is named as “SSP_RCP_Year.tif”, where “SSP” and “RCP” denote the SSP and RCP scenarios, and “Year” denotes the year of the LULC data. For example, the file storing the LULC data under SSP1-RCP1.9 in 2100 is named as “SSP1-RCP19_2100.tif”. Taking “SSP1-RCP19_2100.tif” as an example, Figure S2 shows the 2100 LULC spatial distributions under SSP1-RCP1.9. In each geotiff file, different integer values represent different LULC types: 1 Urban; 2 Cropland; 3 Grassland; 4 Shrub; 5 Forest; 6 Water; 7 Bareland; and 8 Ice.

Technical Validation

We used the CCI-LC data in 2000 to train the FLUS model and then used the CCI-LC data in 2010 to evaluate the reliability of our downscaled dataset. We also compared our gridded dataset with the widely-used Land-use Harmonisation (LUH2) data at a 0.25 degree resolution⁵⁴, used in the Coupled Model Inter-comparison Project Phase 6 (CMIP6)⁵⁵. Specifically, we chose all the overlapping scenarios between our dataset and LUH2, including SSP1-RCP1.9, SSP1-RCP1.6, SSP2-RCP4.5, SSP4-RCP3.4, and SSP5-RCP3.4 for comparison. In addition, considering the inconsistency in LULC types between our dataset and LUH2, we re-grouped the LUH2 data into five broad LULC types: Cropland, Forest, Bareland, Grassland, and Urban.

To quantitatively assess the simulated LULC during the downscaling process, we calculated the overall accuracy and the Cohen’s Kappa coefficient of each subregion by validating our dataset against the CCI-LC data. Compared with the Kappa coefficient, FoM can avoid overestimating the accuracy and has been demonstrated to effectively evaluate the accuracy and has been demonstrated to be effective to evaluate the accuracy of simulating LULC changes^56,57. Therefore, we further used the FoM metrics to assess the consistencies between the simulated LULC and remote sensing data (CCI-LC). Specifically, FoM represents the ratio of the correct predicted change to the sum of the observed and predicted change:

$${\rm{FoM}}=\frac{{\rm{B}}}{{\rm{A}}+{\rm{B}}+{\rm{C}}+{\rm{D}}}$$

(3)

where A represents the false area where the observed change is predicted as persistence, B represents the correct area where the observed change is predicted as change correctly, C represents the false area where the observed change is predicted as a change in the wrong LULC type, and D represents the false area where the observed persistence is predicted as change. Its value ranges from 0 to 1, and larger FoM represents a better performance on LULC simulations.

In addition, we used the Pearson correlation coefficient and root mean square difference (RMSD) to assess the spatio-temporal consistencies between our gridded LULC dataset and LUH2 data during 2020–2100.

Validation of the downscaling process

The statistical accuracy metrics of the LULC simulations of 2010 in each subregion and the whole of China are shown in Table 5. The Kappa coefficient ranges from 0.43 to 0.75 and the overall accuracy ranges from 0.66 to 0.88 across different subregions. These two metrics have values of 0.64 and 0.79 for China. FoM varies from 0.10 to 0.17 in different subregions, and has a value of 0.13 for China. This result is identical to existing studies which showed that the FoM values were usually in the range of 0.1 to 0.3, due to the path-dependent effects^32,33,58. The confusion matrix of the simulated LULC compared to CCI-LC data in 2010 shows that the number of mis-classified pixels is small (Table 6). These results demonstrate that the simulated LULC have a good agreement with CCI-LC in 2010.

Table 5 The statistical accuracy metrics of LULC simulations of 2010 in each subregion as well as the whole of China.

Full size table

Table 6 Confusion matrix of the simulated LULC compared to CCI-LC in 2010.

Full size table

We also compared the LULC spatial distributions of the actual LULC map retrieved by remote sensing data (CCI-LC, used as the base map), our downscaled data and LUH2 data in 2010 (Figure S3). It is worth noting that LUH2 data used a different historical data source⁵⁵. Overall, they show similar spatial patterns with minor differences in most regions. The Pearson’s correlation coefficients between our data and base map are 0.98, 0.99, 0.95, 0.96, and 0.90 for Cropland, Forest, Bareland, Grassland and Urban. The differences between LUH2 and base map are mainly distributed in the eastern and south eastern China, where the base map shows a lower proportion of Bareland and Grassland, but a higher proportion of Urban. We further compared the difference of the land amount proportion of each LULC type within a 10-km x 10-km gird between our simulations and base map (Figure S4), which shows that the overall spatial pattern of our simulated LULC is consistent with the base map. Some differences between them are mainly distributed in the northwest of China, where our simulation overestimates the area of Bareland and underestimates the areas of Grassland and Cropland. This may be because from 2000 to 2010, some areas of Bareland were converted into Grassland and Cropland in the northwest China. However, our downscaling strategy assumes the Bareland can only be infringed by urban. Thus, the conversion from Bareland to other LULC types is omitted, which can lead to the overestimation of the areas of Bareland and underestimation of the area of Grassland and Cropland in this region. The fraction differences of all the eight LULC types within each 10-km × 10-km gird range from −15% to 15% and most of them are smaller than ± 5% (Fig. 2). All these results demonstrate that the downscaling process can accurately simulate the LULC spatial distributions.

Spatio-temporal consistency with LUH2

We first analyzed the temporal change of the eight LULC types’ land amount in our dataset from 2010 to 2100 (Figure S5). Notably, Ice and Water are assumed to be intact in our LULC projections, and thus they will keep unchanged over time. Forest will increase in SSP1-, SSP2-, SSP4-based scenarios, but whether Forest increases or decreases in SSP3- and SSP5-based scenarios depends on specific RCP settings. Shrub shows an increasing trend in all the SSP scenarios. Grassland shows a decreasing trend in most of the SSP2-, SSP3-, SSP4-, and SSP5-based scenarios, a slight increase in the baseline scenarios of SSP4 and SSP5, and trivial changes in SSP1-based scenarios. Crop has a decreasing trend in all SSP1-, SSP4- and SSP5-based scenarios, but some SSP2- and SSP3-based scenarios show an increasing trend. In terms of RCP, more Croplands are required in RCP1.9, RCP2.6, and RCP3.4 than RCP4.5, RCP6, and baseline scenarios. Forest, Shrub, and Grassland will increase faster in RCP1.9, RCP2.6, and RCP3.4 than RCP4.5, RCP6, and baseline scenarios.

Then, we compared the LULC amounts of five overlapped scenarios between our dataset and LUH2 across the five LULC types. Figure 3 shows that the temporal trend of Forest is similar to LUH2 before 2060 and lightly differs from LUH2 after 2060, while the temporal trend of Grassland in our dataset is generally consistent with LUH2. Cropland in our dataset will decrease in all five scenarios, and presents different trends with LUH2. However, the trend of Cropland is similar with the results reported in Chen et.al. (2020b). The Urban shows an increasing trend from 2010 to the mid-21st century, consistent with LUH2. But Urban will remain stable after reaching its maximum values, which is different from the decreasing trends in LUH2. Bareland will slightly decrease from 2010 to mid-21st century and remain unchanged later, which is also different from LUH2. The difference in Urban and Bareland is mainly because these two types are assumed to be intact in GCAM, and we used an urban land dataset under SSPs to calibrate the future LULC change. This urban land dataset assumes that the urban will not convert to other LULC types in the city shrinking stage, which causes that the area of urban will keep unchanged after reaching the peak. Besides, we assume the Bareland can be infringed by urban in the expanding stage, leading to the slight decrease of Bareland from 2010 to the mid-21st century.

We further compared the spatial consistencies of our downscaled gridded LULC dataset with LUH2 from 2010–2100 (Fig. 4). Overall, the two datasets show good consistencies across the five LULC types. The mean values of the Pearson’s correlation coefficients for five sceneries are 0.88 (Crop), 0.76 (Forest), 0.66 (Bareland), 0.78 (Grassland) and 0.82 (Urban) respectively. The relatively low consistency for Bareland may result from two reasons. First, the Bareland change in our dataset is caused by the urban encroachment since the demands generated by GCAM remain stable in the future, but LUH2 adopted different assumptions. Second, the base map we used had a distinct spatial pattern of Bareland compared to LUH2 in 2010 (see Figure S3). The Pearson’s correlation coefficients between our dataset and LUH2 range from 0.56 to 0.92 in SSP1-RCP1.9, 0.56 to 0.92 in SSP1-RCP2.6, 0.68 to 0.91 in SSP2-RCP3.4, 0.69 to 0.92 in SSP4-RCP3.4 and 0.68 to 0.92 in SSP5-RCP4.5. Among the five scenarios, our dataset has the highest Pearson’s correlation coefficients with LUH2 in SSP4-RCP3.4, which can be explained by that LUH2 used the same GCAM model with a different version (v4) for the LULC projection. These results demonstrate that our dataset has a good spatial consistency with LUH2 for different LULC types under different scenarios.

We also compared the temporal consistencies (RMSD) of land area in five LULC types between our dataset and LUH2. As shown in Fig. 5, most of the temporal variation in our dataset is similar to LUH2, with only some minor differences. For all the five scenarios, the RMSD for Cropland ranges from 0.057 to 0.11; for Forest, it ranges from 0.052 to 0.073; for Bareland, it ranges from 0.001 to 0.026; for Grassland, it ranges from 0.062 to 0.11; and for Urban, it ranges from 0.0001 to 0.0006. Among all the LULC types, Urban shows the highest degree of similarity which may be because the urban dataset we used to calibrate the urban demand has a high consistency with LUH2³². But there are some differences in the temporal variation of Bareland and Grassland, and most of them occur in the northwest China, possibly because of the differences in the input data, the spatial downscaling strategy, and the new land cover classification scheme we used. The Bareland change is only caused by urban encroachment in our dataset, while this is not the case in LUH2. Some differences also occur in the southeast of China, which may result from the differences in type definition and classification standards between the base map and LUH2 (Figure S3). Overall our dataset shows high spatio-temporal consistency with LUH2. The discrepancy between our dataset and LUH2 can be caused by different IAM models, base map and LULC downscaling methods.

Besides, our dataset can reflect more spatial details because of its high spatial resolution (1-km) compared to LUH2 with coarse spatial resolution of 0.25° (Fig. 6). This means our dataset can be more helpful to investigate the local impacts of LULC on ecosystem services and many other studies under different socioeconomic and emission conditions.

Code availability

The GCAM v5.2 and FLUS models can be freely downloaded in https://github.com/JGCRI/gcam-core/releases and http://www.geosimulation.cn/FLUS.html, respectively.

References

Chen, C. et al. China and India lead in greening of the world through land-use management. Nat. Sustain. 2, 122–129 (2019).
Article PubMed PubMed Central Google Scholar
Eitelberg, D. A., van Vliet, J., Doelman, J. C., Stehfest, E. & Verburg, P. H. Demand for biodiversity protection and carbon storage as drivers of global land change scenarios. Glob. Environ. Chang. 40, 101–111 (2016).
Article Google Scholar
Duveiller, G., Hooker, J. & Cescatti, A. The mark of vegetation change on Earth’s surface energy balance. Nat. Commun. 9 (2018).
Bhattacharjee, K. & Behera, B. Does forest cover help prevent flood damage? Empirical evidence from India. Glob. Environ. Chang. 53, 78–89 (2018).
Article Google Scholar
Sy, S. & Quesada, B. Anthropogenic land cover change impact on climate extremes during the 21st century. Environ. Res. Lett. 15 (2020).
Jiyuan, L. et al. Spatial patterns and driving forces of land use change in China during the early 21st century. 20, 483–494 (2010).
Foley, J. A. Global Consequences of Land Use Global Consequences of Land Use. Science. 309, 570–574 (2005).
Article ADS CAS PubMed Google Scholar
Liu, X., Zhao, C. & Song, W. Review of the evolution of cultivated land protection policies in the period following China’s reform and liberalization. Land use policy 67, 660–669 (2017).
Article Google Scholar
Liu, Y. Introduction to land use and rural sustainability in China. Land use policy 74, 1–4 (2018).
Article Google Scholar
Rounsevell, M. D. A. et al. Challenges for land system science. Land use policy 29, 899–910 (2012).
Article Google Scholar
Piao, S. et al. Detection and attribution of vegetation greening trend in China over the last 30 years. Glob. Chang. Biol. 21, 1601–1609 (2015).
Article ADS PubMed Google Scholar
Riahi, K. et al. The Shared Socioeconomic Pathways and their energy, land use, and greenhouse gas emissions implications: An overview. Glob. Environ. Chang. 42, 153–168 (2017).
Article Google Scholar
van Vuuren, D. P. et al. The representative concentration pathways: An overview. Clim. Change 109, 5–31 (2011).
Article ADS Google Scholar
Fujimori, S., Hasegawa, T., Ito, A., Takahashi, K. & Masui, T. Data descriptor: Gridded emissions and land-use data for 2005–2100 under diverse socioeconomic and climate mitigation scenarios. Sci. Data 5, 1–13 (2018).
Article Google Scholar
van Vuuren, D. P. et al. Energy, land-use and greenhouse gas emissions trajectories under a green growth paradigm. Glob. Environ. Chang. 42, 237–250 (2017).
Article Google Scholar
Calvin, K. et al. The SSP4: A world of deepening inequality. Glob. Environ. Chang. 42, 284–296 (2017).
Article Google Scholar
IPCC Fifth Assessment Report: CSIROexperts comment. Ecos https://doi.org/10.1071/ec13228 (2013).
Chen, M. et al. Calibration and analysis of the uncertainty in downscaling global land use and land cover projections from GCAM using. 1753–1764 (2019).
Dong, N., You, L., Cai, W., Li, G. & Lin, H. Land use projections in China under global socioeconomic and emission scenarios: Utilizing a scenario-based land-use change assessment framework. Glob. Environ. Chang. 50, 164–177 (2018).
Article Google Scholar
Cao, M. et al. Spatial sequential modeling and predication of global land use and land cover changes by integrating a global change assessment model and cellular automata. Earth’s Future 7, 1102–1116 (2019).
Article ADS Google Scholar
Kim, S. H. et al. Balancing global water availability and use at basin scale in an integrated assessment model. Clim. Change 136, 217–231 (2016).
Article ADS Google Scholar
Graham, N. T. et al. Water Sector Assumptions for the Shared Socioeconomic Pathways in an Integrated Modeling Framework. Water Resour. Res. 54, 6423–6440 (2018).
Article ADS Google Scholar
Verburg, P. H., Ellis, E. C. & Letourneau, A. A global assessment of market accessibility and market influence for global environmental change studies. Environ. Res. Lett. 6, 1–12 (2011).
Article Google Scholar
Li, X. et al. A New Global Land-Use and Land-Cover Change Product at a 1-km Resolution for 2010 to 2100 Based on Human–Environment Interactions. Ann. Am. Assoc. Geogr. 107, 1040–1059 (2017).
Google Scholar
Calvin, K. et al. GCAM v5. 1: representing the linkages between energy, water, land, climate, and economic systems. 677–698 (2019).
Fujimori, S. et al. SSP3: AIM implementation of Shared Socioeconomic. Pathways. Glob. Environ. Chang. 42, 268–283 (2017).
Article Google Scholar
Doelman, J. C. et al. Exploring SSP land-use dynamics using the IMAGE model: Regional and gridded scenarios of land-use change and land-based climate change mitigation. Glob. Environ. Chang. 48, 119–135 (2018).
Article Google Scholar
Fricko, O. et al. The marker quantification of the Shared Socioeconomic Pathway 2: A middle-of-the-road scenario for the 21st century. Glob. Environ. Chang. 42, 251–267 (2017).
Article Google Scholar
Li, X. & Yeh, A. G. Neural-network-based cellular automata for simulating multiple land use changes using GIS. 16, 323–343 (2010).
Li, X. & Yeh, A. G. Modelling sustainable urban development by the integration of constrained cellular automata and GIS. 14, 131–152(2010).
Verburg, P. H. & Veldkamp, A. Modeling the Spatial Dynamics of Regional Land Use: The CLUE-S Model. 30, 391–405 (2002).
Chen, G. et al. Global projections of future urban land expansion under shared socioeconomic pathways. Nat. Commun. 11, 1–12 (2020).
Google Scholar
Liu, X. et al. Landscape and Urban Planning A future land use simulation model (FLUS) for simulating multiple land use scenarios by coupling human and natural e ff ects. Landsc. Urban Plan. 168, 94–116 (2017).
Article Google Scholar
Neill, B. C. O. et al. Achievements and needs for the climate change scenario framework. Nat Clim Chang. 25, 1–11 (2020).
Google Scholar
Vuuren, D. P. V, Kriegler, E. & Neill, B. C. O. A new scenario framework for Climate Change Research: scenario matrix architecture. 122, 373–386 (2014)
UNFCCC. FCCC/CP/2015/L.9/Rev.1: Adoption of the Paris Agreement. 1–32 (United Nations, 2015).
Pye, S., Li, F. G. N., Price, J. & Fais, B. Achieving net-zero emissions through the reframing of UK national targets in the post-Paris Agreement era. Nat. Energy 2, 1–8 (2017).
Google Scholar
Rogelj, J. et al. Scenarios towards limiting global mean temperature increase below 1.5 °C. Nat. Clim. Chang. 8 (2018).
IPCC. IPCC Special Report on the impacts of global warming of 1.5 °C. Ipcc - Sr15 (IPCC, 2018).
Gao, J. & O’Neill, B. C. Mapping global urban land for the 21st century with data-driven simulations and Shared Socioeconomic Pathways. Nat. Commun. 11, 1–12 (2020).
Article Google Scholar
Li, X. & Zhou, Y. Projecting Global Urban Area Growth Through 2100 Based on Historical Time Series Data and Future Shared Socioeconomic Pathways Earth’ s Future. 7, 351–362 (2019).
Chen, M. et al. Global land use projections for 2015–2100 at 0.05-degree resolution under diverse Shared Socioeconomic Pathways and Representative Concentration Pathways. Sci. Data Submitted 7, 320–331 (2020).
Article Google Scholar
Liu, X. et al. High-spatiotemporal-resolution mapping of global urban change from 1985 to 2015. Nat. Sustain. 3, 564–570 (2020).
Article Google Scholar
Martinez-Fernandez, C., Audirac, I., Fol, S. & Cunningham-Sabot, E. Shrinking Cities: Urban Challenges of Globalization. Int. J. Urban Reg. Res. 36, 213–225 (2012).
Article PubMed Google Scholar
Wiechmann, T. & Pallagst, K. M. Urban shrinkage in Germany and the USA: A Comparison of Transformation Patterns and Local Strategies. Int. J. Urban Reg. Res. 36, 261–280 (2012).
Article PubMed Google Scholar
Long, Y. & Wu, K. Shrinking cities in a rapidly urbanizing China. Environ. Plan. A 48, 220–222 (2016).
Article Google Scholar
Lawler, J. J. et al. Projected land-use change impacts on ecosystem services in the United States. Proc. Natl. Acad. Sci. USA 111, 7492–7497 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Hill, S. L. L. et al. Worldwide impacts of past and projected future land-use change on local species richness and the Biodiversity Intactness Index. preprint at https://www.biorxiv.org/content/10.1101/311787v1.
Borrelli, P. et al. Land use and climate change impacts on global soil erosion by water (2015–2070). Proc. Natl. Acad. Sci. 117, 21994–22001 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Bontemps, S. et al. Multi-year global land cover mapping at 300 M and characterization for climate modelling: Achievements of the land cover component of the ESA climate change initiative. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. - ISPRS Arch. 40, 323–328 (2015).
Article Google Scholar
Duan, H. et al. Assessing China’ s efforts to pursue the 1.5 °C warming limit. 385, 378–385 (2021).
Hare, W. L., Cramer, W., Schaeffer, M., Battaglini, A. & Jaeger, C. C. Climate hotspots: Key vulnerable regions, climate change and limits to warming. Reg. Environ. Chang. 11, 1–13 (2011).
Article Google Scholar
Luo, M. 1-km gridded LULC data under SSP-RCPs in China from 2020 to 2100. figshare. https://doi.org/10.6084/m9.figshare.14776128.v4 (2021).
Hurtt, G. et al. Harmonization of Global Land-Use Change and Management for the Period 850–2100 (LUH2) for CMIP6. Geosci. Model Dev. Discuss. 1–65, https://doi.org/10.5194/gmd-2019-360 (2020).
Hurtt, G. C. et al. Harmonization of global land use change and management for the period 850 – 2100 (LUH2) for CMIP6. 13, 5425–5464 (2020).
Pontius, R. G. & Millones, M. Death to Kappa: Birth of quantity disagreement and allocation disagreement for accuracy assessment. Int. J. Remote Sens. 32, 4407–4429 (2011).
Article Google Scholar
Gilmore, R. et al. Comparing the input, output, and validation maps for several models of land change. 42, 11–37 (2008).
Brown, D. G., Page, S., Riolo, R., Zellner, M. & Rand, W. Path dependence and the validation of agent-based spatial models of land use. Int. J. Geogr. Inf. Sci. 19, 153–174 (2005).
Article Google Scholar
Fischer, G.,et al. Global Agro-ecological Zones Assessment for Agriculture (GAEZ, 2008).
Gaughan, A. E. et al. Spatiotemporal patterns of population in mainland China, 1990 to 2010. Sci. Data 3, 1–11 (2016).
Kummu, M., Taka, M. & Guillaume, J. H. A. Gridded global datasets for Gross Domestic Product and Human Development Index over 1990–2015. Sci. Data 5, 1–15 (2018).
Article Google Scholar
Huang, Z., Wu, X., Garcia, A. J., Fik, T. J. & Tatem, A. J. An Open-Access Modeled Passenger Flow Matrix for the Global Air Network in 2010. 8, e64317 (2013).

Download references

Acknowledgements

This study was supported by Key National Natural Science Foundation of China (Grant No. 42130107), National Key R&D Program of China (No.2017YFA0604402), and the Open Fund of Key Laboratory of Geographic Information Science (Ministry of Education), East China Normal University (Grant No. KLGIS2020A02). Meng Luo thanks Xin Zhao for his help in running the GCAM model. Meng Luo thanks her boyfriend Dalei Hao for all his love and support. Meng Luo thanks her roommate Yvxiao Qi for all her help and encouragement.

Author information

Authors and Affiliations

Key Lab of Geographic Information Science (Ministry of Education), School of Geographic Sciences, East China Normal University, 500 Dongchuan Road, Shanghai, 200241, China
Meng Luo, Guohua Hu, Xiaojuan Liu, Haiyan Hou & Xia Li
Institute of Future Cities, The Chinese University of Hong Kong, Shatin, NT, Hong Kong SAR
Guangzhao Chen

Authors

Meng Luo
View author publications
You can also search for this author in PubMed Google Scholar
Guohua Hu
View author publications
You can also search for this author in PubMed Google Scholar
Guangzhao Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xiaojuan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Haiyan Hou
View author publications
You can also search for this author in PubMed Google Scholar
Xia Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.L.: Methodology, Validation, Writing - Original Draft, Writing - Review & Editing. X.L.: Conceptualization, Writing - Review & Editing, Supervision. G.H.H.: Conceptualization, Writing - Review & Editing. H.Y.H.: Software. X.J.L.: Resources. G.Z.C.: Methodology, Resources.

Corresponding authors

Correspondence to Guohua Hu or Xia Li.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Luo, M., Hu, G., Chen, G. et al. 1 km land use/land cover change of China under comprehensive socioeconomic and climate scenarios for 2020–2100. Sci Data 9, 110 (2022). https://doi.org/10.1038/s41597-022-01204-w

Download citation

Received: 12 August 2021
Accepted: 14 February 2022
Published: 28 March 2022
DOI: https://doi.org/10.1038/s41597-022-01204-w

This article is cited by

Robustness of design flood estimates under nonstationary conditions: parameter sensitivity perspective
- Bin Xiong
- Shuchen Zheng
- Chong-Yu Xu
Stochastic Environmental Research and Risk Assessment (2024)
Spatiotemporal variations of non-farming use of cropland in China under different SSP-RCP scenarios
- Linna Linghu
- Peijun Sun
- Yue Wu
Regional Environmental Change (2024)
Mapping the spatial heterogeneity of global land use and land cover from 2020 to 2100 at a 1 km resolution
- Tianyuan Zhang
- Changxiu Cheng
- Xudong Wu
Scientific Data (2023)