May 13, 2025

Jo Mai Asian Culture

Embrace Artistry Here

Assessment and simulation of eco-environmental quality changes in rapid rural urbanization: Xiong’an New Area, China

Assessment and simulation of eco-environmental quality changes in rapid rural urbanization: Xiong’an New Area, China

RSEI ecological index

RSEI is a regional EEQ assessment index that is completely based on remote sensing technology. Compared to traditional ecological indices, the most notable improvement in RSEI is its selection of four important indicators, namely greenness, wetness, heat, and dryness, which directly reflect the ecological quality14. These indicators are used for dimensionality reduction calculation, which not only avoid one-sided evaluation of a single indicator, but also integrate the information from multiple indicators into a concise visual index or result. The functional form of RSEI can be expressed as:

$$RSEI = f\left( Greenness,Wetness,Heat,Dryness \right)$$

(1)

NDVI is capable of reliably assessing vegetation quality and productivity, and it has the advantages of being concise and easy to calculate30. NDVI is currently the most widely used remote sensing vegetation index. In this study, greenness indicator in RSEI is represented by NDVI. NDVI is expressed as:

$$NDVI = \left( NIR – Red \right)/\left( NIR + Red \right)$$

(2)

where NIR and Red are the near-infrared (NIR) band and red band of Landsat-8 reflectance images, respectively.

Tasseled cap transformation (TCT) is a valuable tool for compressing spectral data into a few components that correspond to the physical characteristics of a scene, while minimizing information loss31. Among them, the wetness component is associated with soil and plant moistures, water and other moist features32. We used wetness component to characterize the wetness indicator in RSEI. The wetness component of TCT is expressed as31:

$$wetness = 0.1511Blue + 0.1973Green + 0.3283Red + 0.3407NIR – 0.7117SWIR1 – 0.4559SWIR2$$

(3)

where Blue, Green, Red, NIR, SWIR1 and SWIR2 corresponding to the blue, green, red, NIR, short-infrared 1 (SWIR1), and short-infrared 2 (SWIR2) bands of Landsat-8 reflectance images, respectively.

Dryness indicator is mainly used to quantitatively depict the “dryness” characteristics on the land surface caused by the replacement of natural cover with artificial surface due to human activities in a given region. In studies focusing on urban built-up area, commonly used indices such as the building/built-up index or the impervious surface index were employed to represent this indicator. However, unlike urban built-up areas, Xiong’an New Area is located in rural areas. Apart from the rural settlements, there are currently large areas of bare soil in this region, which are primarily caused by land leveling activities during the early stages of engineering construction and crop harvesting. In this study, therefore, the normalized difference impervious surface index (NDISI)33 and the normalized difference soil index (NDSI)34 were combined to construct the normalized difference impervious surface and soil index (NDISSI) to represent the Dryness indicator in RSEI. The NDISSI is expressed as:

$$NDISSI = \left( NDISI_\textnor + NDSI_\textnor \right)/2$$

(4)

where NDISInor and NDSInor are normalized NDISI and NDSI, respectively. NDISI and NDSI are expressed as:

$$NDISI = \fracTIR – \left[ \left( WI + NIR + SWIR1 \right)/3 \right]TIR + \left[ \left( WI + NIR + SWIR1 \right)/3 \right]$$

(5)

$$NDSI = \left( SWIR1 – NIR \right)/\left( SWIR1 + NIR \right)$$

(6)

where TIR is the thermal band; WI represents water index-derived band and can be represented by the modification of normalized difference water index (MNDWI)35.

The normalization algorithm can be expressed as:

$$I_\textnor = \fracI_i – I_\min {I_\max – I_\min }$$

(7)

where Inor is the normalized index or indicator; Ii represents the value of the index or indicator in pixel i; Imax and Imin represent the maximum and minimum values of the index or indicator, respectively.

Heat indicator in RSEI is represented by LST. In this study, Landsat-8 TIRS band 10 was utilized. The single channel (SC) algorithm proposed by Jiménez-Muñoz and Sobrino36, Jiménez-Muñoz et al.37, and Cristóbal et al.38 was adopted to retrieve LST. The SC algorithm is expressed as follows:

$$LST = \gamma \times \left[ \varepsilon ^ – 1 \times \left( \psi _1 \times L + \psi _2 \right) + \psi _3 \right] + \delta$$

(8)

$$\gamma = T^2 /\left( b_\gamma \times L \right)$$

(9)

$$\delta \approx T – T^2 /b_\gamma $$

(10)

where L is the at-sensor spectral radiance of TIRS band 10; ε is the land surface emissivity; γ and δ are two parameters dependent on Planck’s function; bγ = 1324 for TIRS band 10; ψ1, ψ2, and ψ3 are the atmospheric functions calculated as:

$$\psi _1 = 1/\tau ,\psi _2 = – L^ \downarrow – L^ \uparrow /\tau ,\psi _3 = L^ \uparrow $$

(11)

where τ is the atmospheric transmissivity, and L and L are the upwelling and downwelling atmospheric radiance, respectively. T is the at-sensor brightness temperature calculated as follows:

$$T = K_2 /\ln \left( K_1 /L + 1 \right)$$

(12)

where K1 and K2 are the band-specific thermal conversion constants, K1 = 774.89 (W/m2/sr/µm) and K2 = 1321.08 K for TIRS band 10.

The PCA algorithm, a multivariate statistical method, was utilized in the construction of RSEI. Through rotating the coordinate axes of feature spectral space to maximize the removal of correlation among different indicators, the key information of multivariate is concentrated into the first few principal components, such as the first principal component (PC1). The advantage of PCA lies in its utilization of covariance matrix for automatic and objective calculation of the weight distribution for each indicator, and in the objective determination of the contribution of each principal component through the corresponding eigenvalue. Any weighting structure is susceptible to criticism, as assigning weights is a subjective value-dependent process39. While, PCA effectively avoids result biases caused by arbitrary subjective weighting of indicators14. The RSEI function in Eq. (1) can be further divided into the following two steps:

$$RSEI_0 = 1 – \textPC1\left[ f\left( NDVI,Wetness,LST,NDISSI \right) \right]$$

(13)

$$RSEI = RSEI_{{0\:{\textnor}}}$$

(14)

It is important to note that due to the different units of measurement among indicators, it is necessary to normalize each indicator using Eq. (7) before performing PCA. The normalization process ensures that all the indicators are on a unified scale within the range of 0 to 1. Additionally, if there are large water bodies in the study area, they may have an impact on the analysis of indicators loading in PCA14. In such cases, the MNDWI can be used to mask out the water information. The resulting RSEI is referred to as the normalized RSEI0 (RSEI0 nor), which has value range from 0 to 1. A higher value of RSEI, that is closer to 1, indicates a better EEQ, while a value of 0 denotes an extremely poor one.

XGBoost algorithm

The XGBoost algorithm, initially proposed by Chen et al.40, is an optimized distributed gradient boosting library that has found extensive applications in various fields, including the built environment41,42, natural geography43, and agricultural and biological44. It has demonstrated high accuracy in these domains. As a ensemble decision trees, XGBoost initializes with a base prediction and then iteratively constructs trees to fit the residuals (i.e., the differences between the observed and predicted values) from the previous iterations. This process repeats multiple times, allowing XGBoost to achieve highly accurate predictions45. One notable advantage of XGBoost is its ability to refine its predictions through multiple rounds of iterations over the residuals42.

XGBoost is a highly scalable machine learning system developed from the concept of Gradient Boost Decision Tree (GBDT)43, with multiple adjusted hyperparameters. Configuring these hyperparameters appropriately is a crucial step in achieving model accuracy. In this study, the seven hyperparameters, namely eta, gamma, max_depth, min_child_weight, subsample, colsample_bytree, and nrounds, were iteratively calculated within a preset hyperparameters tuning space (Table 2). A 5-fold cross validation approach46 was employed to determine the optimal hyperparameters combination that minimizes the model’s error. Furthermore, the Shapley Additive exPlanation (SHAP) analysis is based on game theory and relies on SHAP values to determine the importance of individual independent variables47. It takes into account their marginal contribution to the XGBoost regression model outcome.

Table 2 The hyperparameters of the XGBoost algorithm used in this study and their corresponding tuning space.

Multi-scenario simulation

This study was based on the 2021 conditions of study area as the basis for multi-scenario simulation. The illustration of multi-scenario simulation workflow is shown in Fig. 2. First, the built-up areas and bare soil areas under development within the study area were extracted using the NDISSI threshold, forming the initial region A1. Then, concentric buffer zones of 200 m, 400 m, and 600 m were created around A1, forming buffer regions A2, A3, A4, respectively. Next, relevant urban-rural construction features of the current study area in 2021 (S0) was obtained, including NDVInor, NDISInor, percentage of the building rooftop (PB), population density (PopD), and building height (BH). These construction features were then progressively changed by 5% starting from region A1 with a step length of 200 m to simulate the possibility of urban construction in multiple scenarios. Four scenarios were simulated as follows: Scenario One (S1) involved a 5% progressive change in the urban-rural construction features of region A1 based on current situation (S0), specifically a decrease of 5% in NDVInor and an increase of 5% in NDISInor, PB, PopD, and BH, while other non-A1 regions remain unchanged. The modified urban-rural construction features were used in the XGBoost regression model to simulate the RSEI, resulting in S1. Similarly, Scenario Two (S2) was built upon S1 with a progressive change of 5% in the construction features for regions A1 and A2. Likewise, Scenario Three (S3) was based on S2 and involved a change of 5% in the construction features for regions A1, A2, and A3. Finally, Scenario Four (S4) was built upon S3 with a change of 5% in the construction features for regions A1, A2, A3, and A4.

Fig. 2
figure 2

Illustration of multi-scenario simulation workflow.

It is worth noting that during the simulation, if NDVInor falls below 0, it is adjusted to 0. If NDISInor and PB exceed values of 1 or 100%, they are constrained to 1 or 100% accordingly. Additionally, in the newly added buffer region, if PopD and BH are both 0, the minimum non-zone values within that region are utilized as substitute.

link

Copyright © All rights reserved. | Newsphere by AF themes.