Farzaneh Parsaie; Ahmad Farrokhian Firouzi; Masoud Davari; Ruhollah Taghizadeh-Mehrjardi
Abstract
Surface soil saturated hydraulic conductivity (Ks), as one of the most important physical properties of soil, plays a key role in the distribution of water and nutrients within the soil environment and holds particular significance in water and soil resource management. This study aimed to digitally ...
Read More
Surface soil saturated hydraulic conductivity (Ks), as one of the most important physical properties of soil, plays a key role in the distribution of water and nutrients within the soil environment and holds particular significance in water and soil resource management. This study aimed to digitally model Ks using machine learning approaches in the Kilanah watershed, located in Kurdistan Province, covering an area of 12,000 hectares. Three machine learning algorithms, including Gradient Boosted Decision Tree (XGBoost), Random Forest (RF), and k-Nearest Neighbors (k-NN), were utilized, incorporating various environmental variables derived from the digital elevation model and Sentinel-2 satellite imagery. These variables included distance from the drainage channel, valley depth, relative slope position, channel base level, brightness index, wind effect index, Normalized Difference Vegetation Index (NDVI), Band 12, greenness index, and surface curvature. Additionally, soil parameters such as organic matter, lime content, bulk density, geometric mean particle diameter, soil texture, and near-soil spectroscopic data (Latent Variable) within the wavelength range of 400–2450 nm were used as proxies for pedogenic factors to model saturated hydraulic conductivity. The results indicated that the XGBoost model exhibited the highest accuracy for predicting Ks, with an R² value of 0.65 and an nRMSE of 0.25, outperforming the other models. Spectral data, topographic variables, and soil parameters, as model inputs, played a significant role in predicting the spatial variability of Ks. The XGBoost model was able to provide highly accurate predictions. The results demonstrated that topographic, physical, and spectral variables influence Ks; organic matter, soil texture, and topographic indices such as slope and relative position had the most substantial impact. The generated maps can be utilized for water and soil resource management and hydrological models.
Farzaneh Parsaie; Ahmad Farrokhian Firouzi; Masoud Davari; Ruhollah Taghizadeh-Mehrjardi
Abstract
Mechanical properties of soil, such as shear strength and penetration resistance, play a crucial role in optimizing crop productivity and proper soil management. The objective of the research was to produce digital map of soil shear strength and penetration resistance in Kielaneh watershed, located in ...
Read More
Mechanical properties of soil, such as shear strength and penetration resistance, play a crucial role in optimizing crop productivity and proper soil management. The objective of the research was to produce digital map of soil shear strength and penetration resistance in Kielaneh watershed, located in Kurdistan Province, covering an area of 12,000 hectares using Gradient Boosted Decision Trees (XGBoost), Random Forest (RF), and k-Nearest Neighbors (KNN). Soil penetration resistance and shear strength were measured using handheld penetrometers and vane shear devices at 150 observation points from the surface soil layer (0 to 10 centimeters). Spectral data and auxiliary variables derived from the Digital Elevation Model and Sentinel-2 satellite images were used to predict soil shear strength and penetration resistance. These variables include CHND, VD, RSP, CHNBL, Brightness, WE, NDVI, Band12, Greenness, PLC, as well as soil parameters such as organic matter, calcium carbonate, bulk density, geometric mean particle size, soil texture (percentages of clay, sand, silt), and visible near-infrared spectral data as latent variable (LT), representing soil formation factors. The results showed that the XGBoost had higher accuracy compared to other models for predicting shear strength in surface soil layer with an (R2) of 0.61 and an nRMSE of 0.16, as well as for predicting penetration resistance in the surface soil layer with an (R2) of 0.60 and an nRMSE of 0.11. In conclusion, the XGBoost model, using spectral data along with topographic variables and soil parameters, was able to estimate the spatial variability of soil mechanical properties with acceptable accuracy in the study area. The generated maps can be used to make necessary management decisions regarding of the region.
Fatemeh Ebrahimi Meymand; Hasan Ramezanpour; Nafiseh Yaghmaeian; Kamran Eftekhari
Abstract
In recent years, the use of digital soil mapping (DSM) based on machine learning algorithms with the aim of preparing soil maps has become widespread with the basis of soil class prediction with the help of modeling the relationships between them and environmental variables. One of this method's challenges ...
Read More
In recent years, the use of digital soil mapping (DSM) based on machine learning algorithms with the aim of preparing soil maps has become widespread with the basis of soil class prediction with the help of modeling the relationships between them and environmental variables. One of this method's challenges is the imbalanced nature of soil distribution in landscape, which leads to overfitting and underfitting of classes, and as a result, reduces the accuracy of many used models. This study was conducted to evaluate the ability of two machine learning algorithms, including random forests and support vector machines, for the digital mapping of soil classes with an imbalanced data set. This study was conducted on 95 soil profile classes at the family level, in 4000 hectares of land in the Honam sub-basin, Lorestan province. The issue of imbalance in soil classes was investigated by using six data sets, including the original soil data set and five data sets created by several resampling approaches including two manual classifications and three over-sampling, under-sampling, and Synthetic Minority Over-Sampling Techniques in the R software. The results showed that despite the low values of overall accuracy, the Geographical distribution of soils with high frequency in the study area in digital soil map obtained from the random forest and the original data set as well as Synthetic Minority Over-Sampling Technique, with conventional soil map of study area is significant. Therefore, the low observation number of other soil classes and as a result incorrect training of models can be considered as one of the main reasons for the low accuracy of the used models.
Zahra Barati; Ebrahim Omidvar; Ataollah Shirzadi
Abstract
Landslide susceptibility mapping is considered as the first important step in landslide risk assessment. The main purpose of this study is to compare the performance of a machine learning algorithm (a logistic model tree), and a statistical model (a logistic regression), for landslide susceptibility ...
Read More
Landslide susceptibility mapping is considered as the first important step in landslide risk assessment. The main purpose of this study is to compare the performance of a machine learning algorithm (a logistic model tree), and a statistical model (a logistic regression), for landslide susceptibility modeling in the Sarkhoon watershed, Chaharmahal and Bakhtiari province. For this purpose, at first, a landslide inventory map including a total of 98 landslide locations was constructed using historical landslides, and extensive field surveys. In addition, a total of 100 non-landslide locations were also identified to construct a database. The landslide and non-landslide locations were randomly selected and divided into two groups with a 70/30 ratio for modelling and validation processes. Twenty conditioning factors were selected based on literature review and geo-environmental properties in the study area. Subsequently, the logistic model tree (LMT) and the logistic regression (LR) models were applied to identify the influence of conditioning factors on landslide occurrence. Finally, the performance of the models in landslide susceptibility mapping was investigated using the area under the receiver operating characteristics curve (AUC). The results concluded that the LR model (AUC = 0.797) outperformed and outclassed the LMT (AUC = 0.740) model in the study area. Although both models were reliable tools for spatial prediction of landslide susceptibility; however, the LR model was more accurate that it can be proposed as an alternative tool for better management of areas prone to landslide in the study area.