Prediction of cadmium content using machine learning methods


Keçeci M., Gökmen F., Usul M., Koca C., UYGUR V.

Environmental Earth Sciences, cilt.83, sa.12, 2024 (SCI-Expanded, Scopus) identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 83 Sayı: 12
  • Basım Tarihi: 2024
  • Doi Numarası: 10.1007/s12665-024-11672-5
  • Dergi Adı: Environmental Earth Sciences
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Anahtar Kelimeler: Machine learning, Modelling, R statistics, Soil properties, XGBoost
  • Isparta Uygulamalı Bilimler Üniversitesi Adresli: Evet

Özet

Heavy metals are the most environmentally hazardous pollutions in agricultural soils, threatening humans and several ecosystem services. Cadmium (Cd) is a highly toxic element but distinctively different from other heavy metals with its high mobility in soil environments. The study aimed to evaluate the Cd concentration of soils in the Konya plain with a specific attribute to soil fertilization, mainly phosphorous fertilizers. A total of 538 surface (0–20 cm) soil samples were analyzed to determine basic physical and chemical properties and total phosphorus (P) and Cd concentrations. Descriptive statistics, machine learning, and regression models were used to assess the accumulation of Cd in soils. Decision Trees, Linear Regression, Random Forest, and XGBoost machine learning methods were used in Cd prediction. The XGBoost model proved to be the best prediction model, with a coefficient of determination of 98.1%. Electrical conductivity, pH, CaCO3, silt, and P were used in the Cd estimation of the XGBoost model and explained 56.51% of the total variance in relation to measured soil properties. The results revealed that a machine learning algorithm could be useful for estimating Cd concentration in soils using basic physical and chemical soil properties.