EFFECTIVENESS OF MACHINE LEARNING ALGORITHMS IN CREDIT RISK DETECTION
Keywords:
Credit risk detection, machine learning, credit scoring, logistic regression, Random Forest, XGBoost, artificial neural networks, financial technology, predictive analytics, explainable artificial intelligence.Abstract
Credit risk detection is one of the most significant tasks in the banking and financial sector because inaccurate assessment of borrowers may lead to substantial financial losses and instability in financial institutions. Traditional statistical approaches such as logistic regression have been widely used in credit scoring systems for decades. However, the growth of digital banking, large-scale financial datasets, and computational technologies has encouraged the adoption of machine learning algorithms for more accurate prediction of default risk. This article analyzes the effectiveness of major machine learning algorithms in credit risk detection, including Logistic Regression, Decision Trees, Random Forest, Support Vector Machines, Artificial Neural Networks, and XGBoost. The study is based on recent scientific literature, comparative experimental findings, and empirical evidence from financial datasets. Research findings demonstrate that ensemble learning methods such as Random Forest and XGBoost generally outperform traditional statistical models in predictive accuracy, Area Under Curve (AUC), recall, and precision metrics. At the same time, issues related to interpretability, fairness, regulatory compliance, and imbalanced datasets remain important challenges in practical implementation. The paper also discusses the significance of explainable artificial intelligence (XAI) methods in improving transparency in machine learning-based credit scoring systems. The study concludes that machine learning algorithms significantly improve credit risk detection performance when combined with appropriate preprocessing techniques, feature engineering, and interpretability frameworks.
References
Chen T., Guestrin C. XGBoost: A Scalable Tree Boosting System. 2016. pp. 785–794.
Breiman L. Random Forests. Machine Learning Journal. 2001. pp. 5–32.
Fitzpatrick T., Mues C. An Empirical Comparison of Classification Algorithms for Mortgage Default Prediction. 2015. pp. 1–18.
Alonso A., Carbo-Valverde S., Rodriguez-Fernandez F. Understanding the Performance of Machine Learning Models in Credit Default Prediction. 2021. pp. 10–42.
Mathibela M.R. Predictive Modelling of Credit Default Risk Using Machine Learning. 2026. pp. 15–33.
Ginting A.H., Sembiring R.W., Zamzami E.M. Comparison Analysis: Logistic Regression, Random Forest, XGBoost, and CatBoost in Credit Scoring. 2025. pp. 210–217.
Moscatelli M., et al. Machine Learning Approaches to Credit Risk Assessment. 2019. pp. 45–67.
Paz Á. Machine Learning and Metaheuristics Approach for Individual Credit Risk. 2025. pp. 1–25.
Zhao F. Evaluating Machine Learning Models for Personal Credit Risk. 2025. pp. 232–241.
Fekadu R., Getachew A., Tadele Y., Ali N., Goytom I. Machine Learning Models Evaluation and Feature Importance Analysis on NPL Dataset. 2022. pp. 1–14.
Chen D., Ye W., Ye J. Interpretable Selective Learning in Credit Risk. 2022. pp. 3–19.
Shreya, Pathak H. Explainable Artificial Intelligence Credit Risk Assessment using Machine Learning. 2025. pp. 1–21.
Yang S., Huang Z., Xiao W., Shen X. Interpretable Credit Default Prediction with Ensemble Learning and SHAP. 2025. pp. 5–24.






Azerbaijan
Türkiye
Uzbekistan
Kazakhstan
Turkmenistan
Kyrgyzstan
Republic of Korea
Japan
India
United States of America
Kosovo