Utilize este identificador para referenciar este registo:
http://hdl.handle.net/10362/159405
Título: | Big Data for Credit Risk Analysis |
Autor: | Ashofteh, Afshin |
Palavras-chave: | Credit score Big Data Machine learning Risk Management Finance SDG 8 - Decent Work and Economic Growth SDG 9 - Industry, Innovation, and Infrastructure SDG 10 - Reduced Inequalities SDG 17 - Partnerships for the Goals |
Data: | 19-Out-2023 |
Editora: | Springer, Cham |
Resumo: | Recently, Big Data has become an increasingly important source to support traditional credit scoring. Personal credit evaluation based on machine learning approaches focuses on the application data of clients in open banking and new banking platforms with challenges about Big Data quality and model risk. This paper represents a PySpark code for computationally efficient use of statistical learning and machine learning algorithms for the application scenario of personal credit evaluation with a performance comparison of models including logistic regression, decision tree, random forest, neural network, and support vector machine. The findings of this study reveal that the logistic regression methodology represents a more reasonable coefficient of determination and a lower false negative rate than other models. Additionally, it is computationally less expensive and more comprehensible. Finally, the paper highlights the steps, perils, and benefits of using Big Data and machine learning algorithms in credit scoring. |
Descrição: | Ashofteh, A. (2023). Big Data for Credit Risk Analysis: Efficient Machine Learning Models Using PySpark. In J. Pilz, V. B. Melas, & A. Bathke (Eds.), Statistical Modeling and Simulation for Experimental Design and Machine Learning Applications: Selected Contributions from SimStat 2019 and Invited Papers (pp. 245-265). (Contributions to Statistics). Springer, Cham. https://doi.org/10.1007/978-3-031-40055-1_14 |
Peer review: | yes |
URI: | http://hdl.handle.net/10362/159405 |
DOI: | https://doi.org/10.1007/978-3-031-40055-1_14 |
ISBN: | 978-3-031-40054-4 978-3-031-40057-5 978-3-031-40055-1 |
ISSN: | 1431-1968 |
Aparece nas colecções: | NIMS: MagIC - Documentos de conferências internacionais |
Ficheiros deste registo:
Ficheiro | Descrição | Tamanho | Formato | |
---|---|---|---|---|
Big_Data_for_Credit_Risk_Analysis_Efficient_Machine_Learning_Models_Using_PySpark_Author_Accepted_Manuscript.pdf | 2,47 MB | Adobe PDF | Ver/Abrir |
Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.