Logo do repositório
 
A carregar...
Miniatura
Publicação

Churn Prediction in Digital Service Platforms

Utilize este identificador para referenciar este registo.
Nome:Descrição:Tamanho:Formato: 
TCDMAA5486.pdf1.63 MBAdobe PDF Ver/Abrir

Orientador(es)

Resumo(s)

Customer churn prediction has become an important task for companies operating in competitive digital environments, particularly in non-contractual digital platforms where churn is not directly observable and must be inferred from patterns of user inactivity. This study develops and evaluates machine learning models to predict customer churn in a Portuguese digital service platform characterised by irregular and heterogeneous user activity patterns. Churn is defined using a 180-day inactivity threshold, supported by the distribution of inter-purchase intervals. The project follows the Cross-Industry Standard Process for Data Mining (CRISP-DM) and includes data preparation, feature engineering, and model comparison across several machine learning algorithms, including Logistic Regression, Random Forest, Gradient Boosting, XGBoost, LightGBM, Neural Networks, and a Stacking Ensemble. Special attention is given to class imbalance, as the dataset presents a reversed imbalance structure in which active users represent the minority class. The results show that models trained on the original imbalanced data achieve misleadingly strong performance by favouring the majority class, while the application of SMOTE leads to more balanced predictions across both classes. Among the evaluated models, LightGBM achieved the best overall performance, obtaining the highest F1-score while maintaining good generalisation and computational efficiency. The results also show the importance of handling class imbalance appropriately, selecting suitable evaluation metrics, and designing features that capture customer engagement patterns. In addition, engineered transactional features were shown to provide useful predictive information for churn prediction in non-contractual digital platforms. Overall, the study shows that machine learning models can effectively predict churn in environments characterised by irregular user activity patterns and non-standard class distributions.

Descrição

Dissertation presented as the partial requirement for obtaining a Master's degree in Data Science and Advanced Analytics, specialization in Data Science

Palavras-chave

Churn Prediction Machine Learning Digital Service Platforms Non-contractual Settings Class Imbalance

Contexto Educativo

Citação

Projetos de investigação

Unidades organizacionais

Fascículo