Logo do repositório
 
Publicação

Customer Churn Prediction in Subscription-Based Services: An Educational Technology Company Use Case

datacite.subject.fosCiências Naturais::Ciências da Computação e da Informaçãopt_PT
dc.contributor.advisorAntónio, Nuno Miguel da Conceição
dc.contributor.authorGavrilova, Liubov
dc.date.accessioned2025-11-18T11:59:03Z
dc.date.embargo2028-11-03
dc.date.issued2025-11-03
dc.descriptionDissertation presented as the partial requirement for obtaining a Master's degree in Data Driven Marketing, specialization in Digital Marketing and Analyticspt_PT
dc.description.abstractSubscription-based educational technology (EdTech) companies face significant revenue loss due to customer churn, making retention vital. Retaining engaged learners is generally more cost-effective than acquiring new ones. This study develops a predictive churn model using anonymised student data from Company X, an EdTech platform for children’s English learning, covering the period from April 2024 to February 2025. The real name of the company has been intentionally left out to ensure confidentiality. Following a CRISP-DM process, student activity, engagement, and financial features were engineered over multiple time windows, capturing evolving student behaviours. Four machine learning (ML) algorithms - logistic regression (LR), random forest (RF), neural networks (NN), and XGBoost (XGB) - were trained and compared, employing resampling techniques (RUS+SMOTE) to address class imbalance. The optimised XGB model achieved the best performance, with approximately 0.84 accuracy, 0.63 F1 score, 0.85 recall, and the area under the curve (AUC) of 0.85 on test data, effectively identifying likely churners. Shapley Additive exPlanations (SHAP) based analysis revealed that engagement metrics, particularly the number of paid classes (both current and mean over 8 weeks), total learning time, engagement at gamified features, and customer tenure, were the most influential predictors, confirming that highly engaged students are less likely to churn. This interpretable model provides actionable insights for retention strategies by predicting individual churn risk and highlighting key engagement drivers. Practically, even a 1% monthly reduction in churn could translate to multi-million-dollar annual savings for subscription EdTech providers. Overall, this research extends churn prediction into the EdTech domain, demonstrates the value of long-term engagement features, and applies explainable AI to enhance model transparency, thereby supporting its practical adoption.pt_PT
dc.identifier.tid204073448
dc.identifier.urihttp://hdl.handle.net/10362/190936
dc.language.isoengpt_PT
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/pt_PT
dc.subjectCustomer Churn Predictionpt_PT
dc.subjectMachine Learningpt_PT
dc.subjectEducational Technologypt_PT
dc.subjectSubscription Business Modelpt_PT
dc.subjectSDG 4 - Quality educationpt_PT
dc.subjectSDG 8 - Decent work and economic growthpt_PT
dc.subjectSDG 9 - Industry, innovation and infrastructurept_PT
dc.subjectSDG 11 - Sustainable cities and communitiespt_PT
dc.subjectSDG 12 - Responsible production and consumptionpt_PT
dc.titleCustomer Churn Prediction in Subscription-Based Services: An Educational Technology Company Use Casept_PT
dc.typemaster thesis
dspace.entity.typePublication
rcaap.embargofctI would like to publish paper in a scientific journal or conferencept_PT
rcaap.rightsembargoedAccesspt_PT
rcaap.typemasterThesispt_PT
thesis.degree.nameMestrado em Marketing Analítico, especialização em Digital Marketing and Analyticspt_PT

Ficheiros

Principais
A mostrar 1 - 1 de 1
Miniatura indisponível
Nome:
TDDM4269.pdf
Tamanho:
2.75 MB
Formato:
Adobe Portable Document Format
Licença
A mostrar 1 - 1 de 1
Miniatura indisponível
Nome:
license.txt
Tamanho:
348 B
Formato:
Item-specific license agreed upon to submission
Descrição: