Logo do repositório
 
Publicação

Machine Learning Techniques to reveal abnormal behaviour in client profiles and vehicle characteristics in the non-life insurance context

dc.contributor.advisorCastelli, Mauro
dc.contributor.authorCandeias, Catarina Sofia Domingues
dc.date.accessioned2023-04-27T10:14:17Z
dc.date.available2024-04-10T00:34:28Z
dc.date.issued2023-04-10
dc.descriptionInternship Report presented as the partial requirement for obtaining a Master's degree in Data Science and Advanced Analytics, specialization in Business Analyticspt_PT
dc.description.abstractAnomalies are everywhere, and neither can we discard such truth in the business context. From intrusion detection for computer network systems to fraud detection and credit risk analysis, abnormalities are an unavoidable component of practically every known system. Insurance companies have registered significant growth over the last few years with the support of machine learning techniques and technological advancements. Several studies have discussed the best-unsupervised anomaly detection algorithm for each business problem and domain. Algorithms’ enhancements and novel models’ proposals are the most typical subject addressed. Nonetheless, fewer studies have been made regarding the identification of abnormal behaviour in client profiles and vehicle characteristics that may influence the two main measures in a non-life insurance field: the frequency and the severity. This project aims to respond to this need by experimenting with different clustering techniques, such as DBSCAN and OPTICS, and distinct unsupervised anomaly detection models, such as Isolation Forest, Extended Isolation Forest and Local Outlier Factor, on a real-world dataset provided by an insurance company that operates in Portugal. In doing so, its impact on the pricing and underwriting rules allows the attribution of an equitable tariff for the insurance entity and its customers. The implementation of the Isolation Forest algorithm for the whole dataset outperforms the remaining models by achieving an AUC score of approximately 0.86. The development of this project, besides supporting the decision-making process on identifying unsought clients in the insurance context, also contributes to broadening the knowledge of existing state-of-the-art anomaly detection algorithms and their performances.pt_PT
dc.identifier.tid203268342pt_PT
dc.identifier.urihttp://hdl.handle.net/10362/152173
dc.language.isoengpt_PT
dc.subjectAnomaly Detectionpt_PT
dc.subjectUnsupervised Learningpt_PT
dc.subjectClusteringpt_PT
dc.subjectNon-life Insurancept_PT
dc.subjectIsolation Forestpt_PT
dc.subjectArea Under the Curvept_PT
dc.titleMachine Learning Techniques to reveal abnormal behaviour in client profiles and vehicle characteristics in the non-life insurance contextpt_PT
dc.typemaster thesis
dspace.entity.typePublication
rcaap.embargofct"Relativamente ao motivo de embargo, uma vez que se trata de informação providenciada por uma empresa, a mesma apresenta informação atualizada ao presente ano (na verdade, informação desde 2017 a 2022). Desta forma, daqui a algum tempo, a mesma informação não será tão relevante, e é mais viável a apresentação da mesma à sociedade, sem qualquer problema adicional."pt_PT
rcaap.rightsopenAccesspt_PT
rcaap.typemasterThesispt_PT
thesis.degree.nameMestrado em Ciência de Dados e Métodos Analíticos Avançados, especialização em Métodos Analíticos para a Gestãopt_PT

Ficheiros

Principais
A mostrar 1 - 1 de 1
A carregar...
Miniatura
Nome:
TCDMAA1734.pdf
Tamanho:
2.9 MB
Formato:
Adobe Portable Document Format
Licença
A mostrar 1 - 1 de 1
Miniatura indisponível
Nome:
license.txt
Tamanho:
348 B
Formato:
Item-specific license agreed upon to submission
Descrição: