Publicação
Evaluating Ensemble Neural Networks as an Alternative to Tree-Based Ensemble Methods for Heart Disease Prediction Using Oversampling Methods
| datacite.subject.fos | Ciências Naturais::Ciências da Computação e da Informação | pt_PT |
| dc.contributor.advisor | Orghian, Diana | |
| dc.contributor.advisor | Pinheiro, Flávio Luís Portas | |
| dc.contributor.author | Duran, Emrecan | |
| dc.date.accessioned | 2025-02-19T15:33:58Z | |
| dc.date.available | 2025-02-19T15:33:58Z | |
| dc.date.issued | 2025-02-13 | |
| dc.description | Dissertation presented as the partial requirement for obtaining a Master's degree in Data Driven Marketing, specialization in Data Science for Marketing | pt_PT |
| dc.description.abstract | Ensemble learning enhances predictive accuracy by combining multiple models, but it often struggles with imbalanced data, which can lead to biased results. To address this challenge, this study explores whether Ensemble Neural Networks (ENN) can be an alternative model to treebased methods like Random Forest (RF), Gradient Boosting (GB), and Extreme Gradient Boosting (XGB) in predicting cardiovascular disease (CVD) and whether it can provide improved results. Unlike single neural networks, ENN combines multiple neural network architectures, like how tree-based models use ensembles of decision trees. This approach might allow ENN to better capture and understand data patterns. To mitigate class imbalance, oversampling techniques such as Random oversampling (ROS), Synthetic minority oversampling technique (SMOTE), Borderline-smote (B-SMOTE), and Adaptive synthetic sampling (ADASYN) are applied. Performance is evaluated using accuracy, F-score, geometric mean (G-mean), and area under the curve (AUC) on three CVD datasets: Heart Disease Health Indicators, Framingham, and Statlog. Results show that ENN, when combined with SMOTE and B-SMOTE, offers a strong alternative for imbalanced classification tasks, though tree-based methods remain more robust in terms of overall performance. | pt_PT |
| dc.identifier.tid | 203921879 | |
| dc.identifier.uri | http://hdl.handle.net/10362/179331 | |
| dc.language.iso | eng | pt_PT |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | pt_PT |
| dc.subject | imbalanced learning | pt_PT |
| dc.subject | oversampling | pt_PT |
| dc.subject | tree-based ensemble algorithms | pt_PT |
| dc.subject | ensemble neural networks | pt_PT |
| dc.subject | SDG 3 - Good health and well-being | pt_PT |
| dc.title | Evaluating Ensemble Neural Networks as an Alternative to Tree-Based Ensemble Methods for Heart Disease Prediction Using Oversampling Methods | pt_PT |
| dc.type | master thesis | |
| dspace.entity.type | Publication | |
| rcaap.rights | openAccess | pt_PT |
| rcaap.type | masterThesis | pt_PT |
| thesis.degree.name | Mestrado em Marketing Analítico, especialização em Ciência de Dados Aplicada ao Marketing | pt_PT |
