| Nome: | Descrição: | Tamanho: | Formato: | |
|---|---|---|---|---|
| 1.06 MB | Adobe PDF |
Autores
Orientador(es)
Resumo(s)
Addressing the challenge of class imbalance in binary classification, this paper introduces Genetic Methods for OverSampling (GM4OS), an innovative technique leveraging the combined capabilities of Genetic Algorithms (GAs) and Genetic Programming (GP). Traditional oversampling methods like SMOTE and its variants depend on the selected data points and fixed synthetic data generation processes, often leading to suboptimal results. GM4OS advances this field by simultaneously evolving a resampling set and a synthetic data generation function. Individuals in GM4OS are made of two components, the GA component selects minority class observations for resampling, while the GP component evolves functions to create synthetic observations. This dual evolution process aims to optimize both the selection of data points and the creation of synthetic samples, enhancing the performance of classifiers on imbalanced datasets. We studied the performance of GM4OS across ten different test datasets and against five oversampling approaches commonly used in the literature. The results highlight how GM4OS is able to outperform the baseline methods in three out of ten test datasets, improving the algorithm performance.
Descrição
Farinati, D., & Vanneschi, L. (2025). An Empirical Study of GM4OS for Imbalanced Binary Classification. SN Computer Science, 6(5), Article 510. https://doi.org/10.1007/s42979-025-04048-4 --- Open access funding provided by FCT|FCCN (b-on). This work was supported by national funds through FCT (Fundação para a Ciência e a Tecnologia), under the project - UIDB/04152 - Centro de Investigação em Gestão de Informação (MagIC)/NOVA IMS.
Palavras-chave
Oversampling Imbalanced data Genetic programming Genetic algorithms General Computer Science Computer Science Applications Computer Networks and Communications Computer Graphics and Computer-Aided Design Computational Theory and Mathematics Artificial Intelligence
