Publicação
imbalanced-learn-extra
| dc.contributor.author | Douzas, Georgios | |
| dc.contributor.author | Bação, Fernando | |
| dc.contributor.institution | Information Management Research Center (MagIC) - NOVA Information Management School | |
| dc.contributor.institution | NOVA Information Management School (NOVA IMS) | |
| dc.contributor.pbl | Ubiquity Press | |
| dc.date.accessioned | 2026-01-23T10:43:01Z | |
| dc.date.available | 2026-01-23T10:43:01Z | |
| dc.date.issued | 2026-01-19 | |
| dc.description | Douzas, G., & Bação, F. (2026). imbalanced-learn-extra: A Python Package for Novel Oversampling Algorithms. Journal of Open Research Software, 14(1), Article 1. https://doi.org/10.5334/jors.459 --- This work was supported by national funds through FCT (Fundação para a Ciência e a Tecnologia), under the project – UID/04152/2025 – Centro de Investigação em Gestão de Informação (MagIC)/NOVA IMS – https://doi.org/10.54499/UID/04152/2025 (2025-01-01/2028-12-31) and UID/PRR/04152/2025 https://doi.org/10.54499/UID/PRR/04152/2025 (2025-01-01/2026-06-30) and the project ENHANCE – Enhancing Sustainable Travel in Small Cities and Outer Metropolitan Areas (DUT/0004/2022) https://doi.org/10.54499/DUT/0004/2022. | |
| dc.description.abstract | Learning from imbalanced data is a common challenge in supervised learning, as most classifiers assume balanced class distributions. Among the strategies to mitigate this issue, oversampling algorithms offer a flexible and model-agnostic solution by generating synthetic samples for minority classes. In this paper, we introduce the imbalanced-learn-extra Python library, an open-source extension of the imbalanced-learn ecosystem that provides additional oversampling techniques for research and practical use. The library integrates seamlessly with Scikit-Learn, allowing users to easily incorporate it into existing workflows. It implements Geometric SMOTE, a geometrically enhanced drop-in replacement for the original SMOTE algorithm, and clustering-based oversampling methods such as KMeans-SMOTE and G-SOMO, which combine existing imbalanced-learn oversamplers with Scikit-Learn clustering algorithms to address within-class imbalances. Rather than re-assessing the performance of these algorithms, which has already been thoroughly evaluated in prior studies, this paper focuses on their software design, implementation, and practical use within a unified framework. | en |
| dc.description.version | publishersversion | |
| dc.description.version | published | |
| dc.format.extent | 10 | |
| dc.format.extent | 1633004 | |
| dc.identifier.doi | 10.5334/jors.459 | |
| dc.identifier.issn | 2049-9647 | |
| dc.identifier.other | PURE: 151021603 | |
| dc.identifier.other | PURE UUID: 0535d688-f3d1-4142-b065-205f6ca58da1 | |
| dc.identifier.other | Scopus: 105030571433 | |
| dc.identifier.other | ORCID: /0000-0002-0834-0275/work/203443186 | |
| dc.identifier.uri | http://hdl.handle.net/10362/199676 | |
| dc.identifier.url | https://www.scopus.com/pages/publications/105030571433 | |
| dc.identifier.url | https://doi.org/10.5281/zenodo.15561407 | |
| dc.identifier.url | https://github.com/georgedouzas/imbalanced-learn-extra | |
| dc.language.iso | eng | |
| dc.peerreviewed | yes | |
| dc.relation | https://doi.org/10.54499/UID/04152/2025 | |
| dc.relation | https://doi.org/10.54499/UID/PRR/04152/2025 | |
| dc.relation | https://doi.org/10.54499/DUT/0004/2022 | |
| dc.subject | Machine Learning Classification | |
| dc.subject | Imbalanced Learning | |
| dc.subject | Oversampling | |
| dc.subject | Clustering | |
| dc.subject | Geometric SMOTE | |
| dc.subject | Scikit-Learn | |
| dc.subject | Software | |
| dc.subject | Information Systems | |
| dc.subject | Library and Information Sciences | |
| dc.title | imbalanced-learn-extra | en |
| dc.title.subtitle | A Python Package for Novel Oversampling Algorithms | en |
| dc.type | journal article | |
| degois.publication.issue | 1 | |
| degois.publication.title | Journal of Open Research Software | |
| degois.publication.volume | 14 | |
| dspace.entity.type | Publication | |
| rcaap.rights | openAccess |
Ficheiros
Principais
1 - 1 de 1
A carregar...
- Nome:
- imbalanced-learn-extra.pdf
- Tamanho:
- 1.56 MB
- Formato:
- Adobe Portable Document Format
