KDE-Based Ensemble Learning for Imbalanced Data

Kamalov, Firuz; Moussa, Sherif; Avante Reyes, Jorge

KDE-Based Ensemble Learning for Imbalanced Data

Files

Access Instruction 710.pdf (403.01 KB)

Date

2022-09

Authors

Kamalov, Firuz

Moussa, Sherif

Avante Reyes, Jorge

Publisher

MDPI

Abstract

Imbalanced class distribution affects many applications in machine learning, including medical diagnostics, text classification, intrusion detection and many others. In this paper, we propose a novel ensemble classification method designed to deal with imbalanced data. The proposed method trains each tree in the ensemble using uniquely generated synthetically balanced data. The data balancing is carried out via kernel density estimation, which offers a natural and effective approach to generating new sample points. We show that the proposed method results in a lower variance of the model estimator. The proposed method is tested against benchmark classifiers on a range of simulated and real-life data. The results of experiments show that the proposed classifier significantly outperforms the benchmark methods. © 2022 by the authors.

Keywords

data sampling, ensemble method, imbalanced data, kernel density estimate

Citation

Kamalov, F., Moussa, S., & Avante Reyes, J. (2022). KDE-based ensemble learning for imbalanced data. Electronics (Switzerland), 11(17). https://doi.org/10.3390/electronics11172703.

URI

https://doi.org/10.3390/electronics11172703
http://hdl.handle.net/20.500.12519/710

Collections

Department of Electrical Engineering

Full item page

KDE-Based Ensemble Learning for Imbalanced Data

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

item.page.type

item.page.format

Keywords

Citation

URI

DOI

Collections