Kernel density estimation-based sampling for neural network classification

Date
2021
Authors
Kamalov, Firuz
Elnagar, Ashraf
Journal Title
Journal ISSN
Volume Title
Publisher
Institute of Electrical and Electronics Engineers Inc.
Abstract
Imbalanced data occurs in a wide range of scenarios. The skewed distribution of the target variable elicits bias in machine learning algorithms. One of the popular methods to combat imbalanced data is to artificially balance the data through resampling. In this paper, we compare the efficacy of a recently proposed kernel density estimation (KDE) sampling technique in the context of artificial neural networks. We benchmark the KDE sampling method against two base sampling techniques and perform comparative experiments using 8 datasets and 3 neural networks architectures. The results show that KDE sampling produces the best performance on 6 out of 8 datasets. However, it must be used with caution on image datasets. We conclude that KDE sampling is capable of significantly improving the performance of neural networks. © 2021 IEEE.
Description
This conference paper is not available at CUD collection. The version of scholarly record of this paper is published in 2021 International Symposium on Networks, Computers and Communications, ISNCC (2021), available online at: https://doi.org/10.1109/ISNCC52172.2021.9615715
Keywords
Deep learning, Imbalanced data, KDE, Kernel density estimation, Neural networks, Sampling
Citation
Kamalov, F., & Elnagar, A. (2021). Kernel density estimation-based sampling for neural network classification. 2021 International Symposium on Networks, Computers and Communications, ISNCC. https://doi.org/10.1109/ISNCC52172.2021.9615715