Data Oversampling with Structure Preserving Variational Learning

Page view(s)
68
Checked on Feb 21, 2025
Data Oversampling with Structure Preserving Variational Learning
Title:
Data Oversampling with Structure Preserving Variational Learning
Journal Title:
Proceedings of the 31st ACM International Conference on Information & Knowledge Management
Publication Date:
17 October 2022
Citation:
Solomon, I., Jayavelu, S., Ferdaus, M. M., & Kumar, U. (2022). Data Oversampling with Structure Preserving Variational Learning. Proceedings of the 31st ACM International Conference on Information & Knowledge Management. https://doi.org/10.1145/3511808.3557575
Abstract:
Traditional oversampling methods are well explored for binary and multi-class imbalanced datasets. In most cases, the data space is adapted for oversampling the imbalanced classes. It leads to various issues like poor modelling of the structure of the data, resulting in data overlapping between minority and majority classes that lead to poor classification performance of minority class(es). To overcome these limitations, we propose a novel data oversampling architecture called Structure Preserving Variational Learning (SPVL). This technique captures an uncorrelated distribution among classes in the latent space using an encoder-decoder framework. Hence, minority samples are generated in the latent space, preserving the structure of the data distribution. The improved latent space distribution (oversampled training data) is evaluated by training an MLP classifier and testing with unseen test dataset. The proposed SPVL method is applied to various benchmark datasets with i) binary and multi-class imbalance data, ii) high-dimensional data and, iii) large or small-scale data. Extensive experimental results demonstrated that the proposed SPVL technique outperforms the state-of-the-art counterparts.
License type:
Publisher Copyright
Funding Info:
This research / project is supported by the A*STAR - AME Programmatic Fund
Grant Reference no. : A1898b0043

Mphasis Cognitive Computing Centre of Excellence. Grant No 7111
Description:
© { Author | ACM 2022. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in Proceedings of the 31st ACM International Conference on Information & Knowledge Management, http://dx.doi.org/10.1145/3511808.3557575
ISSN:
2381-8549
Files uploaded:

File Size Format Action
cikm-2022-spvl-upload-amended.pdf 1.11 MB PDF Open