Embedding Physical Augmentation and Wavelet Scattering Transform to Generative Adversarial Networks for Audio Classification with Limited Training Resources

Page view(s)

Checked on Sep 09, 2025

Please use this identifier to cite or link to this item: https://oar.a-star.edu.sg/communities-collections/articles/14382

Title:

Embedding Physical Augmentation and Wavelet Scattering Transform to Generative Adversarial Networks for Audio Classification with Limited Training Resources

Journal Title:

2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

DOI:

10.1109/ICASSP.2019.8683199

Publication URL:

https://doi.org/10.1109/ICASSP.2019.8683199

Authors:

Kah Kuan Teh, Huy Dat Tran

Keywords:

Audio Classification, Limited Training, Augmentation, generative adversarial networks, Wavelet Scattering Transform

Publication Date:

17 April 2019

Citation:

K.K.Teh, Tran Huy Dat, "Embedding Physical Augmentation and Wavelet Scattering Transform to Generative Adversarial Networks for Audio Classification with Limited Training Resources," In ICASSP, 2019

Abstract:

This paper addresses audio classification with limited training resources. We first investigate different types of data augmentation including physical modeling, wavelet scattering transform and Generative Adversarial Networks (GAN). We than propose a novel GAN method to embed physical augmentation and wavelet scattering transform in processing. The experimental results on Google Speech Command show significant improvements of the proposed method when training with limited resources. It could lift up classification accuracy from the best baselines of 62.06% and 77.29% on ResNet, to as far as 91.96% and 93.38%, when training with 10% and 25% training data, respectively.

License type:

PublisherCopyrights

Funding Info:

Description:

URI:

https://oar.a-star.edu.sg/communities-collections/articles/14382

ISSN:

1520-6149
1520-6149

Collections:

Institute for Infocomm Research

Files uploaded:

Manuscripts in This Item:

File	Size	Format	Action
There are no attached files.