Joint Learning Feature and Model Adaptation for Unsupervised Acoustic Modelling of Child Speech

Page view(s)

111

Checked on Aug 10, 2025

Please use this identifier to cite or link to this item: https://oar.a-star.edu.sg/communities-collections/articles/19255

Title:

Joint Learning Feature and Model Adaptation for Unsupervised Acoustic Modelling of Child Speech

Journal Title:

INTERSPEECH 2023

DOI:

10.21437/Interspeech.2023-1302

Publication URL:

http://dx.doi.org/10.21437/interspeech.2023-1302

Authors:

Richeng Duan

Keywords:

Publication Date:

14 August 2023

Citation:

Duan, R. (2023). Joint Learning Feature and Model Adaptation for Unsupervised Acoustic Modelling of Child Speech. INTERSPEECH 2023. https://doi.org/10.21437/interspeech.2023-1302

Abstract:

Due to the high acoustic variability of child speech and the lack of publicly available datasets, acoustic modeling for child speech is challenging. In this work, we address these challenges by leveraging the large amounts of resources for adult speech (well-trained acoustic models and transcribed speech dataset) and proposing a joint acoustic feature and model adaptation framework to minimize acoustic mismatch between adult and child speech. Empirical results on three tasks of speech recognition, pronunciation assessment, and fluency assessment show that our proposed approach consistently outperforms competitive baselines, achieving up to 31.18% phone error reduction on speech recognition and around 7% gains on speech evaluation tasks.

License type:

Publisher Copyright

Funding Info:

This research is supported by core funding from: I2R
Grant Reference no. : SC20-RD120

Description:

URI:

https://oar.a-star.edu.sg/communities-collections/articles/19255

ISSN:

1990-9772

Collections:

Institute for Infocomm Research

Files uploaded:

Manuscripts in This Item:

File	Size	Format	Action
thu-o1203.pdf	466.83 KB	PDF	Open