I2R’s End-to-End Speech Translation System for IWSLT 2023 Offline Shared Task

Page view(s)

132

Checked on Sep 12, 2025

Please use this identifier to cite or link to this item: https://oar.a-star.edu.sg/communities-collections/articles/19216

Title:

I2R’s End-to-End Speech Translation System for IWSLT 2023 Offline Shared Task

Journal Title:

Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023)

DOI:

Publication URL:

https://aclanthology.org/2023.iwslt-1.16/

Authors:

Muhammad Huzaifah, Kye Min Tan, Richeng Duan

Keywords:

speech translation, Representation learning, multimodal learning

Publication Date:

14 July 2023

Citation:

Muhammad Huzaifah, Kye Min Tan, and Richeng Duan. 2023. I2R’s End-to-End Speech Translation System for IWSLT 2023 Offline Shared Task. In Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), pages 202–210, Toronto, Canada (in-person and online). Association for Computational Linguistics.

Abstract:

This paper describes I2R’s submission to the offline speech translation track for IWSLT 2023. We focus on an end-to-end approach for translation from English audio to German text, one of the three available language directions in this year’s edition. The I2R system leverages on pretrained models that have been exposed to large-scale audio and text data for our base model. We introduce several stages of additional pretraining followed by fine-tuning to adapt the system for the downstream speech translation task. The strategy is supplemented by other techniques such as data augmentation, domain tagging, knowledge distillation, and model ensemble, among others. We evaluate the system on several publicly available test sets for comparison.

License type:

Attribution 4.0 International (CC BY 4.0)

Funding Info:

There was no specific funding for the research done

Description:

URI:

https://oar.a-star.edu.sg/communities-collections/articles/19216

ISBN:

2023.iwslt-1.16

Collections:

Institute for Infocomm Research

Files uploaded:

https://aclanthology.org/2023.iwslt-1.16