Search results

Publication date Communities Collections Article title Author(s) Journal/Conference
13 Aug 2025 SERC Institute for Infocomm Research MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations Ziyang Zhang, Yang Yu, Yucheng Chen, Xulei Yang, Si Yong Yeo Computer Vision and Pattern Recognition Conference (CVPR)
12 Mar 2025 SERC Institute for Infocomm Research MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders Wenyu Zhang, Shuo Sun, Bin Wang, Xunlong Zou, Zhuohan Liu, Yingxu He, Geyu Lin, Nancy F. Chen, Ai Ti Aw ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
10 Nov 2023 SERC Institute for Infocomm Research Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention Burak Satar, Hongyuan Zhu, Hanwang Zhang, Joo-Hwee Lim BMVC2023
14 Jul 2023 SERC Institute for Infocomm Research I2R’s End-to-End Speech Translation System for IWSLT 2023 Offline Shared Task Muhammad Huzaifah, Kye Min Tan, Richeng Duan Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023)
23 Oct 2022 SERC Institute for Infocomm Research Comparing Classification and Generation Approaches to Situated Reasoning with Vision-language Pre-trained Models Xin Huang, Hui Li Tan, Jung Jae Kim European Conference on Computer Vision - Machine Visual Common Sense (ECCV-MVCS) workshop (2022)
7 May 2021 SERC Institute for Infocomm Research Deep Multimodal Transfer Learning for Cross-Modal Retrieval Liangli Zhen, Peng Hu, Xi Peng, Rick Siow Mong Goh, Joey Tianyi Zhou IEEE Transactions on Neural Networks and Learning Systems