Search results

Publication date Communities Collections Article title Author(s) Journal/Conference
13 Sep 2022 SERC Institute for Infocomm Research Discriminative speaker embedding with serialized multi-layer multi-head attention Hongning Zhu, KONG AIK LEE, Haizhou Li Speech Communication
27 Apr 2022 SERC Institute for Infocomm Research MFA: TDNN with Multi-Scale Frequency-Channel Attention for Text-Independent Speaker Verification with Short Utterances Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
27 Apr 2022 SERC Institute for Infocomm Research Self-Supervised Speaker Recognition with Loss-Gated Learning Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
3 Feb 2022 SERC Institute for Infocomm Research PL-EESR: Perceptual Loss Based End-to-End Robust Speaker Representation Extraction Yi Ma, KONG AIK LEE, Ville Hautamäki, Haizhou Li 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
13 Jan 2022 SERC Institute for Infocomm Research Neural Acoustic-Phonetic Approach for Speaker Verification with Phonetic Attention Mask Tianchi Liu, Rohan Kumar Das, KONG AIK LEE, Haizhou Li IEEE Signal Processing Letters
30 Aug 2021 SERC Institute for Infocomm Research Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding Hongning Zhu, KONG AIK LEE, Haizhou Li Interspeech 2021
30 Aug 2021 SERC Institute for Infocomm Research Multi-Level Transfer Learning from Near-Field to Far-Field Speaker Verification Li Zhang, Qing Wang, KONG AIK LEE, Lei Xie, Haizhou Li Interspeech 2021
29 Jul 2021 SERC Institute for Infocomm Research Multi-Tone Phase Coding of Interaural Time Difference for Sound Source Localization With Spiking Neural Networks Zihan Pan, Malu Zhang, Jibin Wu, Jiadong Wang, Haizhou Li IEEE/ACM Transactions on Audio, Speech, and Language Processing
7 May 2021 SERC Institute for Infocomm Research The Psychoacoustics and Synthesis of Singing Harmony Paul Yaozhu Chan, Minghui Dong, Haizhou Li Digital Repository of NTU
7 May 2021 SERC Institute for Infocomm Research Deep Bidirectional LSTM Modeling of Timbre and Prosody for Emotional Voice Conversion Huaiping Ming, Dongyan Huang, Lei Xie, Jie Wu, Minghui Dong, Haizhou Li Interspeech 2016