Publication date | Communities | Collections | Article title | Author(s) | Journal/Conference |
---|---|---|---|---|---|
6 Dec 2023 | SERC | Institute for Infocomm Research | HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork | Giang Do, Khiem Le, Quang Pham, TrungTin Nguyen, Thanh-Nam Doan, Binh T. Nguyen, Chenghao Liu, Savitha Ramasamy, Xiaol Li, Steven Hoi | The 2023 Conference on Empirical Methods in Natural Language Processing |