共 50 条
- [22] MoDE: A Mixture-of-Experts Model with Mutual Distillation among the Experts THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 14, 2024, : 16067 - 16075
- [25] On the Benefits of Learning to Route in Mixture-of-Experts Models 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 9376 - 9396
- [26] Measurement of the probability of insolvency with mixture-of-experts networks CLASSIFICATION IN THE INFORMATION AGE, 1999, : 421 - 429
- [27] Advances in using hierarchical mixture of experts for signal classification 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 3569 - 3572
- [28] Breaking the gridlock in Mixture-of-Experts: Consistent and Efficient Algorithms INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [29] A Mixture-of-Experts Model for Antonym-Synonym Discrimination ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 558 - 564