Several papers featuring MILES team members have been accepted to the Forty-Second International Conference on Machine Learning (ICML 2025):
- Wrapped Gaussian on the manifold of Symmetric Positive Definite Matrices by Thibault de Surrel, Fabien Lotte, Sylvain Chevallier and Florian Yger (MILES alumni);
- Improving Diversity in Language Models: When Temperature Fails, Change the Loss by Alexandre Vérine (MILES Alumni), Florian Le Bronnec, Kunhao Zheng, Alexandre Allauzen, Yann Chevaleyre, and Benjamin Negrevergne.
- Exploring Large Action Sets with Hyperspherical Embeddings using von Mises-Fisher Sampling by Walid Bendada, Guillaume Salha-Galvan, Romain Hennequin, Théo Bontempelli, Thomas Bouabca and Tristan Cazenave.
- Optimizing Language Models for Inference Time Objectives using Reinforcement Learning by Yunhao Tang, Kunhao Zheng, Gabriel Synnaeve and Rémi Munos
- RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning by Jonas Gehring, Kunhao Zheng, Jade Copet, Vegard Mella, Taco Cohen and Gabriel Synnaeve
- PILAF: Optimal Human Preference Sampling for Reward Modeling by Yuzhen Feng, Ariel Kwiatkowski, Kunhao Zheng, Julia Kempe and Yaqi Duan.