MILES@ICML 2025

Several papers featuring MILES team members have been accepted to the Forty-Second International Conference on Machine Learning (ICML 2025):

Wrapped Gaussian on the manifold of Symmetric Positive Definite Matrices by Thibault de Surrel, Fabien Lotte, Sylvain Chevallier and Florian Yger (MILES alumni);
Improving Diversity in Language Models: When Temperature Fails, Change the Loss by Alexandre Vérine (MILES Alumni), Florian Le Bronnec, Kunhao Zheng, Alexandre Allauzen, Yann Chevaleyre, and Benjamin Negrevergne.
Exploring Large Action Sets with Hyperspherical Embeddings using von Mises-Fisher Sampling by Walid Bendada, Guillaume Salha-Galvan, Romain Hennequin, Théo Bontempelli, Thomas Bouabca and Tristan Cazenave.
Optimizing Language Models for Inference Time Objectives using Reinforcement Learning by Yunhao Tang, Kunhao Zheng, Gabriel Synnaeve and Rémi Munos
RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning by Jonas Gehring, Kunhao Zheng, Jade Copet, Vegard Mella, Taco Cohen and Gabriel Synnaeve
PILAF: Optimal Human Preference Sampling for Reward Modeling by Yuzhen Feng, Ariel Kwiatkowski, Kunhao Zheng, Julia Kempe and Yaqi Duan.

Recent papers