Search Results for author: Josselin Somerville Roberts

Found 5 papers, 4 papers with code

Magistral

no code implementations12 Jun 2025 Mistral-AI, :, Abhinav Rastogi, Albert Q. Jiang, Andy Lo, Gabrielle Berrada, Guillaume Lample, Jason Rute, Joep Barmentlo, Karmesh Yadav, Kartik Khandelwal, Khyathi Raghavi Chandu, Léonard Blier, Lucile Saulnier, Matthieu Dinot, Maxime Darrin, Neha Gupta, Roman Soletskyi, Sagar Vaze, Teven Le Scao, Yihan Wang, Adam Yang, Alexander H. Liu, Alexandre Sablayrolles, Amélie Héliou, Amélie Martin, Andy Ehrenberg, Anmol Agarwal, Antoine Roux, Arthur Darcet, Arthur Mensch, Baptiste Bout, Baptiste Rozière, Baudouin De Monicault, Chris Bamford, Christian Wallenwein, Christophe Renaudin, Clémence Lanfranchi, Darius Dabert, Devon Mizelle, Diego de Las Casas, Elliot Chane-Sane, Emilien Fugier, Emma Bou Hanna, Gauthier Delerce, Gauthier Guinet, Georgii Novikov, Guillaume Martin, Himanshu Jaju, Jan Ludziejewski, Jean-Hadrien Chabran, Jean-Malo Delignon, Joachim Studnia, Jonas Amar, Josselin Somerville Roberts, Julien Denize, Karan Saxena, Kush Jain, Lingxiao Zhao, Louis Martin, Luyu Gao, Lélio Renard Lavaud, Marie Pellat, Mathilde Guillaumin, Mathis Felardos, Maximilian Augustin, Mickaël Seznec, Nikhil Raghuraman, Olivier Duchenne, Patricia Wang, Patrick von Platen, Patryk Saffer, Paul Jacob, Paul Wambergue, Paula Kurylowicz, Pavankumar Reddy Muddireddy, Philomène Chagniot, Pierre Stock, Pravesh Agrawal, Romain Sauvestre, Rémi Delacourt, Sanchit Gandhi, Sandeep Subramanian, Shashwat Dalal, Siddharth Gandhi, Soham Ghosh, Srijan Mishra, Sumukh Aithal, Szymon Antoniak, Thibault Schueller, Thibaut Lavril, Thomas Robert, Thomas Wang, Timothée Lacroix, Valeriia Nemychnikova, Victor Paltz, Virgile Richard, Wen-Ding Li, William Marshall, Xuanyu Zhang, Yunhao Tang

We introduce Magistral, Mistral's first reasoning model and our own scalable reinforcement learning (RL) pipeline.

Instruction Following Reinforcement Learning (RL)

Image2Struct: Benchmarking Structure Extraction for Vision-Language Models

1 code implementation29 Oct 2024 Josselin Somerville Roberts, Tony Lee, Chi Heem Wong, Michihiro Yasunaga, Yifan Mai, Percy Liang

The structure is then rendered to produce an output image (e. g., rendered webpage), which is compared against the input image to produce a similarity score.

Benchmarking

VHELM: A Holistic Evaluation of Vision Language Models

1 code implementation9 Oct 2024 Tony Lee, Haoqin Tu, Chi Heem Wong, Wenhao Zheng, Yiyang Zhou, Yifan Mai, Josselin Somerville Roberts, Michihiro Yasunaga, Huaxiu Yao, Cihang Xie, Percy Liang

Current benchmarks for assessing vision-language models (VLMs) often focus on their perception or problem-solving capabilities and neglect other critical aspects such as fairness, multilinguality, or toxicity.

Fairness

Projected Task-Specific Layers for Multi-Task Reinforcement Learning

1 code implementation15 Sep 2023 Josselin Somerville Roberts, Julia Di

Multi-task reinforcement learning could enable robots to scale across a wide variety of manipulation tasks in homes and workplaces.

reinforcement-learning Reinforcement Learning

A Skeleton-based Approach For Rock Crack Detection Towards A Climbing Robot Application

1 code implementation10 Sep 2023 Josselin Somerville Roberts, Paul-Emile Giacomelli, Yoni Gozlan, Julia Di

A new group of metrics, LineAcc, has been proposed for thin object segmentation such that the impact of the object width on the score is minimized.

Object Segmentation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.