Search Results for author: Miguel Moura Ramos

Found 1 papers, 0 papers with code

Aligning Neural Machine Translation Models: Human Feedback in Training and Inference

no code implementations • 15 Nov 2023 • Miguel Moura Ramos, Patrick Fernandes, António Farinhas, André F. T. Martins

A core ingredient in RLHF's success in aligning and improving large language models (LLMs) is its reward model, trained using human feedback on model outputs.

Language Modelling Machine Translation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.