Search Results for author: Miguel Moura Ramos

Found 1 papers, 0 papers with code

Aligning Neural Machine Translation Models: Human Feedback in Training and Inference

no code implementations15 Nov 2023 Miguel Moura Ramos, Patrick Fernandes, António Farinhas, André F. T. Martins

A core ingredient in RLHF's success in aligning and improving large language models (LLMs) is its reward model, trained using human feedback on model outputs.

Language Modelling Machine Translation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.