Search Results for author: Robert Windesheim

Found 2 papers, 2 papers with code

SuperHF: Supervised Iterative Learning from Human Feedback

1 code implementation25 Oct 2023 Gabriel Mukobi, Peter Chatain, Su Fong, Robert Windesheim, Gitta Kutyniok, Kush Bhatia, Silas Alberti

Here, we focus on two prevalent methods used to align these models, Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF).

Language Modelling

Explaining Image Classifiers with Multiscale Directional Image Representation

1 code implementation CVPR 2023 Stefan Kolek, Robert Windesheim, Hector Andrade Loarca, Gitta Kutyniok, Ron Levie

However, the smoothness of a mask limits its ability to separate fine-detail patterns, that are relevant for the classifier, from nearby nuisance patterns, that do not affect the classifier.

Cannot find the paper you are looking for? You can Submit a new open access paper.