1 code implementation • 1 Aug 2023 • Itay Itzhak, Gabriel Stanovsky, Nir Rosenfeld, Yonatan Belinkov
Recent studies show that instruction tuning (IT) and reinforcement learning from human feedback (RLHF) improve the abilities of large language models (LMs) dramatically.
1 code implementation • NAACL 2022 • Itay Itzhak, Omer Levy
Standard pretrained language models operate on sequences of subword tokens without direct access to the characters that compose each token's string representation.