Faithfulness Critic

1 papers with code • 3 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?


Most implemented papers

Are self-explanations from Large Language Models faithful?

AndreasMadsen/llm-introspection 15 Jan 2024

For example, if an LLM says a set of words is important for making a prediction, then it should not be able to make its prediction without these words.