no code implementations • 12 Mar 2024 • Yanhong Bai, Jiabao Zhao, Tingjiang Wei, Qing Cai, Liang He
This paper thoroughly analyzes the interpretability of KT algorithms.
no code implementations • 21 Aug 2023 • Yanhong Bai, Jiabao Zhao, Jinxin Shi, Tingjiang Wei, Xingjiao Wu, Liang He
Detecting stereotypes and biases in Large Language Models (LLMs) can enhance fairness and reduce adverse impacts on individuals or groups when these LLMs are applied.