1 code implementation • 11 Mar 2024 • Eunsu Kim, Juyoung Suk, Philhoon Oh, Haneul Yoo, James Thorne, Alice Oh
Despite the rapid development of large language models (LLMs) for the Korean language, there remains an obvious lack of benchmark datasets that test the requisite Korean cultural and linguistic knowledge.
no code implementations • 4 Feb 2024 • EuiYul Song, Philhoon Oh, Sangryul Kim, James Thorne
Modern deterministic retrieval pipelines prioritize achieving state-of-the-art performance but often lack interpretability in decision-making.
1 code implementation • 27 Oct 2023 • Philhoon Oh, James Thorne
However, counter-intuitively, too much context can have a negative impact on the model when evaluated on common question answering (QA) datasets.
1 code implementation • 27 Oct 2023 • Yejoon Lee, Philhoon Oh, James Thorne
This error arises when the knowledge corpus used for retrieval is only a subset of the entire string space, potentially excluding more helpful passages that exist outside the corpus.