Search Results for author: Koki Wataoka

Found 3 papers, 0 papers with code

Self-Preference Bias in LLM-as-a-Judge

no code implementations29 Oct 2024 Koki Wataoka, Tsubasa Takahashi, Ryokan Ri

To explore the causes, we hypothesize that LLMs may favor outputs that are more familiar to them, as indicated by lower perplexity.

MergePrint: Robust Fingerprinting against Merging Large Language Models

no code implementations11 Oct 2024 Shojiro Yamabe, Tsubasa Takahashi, Futa Waseda, Koki Wataoka

As the cost of training large language models (LLMs) rises, protecting their intellectual property has become increasingly critical.

Verbosity Bias in Preference Labeling by Large Language Models

no code implementations16 Oct 2023 Keita Saito, Akifumi Wachi, Koki Wataoka, Youhei Akimoto

In recent years, Large Language Models (LLMs) have witnessed a remarkable surge in prevalence, altering the landscape of natural language processing and machine learning.

reinforcement-learning Reinforcement Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.