Search Results for author: Yuxi Xia

Found 6 papers, 3 papers with code

Influences on LLM Calibration: A Study of Response Agreement, Loss Functions, and Prompt Styles

no code implementations7 Jan 2025 Yuxi Xia, Pedro Henrique Luz de Araujo, Klim Zaporojets, Benjamin Roth

Concretely, we build Calib-n, a novel framework that trains an auxiliary model for confidence estimation that aggregates responses from multiple LLMs to capture inter-model agreement.

Black-box Model Ensembling for Textual and Visual Question Answering via Information Fusion

1 code implementation4 Jul 2024 Yuxi Xia, Kilm Zaporojets, Benjamin Roth

A diverse range of large language models (LLMs), e. g., ChatGPT, and visual question answering (VQA) models, e. g., BLIP, have been developed for solving textual and visual question answering tasks.

Question Answering Visual Question Answering

Exploring prompts to elicit memorization in masked language model-based named entity recognition

no code implementations5 May 2024 Yuxi Xia, Anastasiia Sedova, Pedro Henrique Luz de Araujo, Vasiliki Kougia, Lisa Nußbaumer, Benjamin Roth

Finally, the prompt performance of detecting model memorization is quantified by the percentage of name pairs for which the model has higher confidence for the name from the training set.

Language Modeling Language Modelling +4

Specification Overfitting in Artificial Intelligence

no code implementations13 Mar 2024 Benjamin Roth, Pedro Henrique Luz de Araujo, Yuxi Xia, Saskia Kaltenbrunner, Christoph Korab

Machine learning (ML) and artificial intelligence (AI) approaches are often criticized for their inherent bias and for their lack of control, accountability, and transparency.

Fairness

WAFFLE: Watermarking in Federated Learning

1 code implementation17 Aug 2020 Buse Gul Atli, Yuxi Xia, Samuel Marchal, N. Asokan

In this paper, we present WAFFLE, the first approach to watermark DNN models trained using federated learning.

Federated Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.