Search Results for author: Haoyi Qiu

Found 5 papers, 4 papers with code

VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models

1 code implementation22 Apr 2024 Haoyi Qiu, WenBo Hu, Zi-Yi Dou, Nanyun Peng

To address these issues, we introduce a multi-dimensional benchmark covering objects, attributes, and relations, with challenging images selected based on associative biases.

Hallucination Informativeness +2

From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models

1 code implementation18 Mar 2024 Kung-Hsiang Huang, Hou Pong Chan, Yi R. Fung, Haoyi Qiu, Mingyang Zhou, Shafiq Joty, Shih-Fu Chang, Heng Ji

This survey paper serves as a comprehensive resource for researchers and practitioners in the fields of natural language processing, computer vision, and data analysis, providing valuable insights and directions for future research in chart understanding leveraging large foundation models.

Data Visualization

New Job, New Gender? Measuring the Social Bias in Image Generation Models

no code implementations1 Jan 2024 Wenxuan Wang, Haonan Bai, Jen-tse Huang, Yuxuan Wan, Youliang Yuan, Haoyi Qiu, Nanyun Peng, Michael R. Lyu

BiasPainter uses a diverse range of seed images of individuals and prompts the image generation models to edit these images using gender, race, and age-neutral queries.

Fairness Image Generation

AMRFact: Enhancing Summarization Factuality Evaluation with AMR-Driven Negative Samples Generation

1 code implementation16 Nov 2023 Haoyi Qiu, Kung-Hsiang Huang, Jingnong Qu, Nanyun Peng

Prior works on evaluating factual consistency of summarization often take the entailment-based approaches that first generate perturbed (factual inconsistent) summaries and then train a classifier on the generated data to detect the factually inconsistencies during testing time.

Abstractive Text Summarization Natural Language Inference +1

Gender Biases in Automatic Evaluation Metrics for Image Captioning

1 code implementation24 May 2023 Haoyi Qiu, Zi-Yi Dou, Tianlu Wang, Asli Celikyilmaz, Nanyun Peng

Model-based evaluation metrics (e. g., CLIPScore and GPTScore) have demonstrated decent correlations with human judgments in various language generation tasks.

Fairness Image Captioning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.