Search Results for author: Wenyuan Xue

Found 4 papers, 1 papers with code

A Survey on Hallucination in Large Vision-Language Models

no code implementations • 1 Feb 2024 • Hanchao Liu, Wenyuan Xue, Yifei Chen, Dapeng Chen, Xiutian Zhao, Ke Wang, Liping Hou, Rongjun Li, Wei Peng

In this comprehensive survey, we dissect LVLM-related hallucinations in an attempt to establish an overview and facilitate future mitigation.

Hallucination

Paper
Add Code

Align before Adapt: Leveraging Entity-to-Region Alignments for Generalizable Video Action Recognition

no code implementations • 27 Nov 2023 • Yifei Chen, Dapeng Chen, Ruijin Liu, Sai Zhou, Wenyuan Xue, Wei Peng

With the aligned entities, we feed their text embeddings to a transformer-based video adapter as the queries, which can help extract the semantics of the most important entities from a video to a vector.

Action Recognition Representation Learning +1

Paper
Add Code

ChartDETR: A Multi-shape Detection Network for Visual Chart Recognition

no code implementations • 15 Aug 2023 • Wenyuan Xue, Dapeng Chen, Baosheng Yu, Yifei Chen, Sai Zhou, Wei Peng

Visual chart recognition systems are gaining increasing attention due to the growing demand for automatically identifying table headers and values from chart images.

Keypoint Detection

Paper
Add Code

TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition

1 code implementation • ICCV 2021 • Wenyuan Xue, Baosheng Yu, Wen Wang, DaCheng Tao, Qingyong Li

A table arranging data in rows and columns is a very effective data structure, which has been widely used in business and scientific research.

Cell Detection Graph Reconstruction +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.