Search Results for author: Wenyuan Xue

Found 4 papers, 1 papers with code

A Survey on Hallucination in Large Vision-Language Models

no code implementations1 Feb 2024 Hanchao Liu, Wenyuan Xue, Yifei Chen, Dapeng Chen, Xiutian Zhao, Ke Wang, Liping Hou, Rongjun Li, Wei Peng

In this comprehensive survey, we dissect LVLM-related hallucinations in an attempt to establish an overview and facilitate future mitigation.

Hallucination

Align before Adapt: Leveraging Entity-to-Region Alignments for Generalizable Video Action Recognition

no code implementations27 Nov 2023 Yifei Chen, Dapeng Chen, Ruijin Liu, Sai Zhou, Wenyuan Xue, Wei Peng

With the aligned entities, we feed their text embeddings to a transformer-based video adapter as the queries, which can help extract the semantics of the most important entities from a video to a vector.

Action Recognition Representation Learning +1

ChartDETR: A Multi-shape Detection Network for Visual Chart Recognition

no code implementations15 Aug 2023 Wenyuan Xue, Dapeng Chen, Baosheng Yu, Yifei Chen, Sai Zhou, Wei Peng

Visual chart recognition systems are gaining increasing attention due to the growing demand for automatically identifying table headers and values from chart images.

Keypoint Detection

TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition

1 code implementation ICCV 2021 Wenyuan Xue, Baosheng Yu, Wen Wang, DaCheng Tao, Qingyong Li

A table arranging data in rows and columns is a very effective data structure, which has been widely used in business and scientific research.

Cell Detection Graph Reconstruction +1

Cannot find the paper you are looking for? You can Submit a new open access paper.