Search Results for author: Yan Tai

Found 2 papers, 2 papers with code

REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding

1 code implementation10 Mar 2025 Yan Tai, Luhao Zhu, Zhiqiang Chen, Ynan Ding, Yiying Dong, Xiaohong Liu, Guodong Guo

To address complex visual decoding scenarios, we introduce the Triplet-Based Referring Paradigm (TRP), which explicitly decouples three critical dimensions in visual decoding tasks through a triplet structure: concepts, decoding types, and targets.

Instruction Following Keypoint Detection +4

Link-Context Learning for Multimodal LLMs

1 code implementation CVPR 2024 Yan Tai, Weichen Fan, Zhao Zhang, Feng Zhu, Rui Zhao, Ziwei Liu

The ability to learn from context with novel concepts, and deliver appropriate responses are essential in human conversations.

Few-Shot Learning In-Context Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.