Search Results for author: Chenghang Lai

Found 2 papers, 1 papers with code

Weakly Supervised Gaussian Contrastive Grounding with Large Multimodal Models for Video Question Answering

no code implementations19 Jan 2024 Haibo Wang, Chenghang Lai, Yixuan Sun, Weifeng Ge

GCG learns multiple Gaussian functions to characterize the temporal structure of the video, and sample question-critical frames as positive moments to be the visual inputs of LMMs.

Question Answering Video Question Answering

Weakly Supervised Learning of Semantic Correspondence through Cascaded Online Correspondence Refinement

1 code implementation ICCV 2023 Yiwen Huang, Yixuan Sun, Chenghang Lai, Qing Xu, Xiaomei Wang, Xuli Shen, Weifeng Ge

Following the spirit of multiple instance learning (MIL), we decompose the weakly supervised correspondence learning problem into three stages: image-level matching, region-level matching, and pixel-level matching.

Multiple Instance Learning Semantic correspondence +1

Cannot find the paper you are looking for? You can Submit a new open access paper.