Search Results for author: Zequn Zeng

Found 5 papers, 4 papers with code

MeaCap: Memory-Augmented Zero-shot Image Captioning

1 code implementation • 6 Mar 2024 • Zequn Zeng, Yan Xie, Hao Zhang, Chiyu Chen, Zhengjue Wang, Bo Chen

The framework of MeaCap achieves the state-of-the-art performance on a series of zero-shot IC settings.

Paper
Code

SnapCap: Efficient Snapshot Compressive Video Captioning

no code implementations • 10 Jan 2024 • JianQiao Sun, Yudi Su, Hao Zhang, Ziheng Cheng, Zequn Zeng, Zhengjue Wang, Bo Chen, Xin Yuan

To address these problems, in this paper, we propose a novel VC pipeline to generate captions directly from the compressed measurement, which can be captured by a snapshot compressive sensing camera and we dub our model SnapCap.

Compressive Sensing Video Captioning

Paper
Add Code

PatchCT: Aligning Patch Set and Label Set with Conditional Transport for Multi-Label Image Classification

1 code implementation • ICCV 2023 • Miaoge Li, Dongsheng Wang, Xinyang Liu, Zequn Zeng, Ruiying Lu, Bo Chen, Mingyuan Zhou

We find that by formulating the multi-label classification as a CT problem, we can exploit the interactions between the image and label efficiently by minimizing the bidirectional CT cost.

Multi-Label Classification Multi-Label Image Classification

Paper
Code

ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing

1 code implementation • CVPR 2023 • Zequn Zeng, Hao Zhang, Zhengjue Wang, Ruiying Lu, Dongsheng Wang, Bo Chen

Zero-shot capability has been considered as a new revolution of deep learning, letting machines work on tasks without curated training data.

Image Captioning Language Modelling

Paper
Code

Matching Visual Features to Hierarchical Semantic Topics for Image Paragraph Captioning

1 code implementation • 10 May 2021 • Dandan Guo, Ruiying Lu, Bo Chen, Zequn Zeng, Mingyuan Zhou

Inspired by recent successes in integrating semantic topics into this task, this paper develops a plug-and-play hierarchical-topic-guided image paragraph generation framework, which couples a visual extractor with a deep topic model to guide the learning of a language model.

Image Paragraph Captioning Language Modelling +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.