Search Results for author: Xiaoping Wu

Found 4 papers, 2 papers with code

RDPM: Solve Diffusion Probabilistic Models via Recurrent Token Prediction

no code implementations24 Dec 2024 Xiaoping Wu, Jie Hu, Xiaoming Wei

By progressively introducing Gaussian noise into the latent representations of images and encoding them into vector-quantized tokens in a recurrent manner, RDPM facilitates a unique diffusion process on discrete-value domains.

Image Generation multimodal generation +2

Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input

no code implementations28 Aug 2024 Jiajun Liu, Yibing Wang, Hanghang Ma, Xiaoping Wu, Xiaoqi Ma, Xiaoming Wei, Jianbin Jiao, Enhua Wu, Jie Hu

Particularly, on benchmarks specialized for long videos, Kangaroo excels some larger models with over 10B parameters and proprietary models.

Language Modeling Language Modelling +1

IP102: A Large-Scale Benchmark Dataset for Insect Pest Recognition

1 code implementation CVPR 2019 Xiaoping Wu, Chi Zhan, Yu-Kun Lai, Ming-Ming Cheng, Jufeng Yang

The IP102 has a hierarchical taxonomy and the insect pests which mainly affect one specific agricultural product are grouped into the same upperlevel category.

 Ranked #1 on object-detection on IP102 (using extra training data)

Classification Fine-Grained Image Classification +3

Cannot find the paper you are looking for? You can Submit a new open access paper.