Search Results for author: Hanzi Mao

Found 5 papers, 4 papers with code

Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling

no code implementations4 Sep 2024 Kaiwen Zheng, Yongxin Chen, Hanzi Mao, Ming-Yu Liu, Jun Zhu, Qinsheng Zhang

Masked diffusion models (MDMs) have emerged as a popular research topic for generative modeling of discrete data, thanks to their superior performance over other discrete diffusion models, and are rivaling the auto-regressive models (ARMs) for language modeling tasks.

Language Modelling Text Generation

Exploring Plain Vision Transformer Backbones for Object Detection

10 code implementations30 Mar 2022 Yanghao Li, Hanzi Mao, Ross Girshick, Kaiming He

This design enables the original ViT architecture to be fine-tuned for object detection without needing to redesign a hierarchical backbone for pre-training.

Cross-Domain Few-Shot Object Detection Instance Segmentation +3

A ConvNet for the 2020s

49 code implementations CVPR 2022 Zhuang Liu, Hanzi Mao, Chao-yuan Wu, Christoph Feichtenhofer, Trevor Darrell, Saining Xie

The "Roaring 20s" of visual recognition began with the introduction of Vision Transformers (ViTs), which quickly superseded ConvNets as the state-of-the-art image classification model.

Classification Domain Generalization +3

Entailment as Few-Shot Learner

3 code implementations29 Apr 2021 Sinong Wang, Han Fang, Madian Khabsa, Hanzi Mao, Hao Ma

Large pre-trained language models (LMs) have demonstrated remarkable ability as few-shot learners.

Contrastive Learning Data Augmentation +8

Cannot find the paper you are looking for? You can Submit a new open access paper.