no code implementations • 30 Oct 2024 • Shuai Wang, Zexian Li, Tianhui Song, Xubin Li, Tiezheng Ge, Bo Zheng, LiMin Wang
Arbitrary-resolution image generation still remains a challenging task in AIGC, as it requires handling varying resolutions and aspect ratios while maintaining high visual quality.
no code implementations • arXiv 2022 • Qiang Chen, Jian Wang, Chuchu Han, Shan Zhang, Zexian Li, Xiaokang Chen, Jiahui Chen, Xiaodi Wang, Shuming Han, Gang Zhang, Haocheng Feng, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang
The training process consists of self-supervised pretraining and finetuning a ViT-Huge encoder on ImageNet-1K, pretraining the detector on Object365, and finally finetuning it on COCO.
Ranked #8 on Object Detection on COCO test-dev (using extra training data)
1 code implementation • 6 Sep 2022 • Zhendong Yang, Zhe Li, Ailing Zeng, Zexian Li, Chun Yuan, Yu Li
In this paper, we explore the way of feature-based distillation for ViT.
no code implementations • 19 Aug 2022 • Pan Xie, Qipeng Zhang, Taiyi Peng, Hao Tang, Yao Du, Zexian Li
Our approach focuses on the transformation of sign gloss sequences into their corresponding sign pose sequences (G2P).
1 code implementation • ICCV 2021 • Ke Yu, Zexian Li, Yue Peng, Chen Change Loy, Jinwei Gu
Image Signal Processor (ISP) is a crucial component in digital cameras that transforms sensor signals into images for us to perceive and understand.
no code implementations • 19 Aug 2021 • Pan Xie, Zexian Li, Xiaohui Hu
Conditional masked language models (CMLM) have shown impressive progress in non-autoregressive machine translation (NAT).