2 code implementations • 18 Nov 2024 • Xibo Sun, Jiarui Fang, Aoyu Li, Jinzhe Pan
Our experimental analysis reveals substantial variations in the distribution of redundancy across diffusion steps among different DiT models.
1 code implementation • 4 Nov 2024 • Jiarui Fang, Jinzhe Pan, Xibo Sun, Aoyu Li, Jiannan Wang
Parallel inference is essential for real-time DiTs deployments, but relying on a single parallel method is impractical due to poor scalability at large scales.
2 code implementations • 23 May 2024 • Jiarui Fang, Jinzhe Pan, Jiannan Wang, Aoyu Li, Xibo Sun
This paper presents PipeFusion, an innovative parallel methodology to tackle the high latency issues associated with generating high-resolution images using diffusion transformers (DiTs) models.
no code implementations • 8 Feb 2024 • QiPeng Wang, Shiqi Jiang, Zhenpeng Chen, Xu Cao, Yuanchun Li, Aoyu Li, Yun Ma, Ting Cao, Xuanzhe Liu
The gap on mobile CPU and mobile GPU is 15. 8 times and 7. 8 times, respectively.
1 code implementation • 26 Jan 2023 • Haotong Qin, Mingyuan Zhang, Yifu Ding, Aoyu Li, Zhongang Cai, Ziwei Liu, Fisher Yu, Xianglong Liu
Network binarization emerges as one of the most promising compression approaches offering extraordinary computation and memory savings by minimizing the bit-width.
no code implementations • 18 Nov 2022 • Aoyu Li, Ikuro Sato, Kohta Ishikawa, Rei Kawakami, Rio Yokota
Among various supervised deep metric learning methods proxy-based approaches have achieved high retrieval accuracies.