no code implementations • 21 Nov 2024 • Ruiyuan Gao, Kai Chen, Bo Xiao, Lanqing Hong, Zhenguo Li, Qiang Xu
The rapid advancement of diffusion models has greatly improved video synthesis, especially in controllable video generation, which is essential for applications like autonomous driving.
no code implementations • 23 May 2024 • Ruiyuan Gao, Kai Chen, Zhihao LI, Lanqing Hong, Zhenguo Li, Qiang Xu
While controllable generative models for images and videos have achieved remarkable success, high-quality models for 3D scenes, particularly in unbounded scenarios like autonomous driving, remain underdeveloped due to high data acquisition costs.
no code implementations • 16 Apr 2024 • Kai Chen, Yanze Li, Wenhua Zhang, Yanxin Liu, Pengxiang Li, Ruiyuan Gao, Lanqing Hong, Meng Tian, Xinhai Zhao, Zhenguo Li, Dit-yan Yeung, Huchuan Lu, Xu Jia
Moreover, with our CODA-LM, we build CODA-VLM, a new driving LVLM surpassing all open-sourced counterparts on CODA-LM.
no code implementations • CVPR 2024 • Yibo Wang, Ruiyuan Gao, Kai Chen, Kaiqiang Zhou, Yingjie Cai, Lanqing Hong, Zhenguo Li, Lihui Jiang, Dit-yan Yeung, Qiang Xu, Kai Zhang
Furthermore, image syntheses from DetDiffusion can effectively augment training data, significantly enhancing downstream detection performance.
1 code implementation • 3 Mar 2024 • Yijun Yang, Ruiyuan Gao, Xiao Yang, Jianyuan Zhong, Qiang Xu
Recent advancements in Text-to-Image (T2I) models have raised significant safety concerns about their potential misuse for generating inappropriate or Not-Safe-For-Work (NSFW) contents, despite existing countermeasures such as NSFW classifiers or model fine-tuning for inappropriate concept removal.
1 code implementation • 1 Dec 2023 • Pengxiang Li, Kai Chen, Zhili Liu, Ruiyuan Gao, Lanqing Hong, Guo Zhou, Hua Yao, Dit-yan Yeung, Huchuan Lu, Xu Jia
Despite remarkable achievements in video synthesis, achieving granular control over complex dynamics, such as nuanced movement among multiple interacting objects, still presents a significant hurdle for dynamic world modeling, compounded by the necessity to manage appearance and disappearance, drastic scale changes, and ensure consistency for instances across frames.
no code implementations • 30 Nov 2023 • Ziyang Zheng, Ruiyuan Gao, Qiang Xu
In diffusion models, deviations from a straight generative flow are a common issue, resulting in semantic inconsistencies and suboptimal generations.
2 code implementations • CVPR 2024 • Yijun Yang, Ruiyuan Gao, Xiaosen Wang, Tsung-Yi Ho, Nan Xu, Qiang Xu
In recent years, Text-to-Image (T2I) models have seen remarkable advancements, gaining widespread adoption.
no code implementations • 4 Oct 2023 • Ruiyuan Gao, Kai Chen, Enze Xie, Lanqing Hong, Zhenguo Li, Dit-yan Yeung, Qiang Xu
Recent advancements in diffusion models have significantly enhanced the data synthesis with 2D control.
1 code implementation • ICCV 2023 • Ruiyuan Gao, Chenchen Zhao, Lanqing Hong, Qiang Xu
There is a recent work that directly applies it to OOD detection, which employs a conditional Generative Adversarial Network (cGAN) to enlarge semantic mismatch in the image space.
Generative Adversarial Network Out-of-Distribution Detection
1 code implementation • 31 Jul 2022 • Yijun Yang, Ruiyuan Gao, Qiang Xu
This paper proposes a novel out-of-distribution (OOD) detection framework named MoodCat for image classifiers.
Out-of-Distribution Detection Out of Distribution (OOD) Detection
1 code implementation • 16 Mar 2022 • Ailing Zeng, Xuan Ju, Lei Yang, Ruiyuan Gao, Xizhou Zhu, Bo Dai, Qiang Xu
This paper proposes a simple baseline framework for video-based 2D/3D human pose estimation that can achieve 10 times efficiency improvement over existing works without any performance degradation, named DeciWatch.
Ranked #1 on 2D Human Pose Estimation on JHMDB (2D poses only)
1 code implementation • 24 Jan 2022 • Yijun Yang, Ruiyuan Gao, Yu Li, Qiuxia Lai, Qiang Xu
For legitimate inputs that are correctly inferred, the synthetic output tries to reconstruct the input.
no code implementations • ICLR 2022 • Minhao Liu, Ailing Zeng, Qiuxia Lai, Ruiyuan Gao, Min Li, Jing Qin, Qiang Xu
In this work, we propose a novel tree-structured wavelet neural network for time series signal analysis, namely T-WaveNet, by taking advantage of an inherent property of various types of signals, known as the dominant frequency range.
no code implementations • 30 May 2021 • Ailing Zeng, Minhao Liu, Zhiwei Liu, Ruiyuan Gao, Jing Qin, Qiang Xu
We propose a novel solution to addressing a long-standing dilemma in the representation learning of graph neural networks (GNNs): how to effectively capture and represent useful information embedded in long-distance nodes to improve the performance of nodes with low homophily without leading to performance degradation in nodes with high homophily.
no code implementations • 20 Apr 2021 • Yijun Yang, Ruiyuan Gao, Yu Li, Qiuxia Lai, Qiang Xu
Consequently, we propose a novel learning-based solution to model such contradictions for AE detection.
no code implementations • 10 Apr 2020 • Yaran Chen, Ruiyuan Gao, Fenggang Liu, Dongbin Zhao
Unlike previous search algorithms, and benefiting from inherited knowledge, our method is able to directly search for architectures in the macro space by NSGA-II algorithm without tuning parameters in these \textit{module}s. Experiments show that our strategy can efficiently evaluate the performance of new architecture even without tuning weights in convolutional layers.
no code implementations • 31 Dec 2019 • Ruiyuan Gao, Ming Dun, Hailong Yang, Zhongzhi Luan, Depei Qian
Existing research works rely on metrics that are either impractical or insufficient to measure the effectiveness of privacy protection methods in the above scenario, especially from the aspect of a single user.