Search Results for author: Ruiyuan Gao

Found 18 papers, 7 papers with code

MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control

no code implementations21 Nov 2024 Ruiyuan Gao, Kai Chen, Bo Xiao, Lanqing Hong, Zhenguo Li, Qiang Xu

The rapid advancement of diffusion models has greatly improved video synthesis, especially in controllable video generation, which is essential for applications like autonomous driving.

Autonomous Driving Video Generation

MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes

no code implementations23 May 2024 Ruiyuan Gao, Kai Chen, Zhihao LI, Lanqing Hong, Zhenguo Li, Qiang Xu

While controllable generative models for images and videos have achieved remarkable success, high-quality models for 3D scenes, particularly in unbounded scenarios like autonomous driving, remain underdeveloped due to high data acquisition costs.

3D Generation Autonomous Driving +2

GuardT2I: Defending Text-to-Image Models from Adversarial Prompts

1 code implementation3 Mar 2024 Yijun Yang, Ruiyuan Gao, Xiao Yang, Jianyuan Zhong, Qiang Xu

Recent advancements in Text-to-Image (T2I) models have raised significant safety concerns about their potential misuse for generating inappropriate or Not-Safe-For-Work (NSFW) contents, despite existing countermeasures such as NSFW classifiers or model fine-tuning for inappropriate concept removal.

Binary Classification Language Modeling +2

TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion Models

1 code implementation1 Dec 2023 Pengxiang Li, Kai Chen, Zhili Liu, Ruiyuan Gao, Lanqing Hong, Guo Zhou, Hua Yao, Dit-yan Yeung, Huchuan Lu, Xu Jia

Despite remarkable achievements in video synthesis, achieving granular control over complex dynamics, such as nuanced movement among multiple interacting objects, still presents a significant hurdle for dynamic world modeling, compounded by the necessity to manage appearance and disappearance, drastic scale changes, and ensure consistency for instances across frames.

Image Classification Multi-Object Tracking +4

Non-Cross Diffusion for Semantic Consistency

no code implementations30 Nov 2023 Ziyang Zheng, Ruiyuan Gao, Qiang Xu

In diffusion models, deviations from a straight generative flow are a common issue, resulting in semantic inconsistencies and suboptimal generations.

MMA-Diffusion: MultiModal Attack on Diffusion Models

2 code implementations CVPR 2024 Yijun Yang, Ruiyuan Gao, Xiaosen Wang, Tsung-Yi Ho, Nan Xu, Qiang Xu

In recent years, Text-to-Image (T2I) models have seen remarkable advancements, gaining widespread adoption.

DiffGuard: Semantic Mismatch-Guided Out-of-Distribution Detection using Pre-trained Diffusion Models

1 code implementation ICCV 2023 Ruiyuan Gao, Chenchen Zhao, Lanqing Hong, Qiang Xu

There is a recent work that directly applies it to OOD detection, which employs a conditional Generative Adversarial Network (cGAN) to enlarge semantic mismatch in the image space.

Generative Adversarial Network Out-of-Distribution Detection

Out-of-Distribution Detection with Semantic Mismatch under Masking

1 code implementation31 Jul 2022 Yijun Yang, Ruiyuan Gao, Qiang Xu

This paper proposes a novel out-of-distribution (OOD) detection framework named MoodCat for image classifiers.

Out-of-Distribution Detection Out of Distribution (OOD) Detection

DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation

1 code implementation16 Mar 2022 Ailing Zeng, Xuan Ju, Lei Yang, Ruiyuan Gao, Xizhou Zhu, Bo Dai, Qiang Xu

This paper proposes a simple baseline framework for video-based 2D/3D human pose estimation that can achieve 10 times efficiency improvement over existing works without any performance degradation, named DeciWatch.

2D Human Pose Estimation 3D Human Pose Estimation +2

T-WaveNet: A Tree-Structured Wavelet Neural Network for Time Series Signal Analysis

no code implementations ICLR 2022 Minhao Liu, Ailing Zeng, Qiuxia Lai, Ruiyuan Gao, Min Li, Jing Qin, Qiang Xu

In this work, we propose a novel tree-structured wavelet neural network for time series signal analysis, namely T-WaveNet, by taking advantage of an inherent property of various types of signals, known as the dominant frequency range.

Activity Recognition Representation Learning +3

Relational Graph Neural Network Design via Progressive Neural Architecture Search

no code implementations30 May 2021 Ailing Zeng, Minhao Liu, Zhiwei Liu, Ruiyuan Gao, Jing Qin, Qiang Xu

We propose a novel solution to addressing a long-standing dilemma in the representation learning of graph neural networks (GNNs): how to effectively capture and represent useful information embedded in long-distance nodes to improve the performance of nodes with low homophily without leading to performance degradation in nodes with high homophily.

Graph Neural Network Neural Architecture Search +2

ModuleNet: Knowledge-inherited Neural Architecture Search

no code implementations10 Apr 2020 Yaran Chen, Ruiyuan Gao, Fenggang Liu, Dongbin Zhao

Unlike previous search algorithms, and benefiting from inherited knowledge, our method is able to directly search for architectures in the macro space by NSGA-II algorithm without tuning parameters in these \textit{module}s. Experiments show that our strategy can efficiently evaluate the performance of new architecture even without tuning weights in convolutional layers.

Neural Architecture Search

Privacy for Rescue: A New Testimony Why Privacy is Vulnerable In Deep Models

no code implementations31 Dec 2019 Ruiyuan Gao, Ming Dun, Hailong Yang, Zhongzhi Luan, Depei Qian

Existing research works rely on metrics that are either impractical or insufficient to measure the effectiveness of privacy protection methods in the above scenario, especially from the aspect of a single user.

Cannot find the paper you are looking for? You can Submit a new open access paper.