Search Results for author: Jiawei Ren

Found 25 papers, 13 papers with code

Move Anything with Layered Scene Diffusion

no code implementations • 10 Apr 2024 • Jiawei Ren, Mengmeng Xu, Jui-Chieh Wu, Ziwei Liu, Tao Xiang, Antoine Toisoul

Diffusion models generate images with an unprecedented level of quality, but how can we freely rearrange image layouts?

Paper
Add Code

InsActor: Instruction-driven Physics-based Characters

no code implementations • NeurIPS 2023 • Jiawei Ren, Mingyuan Zhang, Cunjun Yu, Xiao Ma, Liang Pan, Ziwei Liu

Generating animation of physics-based characters with intuitive control has long been a desirable task with numerous applications.

Motion Planning

Paper
Add Code

DreamGaussian4D: Generative 4D Gaussian Splatting

1 code implementation • 28 Dec 2023 • Jiawei Ren, Liang Pan, Jiaxiang Tang, Chi Zhang, Ang Cao, Gang Zeng, Ziwei Liu

Remarkable progress has been made in 4D content generation recently.

392

Paper
Code

FineMoGen: Fine-Grained Spatio-Temporal Motion Generation and Editing

1 code implementation • NeurIPS 2023 • Mingyuan Zhang, Huirong Li, Zhongang Cai, Jiawei Ren, Lei Yang, Ziwei Liu

Notably, FineMoGen further enables zero-shot motion editing capabilities with the aid of modern large language models (LLM), which faithfully manipulates motion sequences with fine-grained instructions.

Ranked #2 on Motion Synthesis on KIT Motion-Language

Motion Synthesis

Paper
Code

GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation

no code implementations • 7 Dec 2023 • Shoufa Chen, Mengmeng Xu, Jiawei Ren, Yuren Cong, Sen He, Yanping Xie, Animesh Sinha, Ping Luo, Tao Xiang, Juan-Manuel Perez-Rua

In this study, we explore Transformer-based diffusion models for image and video generation.

Text-to-Video Generation Video Generation

Paper
Add Code

FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing

no code implementations • 9 Oct 2023 • Yuren Cong, Mengmeng Xu, Christian Simon, Shoufa Chen, Jiawei Ren, Yanping Xie, Juan-Manuel Perez-Rua, Bodo Rosenhahn, Tao Xiang, Sen He

In this paper, for the first time, we introduce optical flow into the attention module in the diffusion model's U-Net to address the inconsistency issue for text-to-video editing.

Optical Flow Estimation Text-to-Video Editing +1

Paper
Add Code

DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation

1 code implementation • 28 Sep 2023 • Jiaxiang Tang, Jiawei Ren, Hang Zhou, Ziwei Liu, Gang Zeng

In contrast to the occupancy pruning used in Neural Radiance Fields, we demonstrate that the progressive densification of 3D Gaussians converges significantly faster for 3D generative tasks.

3D Generation

3,596

Paper
Code

Underwater-Art: Expanding Information Perspectives With Text Templates For Underwater Acoustic Target Recognition

no code implementations • 31 May 2023 • Yuan Xie, Jiawei Ren, Ji Xu

In our work, we propose to implement Underwater Acoustic Recognition based on Templates made up of rich relevant information (hereinafter called "UART").

Contrastive Learning Descriptive

Paper
Add Code

Adaptive ship-radiated noise recognition with learnable fine-grained wavelet transform

no code implementations • 31 May 2023 • Yuan Xie, Jiawei Ren, Ji Xu

Background noise and variable channel transmission environment make it complicated to implement accurate ship-radiated noise recognition.

Transfer Learning

Paper
Add Code

RoboBEV: Towards Robust Bird's Eye View Perception under Corruptions

1 code implementation • 13 Apr 2023 • Shaoyuan Xie, Lingdong Kong, Wenwei Zhang, Jiawei Ren, Liang Pan, Kai Chen, Ziwei Liu

Our experiments further demonstrate that pre-training and depth-free BEV transformation has the potential to enhance out-of-distribution robustness.

Robust Camera Only 3D Object Detection

284

Paper
Code

DiffMimic: Efficient Motion Mimicking with Differentiable Physics

2 code implementations • 6 Apr 2023 • Jiawei Ren, Cunjun Yu, Siwei Chen, Xiao Ma, Liang Pan, Ziwei Liu

Motion mimicking is a foundational task in physics-based character animation.

reinforcement-learning Reinforcement Learning (RL)

257

Paper
Code

Robo3D: Towards Robust and Reliable 3D Perception against Corruptions

1 code implementation • ICCV 2023 • Lingdong Kong, Youquan Liu, Xin Li, Runnan Chen, Wenwei Zhang, Jiawei Ren, Liang Pan, Kai Chen, Ziwei Liu

The robustness of 3D perception systems under natural corruptions from environments and sensors is pivotal for safety-critical applications.

Robust 3D Object Detection Robust 3D Semantic Segmentation

270

Paper
Code

OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation

1 code implementation • CVPR 2023 • Tong Wu, Jiarui Zhang, Xiao Fu, Yuxin Wang, Jiawei Ren, Liang Pan, Wayne Wu, Lei Yang, Jiaqi Wang, Chen Qian, Dahua Lin, Ziwei Liu

Recent advances in modeling 3D objects mostly rely on synthetic datasets due to the lack of large-scale realscanned 3D databases.

Novel View Synthesis Object +1

416

Paper
Code

LaserMix for Semi-Supervised LiDAR Semantic Segmentation

2 code implementations • CVPR 2023 • Lingdong Kong, Jiawei Ren, Liang Pan, Ziwei Liu

Densely annotating LiDAR point clouds is costly, which restrains the scalability of fully-supervised learning methods.

Ranked #1 on Semi-Supervised Semantic Segmentation on ScribbleKITTI

LIDAR Semantic Segmentation Segmentation +1

255

Paper
Code

Sparse Mixture-of-Experts are Domain Generalizable Learners

1 code implementation • 8 Jun 2022 • Bo Li, Yifei Shen, Jingkang Yang, Yezhen Wang, Jiawei Ren, Tong Che, Jun Zhang, Ziwei Liu

It is motivated by an empirical finding that transformer-based models trained with empirical risk minimization (ERM) outperform CNN-based models employing state-of-the-art (SOTA) DG algorithms on multiple DG datasets.

Ranked #11 on Domain Generalization on DomainNet (using extra training data)

Domain Generalization Object Recognition

279

Paper
Code

Balanced MSE for Imbalanced Visual Regression

1 code implementation • CVPR 2022 • Jiawei Ren, Mingyuan Zhang, Cunjun Yu, Ziwei Liu

Data imbalance exists ubiquitously in real-world visual regressions, e. g., age estimation and pose estimation, hurting the model's generalizability and fairness.

Age Estimation Fairness +3

349

Paper
Code

Benchmarking and Analyzing Point Cloud Classification under Corruptions

4 code implementations • 7 Feb 2022 • Jiawei Ren, Liang Pan, Ziwei Liu

3D perception, especially point cloud classification, has achieved substantial progress.

Ranked #7 on Point Cloud Classification on PointCloud-C

Benchmarking Classification +1

162

Paper
Code

Playing for 3D Human Recovery

no code implementations • 14 Oct 2021 • Zhongang Cai, Mingyuan Zhang, Jiawei Ren, Chen Wei, Daxuan Ren, Zhengyu Lin, Haiyu Zhao, Lei Yang, Chen Change Loy, Ziwei Liu

Specifically, we contribute GTA-Human, a large-scale 3D human dataset generated with the GTA-V game engine, featuring a highly diverse set of subjects, actions, and scenarios.

Paper
Add Code

Bayesian Imbalanced Regression Debiasing

no code implementations • 29 Sep 2021 • Jiawei Ren, Mingyuan Zhang, Cunjun Yu, Ziwei Liu

Compared to imbalanced and long-tailed classification, imbalanced regression has its unique challenges as the regression label space can be continuous, boundless, and high-dimensional.

Age Estimation imbalanced classification +2

Paper
Add Code

HAVANA: Hierarchical and Variation-Normalized Autoencoder for Person Re-identification

no code implementations • 6 Jan 2021 • Jiawei Ren, Xiao Ma, Chen Xu, Haiyu Zhao, Shuai Yi

Person Re-Identification (Re-ID) is of great importance to the many video surveillance systems.

Person Re-Identification

Paper
Add Code

REFINE: Prediction Fusion Network for Panoptic Segmentation

no code implementations • 15 Dec 2020 • Jiawei Ren, Cunjun Yu, Zhongang Cai, Mingyuan Zhang, Chongsong Chen, Haiyu Zhao, Shuai Yi, Hongsheng Li

Panoptic segmentation aims at generating pixel-wise class and instance predictions for each pixel in the input image, which is a challenging task and far more complicated than naively fusing the semantic and instance segmentation results.

Ranked #11 on Panoptic Segmentation on COCO test-dev

Instance Segmentation Panoptic Segmentation +1

Paper
Add Code

Balanced Activation for Long-tailed Visual Recognition

no code implementations • 24 Aug 2020 • Jiawei Ren, Cunjun Yu, Zhongang Cai, Haiyu Zhao

Deep classifiers have achieved great success in visual recognition.

object-detection Object Detection +1

Paper
Add Code

Leveraging Localization for Multi-camera Association

no code implementations • 7 Aug 2020 • Zhongang Cai, Cunjun Yu, Junzhe Zhang, Jiawei Ren, Haiyu Zhao

We present McAssoc, a deep learning approach to the as-sociation of detection bounding boxes in different views ofa multi-camera system.

Paper
Add Code

Balanced Meta-Softmax for Long-Tailed Visual Recognition

1 code implementation • NeurIPS 2020 • Jiawei Ren, Cunjun Yu, Shunan Sheng, Xiao Ma, Haiyu Zhao, Shuai Yi, Hongsheng Li

In our experiments, we demonstrate that Balanced Meta-Softmax outperforms state-of-the-art long-tailed classification solutions on both visual recognition and instance segmentation tasks.

Ranked #7 on Long-tail Learning on CIFAR-10-LT (ρ=10)

General Classification Instance Segmentation +2

Paper
Code

Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction

1 code implementation • ECCV 2020 • Cunjun Yu, Xiao Ma, Jiawei Ren, Haiyu Zhao, Shuai Yi

In this paper, we present STAR, a Spatio-Temporal grAph tRansformer framework, which tackles trajectory prediction by only attention mechanisms.

Autonomous Driving Pedestrian Trajectory Prediction +1

330

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.