Search Results for author: Yunbo Wang

Found 35 papers, 22 papers with code

Vid2Act: Activate Offline Videos for Visual RL

no code implementations • 6 Jun 2023 • Minting Pan, Yitao Zheng, Wendong Zhang, Yunbo Wang, Xiaokang Yang

Pretraining RL models on offline video datasets is a promising way to improve their training efficiency in online tasks, but challenging due to the inherent mismatch in tasks, dynamics, and behaviors across domains.

Knowledge Distillation

Paper
Add Code

Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning

no code implementations • 24 May 2023 • Qi Wang, Junming Yang, Yunbo Wang, Xin Jin, Wenjun Zeng, Xiaokang Yang

Training offline reinforcement learning (RL) models using visual inputs poses two significant challenges, i. e., the overfitting problem in representation learning and the overestimation bias for expected future rewards.

Offline RL Reinforcement Learning (RL) +2

Paper
Add Code

DynaVol: Unsupervised Learning for Dynamic Scenes through Object-Centric Voxelization

no code implementations • 30 Apr 2023 • Yanpeng Zhao, Siyu Gao, Yunbo Wang, Xiaokang Yang

The voxel features and global features are complementary and are both leveraged by a compositional NeRF decoder for volume rendering.

Neural Rendering Novel View Synthesis +3

Paper
Add Code

Model-Based Reinforcement Learning with Isolated Imaginations

1 code implementation • 27 Mar 2023 • Minting Pan, Xiangming Zhu, Yitao Zheng, Yunbo Wang, Xiaokang Yang

On top of our previous work, we further consider the sparse dependencies between controllable and noncontrollable states, address the training collapse problem of state decoupling, and validate our approach in transfer learning setups.

Autonomous Driving Model-based Reinforcement Learning +3

Paper
Code

Predictive Experience Replay for Continual Visual Control and Forecasting

2 code implementations • 12 Mar 2023 • Wendong Zhang, Geng Chen, Xiangming Zhu, Siyu Gao, Yunbo Wang, Xiaokang Yang

In this paper, we present a new continual learning approach for visual dynamics modeling and explore its efficacy in visual control and forecasting.

Continual Learning Model-based Reinforcement Learning +2

Paper
Code

Improving Masked Autoencoders by Learning Where to Mask

no code implementations • 12 Mar 2023 • Haijian Chen, Wendong Zhang, Yunbo Wang, Xiaokang Yang

Masked image modeling is a promising self-supervised learning method for visual data.

Image Reconstruction Self-Supervised Learning

Paper
Add Code

Iso-Dream: Isolating and Leveraging Noncontrollable Visual Dynamics in World Models

2 code implementations • 27 May 2022 • Minting Pan, Xiangming Zhu, Yunbo Wang, Xiaokang Yang

First, by optimizing the inverse dynamics, we encourage the world model to learn controllable and noncontrollable sources of spatiotemporal changes on isolated state transition branches.

Autonomous Driving Decision Making

Paper
Code

MetaSets: Meta-Learning on Point Sets for Generalizable Representations

no code implementations • CVPR 2021 • Chao Huang, Zhangjie Cao, Yunbo Wang, Jianmin Wang, Mingsheng Long

It is a challenging problem due to the substantial geometry shift from simulated to real data, such that most existing 3D models underperform due to overfitting the complete geometries in the source domain.

Domain Generalization Meta-Learning

Paper
Add Code

Continual Predictive Learning from Videos

1 code implementation • CVPR 2022 • Geng Chen, Wendong Zhang, Han Lu, Siyu Gao, Yunbo Wang, Mingsheng Long, Xiaokang Yang

Can we develop predictive learning algorithms that can deal with more realistic, non-stationary physical environments?

Continual Learning Test-time Adaptation +1

Paper
Code

NeuroFluid: Fluid Dynamics Grounding with Particle-Driven Neural Radiance Fields

no code implementations • 3 Mar 2022 • Shanyan Guan, Huayu Deng, Yunbo Wang, Xiaokang Yang

Deep learning has shown great potential for modeling the physical dynamics of complex particle systems such as fluids.

Paper
Add Code

Fully Context-Aware Image Inpainting with a Learned Semantic Pyramid

1 code implementation • 8 Dec 2021 • Wendong Zhang, Yunbo Wang, Bingbing Ni, Xiaokang Yang

We train the prior learner and the image generator as a unified model without any post-processing.

Image Inpainting Variational Inference

Paper
Code

Learning Transferable Features for Point Cloud Detection via 3D Contrastive Co-training

no code implementations • NeurIPS 2021 • Zeng Yihan, Chunwei Wang, Yunbo Wang, Hang Xu, Chaoqiang Ye, Zhen Yang, Chao Ma

First, 3D-CoCo is inspired by our observation that the bird-eye-view (BEV) features are more transferable than low-level geometry features.

Cloud Detection Domain Adaptation

Paper
Add Code

Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online Adaptation

1 code implementation • 7 Nov 2021 • Shanyan Guan, Jingwei Xu, Michelle Z. He, Yunbo Wang, Bingbing Ni, Xiaokang Yang

We consider a new problem of adapting a human mesh reconstruction model to out-of-domain streaming videos, where performance of existing SMPL-based models are significantly affected by the distribution shift represented by different camera parameters, bone lengths, backgrounds, and occlusions.

Ranked #1 on 3D Absolute Human Pose Estimation on Surreal

3D Absolute Human Pose Estimation Bilevel Optimization

221

Paper
Code

ModeRNN: Harnessing Spatiotemporal Mode Collapse in Unsupervised Predictive Learning

1 code implementation • 8 Oct 2021 • Zhiyu Yao, Yunbo Wang, Haixu Wu, Jianmin Wang, Mingsheng Long

To this end, we propose ModeRNN, which introduces a novel method to learn structured hidden representations between recurrent states.

Inductive Bias

Paper
Code

Context-Aware Image Inpainting with Learned Semantic Priors

1 code implementation • 14 Jun 2021 • Wendong Zhang, Junwei Zhu, Ying Tai, Yunbo Wang, Wenqing Chu, Bingbing Ni, Chengjie Wang, Xiaokang Yang

Based on the semantic priors, we further propose a context-aware image inpainting model, which adaptively integrates global semantics and local features in a unified image generator.

Image Inpainting Knowledge Distillation

Paper
Code

MetaSets：Meta-Learning on Point Sets for Generalizable Representations

no code implementations • CVPR 2021 • Chao Huang, Zhangjie Cao, Yunbo Wang, Jianmin Wang, Mingsheng Long

Domain Generalization

Paper
Add Code

Bilevel Online Adaptation for Out-of-Domain Human Mesh Reconstruction

1 code implementation • CVPR 2021 • Shanyan Guan, Jingwei Xu, Yunbo Wang, Bingbing Ni, Xiaokang Yang

This paper considers a new problem of adapting a pre-trained model of human mesh reconstruction to out-of-domain streaming videos.

Ranked #39 on 3D Human Pose Estimation on 3DPW

3D Human Pose Estimation

Paper
Code

PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive Learning

3 code implementations • 17 Mar 2021 • Yunbo Wang, Haixu Wu, Jianjin Zhang, Zhifeng Gao, Jianmin Wang, Philip S. Yu, Mingsheng Long

This paper models these structures by presenting PredRNN, a new recurrent network, in which a pair of memory cells are explicitly decoupled, operate in nearly independent transition manners, and finally form unified representations of the complex environment.

Ranked #1 on Video Prediction on KTH (Cond metric)

Video Prediction Weather Forecasting

571

Paper
Code

Source Data-absent Unsupervised Domain Adaptation through Hypothesis Transfer and Labeling Transfer

2 code implementations • 14 Dec 2020 • Jian Liang, Dapeng Hu, Yunbo Wang, Ran He, Jiashi Feng

Furthermore, we propose a new labeling transfer strategy, which separates the target data into two splits based on the confidence of predictions (labeling information), and then employ semi-supervised learning to improve the accuracy of less-confident predictions in the target domain.

Classification General Classification +3

106

Paper
Code

Towards Good Practices of U-Net for Traffic Forecasting

1 code implementation • 4 Dec 2020 • Jingwei Xu, Jianjin Zhang, Zhiyu Yao, Yunbo Wang

This technical report presents a solution for the 2020 Traffic4Cast Challenge.

Paper
Code

Unsupervised Transfer Learning for Spatiotemporal Predictive Networks

1 code implementation • ICML 2020 • Zhiyu Yao, Yunbo Wang, Mingsheng Long, Jian-Min Wang

This paper explores a new research problem of unsupervised transfer learning across multiple spatiotemporal prediction tasks.

Transfer Learning

Paper
Code

Probabilistic Video Prediction From Noisy Data With a Posterior Confidence

no code implementations • CVPR 2020 • Yunbo Wang, Jiajun Wu, Mingsheng Long, Joshua B. Tenenbaum

It is also challenging because it involves two levels of uncertainty: the perceptual uncertainty from noisy observations and the dynamics uncertainty in forward modeling.

Video Prediction

Paper
Add Code

A Balanced and Uncertainty-aware Approach for Partial Domain Adaptation

1 code implementation • ECCV 2020 • Jian Liang, Yunbo Wang, Dapeng Hu, Ran He, Jiashi Feng

On one hand, negative transfer results in misclassification of target samples to the classes only present in the source domain.

Ranked #2 on Partial Domain Adaptation on ImageNet-Caltech

Partial Domain Adaptation Unsupervised Domain Adaptation

Paper
Code

VideoDG: Generalizing Temporal Relations in Videos to Novel Domains

1 code implementation • 8 Dec 2019 • Zhiyu Yao, Yunbo Wang, Jianmin Wang, Philip S. Yu, Mingsheng Long

This paper introduces video domain generalization where most video classification networks degenerate due to the lack of exposure to the target domains of divergent distributions.

Action Recognition Data Augmentation +5

Paper
Code

DualSMC: Tunneling Differentiable Filtering and Planning under Continuous POMDPs

1 code implementation • 28 Sep 2019 • Yunbo Wang, Bo Liu, Jiajun Wu, Yuke Zhu, Simon S. Du, Li Fei-Fei, Joshua B. Tenenbaum

A major difficulty of solving continuous POMDPs is to infer the multi-modal distribution of the unobserved true states and to make the planning algorithm dependent on the perceived uncertainty.

Continuous Control

Paper
Code

DELTA: A DEep learning based Language Technology plAtform

2 code implementations • 2 Aug 2019 • Kun Han, Junwen Chen, HUI ZHANG, Haiyang Xu, Yiping Peng, Yun Wang, Ning Ding, Hui Deng, Yonghu Gao, Tingwei Guo, Yi Zhang, Yahao He, Baochang Ma, Yu-Long Zhou, Kangli Zhang, Chao Liu, Ying Lyu, Chenxi Wang, Cheng Gong, Yunbo Wang, Wei Zou, Hui Song, Xiangang Li

In this paper we present DELTA, a deep learning based language technology platform.

Ranked #3 on Text Classification on Yahoo! Answers

Abstractive Text Summarization Intent Detection +9

1,584

Paper
Code

Z-Order Recurrent Neural Networks for Video Prediction

no code implementations • IEEE International Conference on Multimedia and Expo (ICME) 2019 • Jianjin Zhang, Yunbo Wang, Mingsheng Long, Wang Jianmin, Philip S Yu

First, we propose a new RNN architecture for modeling the deterministic dynamics, which updates hidden states along a z-order curve to enhance the consistency of the features of mirrored layers.

Ranked #1 on Video Prediction on KTH (Cond metric)

Video Prediction

Paper
Add Code

Eidetic 3D LSTM: A Model for Video Prediction and Beyond

3 code implementations • ICLR 2019 • Yunbo Wang, Lu Jiang, Ming-Hsuan Yang, Li-Jia Li, Mingsheng Long, Li Fei-Fei

We first evaluate the E3D-LSTM network on widely-used future video prediction datasets and achieve the state-of-the-art performance.

Ranked #1 on Video Prediction on KTH (Cond metric)

Activity Recognition Video Prediction +1

571

Paper
Code

Spatiotemporal Pyramid Network for Video Action Recognition

no code implementations • CVPR 2017 • Yunbo Wang, Mingsheng Long, Jian-Min Wang, Philip S. Yu

From the technical perspective, we introduce the spatiotemporal compact bilinear operator into video analysis tasks.

Action Recognition Temporal Action Localization

Paper
Add Code

Multi-Task Learning of Generalizable Representations for Video Action Recognition

no code implementations • 20 Nov 2018 • Zhiyu Yao, Yunbo Wang, Mingsheng Long, Jian-Min Wang, Philip S. Yu, Jiaguang Sun

Rev2Net is shown to be effective on the classic action recognition task.

Action Recognition Multi-Task Learning +4

Paper
Add Code

Memory In Memory: A Predictive Neural Network for Learning Higher-Order Non-Stationarity from Spatiotemporal Dynamics

4 code implementations • CVPR 2019 • Yunbo Wang, Jianjin Zhang, Hongyu Zhu, Mingsheng Long, Jian-Min Wang, Philip S. Yu

Natural spatiotemporal processes can be highly non-stationary in many ways, e. g. the low-level non-stationarity such as spatial correlations or temporal dependencies of local pixel values; and the high-level variations such as the accumulation, deformation or dissipation of radar echoes in precipitation forecasting.

Ranked #5 on Video Prediction on Human3.6M