Search Results for author: Ruihan Yang

Found 28 papers, 14 papers with code

Visual Whole-Body Control for Legged Loco-Manipulation

no code implementations • 25 Mar 2024 • Minghuan Liu, Zixuan Chen, Xuxin Cheng, Yandong Ji, Rizhao Qiu, Ruihan Yang, Xiaolong Wang

That is, the robot can control the legs and the arm at the same time to extend its workspace.

Position

Paper
Add Code

Learning Generalizable Feature Fields for Mobile Manipulation

no code implementations • 12 Mar 2024 • Ri-Zhao Qiu, Yafei Hu, Ge Yang, Yuchen Song, Yang Fu, Jianglong Ye, Jiteng Mu, Ruihan Yang, Nikolay Atanasov, Sebastian Scherer, Xiaolong Wang

An open problem in mobile manipulation is how to represent objects and scenes in a unified manner, so that robots can use it both for navigating in the environment and manipulating objects.

Novel View Synthesis

Paper
Add Code

Expressive Whole-Body Control for Humanoid Robots

no code implementations • 26 Feb 2024 • Xuxin Cheng, Yandong Ji, Junming Chen, Ruihan Yang, Ge Yang, Xiaolong Wang

Can we enable humanoid robots to generate rich, diverse, and expressive motions in the real world?

Imitation Learning

Paper
Add Code

GumbelSoft: Diversified Language Model Watermarking via the GumbelMax-trick

1 code implementation • 20 Feb 2024 • Jiayi Fu, Xuandong Zhao, Ruihan Yang, Yuansen Zhang, Jiangjie Chen, Yanghua Xiao

Large language models (LLMs) excellently generate human-like text, but also raise concerns about misuse in fake news and academic dishonesty.

Language Modelling

Paper
Code

Precipitation Downscaling with Spatiotemporal Video Diffusion

no code implementations • 11 Dec 2023 • Prakhar Srivastava, Ruihan Yang, Gavin Kerrigan, Gideon Dresdner, Jeremy McGibbon, Christopher Bretherton, Stephan Mandt

In climate science and meteorology, high-resolution local precipitation (rain and snowfall) predictions are limited by the computational costs of simulation-based methods.

Optical Flow Estimation Super-Resolution

Paper
Add Code

Harmonic Mobile Manipulation

no code implementations • 11 Dec 2023 • Ruihan Yang, Yejin Kim, Aniruddha Kembhavi, Xiaolong Wang, Kiana Ehsani

Recent advancements in robotics have enabled robots to navigate complex scenes or manipulate diverse objects independently.

Navigate

Paper
Add Code

CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional Modeling

no code implementations • 8 Dec 2023 • Ruihan Yang, Hannes Gamper, Sebastian Braun

We introduce a multi-modal diffusion model tailored for the bi-directional conditional generation of video and audio.

Audio Generation

Paper
Add Code

Generalized Animal Imitator: Agile Locomotion with Versatile Motion Prior

no code implementations • 2 Oct 2023 • Ruihan Yang, Zhuoqun Chen, Jianhan Ma, Chongyi Zheng, Yiyu Chen, Quan Nguyen, Xiaolong Wang

This paper introduces the Versatile Instructable Motion prior (VIM) - a Reinforcement Learning framework designed to incorporate a range of agile locomotion tasks suitable for advanced robotic applications.

Paper
Add Code

Neural Volumetric Memory for Visual Locomotion Control

no code implementations • CVPR 2023 • Ruihan Yang, Ge Yang, Xiaolong Wang

To solve this problem, we follow the paradigm in computer vision that explicitly models the 3D geometry of the scene and propose Neural Volumetric Memory (NVM), a geometric memory architecture that explicitly accounts for the SE(3) equivariance of the 3D world.

Paper
Add Code

Lossy Image Compression with Conditional Diffusion Models

1 code implementation • NeurIPS 2023 • Ruihan Yang, Stephan Mandt

This paper outlines an end-to-end optimized lossy image compression framework using diffusion generative models.

Image Compression Image Quality Assessment

Paper
Code

SC2 Benchmark: Supervised Compression for Split Computing

1 code implementation • 16 Mar 2022 • Yoshitomo Matsubara, Ruihan Yang, Marco Levorato, Stephan Mandt

With the increasing demand for deep learning models on mobile devices, splitting neural network computation between the device and a more powerful edge server has become an attractive solution.

Data Compression Edge-computing +2

Paper
Code

Diffusion Probabilistic Modeling for Video Generation

1 code implementation • 16 Mar 2022 • Ruihan Yang, Prakhar Srivastava, Stephan Mandt

Denoising diffusion probabilistic models are a promising new class of generative models that mark a milestone in high-quality image generation.

Denoising Image Generation +2

Paper
Code

Vision-Guided Quadrupedal Locomotion in the Wild with Multi-Modal Delay Randomization

1 code implementation • 29 Sep 2021 • Chieko Sarah Imai, Minghao Zhang, Yuchen Zhang, Marcin Kierebinski, Ruihan Yang, Yuzhe Qin, Xiaolong Wang

While Reinforcement Learning (RL) provides a promising paradigm for agile locomotion skills with vision inputs in simulation, it is still very challenging to deploy the RL policy in the real world.

Reinforcement Learning (RL)

196

Paper
Code

Supervised Compression for Resource-Constrained Edge Computing Systems

2 code implementations • 21 Aug 2021 • Yoshitomo Matsubara, Ruihan Yang, Marco Levorato, Stephan Mandt

There has been much interest in deploying deep learning algorithms on low-powered devices, including smartphones, drones, and medical sensors.

Data Compression Edge-computing +2

Paper
Code

DexMV: Imitation Learning for Dexterous Manipulation from Human Videos

1 code implementation • 12 Aug 2021 • Yuzhe Qin, Yueh-Hua Wu, Shaowei Liu, Hanwen Jiang, Ruihan Yang, Yang Fu, Xiaolong Wang

While significant progress has been made on understanding hand-object interactions in computer vision, it is still very challenging for robots to perform complex dexterous manipulation.

Imitation Learning motion retargeting +1

Paper
Code

Insights from Generative Modeling for Neural Video Compression

1 code implementation • 28 Jul 2021 • Ruihan Yang, Yibo Yang, Joseph Marino, Stephan Mandt

While recent machine learning research has revealed connections between deep generative models such as VAEs and rate-distortion losses used in learned compression, most of this work has focused on images.

Video Compression

Paper
Code

Learning Vision-Guided Quadrupedal Locomotion End-to-End with Cross-Modal Transformers

1 code implementation • ICLR 2022 • Ruihan Yang, Minghao Zhang, Nicklas Hansen, Huazhe Xu, Xiaolong Wang

Our key insight is that proprioceptive states only offer contact measurements for immediate reaction, whereas an agent equipped with visual sensory observations can learn to proactively maneuver environments with obstacles and uneven terrain by anticipating changes in the environment many steps ahead.

Reinforcement Learning (RL)

196

Paper
Code

SCALE SPACE FLOW WITH AUTOREGRESSIVE PRIORS

no code implementations • ICLR Workshop Neural_Compression 2021 • Ruihan Yang, Yibo Yang, Joseph Marino, Stephan Mandt

There has been a recent surge of interest in neural video compression models that combines data-driven dimensionality reduction with learned entropy coding.

Dimensionality Reduction Open-Ended Question Answering +1

Paper
Add Code

Generative Video Compression as Hierarchical Variational Inference

no code implementations • pproximateinference AABI Symposium 2021 • Ruihan Yang, Yibo Yang, Joseph Marino, Stephan Mandt

Recent work by Marino et al. (2020) showed improved performance in sequential density estimation by combining masked autoregressive flows with hierarchical latent variable models.

Density Estimation Variational Inference +1

Paper
Add Code

Hierarchical Autoregressive Modeling for Neural Video Compression

3 code implementations • ICLR 2021 • Ruihan Yang, Yibo Yang, Joseph Marino, Stephan Mandt

Recent work by Marino et al. (2020) showed improved performance in sequential density estimation by combining masked autoregressive flows with hierarchical latent variable models.

Density Estimation Video Compression

Paper
Code

PIANOTREE VAE: Structured Representation Learning for Polyphonic Music

2 code implementations • 17 Aug 2020 • Ziyu Wang, Yiyi Zhang, Yixiao Zhang, Junyan Jiang, Ruihan Yang, Junbo Zhao, Gus Xia

The dominant approach for music representation learning involves the deep unsupervised model family variational autoencoder (VAE).

Music Generation Representation Learning

Paper
Code

Multi-Task Reinforcement Learning with Soft Modularization

1 code implementation • NeurIPS 2020 • Ruihan Yang, Huazhe Xu, Yi Wu, Xiaolong Wang

While training multiple tasks jointly allow the policies to share parameters across different tasks, the optimization problem becomes non-trivial: It remains unclear what parameters in the network should be reused across tasks, and how the gradients from different tasks may interfere with each other.

Ranked #1 on Meta-Learning on MT50

Meta-Learning Multi-Task Learning +2

100

Paper
Code

Suphx: Mastering Mahjong with Deep Reinforcement Learning

no code implementations • 30 Mar 2020 • Junjie Li, Sotetsu Koyamada, Qiwei Ye, Guoqing Liu, Chao Wang, Ruihan Yang, Li Zhao, Tao Qin, Tie-Yan Liu, Hsiao-Wuen Hon

Artificial Intelligence (AI) has achieved great success in many domains, and game AI is widely regarded as its beachhead since the dawn of AI.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Deep Music Analogy Via Latent Representation Disentanglement

3 code implementations • 9 Jun 2019 • Ruihan Yang, Dingsu Wang, Ziyu Wang, Tianyao Chen, Junyan Jiang, Gus Xia

Analogy-making is a key method for computer algorithms to generate both natural and creative music pieces.

Disentanglement

Paper
Code

Learning Efficient and Effective Exploration Policies with Counterfactual Meta Policy

no code implementations • 28 May 2019 • Ruihan Yang, Qiwei Ye, Tie-Yan Liu

Based on that, We proposed an end-to-end algorithm to learn exploration policy by meta-learning.

counterfactual Efficient Exploration +1

Paper
Add Code

Inspecting and Interacting with Meaningful Music Representations using VAE

no code implementations • 18 Apr 2019 • Ruihan Yang, Tianyao Chen, Yiyi Zhang, Gus Xia

Variational Autoencoders(VAEs) have already achieved great results on image generation and recently made promising progress on music generation.

Disentanglement Image Generation +1

Paper
Add Code

Artificial Intelligence for Prosthetics - challenge solutions

1 code implementation • 7 Feb 2019 • Łukasz Kidziński, Carmichael Ong, Sharada Prasanna Mohanty, Jennifer Hicks, Sean F. Carroll, Bo Zhou, Hongsheng Zeng, Fan Wang, Rongzhong Lian, Hao Tian, Wojciech Jaśkowski, Garrett Andersen, Odd Rune Lykkebø, Nihat Engin Toklu, Pranav Shyam, Rupesh Kumar Srivastava, Sergey Kolesnikov, Oleksii Hrinchuk, Anton Pechenko, Mattias Ljungström, Zhen Wang, Xu Hu, Zehong Hu, Minghui Qiu, Jun Huang, Aleksei Shpilman, Ivan Sosin, Oleg Svidchenko, Aleksandra Malysheva, Daniel Kudenko, Lance Rane, Aditya Bhatt, Zhengfei Wang, Penghui Qi, Zeyang Yu, Peng Peng, Quan Yuan, Wenxin Li, Yunsheng Tian, Ruihan Yang, Pingchuan Ma, Shauharda Khadka, Somdeb Majumdar, Zach Dwiel, Yinyin Liu, Evren Tumer, Jeremy Watson, Marcel Salathé, Sergey Levine, Scott Delp

In the NeurIPS 2018 Artificial Intelligence for Prosthetics challenge, participants were tasked with building a controller for a musculoskeletal model with a goal of matching a given time-varying velocity vector.

Imitation Learning reinforcement-learning +1

Paper
Code

MatchBench: An Evaluation of Feature Matchers

no code implementations • 7 Aug 2018 • Jia-Wang Bian, Ruihan Yang, Yun Liu, Le Zhang, Ming-Ming Cheng, Ian Reid, WenHai Wu

This leads to a critical absence in this field that there is no standard datasets and evaluation metrics to evaluate different feature matchers fairly.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.