Search Results for author: Xuxin Cheng

Found 19 papers, 5 papers with code

Visual Whole-Body Control for Legged Loco-Manipulation

no code implementations • 25 Mar 2024 • Minghuan Liu, Zixuan Chen, Xuxin Cheng, Yandong Ji, Rizhao Qiu, Ruihan Yang, Xiaolong Wang

That is, the robot can control the legs and the arm at the same time to extend its workspace.

Position

Paper
Add Code

Retrieval is Accurate Generation

no code implementations • 27 Feb 2024 • Bowen Cao, Deng Cai, Leyang Cui, Xuxin Cheng, Wei Bi, Yuexian Zou, Shuming Shi

To address this, we propose to initialize the training oracles using linguistic heuristics and, more importantly, bootstrap the oracles through iterative self-reinforcement.

Language Modelling Retrieval +1

Paper
Add Code

Expressive Whole-Body Control for Humanoid Robots

no code implementations • 26 Feb 2024 • Xuxin Cheng, Yandong Ji, Junming Chen, Ruihan Yang, Ge Yang, Xiaolong Wang

Can we enable humanoid robots to generate rich, diverse, and expressive motions in the real world?

Imitation Learning

Paper
Add Code

Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning

1 code implementation • 30 Jan 2024 • Bang Yang, Yong Dai, Xuxin Cheng, Yaowei Li, Asif Raza, Yuexian Zou

To alleviate CF raised by covariate shift and lexical overlap, we further propose a novel approach that ensures the identical distribution of all token embeddings during initialization and regularizes token embedding learning during training.

Text Retrieval

Paper
Code

ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding

no code implementations • 19 Nov 2023 • Xuxin Cheng, Bowen Cao, Qichen Ye, Zhihong Zhu, Hongxiang Li, Yuexian Zou

Specifically, in fine-tuning, we apply mutual learning and train two SLU models on the manual transcripts and the ASR transcripts, respectively, aiming to iteratively share knowledge between these two models.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language Model

1 code implementation • 13 Oct 2023 • Qichen Ye, Junling Liu, Dading Chong, Peilin Zhou, Yining Hua, Fenglin Liu, Meng Cao, ZiMing Wang, Xuxin Cheng, Zhu Lei, Zhenhua Guo

In the CPT and SFT phases, Qilin-Med achieved 38. 4% and 40. 0% accuracy on the CMExam test set, respectively.

Knowledge Graphs Language Modelling +2

Paper
Code

Extreme Parkour with Legged Robots

no code implementations • 25 Sep 2023 • Xuxin Cheng, Kexin Shi, Ananye Agarwal, Deepak Pathak

In this paper, we take a similar approach to developing robot parkour on a small low-cost robot with imprecise actuation and a single front-facing depth camera for perception which is low-frequency, jittery, and prone to artifacts.

Paper
Add Code

G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory

1 code implementation • ICCV 2023 • Hongxiang Li, Meng Cao, Xuxin Cheng, Yaowei Li, Zhihong Zhu, Yuexian Zou

Due to two annoying issues in video grounding: (1) the co-existence of some visual entities in both ground truth and other moments, \ie semantic overlapping; (2) only a few moments in the video are annotated, \ie sparse annotation dilemma, vanilla contrastive learning is unable to model the correlations between temporally distant moments and learned inconsistent video representations.

Contrastive Learning Video Grounding

Paper
Code

PolyVoice: Language Models for Speech to Speech Translation

no code implementations • 5 Jun 2023 • Qianqian Dong, Zhiying Huang, Qiao Tian, Chen Xu, Tom Ko, Yunlong Zhao, Siyuan Feng, Tang Li, Kexin Wang, Xuxin Cheng, Fengpeng Yue, Ye Bai, Xi Chen, Lu Lu, Zejun Ma, Yuping Wang, Mingxuan Wang, Yuxuan Wang

For the speech synthesis part, we adopt the existing VALL-E X approach and build a unit-based audio language model.

Language Modelling Speech Synthesis +2

Paper
Add Code

Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation

no code implementations • ICCV 2023 • Yaowei Li, Bang Yang, Xuxin Cheng, Zhihong Zhu, Hongxiang Li, Yuexian Zou

Automatic radiology report generation has attracted enormous research interest due to its practical value in reducing the workload of radiologists.

Sentence

Paper
Add Code

Legs as Manipulator: Pushing Quadrupedal Agility Beyond Locomotion

no code implementations • 20 Mar 2023 • Xuxin Cheng, Ashish Kumar, Deepak Pathak

Locomotion has seen dramatic progress for walking or running across challenging terrains.

Paper
Add Code

PoseRAC: Pose Saliency Transformer for Repetitive Action Counting

1 code implementation • 15 Mar 2023 • Ziyu Yao, Xuxin Cheng, Yuexian Zou

Moreover, we introduce a pose-level method, PoseRAC, which is based on this representation and achieves state-of-the-art performance on two new version datasets by using Pose Saliency Annotation to annotate salient poses for training.

Ranked #1 on Repetitive Action Counting on RepCount

Repetitive Action Counting

Paper
Code

Exploiting Auxiliary Caption for Video Grounding

no code implementations • 15 Jan 2023 • Hongxiang Li, Meng Cao, Xuxin Cheng, Zhihong Zhu, Yaowei Li, Yuexian Zou

Video grounding aims to locate a moment of interest matching the given query sentence from an untrimmed video.

Contrastive Learning Dense Video Captioning +2

Paper
Add Code

M3ST: Mix at Three Levels for Speech Translation

no code implementations • 7 Dec 2022 • Xuxin Cheng, Qianqian Dong, Fengpeng Yue, Tom Ko, Mingxuan Wang, Yuexian Zou

How to solve the data scarcity problem for end-to-end speech-to-text translation (ST)?

Data Augmentation Machine Translation +3

Paper
Add Code

A Dynamic Graph Interactive Framework with Label-Semantic Injection for Spoken Language Understanding

1 code implementation • 8 Nov 2022 • Zhihong Zhu, Weiyuan Xu, Xuxin Cheng, Tengtao Song, Yuexian Zou

Multi-intent detection and slot filling joint models are gaining increasing traction since they are closer to complicated real-world scenarios.

Intent Detection slot-filling +2

Paper
Code

Deep Whole-Body Control: Learning a Unified Policy for Manipulation and Locomotion

no code implementations • 18 Oct 2022 • Zipeng Fu, Xuxin Cheng, Deepak Pathak

The standard hierarchical control pipeline for such legged manipulators is to decouple the controller into that of manipulation and locomotion.

Paper
Add Code

Reinforcement Learning for Robust Parameterized Locomotion Control of Bipedal Robots

no code implementations • 26 Mar 2021 • Zhongyu Li, Xuxin Cheng, Xue Bin Peng, Pieter Abbeel, Sergey Levine, Glen Berseth, Koushil Sreenath

Developing robust walking controllers for bipedal robots is a challenging endeavor.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Automated Lane Change Strategy using Proximal Policy Optimization-based Deep Reinforcement Learning

no code implementations • 7 Feb 2020 • Fei Ye, Xuxin Cheng, Pin Wang, Ching-Yao Chan, Jiucai Zhang

The simulation results demonstrate the lane change maneuvers can be efficiently learned and executed in a safe, smooth, and efficient manner.

Autonomous Driving reinforcement-learning +1

Paper
Add Code

Driving Decision and Control for Autonomous Lane Change based on Deep Reinforcement Learning

no code implementations • 23 Apr 2019 • Tianyu Shi, Pin Wang, Xuxin Cheng, Ching-Yao Chan, Ding Huang

We apply Deep Q-network (DQN) with the consideration of safety during the task for deciding whether to conduct the maneuver.

Autonomous Driving Decision Making +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.