Search Results for author: Jie Cheng

Found 26 papers, 10 papers with code

Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning

1 code implementation21 Apr 2025 Jie Cheng, Ruixi Qiao, Lijun Li, Chao Guo, Junle Wang, Gang Xiong, Yisheng Lv, Fei-Yue Wang

In this paper, we identify the main cause of PRM-induced reward hacking: the canonical summation-form credit assignment in reinforcement learning (RL), which defines the value as cumulative gamma-decayed future rewards, easily induces LLMs to hack steps with high rewards.

All Form +2

ElimPCL: Eliminating Noise Accumulation with Progressive Curriculum Labeling for Source-Free Domain Adaptation

no code implementations31 Mar 2025 Jie Cheng, Hao Zheng, Meiguang Zheng, Lei Wang, Hao Wu, Jian Zhang

Source-Free Domain Adaptation (SFDA) aims to train a target model without source data, and the key is to generate pseudo-labels using a pre-trained source model.

Source-Free Domain Adaptation

Offline Reinforcement Learning with Discrete Diffusion Skills

no code implementations26 Mar 2025 Ruixi Qiao, Jie Cheng, Xingyuan Dai, Yonglin Tian, Yisheng Lv

Skills have been introduced to offline reinforcement learning (RL) as temporal abstractions to tackle complex, long-horizon tasks, promoting consistent behavior and enabling meaningful exploration.

Decoder Offline RL +3

Establishing Reality-Virtuality Interconnections in Urban Digital Twins for Superior Intelligent Road Inspection

no code implementations23 Dec 2024 Yikang Zhang, Chuang-Wei Liu, Jiahang Li, Yingbing Chen, Jie Cheng, Rui Fan

Road inspection is essential for ensuring road maintenance and traffic safety, as road defects gradually emerge and compromise road functionality.

LHPF: Look back the History and Plan for the Future in Autonomous Driving

no code implementations26 Nov 2024 Sheng Wang, Yao Tian, Xiaodong Mei, Ge Sun, Jie Cheng, Fulong Ma, Pedro V. Sander, Junwei Liang

However, these algorithms typically assess the current and historical plans independently, leading to discontinuities in driving intentions and an accumulation of errors with each step in a discontinuous plan.

Autonomous Driving Imitation Learning

Ada-K Routing: Boosting the Efficiency of MoE-based LLMs

no code implementations14 Oct 2024 Tongtian Yue, Longteng Guo, Jie Cheng, Xuange Gao, Jing Liu

In this paper, we propose a novel Ada-K routing strategy that dynamically adjusts the number of activated experts for each token, thereby improving the balance between computational efficiency and model performance.

Computational Efficiency Mixture-of-Experts

Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining

1 code implementation1 Oct 2024 Jie Cheng, Ruixi Qiao, Gang Xiong, Qinghai Miao, Yingwei Ma, Binhua Li, Yongbin Li, Yisheng Lv

Experimental results indicate that our largest agent, with 150 million parameters, achieves 78. 9% human-level performance on pretrained games using only 10% subsampled offline data, outperforming existing state-of-the-art large-scale offline RL baselines by 31. 6% on averange.

Atari Games model +3

Multipath Identification and Mitigation with FDA-MIMO Radar

no code implementations25 Jul 2024 Yizhen Jia, Jie Cheng, Wen-Qin Wang, Hui Chen

In smart city development, the automatic detection of structures and vehicles within urban or suburban areas via array radar (airborne or vehicle platforms) becomes crucial.

Diversity

Toward Motion Robustness: A masked attention regularization framework in remote photoplethysmography

no code implementations9 Jul 2024 Pengfei Zhao, Qigong Sun, Xiaolin Tian, Yige Yang, Shuo Tao, Jie Cheng, Jiantong Chen

There has been growing interest in facial video-based remote photoplethysmography (rPPG) measurement recently, with a focus on assessing various vital signs such as heart rate and heart rate variability.

Heart Rate Variability

SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models

1 code implementation CVPR 2024 Tongtian Yue, Jie Cheng, Longteng Guo, Xingyuan Dai, Zijia Zhao, Xingjian He, Gang Xiong, Yisheng Lv, Jing Liu

In this paper, we present and delve into the self-consistency capability of LVLMs, a crucial aspect that reflects the models' ability to both generate informative captions for specific objects and subsequently utilize these captions to accurately re-identify the objects in a closed-loop process.

RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences

1 code implementation27 Feb 2024 Jie Cheng, Gang Xiong, Xingyuan Dai, Qinghai Miao, Yisheng Lv, Fei-Yue Wang

Our experiments on robotic manipulation and locomotion tasks demonstrate that RIME significantly enhances the robustness of the state-of-the-art PbRL method.

reinforcement-learning Reinforcement Learning

On Smart Morphing Wing Aircraft Robust Adaptive Beamforming

no code implementations22 Dec 2023 Yizhen Jia, Hui Chen, Wen-Qin Wang, Jie Cheng

To overcome this problem and ensure robust beamforming for FCA, deviations in array control parameters (ACPs) and array perturbations, the effect of mutual coupling in addition to looking-direction errors should be considered.

Forecast-MAE: Self-supervised Pre-training for Motion Forecasting with Masked Autoencoders

2 code implementations ICCV 2023 Jie Cheng, Xiaodong Mei, Ming Liu

This study explores the application of self-supervised learning (SSL) to the task of motion forecasting, an area that has not yet been extensively investigated despite the widespread success of SSL in computer vision and natural language processing.

Inductive Bias Motion Forecasting +1

Accurate and Efficient Event-based Semantic Segmentation Using Adaptive Spiking Encoder-Decoder Network

no code implementations24 Apr 2023 Rui Zhang, Luziwei Leng, Kaiwei Che, Hu Zhang, Jie Cheng, Qinghai Guo, Jiangxing Liao, Ran Cheng

Moreover, we develop a dual-path Spiking Spatially-Adaptive Modulation module, which is specifically tailored to enhance the representation of sparse events and multi-modal inputs, thereby considerably improving network performance.

Decoder Event-based vision +1

E-MLB: Multilevel Benchmark for Event-Based Camera Denoising

1 code implementation21 Mar 2023 Saizhe Ding, Jinze Chen, Yang Wang, Yu Kang, Weiguo Song, Jie Cheng, Yang Cao

Event cameras, such as dynamic vision sensors (DVS), are biologically inspired vision sensors that have advanced over conventional cameras in high dynamic range, low latency and low power consumption, showing great application potential in many fields.

Denoising

Neuro-Modulated Hebbian Learning for Fully Test-Time Adaptation

no code implementations CVPR 2023 Yushun Tang, Ce Zhang, Heng Xu, Shuoshuo Chen, Jie Cheng, Luziwei Leng, Qinghai Guo, Zhihai He

We observe that the performance of this feed-forward Hebbian learning for fully test-time adaptation can be significantly improved by incorporating a feedback neuro-modulation layer.

Test-time Adaptation

DKT: Diverse Knowledge Transfer Transformer for Class Incremental Learning

no code implementations CVPR 2023 Xinyuan Gao, Yuhang He, Songlin Dong, Jie Cheng, Xing Wei, Yihong Gong

Deep neural networks suffer from catastrophic forgetting in class incremental learning, where the classification accuracy of old classes drastically deteriorates when the networks learn the knowledge of new classes.

class-incremental learning Class Incremental Learning +3

Time Series Prediction by Multi-task GPR with Spatiotemporal Information Transformation

1 code implementation26 Apr 2022 Peng Tao, Xiaohu Hao, Jie Cheng, Luonan Chen

Making an accurate prediction of an unknown system only from a short-term time series is difficult due to the lack of sufficient information, especially in a multi-step-ahead manner.

GPR Prediction +2

Discrete Time Convolution for Fast Event-Based Stereo

1 code implementation CVPR 2022 Kaixuan Zhang, Kaiwei Che, JianGuo Zhang, Jie Cheng, Ziyang Zhang, Qinghai Guo, Luziwei Leng

Inspired by continuous dynamics of biological neuron models, we propose a novel encoding method for sparse events - continuous time convolution (CTC) - which learns to model the spatial feature of the data with intrinsic dynamics.

Depth Estimation Stereo Matching

GSECnet: Ground Segmentation of Point Clouds for Edge Computing

no code implementations5 Apr 2021 Dong He, Jie Cheng, Jong-Hwan Kim

This paper proposes the GSECnet - Ground Segmentation network for Edge Computing, an efficient ground segmentation framework of point clouds specifically designed to be deployable on a low-power edge computing unit.

Edge-computing Segmentation

LogDet Rank Minimization with Application to Subspace Clustering

no code implementations3 Jul 2015 Zhao Kang, Chong Peng, Jie Cheng, Qiang Chen

Most of the recent studies use the nuclear norm as a convex surrogate of the rank operator.

Clustering Face Clustering +1

High-dimensional Mixed Graphical Models

no code implementations9 Apr 2013 Jie Cheng, Tianxi Li, Elizaveta Levina, Ji Zhu

While graphical models for continuous data (Gaussian graphical models) and discrete data (Ising models) have been extensively studied, there is little work on graphical models linking both continuous and discrete variables (mixed data), which are common in many scientific applications.

Computational Efficiency Vocal Bursts Intensity Prediction

Cannot find the paper you are looking for? You can Submit a new open access paper.