Search Results for author: Chenyang Zhao

Found 24 papers, 13 papers with code

Revisiting Training-Inference Trigger Intensity in Backdoor Attacks

1 code implementation15 Mar 2025 Chenhao Lin, Chenyang Zhao, Shiwei Wang, Longtian Wang, Chao Shen, Zhengyu Zhao

Backdoor attacks typically place a specific trigger on certain training data, such that the model makes prediction errors on inputs with that trigger during inference.

ParallelComp: Parallel Long-Context Compressor for Length Extrapolation

no code implementations20 Feb 2025 Jing Xiong, Jianghan Shen, Chuanyang Zheng, Zhongwei Wan, Chenyang Zhao, Chiwun Yang, Fanghua Ye, Hongxia Yang, Lingpeng Kong, Ngai Wong

To mitigate the attention sink issue, we propose an attention calibration strategy that reduces biases, ensuring more stable long-range attention.

4k 8k

Evolution of cooperation in a bimodal mixture of conditional cooperators

1 code implementation11 Feb 2025 Chenyang Zhao, Xinshi Feng, Guozhong Zheng, Weiran Cai, Jiqiang Zhang, Li Chen

To explore the effects of hard and soft conditional cooperators, we examine their interactions in two scenarios: structural mixture (SM) and probabilistic mixture (PM), where the two behavioral modes are fixed and probabilistically adopted, respectively.

Q-Learning

RGBDS-SLAM: A RGB-D Semantic Dense SLAM Based on 3D Multi Level Pyramid Gaussian Splatting

1 code implementation2 Dec 2024 Zhenzhong Cao, Chenyang Zhao, Qianyi Zhang, Jinzheng Guang, Yinuo Song Jingtai Liu

To address this, we propose RGBDS-SLAM, a RGB-D semantic dense SLAM system based on 3D multi-level pyramid gaussian splatting, which enables high-quality dense reconstruction of scene RGB, depth, and semantics. In this system, we introduce a 3D multi-level pyramid gaussian splatting method that restores scene details by extracting multi-level image pyramids for gaussian splatting training, ensuring consistency in RGB, depth, and semantic reconstructions.

Navigating Spatial Inequities in Freight Truck Crash Severity via Counterfactual Inference in Los Angeles

no code implementations26 Nov 2024 Yichen Wang, Hao Yin, Yifan Yang, Chenyang Zhao, Siqin Wang

Freight truck-related crashes pose significant challenges, leading to substantial economic losses, injuries, and fatalities, with pronounced spatial disparities across different regions.

counterfactual Counterfactual Inference

CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing

1 code implementation22 Oct 2024 Chen Yang, Chenyang Zhao, Quanquan Gu, Dongruo Zhou

In detail, CoPS leverages agents' experiences on previous tasks, selecting distribution-matched experiences via a provable pessimism-based strategy to maximize utility while minimizing risks from distribution shifts.

How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making?

no code implementations21 Oct 2024 Zuojin Tang, Bin Hu, Chenyang Zhao, De Ma, Gang Pan, Bin Liu

We provide a definitive answer to this question by developing a new model architecture termed Visual Language Action model for Chatting and Decision Making (VLA4CD), and further demonstrating its performance in challenging autonomous driving tasks.

Autonomous Driving Decision Making

Configurable Foundation Models: Building LLMs from a Modular Perspective

no code implementations4 Sep 2024 Chaojun Xiao, Zhengyan Zhang, Chenyang Song, Dazhi Jiang, Feng Yao, Xu Han, Xiaozhi Wang, Shuo Wang, Yufei Huang, GuanYu Lin, Yingfa Chen, Weilin Zhao, Yuge Tu, Zexuan Zhong, Ao Zhang, Chenglei Si, Khai Hao Moo, Chenyang Zhao, Huimin Chen, Yankai Lin, Zhiyuan Liu, Jingbo Shang, Maosong Sun

We first formalize modules into emergent bricks - functional neuron partitions that emerge during the pre-training phase, and customized bricks - bricks constructed via additional post-training to improve the capabilities and knowledge of LLMs.

Computational Efficiency Mixture-of-Experts

SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning

1 code implementation16 Jul 2024 Chenyang Zhao, Xueying Jia, Vijay Viswanathan, Tongshuang Wu, Graham Neubig

Large language models (LLMs) hold the promise of solving diverse tasks when provided with appropriate natural language prompts.

Instruction Following Language Modelling

Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence

1 code implementation9 Jul 2024 Weize Chen, Ziming You, Ran Li, Yitong Guan, Chen Qian, Chenyang Zhao, Cheng Yang, Ruobing Xie, Zhiyuan Liu, Maosong Sun

The rapid advancement of large language models (LLMs) has paved the way for the development of highly capable autonomous agents.

Leveraging Local Patch Alignment to Seam-cutting for Large Parallax Image Stitching

1 code implementation30 Nov 2023 Tianli Liao, Chenyang Zhao, Lei LI, Heling Cao

In this paper, we argue that by adding a simple Local Patch Alignment Module (LPAM) into the seam cutting, the final result can be efficiently improved for large parallax image stitching.

Image Stitching

Large Language Model as a Policy Teacher for Training Reinforcement Learning Agents

1 code implementation22 Nov 2023 ZiHao Zhou, Bin Hu, Chenyang Zhao, Pu Zhang, Bin Liu

By incorporating the guidance from the teacher agent, the student agent can distill the prior knowledge of the LLM into its own model.

Decision Making Language Modeling +4

Prompt2Model: Generating Deployable Models from Natural Language Instructions

1 code implementation23 Aug 2023 Vijay Viswanathan, Chenyang Zhao, Amanda Bertsch, Tongshuang Wu, Graham Neubig

In this paper, we propose Prompt2Model, a general-purpose method that takes a natural language task description like the prompts provided to LLMs, and uses it to train a special-purpose model that is conducive to deployment.

Data-free Knowledge Distillation Dataset Generation +1

Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach

1 code implementation6 Jun 2023 Bin Hu, Chenyang Zhao, Pu Zhang, ZiHao Zhou, Yuanhang Yang, Zenglin Xu, Bin Liu

We find that this problem can be naturally formulated by a Markov decision process (MDP), and propose When2Ask, a reinforcement learning based approach that learns when it is necessary to query LLMs for high-level instructions to accomplish a target task.

Decision Making Sequential Decision Making +1

ODAM: Gradient-based instance-specific visual explanations for object detection

1 code implementation13 Apr 2023 Chenyang Zhao, Antoni B. Chan

We propose the gradient-weighted Object Detector Activation Maps (ODAM), a visualized explanation technique for interpreting the predictions of object detectors.

Attribute Object +2

On Context Distribution Shift in Task Representation Learning for Offline Meta RL

1 code implementation1 Apr 2023 Chenyang Zhao, ZiHao Zhou, Bin Liu

Offline Meta Reinforcement Learning (OMRL) aims to learn transferable knowledge from offline datasets to enhance the learning process for new target tasks.

continuous-control Continuous Control +5

Robust Domain Randomised Reinforcement Learning through Peer-to-Peer Distillation

no code implementations9 Dec 2020 Chenyang Zhao, Timothy Hospedales

In reinforcement learning, domain randomisation is an increasingly popular technique for learning more general policies that are robust to domain-shifts at deployment.

continuous-control Continuous Control +3

Cannot find the paper you are looking for? You can Submit a new open access paper.