Search Results for author: Chenyang Zhao

Found 13 papers, 4 papers with code

Seam-guided local alignment and stitching for large parallax images

no code implementations30 Nov 2023 Tianli Liao, Chenyang Zhao, Lei LI, Heling Cao

However, the effectiveness of seam-cutting usually depends on that images can be roughly aligned such that there exists a local region where a plausible seam can be found.

Image Stitching

Large Language Model as a Policy Teacher for Training Reinforcement Learning Agents

1 code implementation22 Nov 2023 ZiHao Zhou, Bin Hu, Chenyang Zhao, Pu Zhang, Bin Liu

By incorporating the guidance from the teacher agent, the student agent can distill the prior knowledge of the LLM into its own model.

Decision Making Language Modelling +2

TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise

no code implementations29 Oct 2023 Nan He, Hanyu Lai, Chenyang Zhao, Zirui Cheng, Junting Pan, Ruoyu Qin, Ruofan Lu, Rui Lu, Yunchen Zhang, Gangming Zhao, Zhaohui Hou, Zhiyuan Huang, Shaoqing Lu, Ding Liang, Mingjie Zhan

Based on TeacherLM-7. 1B, we augmented 58 NLP datasets and taught various student models with different parameters from OPT and BLOOM series in a multi-task setting.

Data Augmentation Language Modelling

Prompt2Model: Generating Deployable Models from Natural Language Instructions

1 code implementation23 Aug 2023 Vijay Viswanathan, Chenyang Zhao, Amanda Bertsch, Tongshuang Wu, Graham Neubig

In this paper, we propose Prompt2Model, a general-purpose method that takes a natural language task description like the prompts provided to LLMs, and uses it to train a special-purpose model that is conducive to deployment.

Retrieval

ODAM: Gradient-based instance-specific visual explanations for object detection

no code implementations13 Apr 2023 Chenyang Zhao, Antoni B. Chan

We propose the gradient-weighted Object Detector Activation Maps (ODAM), a visualized explanation technique for interpreting the predictions of object detectors.

Attribute Object +2

On Context Distribution Shift in Task Representation Learning for Offline Meta RL

1 code implementation1 Apr 2023 Chenyang Zhao, ZiHao Zhou, Bin Liu

Offline Meta Reinforcement Learning (OMRL) aims to learn transferable knowledge from offline datasets to enhance the learning process for new target tasks.

Continuous Control Meta Reinforcement Learning +3

Towards Trustworthy Multi-label Sewer Defect Classification via Evidential Deep Learning

no code implementations25 Oct 2022 Chenyang Zhao, Chuanfei Hu, Hang Shao, Zhe Wang, Yongxiong Wang

An automatic vision-based sewer inspection plays a key role of sewage system in a modern city.

Robust Domain Randomised Reinforcement Learning through Peer-to-Peer Distillation

no code implementations9 Dec 2020 Chenyang Zhao, Timothy Hospedales

In reinforcement learning, domain randomisation is an increasingly popular technique for learning more general policies that are robust to domain-shifts at deployment.

Continuous Control reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.