Search Results for author: Boyuan Zheng

Found 16 papers, 7 papers with code

A Trembling House of Cards? Mapping Adversarial Attacks against Language Agents

1 code implementation15 Feb 2024 Lingbo Mo, Zeyi Liao, Boyuan Zheng, Yu Su, Chaowei Xiao, Huan Sun

There is a surprisingly large gap between the speed and scale of their development and deployment and our understanding of their safety risks.

Dual-View Visual Contextualization for Web Navigation

no code implementations6 Feb 2024 Jihyung Kil, Chan Hee Song, Boyuan Zheng, Xiang Deng, Yu Su, Wei-Lun Chao

Automatic web navigation aims to build a web agent that can follow language instructions to execute complex and diverse tasks on real-world websites.

The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts

no code implementations23 Jan 2024 Lingfeng Shen, Weiting Tan, Sihao Chen, Yunmo Chen, Jingyu Zhang, Haoran Xu, Boyuan Zheng, Philipp Koehn, Daniel Khashabi

As the influence of large language models (LLMs) spans across global communities, their safety challenges in multilingual settings become paramount for alignment research.

GPT-4V(ision) is a Generalist Web Agent, if Grounded

1 code implementation3 Jan 2024 Boyuan Zheng, Boyu Gou, Jihyung Kil, Huan Sun, Yu Su

The recent development on large multimodal models (LMMs), especially GPT-4V(ision) and Gemini, has been quickly expanding the capability boundaries of multimodal models beyond traditional tasks like image captioning and visual question answering.

Image Captioning Question Answering +1

Mind2Web: Towards a Generalist Agent for the Web

1 code implementation NeurIPS 2023 Xiang Deng, Yu Gu, Boyuan Zheng, Shijie Chen, Samuel Stevens, Boshi Wang, Huan Sun, Yu Su

We introduce Mind2Web, the first dataset for developing and evaluating generalist agents for the web that can follow language instructions to complete complex tasks on any website.

Flatness-Aware Prompt Selection Improves Accuracy and Sample Efficiency

1 code implementation18 May 2023 Lingfeng Shen, Weiting Tan, Boyuan Zheng, Daniel Khashabi

We provide theoretical foundations for this metric and its relationship with other prompt selection metrics, providing a comprehensive understanding of existing methods.

Genetic Imitation Learning by Reward Extrapolation

no code implementations3 Jan 2023 Boyuan Zheng, Jianlong Zhou, Fang Chen

Imitation learning demonstrates remarkable performance in various domains.

Imitation Learning

Explaining Imitation Learning through Frames

no code implementations3 Jan 2023 Boyuan Zheng, Jianlong Zhou, Chunjie Liu, Yiqiao Li, Fang Chen

As one of the prevalent methods to achieve automation systems, Imitation Learning (IL) presents a promising performance in a wide range of domains.

Explainable artificial intelligence Imitation Learning

GANExplainer: GAN-based Graph Neural Networks Explainer

no code implementations30 Dec 2022 Yiqiao Li, Jianlong Zhou, Boyuan Zheng, Fang Chen

With the rapid deployment of graph neural networks (GNNs) based techniques into a wide range of applications such as link prediction, node classification, and graph classification the explainability of GNNs has become an indispensable component for predictive and trustworthy decision-making.

Decision Making Generative Adversarial Network +3

An Empirical Study on Finding Spans

no code implementations13 Oct 2022 Weiwei Gu, Boyuan Zheng, Yunmo Chen, Tongfei Chen, Benjamin Van Durme

We present an empirical study on methods for span finding, the selection of consecutive tokens in text for some downstream tasks.

Multilingual Coreference Resolution in Multiparty Dialogue

1 code implementation2 Aug 2022 Boyuan Zheng, Patrick Xia, Mahsa Yarmohammadi, Benjamin Van Durme

Existing multiparty dialogue datasets for entity coreference resolution are nascent, and many challenges are still unaddressed.

coreference-resolution Data Augmentation

Learn To Remember: Transformer with Recurrent Memory for Document-Level Machine Translation

no code implementations Findings (NAACL) 2022 Yukun Feng, Feng Li, Ziang Song, Boyuan Zheng, Philipp Koehn

We conduct experiments on three popular datasets for document-level machine translation and our model has an average improvement of 0. 91 s-BLEU over the sentence-level baseline.

Document Level Machine Translation Machine Translation +2

Exploring Generalization Ability of Pretrained Language Models on Arithmetic and Logical Reasoning

no code implementations15 Aug 2021 Cunxiang Wang, Boyuan Zheng, Yuchen Niu, Yue Zhang

To quantitatively and intuitively explore the generalization ability of pre-trained language models (PLMs), we have designed several tasks of arithmetic and logical reasoning.

Logical Reasoning

Imitation Learning: Progress, Taxonomies and Challenges

no code implementations23 Jun 2021 Boyuan Zheng, Sunny Verma, Jianlong Zhou, Ivor Tsang, Fang Chen

Imitation learning aims to extract knowledge from human experts' demonstrations or artificially created agents in order to replicate their behaviors.

Autonomous Driving Imitation Learning

SemEval-2021 Task 4: Reading Comprehension of Abstract Meaning

1 code implementation SEMEVAL 2021 Boyuan Zheng, Xiaoyu Yang, Yu-Ping Ruan, ZhenHua Ling, Quan Liu, Si Wei, Xiaodan Zhu

Given a passage and the corresponding question, a participating system is expected to choose the correct answer from five candidates of abstract concepts in a cloze-style machine reading comprehension setup.

Machine Reading Comprehension

Cannot find the paper you are looking for? You can Submit a new open access paper.