no code implementations • 27 May 2024 • Boyuan Zheng, Jianlong Zhou, Fang Chen
Natural language, moreover, serves as the primary medium through which humans acquire new knowledge, presenting a potentially intuitive bridge for translating concepts understandable by humans into formats that can be learned by machines.
no code implementations • 20 May 2024 • Hai Zhang, Boyuan Zheng, Anqi Guo, Tianying Ji, Pheng-Ann Heng, Junqiao Zhao, Lanqing Li
Previous context-based approaches predominantly rely on the intuition that maximizing the mutual information between the task and the task representation ($I(Z;M)$) can lead to performance improvements.
1 code implementation • 15 Feb 2024 • Lingbo Mo, Zeyi Liao, Boyuan Zheng, Yu Su, Chaowei Xiao, Huan Sun
There is a surprisingly large gap between the speed and scale of their development and deployment and our understanding of their safety risks.
no code implementations • CVPR 2024 • Jihyung Kil, Chan Hee Song, Boyuan Zheng, Xiang Deng, Yu Su, Wei-Lun Chao
Automatic web navigation aims to build a web agent that can follow language instructions to execute complex and diverse tasks on real-world websites.
no code implementations • 23 Jan 2024 • Lingfeng Shen, Weiting Tan, Sihao Chen, Yunmo Chen, Jingyu Zhang, Haoran Xu, Boyuan Zheng, Philipp Koehn, Daniel Khashabi
As the influence of large language models (LLMs) spans across global communities, their safety challenges in multilingual settings become paramount for alignment research.
1 code implementation • 3 Jan 2024 • Boyuan Zheng, Boyu Gou, Jihyung Kil, Huan Sun, Yu Su
The recent development on large multimodal models (LMMs), especially GPT-4V(ision) and Gemini, has been quickly expanding the capability boundaries of multimodal models beyond traditional tasks like image captioning and visual question answering.
4 code implementations • CVPR 2024 • Xiang Yue, Yuansheng Ni, Kai Zhang, Tianyu Zheng, Ruoqi Liu, Ge Zhang, Samuel Stevens, Dongfu Jiang, Weiming Ren, Yuxuan Sun, Cong Wei, Botao Yu, Ruibin Yuan, Renliang Sun, Ming Yin, Boyuan Zheng, Zhenzhu Yang, Yibo Liu, Wenhao Huang, Huan Sun, Yu Su, Wenhu Chen
We introduce MMMU: a new benchmark designed to evaluate multimodal models on massive multi-discipline tasks demanding college-level subject knowledge and deliberate reasoning.
1 code implementation • NeurIPS 2023 • Xiang Deng, Yu Gu, Boyuan Zheng, Shijie Chen, Samuel Stevens, Boshi Wang, Huan Sun, Yu Su
We introduce Mind2Web, the first dataset for developing and evaluating generalist agents for the web that can follow language instructions to complete complex tasks on any website.
1 code implementation • 18 May 2023 • Lingfeng Shen, Weiting Tan, Boyuan Zheng, Daniel Khashabi
We provide theoretical foundations for this metric and its relationship with other prompt selection metrics, providing a comprehensive understanding of existing methods.
no code implementations • 3 Jan 2023 • Boyuan Zheng, Jianlong Zhou, Chunjie Liu, Yiqiao Li, Fang Chen
As one of the prevalent methods to achieve automation systems, Imitation Learning (IL) presents a promising performance in a wide range of domains.
no code implementations • 3 Jan 2023 • Boyuan Zheng, Jianlong Zhou, Fang Chen
Imitation learning demonstrates remarkable performance in various domains.
no code implementations • 30 Dec 2022 • Yiqiao Li, Jianlong Zhou, Boyuan Zheng, Fang Chen
With the rapid deployment of graph neural networks (GNNs) based techniques into a wide range of applications such as link prediction, node classification, and graph classification the explainability of GNNs has become an indispensable component for predictive and trustworthy decision-making.
no code implementations • 13 Oct 2022 • Weiwei Gu, Boyuan Zheng, Yunmo Chen, Tongfei Chen, Benjamin Van Durme
We present an empirical study on methods for span finding, the selection of consecutive tokens in text for some downstream tasks.
1 code implementation • 2 Aug 2022 • Boyuan Zheng, Patrick Xia, Mahsa Yarmohammadi, Benjamin Van Durme
Existing multiparty dialogue datasets for entity coreference resolution are nascent, and many challenges are still unaddressed.
no code implementations • Findings (NAACL) 2022 • Yukun Feng, Feng Li, Ziang Song, Boyuan Zheng, Philipp Koehn
We conduct experiments on three popular datasets for document-level machine translation and our model has an average improvement of 0. 91 s-BLEU over the sentence-level baseline.
no code implementations • 15 Aug 2021 • Cunxiang Wang, Boyuan Zheng, Yuchen Niu, Yue Zhang
To quantitatively and intuitively explore the generalization ability of pre-trained language models (PLMs), we have designed several tasks of arithmetic and logical reasoning.
no code implementations • 23 Jun 2021 • Boyuan Zheng, Sunny Verma, Jianlong Zhou, Ivor Tsang, Fang Chen
Imitation learning aims to extract knowledge from human experts' demonstrations or artificially created agents in order to replicate their behaviors.
1 code implementation • SEMEVAL 2021 • Boyuan Zheng, Xiaoyu Yang, Yu-Ping Ruan, ZhenHua Ling, Quan Liu, Si Wei, Xiaodan Zhu
Given a passage and the corresponding question, a participating system is expected to choose the correct answer from five candidates of abstract concepts in a cloze-style machine reading comprehension setup.