Search Results for author: Bohan Zhou

Found 5 papers, 1 papers with code

UniCode: Learning a Unified Codebook for Multimodal Large Language Models

no code implementations14 Mar 2024 Sipeng Zheng, Bohan Zhou, Yicheng Feng, Ye Wang, Zongqing Lu

In this paper, we propose \textbf{UniCode}, a novel approach within the domain of multimodal large language models (MLLMs) that learns a unified codebook to efficiently tokenize visual, text, and potentially other types of signals.

Quantization Visual Question Answering (VQA)

Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study

2 code implementations5 Mar 2024 Weihao Tan, Ziluo Ding, Wentao Zhang, Boyu Li, Bohan Zhou, Junpeng Yue, Haochong Xia, Jiechuan Jiang, Longtao Zheng, Xinrun Xu, Yifei Bi, Pengjie Gu, Xinrun Wang, Börje F. Karlsson, Bo An, Zongqing Lu

Despite the success in specific tasks and scenarios, existing foundation agents, empowered by large models (LMs) and advanced tools, still cannot generalize to different scenarios, mainly due to dramatic differences in the observations and actions across scenarios.

Efficient Exploration

Learning from Visual Observation via Offline Pretrained State-to-Go Transformer

no code implementations NeurIPS 2023 Bohan Zhou, Ke Li, Jiechuan Jiang, Zongqing Lu

Learning from visual observation (LfVO), aiming at recovering policies from only visual observation data, is promising yet a challenging problem.

reinforcement-learning

GFIE: A Dataset and Baseline for Gaze-Following From 2D to 3D in Indoor Environments

no code implementations CVPR 2023 Zhengxi Hu, Yuxue Yang, Xiaolin Zhai, Dingye Yang, Bohan Zhou, Jingtai Liu

Gaze-following is a kind of research that requires locating where the person in the scene is looking automatically under the topic of gaze estimation.

Gaze Estimation

Distilling Neuron Spike with High Temperature in Reinforcement Learning Agents

no code implementations5 Aug 2021 Ling Zhang, Jian Cao, Yuan Zhang, Bohan Zhou, Shuo Feng

This method uses distillation to effectively avoid the weakness of STBP, which can achieve SOTA performance in classification, and can obtain a smaller, faster convergence and lower power consumption SNN reinforcement learning model.

reinforcement-learning Reinforcement Learning (RL) +1

Cannot find the paper you are looking for? You can Submit a new open access paper.