no code implementations • 12 Mar 2024 • Hanxu Hu, Simon Yu, Pinzhen Chen, Edoardo M. Ponti
Despite the success of existing instruction-tuned models, we find that they usually struggle to respond to queries with multiple instructions.
no code implementations • 15 Nov 2023 • Wenhong Zhu, Hongkun Hao, Zhiwei He, Yunze Song, Yumeng Zhang, Hanxu Hu, Yiran Wei, Rui Wang, Hongyuan Lu
The best candidate is finally selected from this set based on the BLEURT score.
no code implementations • 24 May 2023 • Hanxu Hu, Frank Keller
Current pre-trained vison-language models (PVLMs) achieve excellent performance on a range of multi-modal datasets.
1 code implementation • 17 May 2023 • Hanxu Hu, Hongyuan Lu, Huajian Zhang, Yun-Ze Song, Wai Lam, Yue Zhang
To this end, we propose a novel method called CoS (Chain-of-Symbol Prompting) that represents the complex environments with condensed symbolic spatial representations during the chained intermediate thinking steps.
1 code implementation • 20 Feb 2023 • Hanxu Hu, Yunqing Liu, Zhongyi Yu, Laura Perez-Beltrachini
In this work we study user controlled table-to-text generation where users explore the content in a table by selecting cells and reading a natural language description thereof automatically produce by a natural language generator.
3 code implementations • IEEE International Conference on Multimedia and Expo (ICME) 2022 • Yang shen, Xuhao Sun, Xiu-Shen Wei, Hanxu Hu, Zhipeng Chen
In this paper, we propose a simple but effective method for dealing with the challenging fine-grained cross-modal retrieval task where it aims to enable flexible retrieval among subor-dinate categories across different modalities.