Search Results for author: Haowei Liu

Found 10 papers, 4 papers with code

ChainRank-DPO: Chain Rank Direct Preference Optimization for LLM Rankers

no code implementations18 Dec 2024 Haowei Liu, Xuyang Wu, Guohao Sun, Zhiqiang Tao, Yi Fang

Large language models (LLMs) have demonstrated remarkable effectiveness in text reranking through works like RankGPT, leveraging their human-like reasoning about relevance.

MMLU Text Reranking

CLERF: Contrastive LEaRning for Full Range Head Pose Estimation

no code implementations3 Dec 2024 Ting-Ruen Wei, Haowei Liu, Huei-Chung Hu, Xuyang Wu, Yi Fang, Hsin-Tai Wu

Experiments show that our methodology performs on par with state-of-the-art models on standard test datasets and outperforms them when images are slightly rotated/ flipped or full range head pose.

Contrastive Learning Head Pose Estimation +2

mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models

1 code implementation9 Aug 2024 Jiabo Ye, Haiyang Xu, Haowei Liu, Anwen Hu, Ming Yan, Qi Qian, Ji Zhang, Fei Huang, Jingren Zhou

Multi-modal Large Language Models (MLLMs) have demonstrated remarkable capabilities in executing instructions for a variety of single-image tasks.

Language Modeling Language Modelling +3

Full-range Head Pose Geometric Data Augmentations

no code implementations2 Aug 2024 Huei-Chung Hu, Xuyang Wu, Haowei Liu, Ting-Ruen Wei, Hsin-Tai Wu

Many head pose estimation (HPE) methods promise the ability to create full-range datasets, theoretically allowing the estimation of the rotation and positioning of the head from various angles.

Dataset Generation Head Pose Estimation +1

MIBench: Evaluating Multimodal Large Language Models over Multiple Images

no code implementations21 Jul 2024 Haowei Liu, Xi Zhang, Haiyang Xu, Yaya Shi, Chaoya Jiang, Ming Yan, Ji Zhang, Fei Huang, Chunfeng Yuan, Bing Li, Weiming Hu

However, most existing MLLMs and benchmarks primarily focus on single-image input scenarios, leaving the performance of MLLMs when handling realistic multiple images underexplored.

In-Context Learning Multiple-choice

Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval

no code implementations26 Feb 2024 Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li, Weiming Hu

In this work, we propose the UNIFY framework, which learns lexicon representations to capture fine-grained semantics and combines the strengths of latent and lexicon representations for video-text retrieval.

Text Retrieval Video-Text Retrieval

CriticBench: Benchmarking LLMs for Critique-Correct Reasoning

1 code implementation22 Feb 2024 Zicheng Lin, Zhibin Gou, Tian Liang, Ruilin Luo, Haowei Liu, Yujiu Yang

Utilizing CriticBench, we evaluate and dissect the performance of 17 LLMs in generation, critique, and correction reasoning, i. e., GQC reasoning.

Benchmarking

NFT1000: A Cross-Modal Dataset for Non-Fungible Token Retrieval

1 code implementation29 Jan 2024 Shuxun Wang, Yunfei Lei, Ziqi Zhang, Wei Liu, Haowei Liu, Li Yang, Wenjuan Li, Bing Li, Weiming Hu

In this paper, we will introduce a benchmark dataset named "NFT Top1000 Visual-Text Dataset" (NFT1000), containing 7. 56 million image-text pairs, and being collected from 1000 most famous PFP1 NFT collections2 by sales volume on the Ethereum blockchain.

Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.