Search Results for author: Xiaona Zhou

Found 1 papers, 0 papers with code

PRIMA: Multi-Image Vision-Language Models for Reasoning Segmentation

no code implementations19 Dec 2024 Muntasir Wahed, Kiet A. Nguyen, Adheesh Sunil Juvekar, Xinzhuo Li, Xiaona Zhou, Vedant Shah, Tianjiao Yu, Pinar Yanardag, Ismini Lourentzou

Despite significant advancements in Large Vision-Language Models (LVLMs), existing pixel-grounding models operate on single-image settings, limiting their ability to perform detailed, fine-grained comparisons across multiple images.

Reasoning Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.