Search Results for author: Hanoona Abdul Rasheed

Found 2 papers, 2 papers with code

PG-Video-LLaVA: Pixel Grounding Large Video-Language Models

1 code implementation22 Nov 2023 Shehan Munasinghe, Rusiru Thushara, Muhammad Maaz, Hanoona Abdul Rasheed, Salman Khan, Mubarak Shah, Fahad Khan

Extending image-based Large Multimodal Models (LMMs) to videos is challenging due to the inherent complexity of video data.

Benchmarking Phrase Grounding +4

Self-Supervised Learning for Fine-Grained Visual Categorization

1 code implementation18 May 2021 Muhammad Maaz, Hanoona Abdul Rasheed, Dhanalaxmi Gaddam

The deconstruction learning forces the model to focus on local object parts, while reconstruction learning helps in learning the correlation between the parts.

Fine-Grained Visual Categorization Representation Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.