Search Results for author: Hanoona Abdul Rasheed

Found 2 papers, 2 papers with code

PG-Video-LLaVA: Pixel Grounding Large Video-Language Models

1 code implementation • 22 Nov 2023 • Shehan Munasinghe, Rusiru Thushara, Muhammad Maaz, Hanoona Abdul Rasheed, Salman Khan, Mubarak Shah, Fahad Khan

Extending image-based Large Multimodal Models (LMMs) to videos is challenging due to the inherent complexity of video data.

Benchmarking Phrase Grounding +4

198

Paper
Code

Self-Supervised Learning for Fine-Grained Visual Categorization

1 code implementation • 18 May 2021 • Muhammad Maaz, Hanoona Abdul Rasheed, Dhanalaxmi Gaddam

The deconstruction learning forces the model to focus on local object parts, while reconstruction learning helps in learning the correlation between the parts.

Fine-Grained Visual Categorization Representation Learning +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.