Search Results for author: Darshan Singh S

Found 2 papers, 0 papers with code

FiGCLIP: Fine-Grained CLIP Adaptation via Densely Annotated Videos

no code implementations • 15 Jan 2024 • Darshan Singh S, Zeeshan Khan, Makarand Tapaswi

We use the SRL and verb information to create rule-based detailed captions, making sure they capture most of the visual concepts.

Paper
Add Code

Unsupervised Audio-Visual Lecture Segmentation

no code implementations • 29 Oct 2022 • Darshan Singh S, Anchit Gupta, C. V. Jawahar, Makarand Tapaswi

We formulate lecture segmentation as an unsupervised task that leverages visual, textual, and OCR cues from the lecture, while clip representations are fine-tuned on a pretext self-supervised task of matching the narration with the temporally aligned visual content.

Navigate Optical Character Recognition (OCR) +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.