Search Results for author: Raffay Hamid

Found 15 papers, 1 papers with code

LEMaRT: Label-Efficient Masked Region Transform for Image Harmonization

no code implementations • CVPR 2023 • Sheng Liu, Cong Phuoc Huynh, Cong Chen, Maxim Arap, Raffay Hamid

We present a simple yet effective self-supervised pre-training method for image harmonization which can leverage large-scale unannotated image datasets.

Image Harmonization

Paper
Add Code

Selective Structured State-Spaces for Long-Form Video Understanding

no code implementations • CVPR 2023 • Jue Wang, Wentao Zhu, Pichao Wang, Xiang Yu, Linda Liu, Mohamed Omar, Raffay Hamid

To address this limitation, we present a novel Selective S4 (i. e., S5) model that employs a lightweight mask generator to adaptively select informative image tokens resulting in more efficient and accurate modeling of long-term spatiotemporal dependencies in videos.

Ranked #2 on Video Classification on Breakfast

Contrastive Learning Token Reduction +2

Paper
Add Code

Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes

no code implementations • 16 Jun 2022 • Xiang Hao, Jingxiang Chen, Shixing Chen, Ahmed Saad, Raffay Hamid

To help customers make better-informed viewing choices, video-streaming services try to moderate their content and provide more visibility into which portions of their movies and TV episodes contain age-appropriate material (e. g., nudity, sex, violence, or drug-use).

Temporal Localization

Paper
Add Code

Robust Cross-Modal Representation Learning with Progressive Self-Distillation

no code implementations • CVPR 2022 • Alex Andonian, Shixing Chen, Raffay Hamid

The learning objective of vision-language approach of CLIP does not effectively account for the noisy many-to-many correspondences found in web-harvested image captioning datasets, which contributes to its compute and data inefficiency.

Ranked #98 on Image Classification on ObjectNet (using extra training data)

Contrastive Learning Image Captioning +5

Paper
Add Code

Depth-Guided Sparse Structure-from-Motion for Movies and TV Shows

1 code implementation • CVPR 2022 • Sheng Liu, Xiaohan Nie, Raffay Hamid

We demonstrate that our approach: (a) significantly improves the quality of 3-D reconstruction for our small-parallax setting, (b) does not cause any degradation for data with large-parallax, and (c) maintains the generalizability and scalability of geometry-based sparse SfM.

Paper
Code

Movies2Scenes: Using Movie Metadata to Learn Scene Representation

no code implementations • CVPR 2023 • Shixing Chen, Chun-Hao Liu, Xiang Hao, Xiaohan Nie, Maxim Arap, Raffay Hamid

However, labeling individual scenes is a time-consuming process.

Contrastive Learning Scene Understanding

Paper
Add Code

Shot Contrastive Self-Supervised Learning for Scene Boundary Detection

no code implementations • CVPR 2021 • Shixing Chen, Xiaohan Nie, David Fan, Dongqing Zhang, Vimal Bhat, Raffay Hamid

To assess the effectiveness of ShotCoL on novel applications of scene boundary detection, we take on the problem of finding timestamps in movies and TV episodes where video-ads can be inserted while offering a minimally disruptive viewing experience.

Boundary Detection Contrastive Learning +1

Paper
Add Code

Geospatial Correspondences for Multimodal Registration

no code implementations • CVPR 2016 • Diego Marcos, Raffay Hamid, Devis Tuia

The growing availability of very high resolution (<1 m/pixel) satellite and aerial images has opened up unprecedented opportunities to monitor and analyze the evolution of land-cover and land-use across the world.

Change Detection

Paper
Add Code

Large-Scale Damage Detection Using Satellite Imagery

no code implementations • CVPR 2015 • Lionel Gueguen, Raffay Hamid

Satellite imagery is a valuable source of information for assessing damages in distressed areas undergoing a calamity, such as an earthquake or an armed conflict.

Paper
Add Code

Hardware Compliant Approximate Image Codes

no code implementations • CVPR 2015 • Da Kuang, Alex Gittens, Raffay Hamid

In recent years, several feature encoding schemes for the bags-of-visual-words model have been proposed.

Computational Efficiency General Classification

Paper
Add Code

Fast Approximate Matching of Cell-Phone Videos for Robust Background Subtraction

no code implementations • 22 Apr 2014 • Raffay Hamid, Atish Das Sarma, Dennis Decoste, Neel Sundaresan

We identify a novel instance of the background subtraction problem that focuses on extracting near-field foreground objects captured using handheld cameras.

Object

Paper
Add Code

piCholesky: Polynomial Interpolation of Multiple Cholesky Factors for Efficient Approximate Cross-Validation

no code implementations • 2 Apr 2014 • Da Kuang, Alex Gittens, Raffay Hamid

The dominant cost in solving least-square problems using Newton's method is often that of factorizing the Hessian matrix over multiple values of the regularization parameter ($\lambda$).

Paper
Add Code

Compact Random Feature Maps

no code implementations • 17 Dec 2013 • Raffay Hamid, Ying Xiao, Alex Gittens, Dennis Decoste

Kernel approximation using randomized feature maps has recently gained a lot of interest.

Paper
Add Code

Large-Scale Video Summarization Using Web-Image Priors

no code implementations • CVPR 2013 • Aditya Khosla, Raffay Hamid, Chih-Jen Lin, Neel Sundaresan

Given the enormous growth in user-generated videos, it is becoming increasingly important to be able to navigate them efficiently.

Navigate Video Summarization

Paper
Add Code

Dense Non-rigid Point-Matching Using Random Projections

no code implementations • CVPR 2013 • Raffay Hamid, Dennis Decoste, Chih-Jen Lin

We present a robust and efficient technique for matching dense sets of points undergoing non-rigid spatial transformations.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.