no code implementations • CVPR 2023 • Sheng Liu, Cong Phuoc Huynh, Cong Chen, Maxim Arap, Raffay Hamid
We present a simple yet effective self-supervised pre-training method for image harmonization which can leverage large-scale unannotated image datasets.
no code implementations • CVPR 2023 • Jue Wang, Wentao Zhu, Pichao Wang, Xiang Yu, Linda Liu, Mohamed Omar, Raffay Hamid
To address this limitation, we present a novel Selective S4 (i. e., S5) model that employs a lightweight mask generator to adaptively select informative image tokens resulting in more efficient and accurate modeling of long-term spatiotemporal dependencies in videos.
Ranked #2 on Video Classification on Breakfast
no code implementations • 16 Jun 2022 • Xiang Hao, Jingxiang Chen, Shixing Chen, Ahmed Saad, Raffay Hamid
To help customers make better-informed viewing choices, video-streaming services try to moderate their content and provide more visibility into which portions of their movies and TV episodes contain age-appropriate material (e. g., nudity, sex, violence, or drug-use).
no code implementations • CVPR 2022 • Alex Andonian, Shixing Chen, Raffay Hamid
The learning objective of vision-language approach of CLIP does not effectively account for the noisy many-to-many correspondences found in web-harvested image captioning datasets, which contributes to its compute and data inefficiency.
Ranked #98 on Image Classification on ObjectNet (using extra training data)
1 code implementation • CVPR 2022 • Sheng Liu, Xiaohan Nie, Raffay Hamid
We demonstrate that our approach: (a) significantly improves the quality of 3-D reconstruction for our small-parallax setting, (b) does not cause any degradation for data with large-parallax, and (c) maintains the generalizability and scalability of geometry-based sparse SfM.
no code implementations • CVPR 2023 • Shixing Chen, Chun-Hao Liu, Xiang Hao, Xiaohan Nie, Maxim Arap, Raffay Hamid
However, labeling individual scenes is a time-consuming process.
no code implementations • CVPR 2021 • Shixing Chen, Xiaohan Nie, David Fan, Dongqing Zhang, Vimal Bhat, Raffay Hamid
To assess the effectiveness of ShotCoL on novel applications of scene boundary detection, we take on the problem of finding timestamps in movies and TV episodes where video-ads can be inserted while offering a minimally disruptive viewing experience.
no code implementations • CVPR 2016 • Diego Marcos, Raffay Hamid, Devis Tuia
The growing availability of very high resolution (<1 m/pixel) satellite and aerial images has opened up unprecedented opportunities to monitor and analyze the evolution of land-cover and land-use across the world.
no code implementations • CVPR 2015 • Lionel Gueguen, Raffay Hamid
Satellite imagery is a valuable source of information for assessing damages in distressed areas undergoing a calamity, such as an earthquake or an armed conflict.
no code implementations • CVPR 2015 • Da Kuang, Alex Gittens, Raffay Hamid
In recent years, several feature encoding schemes for the bags-of-visual-words model have been proposed.
no code implementations • 22 Apr 2014 • Raffay Hamid, Atish Das Sarma, Dennis Decoste, Neel Sundaresan
We identify a novel instance of the background subtraction problem that focuses on extracting near-field foreground objects captured using handheld cameras.
no code implementations • 2 Apr 2014 • Da Kuang, Alex Gittens, Raffay Hamid
The dominant cost in solving least-square problems using Newton's method is often that of factorizing the Hessian matrix over multiple values of the regularization parameter ($\lambda$).
no code implementations • 17 Dec 2013 • Raffay Hamid, Ying Xiao, Alex Gittens, Dennis Decoste
Kernel approximation using randomized feature maps has recently gained a lot of interest.
no code implementations • CVPR 2013 • Aditya Khosla, Raffay Hamid, Chih-Jen Lin, Neel Sundaresan
Given the enormous growth in user-generated videos, it is becoming increasingly important to be able to navigate them efficiently.
no code implementations • CVPR 2013 • Raffay Hamid, Dennis Decoste, Chih-Jen Lin
We present a robust and efficient technique for matching dense sets of points undergoing non-rigid spatial transformations.