no code implementations • 20 Sep 2024 • Steven Grosz, Rui Zhao, Rajeev Ranjan, Hongcheng Wang, Manoj Aggarwal, Gerard Medioni, Anil Jain
This paper improves upon existing data pruning methods for image classification by introducing a novel pruning metric and pruning procedure based on importance sampling.
1 code implementation • 29 Oct 2023 • Alon Shoshan, Nadav Bhonker, Emanuel Ben Baruch, Ori Nizan, Igor Kviatkovsky, Joshua Engelsma, Manoj Aggarwal, Gerard Medioni
We demonstrate the merits of FPGAN-Control, both quantitatively and qualitatively, in terms of identity preservation level, degree of appearance control, and low synthetic-to-real domain gap.
no code implementations • 14 Apr 2023 • Rohan Sarkar, Achal Dave, Gerard Medioni, Benjamin Biggs
This paper presents Shape of You (SoY), an approach to improve the accuracy of 3D body shape estimation for vision-based clothing recommendation systems.
Ranked #4 on
3D Human Shape Estimation
on SSP-3D
no code implementations • 8 Apr 2023 • Jinming Li, Wentao Zhang, Tian Wang, Guanglei Xiong, Alan Lu, Gerard Medioni
The generated queries naturally serve as interpretable representations of user interests and can be searched to recommend cold-start items.
no code implementations • 30 Mar 2023 • Ori Linial, Alon Shoshan, Nadav Bhonker, Elad Hirsch, Lior Zamir, Igor Kviatkovsky, Gerard Medioni
In this setting, a large model is used for indexing the gallery while a lightweight model is used for querying.
no code implementations • 25 Jul 2022 • Karin Sevegnani, Arjun Seshadri, Tian Wang, Anurag Beniwal, Julian McAuley, Alan Lu, Gerard Medioni
Recommender systems and search are both indispensable in facilitating personalization and ease of browsing in online fashion platforms.
no code implementations • 22 Apr 2022 • Jiuhong Xiao, Lavisha Aggarwal, Prithviraj Banerjee, Manoj Aggarwal, Gerard Medioni
We present a novel Identity Preserving Reconstruction (IPR) loss function which achieves Bits-Per-Pixel (BPP) values that are ~38% and ~42% of CRF-23 HEVC compression for LFW (low-resolution) and CelebA-HQ (high-resolution) datasets, respectively, while maintaining parity in recognition accuracy.
2 code implementations • 11 Apr 2022 • Rohan Sarkar, Navaneeth Bodla, Mariya I. Vasileva, Yen-Liang Lin, Anurag Beniwal, Alan Lu, Gerard Medioni
For compatibility prediction, we design an outfit token to capture a global outfit representation and train the framework using a classification loss.
no code implementations • CVPR 2022 • Jialian Wu, Sudhir Yarram, Hui Liang, Tian Lan, Junsong Yuan, Jayan Eledath, Gerard Medioni
In addition, VisTR is not fully end-to-end learnable in multiple video clips as it requires a hand-crafted data association to link instance tracklets between successive clips.
no code implementations • 3 May 2021 • Alon Shoshan, Nadav Bhonker, Igor Kviatkovsky, Matan Fintz, Gerard Medioni
In contrast to using synthetic data for training, in this work we explore whether synthetic data can be beneficial for model selection.
1 code implementation • CVPR 2021 • Mohammed Suhail, Abhay Mittal, Behjat Siddiquie, Chris Broaddus, Jayan Eledath, Gerard Medioni, Leonid Sigal
The proposed formulation allows for efficiently incorporating the structure of scene graphs in the output space.
Ranked #4 on
Scene Graph Generation
on Visual Genome
1 code implementation • ICCV 2021 • Alon Shoshan, Nadav Bhonker, Igor Kviatkovsky, Gerard Medioni
We present a framework for training GANs with explicit control over generated images.
no code implementations • 3 Jun 2020 • Igor Kviatkovsky, Nadav Bhonker, Gerard Medioni
We present a method for synthesizing naturally looking images of multiple people interacting in a specific scenario.
1 code implementation • CVPR 2020 • Maxim Berman, Leonid Pishchulin, Ning Xu, Matthew B. Blaschko, Gerard Medioni
We introduce a novel efficient one-shot NAS approach to optimally search for channel numbers, given latency constraints on a specific hardware.
no code implementations • 18 Feb 2020 • Donghyun Kim, Tian Lan, Chuhang Zou, Ning Xu, Bryan A. Plummer, Stan Sclaroff, Jayan Eledath, Gerard Medioni
We embed the attention module in a ``slow-fast'' architecture, where the slower network runs on sparsely sampled keyframes and the light-weight shallow network runs on non-keyframes at a high frame rate.
1 code implementation • 2 Feb 2018 • Feng-Ju Chang, Anh Tuan Tran, Tal Hassner, Iacopo Masi, Ram Nevatia, Gerard Medioni
Our ExpNet CNN is applied directly to the intensities of a face image and regresses a 29D vector of 3D expression coefficients.
Ranked #1 on
3D Facial Expression Recognition
on 2017_test set
(using extra training data)
1 code implementation • CVPR 2018 • Anh Tuan Tran, Tal Hassner, Iacopo Masi, Eran Paz, Yuval Nirkin, Gerard Medioni
Motivated by the concept of bump mapping, we propose a layered approach which decouples estimation of a global shape from its mid-level details (e. g., wrinkles).
5 code implementations • 24 Aug 2017 • Feng-Ju Chang, Anh Tuan Tran, Tal Hassner, Iacopo Masi, Ram Nevatia, Gerard Medioni
Instead, we compare our FPN with existing methods by evaluating how they affect face recognition accuracy on the IJB-A and IJB-B benchmarks: using the same recognition pipeline, but varying the face alignment method.
Ranked #1 on
Facial Landmark Detection
on 300W
(Mean Error Rate metric)
2 code implementations • 22 Apr 2017 • Yuval Nirkin, Iacopo Masi, Anh Tuan Tran, Tal Hassner, Gerard Medioni
To this end, we use the Labeled Faces in the Wild (LFW) benchmark and measure the effect of intra- and inter-subject face swapping on recognition.
no code implementations • 30 Mar 2017 • Donghyun Kim, Matthias Hernandez, Jongmoo Choi, Gerard Medioni
We also propose a 3D face augmentation technique which synthesizes a number of different facial expressions from a single 3D face scan.
5 code implementations • CVPR 2017 • Anh Tuan Tran, Tal Hassner, Iacopo Masi, Gerard Medioni
The 3D shapes of faces are well known to be discriminative.
Ranked #4 on
3D Face Reconstruction
on Florence
(Average 3D Error metric)
no code implementations • 29 Nov 2016 • Shay Deutsch, Antonio Ortega, Gerard Medioni
We propose a new framework for manifold denoising based on processing in the graph Fourier frequency domain, derived from the spectral decomposition of the discrete graph Laplacian.
no code implementations • 6 Jul 2016 • Tal Hassner, Iacopo Masi, Jungyeon Kim, Jongmoo Choi, Shai Harel, Prem Natarajan, Gerard Medioni
We propose a novel approach to template based face recognition.
no code implementations • CVPR 2016 • Iacopo Masi, Stephen Rawls, Gerard Medioni, Prem Natarajan
We propose a method to push the frontiers of unconstrained face recognition in the wild, focusing on the problem of extreme pose variations.
no code implementations • 11 Apr 2016 • Ruizhe Wang, Lingyu Wei, Etienne Vouga, Qi-Xing Huang, Duygu Ceylan, Gerard Medioni, Hao Li
We present an end-to-end system for reconstructing complete watertight and textured models of moving subjects such as clothed humans and animals, using only three or four handheld sensors.
no code implementations • 28 Mar 2016 • Bor-Jeng Chen, Gerard Medioni
Tracking many vehicles in wide coverage aerial imagery is crucial for understanding events in a large field of view.
no code implementations • 23 Mar 2016 • Wael Abd-Almageed, Yue Wua, Stephen Rawlsa, Shai Harel, Tal Hassner, Iacopo Masi, Jongmoo Choi, Jatuporn Toy Leksut, Jungyeon Kim, Prem Natarajan, Ram Nevatia, Gerard Medioni
In our representation, a face image is processed by several pose-specific deep convolutional neural network (CNN) models to generate multiple pose-specific features.
Ranked #15 on
Face Verification
on IJB-A
no code implementations • 23 Mar 2016 • Iacopo Masi, Anh Tuan Tran, Jatuporn Toy Leksut, Tal Hassner, Gerard Medioni
Face recognition capabilities have recently made extraordinary leaps.
Ranked #13 on
Face Verification
on IJB-A
no code implementations • 19 Jan 2016 • Tai-Pang Wu, Sai-Kit Yeung, Jiaya Jia, Chi-Keung Tang, Gerard Medioni
We prove a closed-form solution to tensor voting (CFTV): given a point set in any dimensions, our closed-form solution provides an exact, continuous and efficient algorithm for computing a structure-aware tensor that simultaneously achieves salient structure detection and outlier attenuation.
no code implementations • 12 Nov 2015 • Yue Wu, Tal Hassner, KangGeon Kim, Gerard Medioni, Prem Natarajan
We present a novel convolutional neural network (CNN) design for facial landmark coordinate regression.
no code implementations • CVPR 2014 • Ruizhe Wang, Jongmoo Choi, Gerard Medioni
We propose here a novel approach which leverages contour coherence and allows us to align two wide baseline range scans with limited overlap from a poor initialization.
no code implementations • CVPR 2014 • Jan Prokaj, Gerard Medioni
Persistent surveillance of large geographic areas from unmanned aerial vehicles allows us to learn much about the daily activities in the region of interest.