Search Results for author: Ali Thabet

Found 34 papers, 21 papers with code

Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow Models

no code implementations • 2 Mar 2024 • Neta Shaul, Uriel Singer, Ricky T. Q. Chen, Matthew Le, Ali Thabet, Albert Pumarola, Yaron Lipman

This paper introduces Bespoke Non-Stationary (BNS) Solvers, a solver distillation approach to improve sample efficiency of Diffusion and Flow models.

Audio Generation Conditional Image Generation +1

Paper
Add Code

Animated Stickers: Bringing Stickers to Life with Video Diffusion

no code implementations • 8 Feb 2024 • David Yan, Winnie Zhang, Luxin Zhang, Anmol Kalia, Dingkang Wang, Ankit Ramchandani, Miao Liu, Albert Pumarola, Edgar Schoenfeld, Elliot Blanchard, Krishna Narni, Yaqiao Luo, Lawrence Chen, Guan Pang, Ali Thabet, Peter Vajda, Amy Bearman, Licheng Yu

Our model is built on top of the state-of-the-art Emu text-to-image model, with the addition of temporal layers to model motion.

Paper
Add Code

fMPI: Fast Novel View Synthesis in the Wild with Layered Scene Representations

no code implementations • 26 Dec 2023 • Jonas Kohler, Nicolas Griffiths Sanchez, Luca Cavalli, Catherine Herold, Albert Pumarola, Alberto Garcia Garcia, Ali Thabet

In this study, we propose two novel input processing paradigms for novel view synthesis (NVS) methods based on layered scene representations that significantly improve their runtime without compromising quality.

Novel View Synthesis

Paper
Add Code

Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models

1 code implementation • 19 Dec 2023 • Angela Castillo, Jonas Kohler, Juan C. Pérez, Juan Pablo Pérez, Albert Pumarola, Bernard Ghanem, Pablo Arbeláez, Ali Thabet

Our findings provide insights into the efficiency of the conditional denoising process that contribute to more practical and swift deployment of text-conditioned diffusion models.

Denoising Neural Architecture Search

208

Paper
Code

Bespoke Solvers for Generative Flow Models

no code implementations • 29 Oct 2023 • Neta Shaul, Juan Perez, Ricky T. Q. Chen, Ali Thabet, Albert Pumarola, Yaron Lipman

For example, a Bespoke solver for a CIFAR10 model produces samples with Fr\'echet Inception Distance (FID) of 2. 73 with 10 NFE, and gets to 1% of the Ground Truth (GT) FID (2. 59) for this model with only 20 NFE.

Paper
Add Code

BoDiffusion: Diffusing Sparse Observations for Full-Body Human Motion Synthesis

no code implementations • 21 Apr 2023 • Angela Castillo, Maria Escobar, Guillaume Jeanneret, Albert Pumarola, Pablo Arbeláez, Ali Thabet, Artsiom Sanakoyeu

To the best of our knowledge, this is the first approach that uses the reverse diffusion process to model full-body tracking as a conditional sequence generation task.

Mixed Reality Motion Synthesis

Paper
Add Code

Avatars Grow Legs: Generating Smooth Human Motion from Sparse Tracking Inputs with Diffusion Model

1 code implementation • CVPR 2023 • Yuming Du, Robin Kips, Albert Pumarola, Sebastian Starke, Ali Thabet, Artsiom Sanakoyeu

A particular challenge is that only a sparse tracking signal is available from standalone HMDs (Head Mounted Devices), often limited to tracking the user's head and wrists.

226

Paper
Code

VisCo Grids: Surface Reconstruction with Viscosity and Coarea Grids

no code implementations • 25 Mar 2023 • Albert Pumarola, Artsiom Sanakoyeu, Lior Yariv, Ali Thabet, Yaron Lipman

Surface reconstruction has been seeing a lot of progress lately by utilizing Implicit Neural Representations (INRs).

Inductive Bias Surface Reconstruction

Paper
Add Code

Re-ReND: Real-time Rendering of NeRFs across Devices

1 code implementation • ICCV 2023 • Sara Rojas, Jesus Zarzar, Juan Camilo Perez, Artsiom Sanakoyeu, Ali Thabet, Albert Pumarola, Bernard Ghanem

Re-ReND is designed to achieve real-time performance by converting the NeRF into a representation that can be efficiently processed by standard graphics pipelines.

Paper
Code

Towards Assessing and Characterizing the Semantic Robustness of Face Recognition

no code implementations • 10 Feb 2022 • Juan C. Pérez, Motasem Alfarra, Ali Thabet, Pablo Arbeláez, Bernard Ghanem

We propose a methodology for assessing and characterizing the robustness of FRMs against semantic perturbations to their input.

Face Recognition

Paper
Add Code

Snapshot HDR Video Construction Using Coded Mask

no code implementations • 5 Dec 2021 • Masheal Alghamdi, Qiang Fu, Ali Thabet, Wolfgang Heidrich

This paper study the reconstruction of High Dynamic Range (HDR) video from snapshot-coded LDR video.

Demosaicking Denoising +1

Paper
Add Code

ASSANet: An Anisotropic Separable Set Abstraction for Efficient Point Cloud Representation Learning

1 code implementation • NeurIPS 2021 • Guocheng Qian, Hasan Abed Al Kader Hammoud, Guohao Li, Ali Thabet, Bernard Ghanem

We then introduce a new Anisotropic Reduction function into our Separable SA module and propose an Anisotropic Separable SA (ASSA) module that substantially increases the network's accuracy.

Ranked #33 on 3D Part Segmentation on ShapeNet-Part

3D Part Segmentation 3D Point Cloud Classification +2

Paper
Code

MovieCuts: A New Dataset and Benchmark for Cut Type Recognition

1 code implementation • 12 Sep 2021 • Alejandro Pardo, Fabian Caba Heilbron, Juan León Alcázar, Ali Thabet, Bernard Ghanem

Advances in automatic Cut-type recognition can unleash new experiences in the video editing industry, such as movie analysis for education, video re-editing, virtual cinematography, machine-assisted trailer generation, machine-assisted video editing, among others.

Video Editing Vocal Bursts Type Prediction

Paper
Code

Learning to Cut by Watching Movies

1 code implementation • ICCV 2021 • Alejandro Pardo, Fabian Caba Heilbron, Juan León Alcázar, Ali Thabet, Bernard Ghanem

Video content creation keeps growing at an incredible pace; yet, creating engaging stories remains challenging and requires non-trivial video editing expertise.

Contrastive Learning Video Editing

Paper
Code

Enhancing Adversarial Robustness via Test-time Transformation Ensembling

1 code implementation • 29 Jul 2021 • Juan C. Pérez, Motasem Alfarra, Guillaume Jeanneret, Laura Rueda, Ali Thabet, Bernard Ghanem, Pablo Arbeláez

Deep learning models are prone to being fooled by imperceptible perturbations known as adversarial attacks.

Adversarial Robustness

Paper
Code

Combating Adversaries with Anti-Adversaries

1 code implementation • ICML Workshop AML 2021 • Motasem Alfarra, Juan C. Pérez, Ali Thabet, Adel Bibi, Philip H. S. Torr, Bernard Ghanem

Deep neural networks are vulnerable to small input perturbations known as adversarial attacks.

Paper
Code

MAAS: Multi-modal Assignation for Active Speaker Detection

1 code implementation • ICCV 2021 • Juan León-Alcázar, Fabian Caba Heilbron, Ali Thabet, Bernard Ghanem

Active speaker detection requires a solid integration of multi-modal cues.

Ranked #13 on Audio-Visual Active Speaker Detection on AVA-ActiveSpeaker

Audio-Visual Active Speaker Detection

Paper
Code

DeeperGCN: Training Deeper GCNs with Generalized Aggregation Functions

no code implementations • 1 Jan 2021 • Guohao Li, Chenxin Xiong, Ali Thabet, Bernard Ghanem

We add our generalized aggregation into a deep GCN framework and show it achieves state-of-the-art results in six benchmarks from OGB.

Point Cloud Classification Representation Learning

Paper
Add Code

SALA: Soft Assignment Local Aggregation for Parameter Efficient 3D Semantic Segmentation

no code implementations • 29 Dec 2020 • Hani Itani, Silvio Giancola, Ali Thabet, Bernard Ghanem

Since it is learnable, this mapping is allowed to be different per layer instead of being applied uniformly throughout the depth of the network.

3D Semantic Segmentation

Paper
Add Code

Video Self-Stitching Graph Network for Temporal Action Localization

1 code implementation • ICCV 2021 • Chen Zhao, Ali Thabet, Bernard Ghanem

We have two key components in VSGN: video self-stitching (VSS) and cross-scale graph pyramid network (xGPN).

Ranked #16 on Temporal Action Localization on ActivityNet-1.3

Temporal Action Localization

Paper
Code

LC-NAS: Latency Constrained Neural Architecture Search for Point Cloud Networks

no code implementations • 24 Aug 2020 • Guohao Li, Mengmeng Xu, Silvio Giancola, Ali Thabet, Bernard Ghanem

In this paper, we introduce a new NAS framework, dubbed LC-NAS, where we search for point cloud architectures that are constrained to a target latency.

Neural Architecture Search Point Cloud Classification +2

Paper
Add Code

Rethinking Clustering for Robustness

1 code implementation • 13 Jun 2020 • Motasem Alfarra, Juan C. Pérez, Adel Bibi, Ali Thabet, Pablo Arbeláez, Bernard Ghanem

This paper studies how encouraging semantically-aligned features during deep neural network training can increase network robustness.

Clustering

Paper
Code

DeeperGCN: All You Need to Train Deeper GCNs

3 code implementations • 13 Jun 2020 • Guohao Li, Chenxin Xiong, Ali Thabet, Bernard Ghanem

Graph Convolutional Networks (GCNs) have been drawing significant attention with the power of representation learning on graphs.

Ranked #1 on Node Property Prediction on ogbn-proteins

Graph Learning Graph Property Prediction +3

13,001

Paper
Code

Gabor Layers Enhance Network Robustness

1 code implementation • ECCV 2020 • Juan C. Pérez, Motasem Alfarra, Guillaume Jeanneret, Adel Bibi, Ali Thabet, Bernard Ghanem, Pablo Arbeláez

We revisit the benefits of merging classical vision concepts with deep learning models.

Paper
Code

AdvPC: Transferable Adversarial Perturbations on 3D Point Clouds

1 code implementation • ECCV 2020 • Abdullah Hamdi, Sara Rojas, Ali Thabet, Bernard Ghanem

Our proposed attack increases the attack success rate by up to 40% for those transferred to unseen networks (transferability), while maintaining a high success rate on the attacked network.

Adversarial Attack Classify 3D Point Clouds

Paper
Code

SGAS: Sequential Greedy Architecture Search

1 code implementation • CVPR 2020 • Guohao Li, Guocheng Qian, Itzel C. Delgadillo, Matthias Müller, Ali Thabet, Bernard Ghanem

Architecture design has become a crucial component of successful deep learning.

Ranked #4 on Node Classification on PPI

Classification General Classification +4

160

Paper
Code

PU-GCN: Point Cloud Upsampling using Graph Convolutional Networks

1 code implementation • CVPR 2021 • Guocheng Qian, Abdulellah Abualshour, Guohao Li, Ali Thabet, Bernard Ghanem

We combine Inception DenseGCN with NodeShuffle into a new point upsampling pipeline called PU-GCN.

3D Reconstruction Point Cloud Super Resolution +1

165

Paper
Code

G-TAD: Sub-Graph Localization for Temporal Action Detection

7 code implementations • CVPR 2020 • Mengmeng Xu, Chen Zhao, David S. Rojas, Ali Thabet, Bernard Ghanem

In this work, we propose a graph convolutional network (GCN) model to adaptively incorporate multi-level semantic context into video features and cast temporal action detection as a sub-graph localization problem.

Ranked #5 on Temporal Action Localization on EPIC-KITCHENS-100

Temporal Action Localization

216

Paper
Code

DeepGCNs: Making GCNs Go as Deep as CNNs

4 code implementations • 15 Oct 2019 • Guohao Li, Matthias Müller, Guocheng Qian, Itzel C. Delgadillo, Abdulellah Abualshour, Ali Thabet, Bernard Ghanem

This work transfers concepts such as residual/dense connections and dilated convolutions from CNNs to GCNs in order to successfully train very deep GCNs.

Ranked #5 on 3D Semantic Segmentation on PartNet

3D Point Cloud Classification 3D Semantic Segmentation +2

1,120

Paper
Code

BAOD: Budget-Aware Object Detection

no code implementations • 10 Apr 2019 • Alejandro Pardo, Mengmeng Xu, Ali Thabet, Pablo Arbelaez, Bernard Ghanem

We adopt a hybrid supervised learning framework to train the object detector from both these types of annotation.

Active Learning Object +2

Paper
Add Code

DeepGCNs: Can GCNs Go as Deep as CNNs?

1 code implementation • ICCV 2019 • Guohao Li, Matthias Müller, Ali Thabet, Bernard Ghanem

Finally, we use these new concepts to build a very deep 56-layer GCN, and show how it significantly boosts performance (+3. 7% mIoU over state-of-the-art) in the task of point cloud semantic segmentation.

3D Semantic Segmentation Graph Classification +1

627

Paper
Code

RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization

1 code implementation • 30 Mar 2019 • Alejandro Pardo, Humam Alwassel, Fabian Caba Heilbron, Ali Thabet, Bernard Ghanem

RefineLoc shows competitive results with the state-of-the-art in weakly-supervised temporal localization.

Temporal Localization Weakly Supervised Action Localization +2

Paper
Code

MortonNet: Self-Supervised Learning of Local Features in 3D Point Clouds

1 code implementation • 30 Mar 2019 • Ali Thabet, Humam Alwassel, Bernard Ghanem

In fact, we show how Morton features can be used to significantly improve performance (+3% for 2 popular semantic segmentation algorithms) in the task of semantic segmentation of point clouds on the challenging and large-scale S3DIS dataset.

Segmentation Self-Supervised Learning +1

Paper
Code

Robust Manhattan Frame Estimation From a Single RGB-D Image

no code implementations • CVPR 2015 • Bernard Ghanem, Ali Thabet, Juan Carlos Niebles, Fabian Caba Heilbron

This paper proposes a new framework for estimating the Manhattan Frame (MF) of an indoor scene from a single RGB-D image.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.