Search Results for author: Afshin Dehghan

Found 18 papers, 7 papers with code

Cubify Anything: Scaling Indoor 3D Object Detection

no code implementations5 Dec 2024 Justin Lazarow, David Griffiths, Gefen Kohavi, Francisco Crespo, Afshin Dehghan

Furthermore, by pre-training on CA-1M, CuTR can outperform point-based methods on a more diverse variant of SUN RGB-D - supporting the notion that while inductive biases in 3D are useful at the smaller sizes of existing datasets, they fail to scale to the data-rich regime of CA-1M.

3D Object Detection Object +1

4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities

1 code implementation13 Jun 2024 Roman Bachmann, Oğuzhan Fatih Kar, David Mizrahi, Ali Garjani, Mingfei Gao, David Griffiths, Jiaming Hu, Afshin Dehghan, Amir Zamir

In this paper, we expand upon the capabilities of them by training a single model on tens of highly diverse modalities and by performing co-training on large-scale multimodal datasets and text corpora.

Instance Segmentation multimodal generation +1

4M: Massively Multimodal Masked Modeling

1 code implementation NeurIPS 2023 David Mizrahi, Roman Bachmann, Oğuzhan Fatih Kar, Teresa Yeo, Mingfei Gao, Afshin Dehghan, Amir Zamir

Current machine learning models for vision are often highly specialized and limited to a single modality and task.

Decoder

GAUDI: A Neural Architect for Immersive 3D Scene Generation

1 code implementation27 Jul 2022 Miguel Angel Bautista, Pengsheng Guo, Samira Abnar, Walter Talbott, Alexander Toshev, Zhuoyuan Chen, Laurent Dinh, Shuangfei Zhai, Hanlin Goh, Daniel Ulbricht, Afshin Dehghan, Josh Susskind

We introduce GAUDI, a generative model capable of capturing the distribution of complex and realistic 3D scenes that can be rendered immersively from a moving camera.

Image Generation Scene Generation

ARKitScenes: A Diverse Real-World Dataset For 3D Indoor Scene Understanding Using Mobile RGB-D Data

1 code implementation17 Nov 2021 Gilad Baruch, Zhuoyuan Chen, Afshin Dehghan, Tal Dimry, Yuri Feigin, Peter Fu, Thomas Gebauer, Brandon Joffe, Daniel Kurz, Arik Schwartz, Elad Shulman

It is not only the first RGB-D dataset that is captured with a now widely available depth sensor, but to our best knowledge, it also is the largest indoor scene understanding data released.

3D Object Detection object-detection +1

DAGER: Deep Age, Gender and Emotion Recognition Using Convolutional Neural Network

2 code implementations14 Feb 2017 Afshin Dehghan, Enrique. G. Ortiz, Guang Shu, Syed Zain Masood

This paper describes the details of Sighthound's fully automated age, gender and emotion recognition system.

Emotion Recognition

View Independent Vehicle Make, Model and Color Recognition Using Convolutional Neural Network

1 code implementation6 Feb 2017 Afshin Dehghan, Syed Zain Masood, Guang Shu, Enrique. G. Ortiz

The backbone of our system is a deep convolutional neural network that is not only computationally inexpensive, but also provides state-of-the-art results on several competitive benchmarks.

Binary Quadratic Programing for Online Tracking of Hundreds of People in Extremely Crowded Scenes

no code implementations30 Mar 2016 Afshin Dehghan, Mubarak Shah

In this paper, we propose a tracker that addresses the aforementioned problems and is capable of tracking hundreds of people efficiently.

Multi-Object Tracking

Deep Tracking: Visual Tracking Using Deep Convolutional Networks

no code implementations13 Dec 2015 Meera Hahn, Si Chen, Afshin Dehghan

In this paper, we study a discriminatively trained deep convolutional network for the task of visual tracking.

Visual Tracking

Target Identity-Aware Network Flow for Online Multiple Target Tracking

no code implementations CVPR 2015 Afshin Dehghan, Yicong Tian, Philip H. S. Torr, Mubarak Shah

In this paper we show that multiple object tracking (MOT) can be formulated in a framework, where the detection and data-association are performed simultaneously.

Multiple Object Tracking object-detection +1

Improving Semantic Concept Detection through the Dictionary of Visually-distinct Elements

no code implementations CVPR 2014 Afshin Dehghan, Haroon Idrees, Mubarak Shah

A video captures a sequence and interactions of concepts that can be static, for instance, objects or scenes, or dynamic, such as actions.

Improving an Object Detector and Extracting Regions Using Superpixels

no code implementations CVPR 2013 Guang Shu, Afshin Dehghan, Mubarak Shah

In general, our method takes detection bounding boxes of a generic detector as input and generates the detection output with higher average precision and precise object regions.

Object Superpixels

Cannot find the paper you are looking for? You can Submit a new open access paper.