Search Results for author: Ali Mahdavi-Amiri

Found 23 papers, 7 papers with code

Survey on Modeling of Articulated Objects

no code implementations22 Mar 2024 Jiayi Liu, Manolis Savva, Ali Mahdavi-Amiri

3D modeling of articulated objects is a research problem within computer vision, graphics, and robotics.

Object

AFreeCA: Annotation-Free Counting for All

no code implementations7 Mar 2024 Adriano D'Alessandro, Ali Mahdavi-Amiri, Ghassan Hamarneh

Consequently, we can generate counting data for any type of object and count them in an unsupervised manner.

Object Object Counting

AnaMoDiff: 2D Analogical Motion Diffusion via Disentangled Denoising

no code implementations5 Feb 2024 Maham Tanveer, Yizhi Wang, Ruiqi Wang, Nanxuan Zhao, Ali Mahdavi-Amiri, Hao Zhang

We present AnaMoDiff, a novel diffusion-based method for 2D motion analogies that is applied to raw, unannotated videos of articulated characters.

Denoising Optical Flow Estimation

CAGE: Controllable Articulation GEneration

no code implementations15 Dec 2023 Jiayi Liu, Hou In Ivan Tam, Ali Mahdavi-Amiri, Manolis Savva

We address the challenge of generating 3D articulated objects in a controllable fashion.

Denoising Object

Slice3D: Multi-Slice, Occlusion-Revealing, Single View 3D Reconstruction

no code implementations3 Dec 2023 Yizhi Wang, Wallace Lira, Wenqi Wang, Ali Mahdavi-Amiri, Hao Zhang

Our key observation is that object slicing is more advantageous than altering views to reveal occluded structures.

3D Reconstruction Denoising +1

CLiC: Concept Learning in Context

no code implementations28 Nov 2023 Mehdi Safaee, Aryan Mikaeili, Or Patashnik, Daniel Cohen-Or, Ali Mahdavi-Amiri

This paper addresses the challenge of learning a local visual pattern of an object from one image, and generating images depicting objects with that pattern.

Object

SYRAC: Synthesize, Rank, and Count

1 code implementation2 Oct 2023 Adriano D'Alessandro, Ali Mahdavi-Amiri, Ghassan Hamarneh

To address this, we use latent diffusion models to create two types of synthetic data: one by removing pedestrians from real images, which generates ranked image pairs with a weak but reliable object quantity signal, and the other by generating synthetic images with a predetermined number of objects, offering a strong but noisy counting signal.

Crowd Counting

TExplain: Explaining Learned Visual Features via Pre-trained (Frozen) Language Models

no code implementations1 Sep 2023 Saeid Asgari Taghanaki, Aliasghar Khani, Amir Khasahmadi, Aditya Sanghi, Karl D. D. Willis, Ali Mahdavi-Amiri

These sentences are then used to extract the most frequent words, providing a comprehensive understanding of the learned features and patterns within the classifier.

Decision Making

PARIS: Part-level Reconstruction and Motion Analysis for Articulated Objects

1 code implementation ICCV 2023 Jiayi Liu, Ali Mahdavi-Amiri, Manolis Savva

Our approach improves reconstruction relative to state-of-the-art baselines with a Chamfer-L1 distance reduction of 3. 94 (45. 2%) for objects and 26. 79 (84. 5%) for parts, and achieves 5% error rate for motion estimation across 10 object categories.

Motion Estimation Object

BRICS: Bi-level feature Representation of Image CollectionS

no code implementations29 May 2023 Dingdong Yang, Yizhi Wang, Ali Mahdavi-Amiri, Hao Zhang

Our key codes and feature grids are jointly trained continuously with well-defined gradient flows, leading to high usage rates of the feature grids and improved generative modeling compared to discrete Vector Quantization (VQ).

Image Generation Quantization

SKED: Sketch-guided Text-based 3D Editing

no code implementations ICCV 2023 Aryan Mikaeili, Or Perel, Mehdi Safaee, Daniel Cohen-Or, Ali Mahdavi-Amiri

To ensure the generated output adheres to the provided sketches, we propose novel loss functions to generate the desired edits while preserving the density and radiance of the base instance.

Text to 3D

DS-Fusion: Artistic Typography via Discriminated and Stylized Diffusion

1 code implementation ICCV 2023 Maham Tanveer, Yizhi Wang, Ali Mahdavi-Amiri, Hao Zhang

We introduce a novel method to automatically generate an artistic typography by stylizing one or more letter fonts to visually convey the semantics of an input word, while ensuring that the output remains readable.

Denoising

MaskTune: Mitigating Spurious Correlations by Forcing to Explore

1 code implementation30 Sep 2022 Saeid Asgari Taghanaki, Aliasghar Khani, Fereshte Khani, Ali Gholami, Linh Tran, Ali Mahdavi-Amiri, Ghassan Hamarneh

A fundamental challenge of over-parameterized deep learning models is learning meaningful data representations that yield good performance on a downstream task without over-fitting spurious input features.

SAC-GAN: Structure-Aware Image Composition

1 code implementation13 Dec 2021 Hang Zhou, Rui Ma, Ling-Xiao Zhang, Lin Gao, Ali Mahdavi-Amiri, Hao Zhang

Specifically, our network takes the semantic layout features from the input scene image, features encoded from the edges and silhouette in the input object patch, as well as a latent code as inputs, and generates a 2D spatial affine transform defining the translation and scaling of the object patch.

Image Augmentation Object

UNIST: Unpaired Neural Implicit Shape Translation Network

no code implementations CVPR 2022 Qimin Chen, Johannes Merz, Aditya Sanghi, Hooman Shayani, Ali Mahdavi-Amiri, Hao Zhang

We introduce UNIST, the first deep neural implicit model for general-purpose, unpaired shape-to-shape translation, in both 2D and 3D domains.

Position Style Transfer +1

Multimodal Shape Completion via IMLE

no code implementations30 Jun 2021 Himanshu Arora, Saurabh Mishra, Shichong Peng, Ke Li, Ali Mahdavi-Amiri

Shape completion is the problem of completing partial input shapes such as partial scans.

CAPRI-Net: Learning Compact CAD Shapes with Adaptive Primitive Assembly

no code implementations CVPR 2022 Fenggen Yu, Zhiqin Chen, Manyi Li, Aditya Sanghi, Hooman Shayani, Ali Mahdavi-Amiri, Hao Zhang

We introduce CAPRI-Net, a neural network for learning compact and interpretable implicit representations of 3D computer-aided design (CAD) models, in the form of adaptive primitive assemblies.

CAD Reconstruction

Data to Physicalization: A Survey of the Physical Rendering Process

no code implementations22 Feb 2021 Hessam Djavaherpour, Faramarz Samavati, Ali Mahdavi-Amiri, Fatemeh Yazdanbakhsh, Samuel Huron, Richard Levy, Yvonne Jansen, Lora Oehlberg

Physical representations of data offer physical and spatial ways of looking at, navigating, and interacting with data.

Graphics I.3.5; I.3.8

Semi-Supervised Co-Analysis of 3D Shape Styles from Projected Lines

no code implementations18 Apr 2018 Fenggen Yu, Yan Zhang, Kai Xu, Ali Mahdavi-Amiri, Hao Zhang

We present a semi-supervised co-analysis method for learning 3D shape styles from projected feature lines, achieving style patch localization with only weak supervision.

Clustering

Cannot find the paper you are looking for? You can Submit a new open access paper.