Search Results for author: Mubbasir Kapadia

Found 31 papers, 14 papers with code

Laying the Foundations of Deep Long-Term Crowd Flow Prediction

1 code implementation ECCV 2020 Samuel S. Sohn, Honglu Zhou, Seonghyeon Moon, Sejong Yoon, Vladimir Pavlovic, Mubbasir Kapadia

Predicting the crowd behavior in complex environments is a key requirement for crowd and disaster management, architectural design, and urban planning.

Management

On the Equivalency, Substitutability, and Flexibility of Synthetic Data

no code implementations24 Mar 2024 Che-Jui Chang, Danrui Li, Seonghyeon Moon, Mubbasir Kapadia

In addition, our study of the impact of synthetic data distributions on downstream performance reveals the importance of flexible data generators in narrowing domain gaps for improved model adaptability.

The Importance of Multimodal Emotion Conditioning and Affect Consistency for Embodied Conversational Agents

no code implementations26 Sep 2023 Che-Jui Chang, Samuel S. Sohn, Sen Zhang, Rajath Jayashankar, Muhammad Usman, Mubbasir Kapadia

We have conducted a user study with 199 participants to assess how the average person judges the affects perceived from multimodal behaviors that are consistent and inconsistent with respect to a driving affect.

Procedure-Aware Pretraining for Instructional Video Understanding

1 code implementation CVPR 2023 Honglu Zhou, Roberto Martín-Martín, Mubbasir Kapadia, Silvio Savarese, Juan Carlos Niebles

This graph can then be used to generate pseudo labels to train a video representation that encodes the procedural knowledge in a more accessible form to generalize to multiple procedure understanding tasks.

Video Understanding

An Information-Theoretic Approach for Estimating Scenario Generalization in Crowd Motion Prediction

no code implementations2 Nov 2022 Gang Qiao, Kaidong Hu, Seonghyeon Moon, Samuel S. Sohn, Sejong Yoon, Mubbasir Kapadia, Vladimir Pavlovic

Learning-based approaches to modeling crowd motion have become increasingly successful but require training and evaluation on large datasets, coupled with complex model selection and parameter tuning.

Model Selection motion prediction

Optimizing Indoor Navigation Policies For Spatial Distancing

no code implementations4 Jun 2022 Xun Zhang, Mathew Schwartz, Muhammad Usman, Petros Faloutsos, Mubbasir Kapadia

In this paper, we focus on the modification of policies that can lead to movement patterns and directional guidance of occupants, which are represented as agents in a 3D simulation engine.

HM: Hybrid Masking for Few-Shot Segmentation

1 code implementation24 Mar 2022 Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, Mubbasir Kapadia

A fundamental limitation of FM is the inability to preserve the fine-grained spatial details that affect the accuracy of segmentation mask, especially for small target objects.

Few-Shot Semantic Segmentation Segmentation +1

MUSE-VAE: Multi-Scale VAE for Environment-Aware Long Term Trajectory Prediction

no code implementations CVPR 2022 Mihee Lee, Samuel S. Sohn, Seonghyeon Moon, Sejong Yoon, Mubbasir Kapadia, Vladimir Pavlovic

Accurate long-term trajectory prediction in complex scenes, where multiple agents (e. g., pedestrians or vehicles) interact with each other and the environment while attempting to accomplish diverse and often unknown goals, is a challenging stochastic forecasting problem.

Trajectory Prediction

Cross-Modal Coherence for Text-to-Image Retrieval

1 code implementation22 Sep 2021 Malihe Alikhani, Fangda Han, Hareesh Ravi, Mubbasir Kapadia, Vladimir Pavlovic, Matthew Stone

Common image-text joint understanding techniques presume that images and the associated text can universally be characterized by a single implicit model.

Image Retrieval Retrieval

AESOP: Abstract Encoding of Stories, Objects, and Pictures

2 code implementations ICCV 2021 Hareesh Ravi, Kushal Kafle, Scott Cohen, Jonathan Brandt, Mubbasir Kapadia

Visual storytelling and story comprehension are uniquely human skills that play a central role in how we learn about and experience the world.

Story Completion Visual Storytelling

Graph-Based Generative Representation Learning of Semantically and Behaviorally Augmented Floorplans

no code implementations8 Dec 2020 Vahid Azizi, Muhammad Usman, Honglu Zhou, Petros Faloutsos, Mubbasir Kapadia

We present a floorplan embedding technique that uses an attributed graph to represent the geometric information as well as design semantics and behavioral features of the inhabitants as node and edge attributes.

Representation Learning

GitEvolve: Predicting the Evolution of GitHub Repositories

1 code implementation9 Oct 2020 Honglu Zhou, Hareesh Ravi, Carlos M. Muniz, Vahid Azizi, Linda Ness, Gerard de Melo, Mubbasir Kapadia

Given its crucial role, there is a need to better understand and model the dynamics of GitHub as a social platform.

Representation Learning

HID: Hierarchical Multiscale Representation Learning for Information Diffusion

2 code implementations19 Apr 2020 Honglu Zhou, Shuyuan Xu, Zuohui Fu, Gerard de Melo, Yongfeng Zhang, Mubbasir Kapadia

In this paper, we present a Hierarchical Information Diffusion (HID) framework by integrating user representation learning and multiscale modeling.

Representation Learning

Knowledge as Priors: Cross-Modal Knowledge Generalization for Datasets without Superior Knowledge

no code implementations CVPR 2020 Long Zhao, Xi Peng, Yuxiao Chen, Mubbasir Kapadia, Dimitris N. Metaxas

Our key idea is to generalize the distilled cross-modal knowledge learned from a Source dataset, which contains paired examples from both modalities, to the Target dataset by modeling knowledge as priors on parameters of the Student.

3D Hand Pose Estimation Knowledge Distillation

Deep Crowd-Flow Prediction in Built Environments

no code implementations13 Oct 2019 Samuel S. Sohn, Seonghyeon Moon, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Mubbasir Kapadia

In this paper, we propose an approach to instantly predict the long-term flow of crowds in arbitrarily large, realistic environments.

Management

Cognitive Agent Based Simulation Model For Improving Disaster Response Procedures

no code implementations2 Oct 2019 Rohit K. Dubey, Samuel S. Sohn, Christoph Hoelscher, Mubbasir Kapadia

In this paper, we propose an agent-based simulation tool, which is grounded in human cognition and decision-making, for evaluating and improving the effectiveness of building evacuation procedures and guidance systems during a disaster.

Decision Making Decision Making Under Uncertainty +1

Domain Authoring Assistant for Intelligent Virtual Agents

no code implementations5 Apr 2019 Sepehr Janghorbani, Ashutosh Modi, Jakob Buhmann, Mubbasir Kapadia

The process of creating such characters often involves a team of creative authors who describe different aspects of the characters in natural language, and planning experts that translate this description into a planning domain.

Affect-Driven Dialog Generation

no code implementations NAACL 2019 Pierre Colombo, Wojciech Witon, Ashutosh Modi, James Kennedy, Mubbasir Kapadia

The majority of current systems for end-to-end dialog generation focus on response quality without an explicit control over the affective content of the responses.

Topic Spotting using Hierarchical Networks with Self Attention

no code implementations NAACL 2019 Pooja Chitkara, Ashutosh Modi, Pravalika Avvaru, Sepehr Janghorbani, Mubbasir Kapadia

Additionally, in contrast to offline processing of dialog, we also analyze the performance of our model in a more realistic setting i. e. in an online setting where the topic is identified in real time as the dialog progresses.

text-classification Text Classification

Learning to Forecast and Refine Residual Motion for Image-to-Video Generation

1 code implementation ECCV 2018 Long Zhao, Xi Peng, Yu Tian, Mubbasir Kapadia, Dimitris Metaxas

We consider the problem of image-to-video translation, where an input image is translated into an output video containing motions of a single object.

Human Pose Forecasting Image to Video Generation +1

Cartoonish sketch-based face editing in videos using identity deformation transfer

no code implementations25 Mar 2017 Long Zhao, Fangda Han, Xi Peng, Xun Zhang, Mubbasir Kapadia, Vladimir Pavlovic, Dimitris N. Metaxas

We first recover the facial identity and expressions from the video by fitting a face morphable model for each frame.

Face Model

Cannot find the paper you are looking for? You can Submit a new open access paper.