1 code implementation • ECCV 2020 • Samuel S. Sohn, Honglu Zhou, Seonghyeon Moon, Sejong Yoon, Vladimir Pavlovic, Mubbasir Kapadia
Predicting the crowd behavior in complex environments is a key requirement for crowd and disaster management, architectural design, and urban planning.
no code implementations • 18 Mar 2025 • Jinseok Bae, Inwoo Hwang, Young Yoon Lee, Ziyu Guo, Joseph Liu, Yizhak Ben-Shabat, Young Min Kim, Mubbasir Kapadia
This severely limits the performance of generative motion models for downstream tasks.
no code implementations • 10 Feb 2025 • Danrui Li, Sen Zhang, Sam S. Sohn, Kaidong Hu, Muhammad Usman, Mubbasir Kapadia
The prototyping of computer games, particularly card games, requires extensive human effort in creative ideation and gameplay evaluation.
no code implementations • 4 Feb 2025 • Che-Jui Chang, Qingze Tony Liu, Honglu Zhou, Vladimir Pavlovic, Mubbasir Kapadia
Recent advances in generative modeling and tokenization have driven significant progress in text-to-motion generation, leading to enhanced quality and realism in generated motions.
1 code implementation • 14 Oct 2024 • Qingze, Liu, Danrui Li, Samuel S. Sohn, Sejong Yoon, Mubbasir Kapadia, Vladimir Pavlovic
Accurate prediction of human or vehicle trajectories with good diversity that captures their stochastic nature is an essential task for many applications.
no code implementations • 15 Jun 2024 • Samuel S. Sohn, Danrui Li, Sen Zhang, Che-Jui Chang, Mubbasir Kapadia
Digital storytelling, essential in entertainment, education, and marketing, faces challenges in production scalability and flexibility.
no code implementations • 14 Jun 2024 • Alexander Gao, Maurice Chu, Mubbasir Kapadia, Ming C. Lin, Hsueh-Ti Derek Liu
Vector fields are widely used to represent and model flows for many science and engineering applications.
no code implementations • 24 Mar 2024 • Che-Jui Chang, Danrui Li, Seonghyeon Moon, Mubbasir Kapadia
In addition, our study of the impact of synthetic data distributions on downstream performance reveals the importance of flexible data generators in narrowing domain gaps for improved model adaptability.
no code implementations • CVPR 2024 • Che-Jui Chang, Danrui Li, Deep Patel, Parth Goel, Honglu Zhou, Seonghyeon Moon, Samuel S. Sohn, Sejong Yoon, Vladimir Pavlovic, Mubbasir Kapadia
The study of complex human interactions and group activities has become a focal point in human-centric computer vision.
no code implementations • 26 Sep 2023 • Che-Jui Chang, Samuel S. Sohn, Sen Zhang, Rajath Jayashankar, Muhammad Usman, Mubbasir Kapadia
We have conducted a user study with 199 participants to assess how the average person judges the affects perceived from multimodal behaviors that are consistent and inconsistent with respect to a driving affect.
no code implementations • 29 Jun 2023 • Che-Jui Chang, Danrui Li, Deep Patel, Parth Goel, Honglu Zhou, Seonghyeon Moon, Samuel S. Sohn, Sejong Yoon, Vladimir Pavlovic, Mubbasir Kapadia
The study of complex human interactions and group activities has become a focal point in human-centric computer vision.
1 code implementation • CVPR 2023 • Honglu Zhou, Roberto Martín-Martín, Mubbasir Kapadia, Silvio Savarese, Juan Carlos Niebles
This graph can then be used to generate pseudo labels to train a video representation that encodes the procedural knowledge in a more accessible form to generalize to multiple procedure understanding tasks.
1 code implementation • ICCV 2023 • Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, Mubbasir Kapadia
To extract information relevant to the target class, a dominant approach in best-performing FSS methods removes background features using a support mask.
Ranked #3 on
Few-Shot Semantic Segmentation
on FSS-1000 (1-shot)
no code implementations • 2 Nov 2022 • Gang Qiao, Kaidong Hu, Seonghyeon Moon, Samuel S. Sohn, Sejong Yoon, Mubbasir Kapadia, Vladimir Pavlovic
Learning-based approaches to modeling crowd motion have become increasingly successful but require training and evaluation on large datasets, coupled with complex model selection and parameter tuning.
no code implementations • 4 Jun 2022 • Xun Zhang, Mathew Schwartz, Muhammad Usman, Petros Faloutsos, Mubbasir Kapadia
In this paper, we focus on the modification of policies that can lead to movement patterns and directional guidance of occupants, which are represented as agents in a 3D simulation engine.
1 code implementation • 24 Mar 2022 • Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, Mubbasir Kapadia
A fundamental limitation of FM is the inability to preserve the fine-grained spatial details that affect the accuracy of segmentation mask, especially for small target objects.
Ranked #5 on
Few-Shot Semantic Segmentation
on FSS-1000 (1-shot)
1 code implementation • CVPR 2022 • Mihee Lee, Samuel S. Sohn, Seonghyeon Moon, Sejong Yoon, Mubbasir Kapadia, Vladimir Pavlovic
Accurate long-term trajectory prediction in complex scenes, where multiple agents (e. g., pedestrians or vehicles) interact with each other and the environment while attempting to accomplish diverse and often unknown goals, is a challenging stochastic forecasting problem.
1 code implementation • 22 Dec 2021 • Honglu Zhou, Advith Chegu, Samuel S. Sohn, Zuohui Fu, Gerard de Melo, Mubbasir Kapadia
Digraph Representation Learning (DRL) aims to learn representations for directed homogeneous graphs (digraphs).
1 code implementation • 11 Dec 2021 • Honglu Zhou, Asim Kadav, Aviv Shamsian, Shijie Geng, Farley Lai, Long Zhao, Ting Liu, Mubbasir Kapadia, Hans Peter Graf
Group Activity Recognition detects the activity collectively performed by a group of actors, which requires compositional reasoning of actors and objects.
Ranked #2 on
Group Activity Recognition
on Collective Activity
1 code implementation • 22 Sep 2021 • Malihe Alikhani, Fangda Han, Hareesh Ravi, Mubbasir Kapadia, Vladimir Pavlovic, Matthew Stone
Common image-text joint understanding techniques presume that images and the associated text can universally be characterized by a single implicit model.
1 code implementation • ICLR 2021 • Honglu Zhou, Asim Kadav, Farley Lai, Alexandru Niculescu-Mizil, Martin Renqiang Min, Mubbasir Kapadia, Hans Peter Graf
We evaluate over CATER dataset and find that Hopper achieves 73. 2% Top-1 accuracy using just 1 FPS by hopping through just a few critical frames.
Ranked #5 on
Video Object Tracking
on CATER
2 code implementations • ICCV 2021 • Hareesh Ravi, Kushal Kafle, Scott Cohen, Jonathan Brandt, Mubbasir Kapadia
Visual storytelling and story comprehension are uniquely human skills that play a central role in how we learn about and experience the world.
no code implementations • 8 Dec 2020 • Vahid Azizi, Muhammad Usman, Honglu Zhou, Petros Faloutsos, Mubbasir Kapadia
We present a floorplan embedding technique that uses an attributed graph to represent the geometric information as well as design semantics and behavioral features of the inhabitants as node and edge attributes.
1 code implementation • 9 Oct 2020 • Honglu Zhou, Hareesh Ravi, Carlos M. Muniz, Vahid Azizi, Linda Ness, Gerard de Melo, Mubbasir Kapadia
Given its crucial role, there is a need to better understand and model the dynamics of GitHub as a social platform.
2 code implementations • 19 Apr 2020 • Honglu Zhou, Shuyuan Xu, Zuohui Fu, Gerard de Melo, Yongfeng Zhang, Mubbasir Kapadia
In this paper, we present a Hierarchical Information Diffusion (HID) framework by integrating user representation learning and multiscale modeling.
no code implementations • CVPR 2020 • Long Zhao, Xi Peng, Yuxiao Chen, Mubbasir Kapadia, Dimitris N. Metaxas
Our key idea is to generalize the distilled cross-modal knowledge learned from a Source dataset, which contains paired examples from both modalities, to the Target dataset by modeling knowledge as priors on parameters of the Student.
no code implementations • 13 Oct 2019 • Samuel S. Sohn, Seonghyeon Moon, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Mubbasir Kapadia
In this paper, we propose an approach to instantly predict the long-term flow of crowds in arbitrarily large, realistic environments.
no code implementations • 2 Oct 2019 • Rohit K. Dubey, Samuel S. Sohn, Christoph Hoelscher, Mubbasir Kapadia
In this paper, we propose an agent-based simulation tool, which is grounded in human cognition and decision-making, for evaluating and improving the effectiveness of building evacuation procedures and guidance systems during a disaster.
no code implementations • 25 Sep 2019 • Glen Berseth, Brandon haworth, Seonghyeon Moon, Mubbasir Kapadia, Petros Faloutsos
Multi-agent reinforcement learning is a particularly challenging problem.
no code implementations • SEMEVAL 2019 • Yeyao Zhang, Eleftheria Tsipidi, Sasha Schriber, Mubbasir Kapadia, Markus Gross, Ashutosh Modi
However, translating natural language text into animation is a challenging task.
5 code implementations • CVPR 2019 • Long Zhao, Xi Peng, Yu Tian, Mubbasir Kapadia, Dimitris N. Metaxas
In this paper, we study the problem of learning Graph Convolutional Networks (GCNs) for regression.
Ranked #27 on
Monocular 3D Human Pose Estimation
on Human3.6M
no code implementations • 5 Apr 2019 • Sepehr Janghorbani, Ashutosh Modi, Jakob Buhmann, Mubbasir Kapadia
The process of creating such characters often involves a team of creative authors who describe different aspects of the characters in natural language, and planning experts that translate this description into a planning domain.
no code implementations • NAACL 2019 • Pierre Colombo, Wojciech Witon, Ashutosh Modi, James Kennedy, Mubbasir Kapadia
The majority of current systems for end-to-end dialog generation focus on response quality without an explicit control over the affective content of the responses.
no code implementations • NAACL 2019 • Pooja Chitkara, Ashutosh Modi, Pravalika Avvaru, Sepehr Janghorbani, Mubbasir Kapadia
Additionally, in contrast to offline processing of dialog, we also analyze the performance of our model in a more realistic setting i. e. in an online setting where the topic is identified in real time as the dialog progresses.
no code implementations • WS 2018 • Wojciech Witon, Pierre Colombo, Ashutosh Modi, Mubbasir Kapadia
This paper describes our participating system in the WASSA 2018 shared task on emotion prediction.
1 code implementation • ECCV 2018 • Long Zhao, Xi Peng, Yu Tian, Mubbasir Kapadia, Dimitris Metaxas
We consider the problem of image-to-video translation, where an input image is translated into an output video containing motions of a single object.
1 code implementation • CVPR 2018 • Hareesh Ravi, Lezi Wang, Carlos Muniz, Leonid Sigal, Dimitris Metaxas, Mubbasir Kapadia
We propose an end-to-end network for the visual illustration of a sequence of sentences forming a story.
no code implementations • 25 Mar 2017 • Long Zhao, Fangda Han, Xi Peng, Xun Zhang, Mubbasir Kapadia, Vladimir Pavlovic, Dimitris N. Metaxas
We first recover the facial identity and expressions from the video by fitting a face morphable model for each frame.