1 code implementation • ECCV 2020 • Samuel S. Sohn, Honglu Zhou, Seonghyeon Moon, Sejong Yoon, Vladimir Pavlovic, Mubbasir Kapadia
Predicting the crowd behavior in complex environments is a key requirement for crowd and disaster management, architectural design, and urban planning.
1 code implementation • 16 Jul 2024 • Kalliopi Basioti, Mohamed A. Abdelsalam, Federico Fancellu, Vladimir Pavlovic, Afsaneh Fazly
We use this Structured Semantic Augmentation (SSA) framework to augment existing image-caption datasets with the grounded controlled captions, increasing their spatial and semantic diversity and focal coverage.
no code implementations • CVPR 2024 • Che-Jui Chang, Danrui Li, Deep Patel, Parth Goel, Honglu Zhou, Seonghyeon Moon, Samuel S. Sohn, Sejong Yoon, Vladimir Pavlovic, Mubbasir Kapadia
The study of complex human interactions and group activities has become a focal point in human-centric computer vision.
1 code implementation • ICCV 2023 • Yuting Wang, Velibor Ilic, Jiatong Li, Branislav Kisacanin, Vladimir Pavlovic
In this work, we propose ALWOD, a new framework that addresses this problem by fusing active learning (AL) with weakly and semi-supervised object detection paradigms.
1 code implementation • 5 Aug 2023 • JianFeng Wang, Daniela Massiceti, Xiaolin Hu, Vladimir Pavlovic, Thomas Lukasiewicz
This is useful in a wide range of real-world applications where collecting pixel-wise labels is not feasible in time or cost.
no code implementations • 29 Jun 2023 • Che-Jui Chang, Danrui Li, Deep Patel, Parth Goel, Honglu Zhou, Seonghyeon Moon, Samuel S. Sohn, Sejong Yoon, Vladimir Pavlovic, Mubbasir Kapadia
The study of complex human interactions and group activities has become a focal point in human-centric computer vision.
1 code implementation • ICCV 2023 • Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, Mubbasir Kapadia
To extract information relevant to the target class, a dominant approach in best-performing FSS methods removes background features using a support mask.
no code implementations • 2 Dec 2022 • Yuting Wang, Ricardo Guerrero, Vladimir Pavlovic
In its warm-up domain adaptation stage, the model learns a fully-supervised object detector (FSOD) to improve the precision of the object proposals in the target domain, and at the same time learns target-domain-specific and detection-aware proposal features.
no code implementations • 2 Nov 2022 • Gang Qiao, Kaidong Hu, Seonghyeon Moon, Samuel S. Sohn, Sejong Yoon, Mubbasir Kapadia, Vladimir Pavlovic
Learning-based approaches to modeling crowd motion have become increasingly successful but require training and evaluation on large datasets, coupled with complex model selection and parameter tuning.
no code implementations • 26 Oct 2022 • Mohamed Ashraf Abdelsalam, Zhan Shi, Federico Fancellu, Kalliopi Basioti, Dhaivat J. Bhatt, Vladimir Pavlovic, Afsaneh Fazly
The success of scene graphs for visual scene understanding has brought attention to the benefits of abstracting a visual input (e. g., image) into a structured representation, where entities (people and objects) are nodes connected by edges specifying their relations.
1 code implementation • 3 Jul 2022 • JianFeng Wang, Thomas Lukasiewicz, Daniela Massiceti, Xiaolin Hu, Vladimir Pavlovic, Alexandros Neophytou
Semi-supervised learning (SSL) has been widely explored in recent years, and it is an effective way of leveraging unlabeled data to reduce the reliance on labeled data.
no code implementations • 18 Jun 2022 • Pritish Sahu, Kalliopi Basioti, Vladimir Pavlovic
We present a novel computational model, "SAViR-T", for the family of visual reasoning problems embodied in the Raven's Progressive Matrices (RPM).
1 code implementation • 24 Mar 2022 • Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, Mubbasir Kapadia
A fundamental limitation of FM is the inability to preserve the fine-grained spatial details that affect the accuracy of segmentation mask, especially for small target objects.
Ranked #5 on Few-Shot Semantic Segmentation on FSS-1000 (1-shot)
no code implementations • CVPR 2022 • Mihee Lee, Samuel S. Sohn, Seonghyeon Moon, Sejong Yoon, Mubbasir Kapadia, Vladimir Pavlovic
Accurate long-term trajectory prediction in complex scenes, where multiple agents (e. g., pedestrians or vehicles) interact with each other and the environment while attempting to accomplish diverse and often unknown goals, is a challenging stochastic forecasting problem.
1 code implementation • 22 Oct 2021 • Fangda Han, Guoyao Hao, Ricardo Guerrero, Vladimir Pavlovic
To synthesize a pizza image with view attributesoutside the range of natural training images, we design a CGI pizza dataset PizzaView using 3D pizza models and employ it to train a view attribute regressor to regularize the generation process, bridging the real and CGI training datasets.
no code implementations • 27 Sep 2021 • Pritish Sahu, Kalliopi Basioti, Vladimir Pavlovic
Computational learning approaches to solving visual reasoning tests, such as Raven's Progressive Matrices (RPM), critically depend on the ability to identify the visual concepts used in the test (i. e., the representation) as well as the latent rules based on those concepts (i. e., the reasoning).
1 code implementation • 22 Sep 2021 • Malihe Alikhani, Fangda Han, Hareesh Ravi, Mubbasir Kapadia, Vladimir Pavlovic, Matthew Stone
Common image-text joint understanding techniques presume that images and the associated text can universally be characterized by a single implicit model.
no code implementations • 5 Feb 2021 • Minyoung Kim, Vladimir Pavlovic
In this paper, we address the problem in a completely different way by considering a random inference model, where we model the mean and variance functions of the variational posterior as random Gaussian processes (GP).
1 code implementation • 4 Feb 2021 • Hai X. Pham, Ricardo Guerrero, Jiatong Li, Vladimir Pavlovic
Despite the abundance of multi-modal data, such as image-text pairs, there has been little effort in understanding the individual entities and their different roles in the construction of these data instances.
no code implementations • 23 Dec 2020 • Mihee Lee, Vladimir Pavlovic
Multi-modal generative models represent an important family of deep models, whose goal is to facilitate representation learning on data with multiple views or modalities.
1 code implementation • 4 Dec 2020 • Fangda Han, Guoyao Hao, Ricardo Guerrero, Vladimir Pavlovic
Because of the complex nature of the multilabel image generation problem, we also regularize synthetic image by predicting the corresponding ingredients as well as encourage the discriminator to distinguish between matched image and mismatched image.
1 code implementation • 2 Dec 2020 • Ricardo Guerrero, Hai Xuan Pham, Vladimir Pavlovic
A key to making CFA possible is multi-modal shared representation learning, which aims to create a joint representation of the multiple views (text and image) of the data.
Ranked #5 on Cross-Modal Retrieval on Recipe1M
no code implementations • 1 Dec 2020 • Minyoung Kim, Ricardo Guerrero, Vladimir Pavlovic
We deal with the problem of learning the underlying disentangled latent factors that are shared between the paired bi-modal data in cross-modal retrieval.
no code implementations • NeurIPS 2020 • Minyoung Kim, Vladimir Pavlovic
Using the functional gradient approach, we devise an intuitive learning criteria for selecting a new mixture component: the new component has to improve the data likelihood (lower bound) and, at the same time, be as divergent from the current mixture distribution as possible, thus increasing representational diversity.
no code implementations • 17 Oct 2020 • Jiatong Li, Fangda Han, Ricardo Guerrero, Vladimir Pavlovic
Increased awareness of the impact of food consumption on health and lifestyle today has given rise to novel data-driven food analysis systems.
no code implementations • 7 Sep 2020 • Minyoung Kim, Vladimir Pavlovic
In deep representational learning, it is often desired to isolate a particular factor (termed {\em content}) from other factors (referred to as {\em style}).
1 code implementation • 25 Feb 2020 • Fangda Han, Ricardo Guerrero, Vladimir Pavlovic
In this work we propose a new computational framework, based on generative deep models, for synthesis of photo-realistic food meal images from textual list of its ingredients.
no code implementations • 13 Oct 2019 • Samuel S. Sohn, Seonghyeon Moon, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Mubbasir Kapadia
In this paper, we propose an approach to instantly predict the long-term flow of crowds in arbitrarily large, realistic environments.
no code implementations • 26 Sep 2019 • Behnam Gholami, Pritish Sahu, Minyoung Kim, Vladimir Pavlovic
In this paper, we improve the performance of DA by introducing a discriminative discrepancy measure which takes advantage of auxiliary information available in the source and the target domains to better align the source and target distributions.
no code implementations • 26 Sep 2019 • Mihee Lee, Ognjen Rudovic, Vladimir Pavlovic, Maja Pantic
In this paper, we propose a deep learning approach for facial AU detection that can easily and in a fast manner adapt to a new AU or target subject by leveraging only a few labeled samples from the new task (either an AU or subject).
no code implementations • 26 Sep 2019 • Jiatong Li, Ricardo Guerrero, Vladimir Pavlovic
In this paper, we study the novel problem of not only predicting ingredients from a food image, but also predicting the relative amounts of the detected ingredients.
1 code implementation • ICCV 2019 • Minyoung Kim, Yuting Wang, Pritish Sahu, Vladimir Pavlovic
We propose a family of novel hierarchical Bayesian deep auto-encoder models capable of identifying disentangled factors of variability in data.
1 code implementation • 16 May 2019 • Issam H. Laradji, Mark Schmidt, Vladimir Pavlovic, Minyoung Kim
The key advantage is that the combination of GP and DRF leads to a tractable model that can both handle a variable-sized input as well as learn deep long-range dependency structures of the data.
1 code implementation • 9 May 2019 • Fangda Han, Ricardo Guerrero, Vladimir Pavlovic
In this work we propose a new computational framework, based on generative deep models, for synthesis of photo-realistic food meal images from textual descriptions of its ingredients.
no code implementations • 6 May 2019 • Lu Sheng, Jianfei Cai, Tat-Jen Cham, Vladimir Pavlovic, King Ngi Ngan
In this paper, we propose a generative framework that unifies depth-based 3D facial pose tracking and face model adaptation on-the-fly, in the unconstrained scenarios with heavy occlusions and arbitrary facial expression variations.
1 code implementation • CVPR 2019 • Minyoung Kim, Pritish Sahu, Behnam Gholami, Vladimir Pavlovic
The latter can be achieved by minimizing the maximum discrepancy of predictors (classifiers).
Ranked #3 on Synthetic-to-Real Translation on Syn2Real-C
1 code implementation • 5 Feb 2019 • Minyoung Kim, Yuting Wang, Pritish Sahu, Vladimir Pavlovic
We propose a novel VAE-based deep auto-encoder model that can learn disentangled latent representations in a fully unsupervised manner, endowed with the ability to identify all meaningful sources of variation and their cardinality.
no code implementations • ICLR 2019 • Behnam Gholami, Pritish Sahu, Ognjen Rudovic, Konstantinos Bousmalis, Vladimir Pavlovic
Unsupervised domain adaptation (uDA) models focus on pairwise adaptation settings where there is a single, labeled, source and a single target domain.
no code implementations • 21 Mar 2018 • Hai X. Pham, Yuting Wang, Vladimir Pavlovic
This paper presents Generative Adversarial Talking Head (GATH), a novel deep generative neural network that enables fully automatic facial expression synthesis of an arbitrary portrait with continuous action unit (AU) coefficients.
no code implementations • 2 Oct 2017 • Hai X. Pham, Yuting Wang, Vladimir Pavlovic
We present a deep learning framework for real-time speech-driven 3D facial animation from just raw waveforms.
no code implementations • ICCV 2017 • Behnam Gholami, Ognjen (Oggi) Rudovic, Vladimir Pavlovic
This paper introduces a probabilistic latent variable model to address unsupervised domain adaptation problems.
no code implementations • 29 Sep 2017 • Cuong D. Tran, Ognjen Rudovic, Vladimir Pavlovic
We study the task of unsupervised domain adaptation, where no labeled data from the target domain is provided during training time.
no code implementations • CVPR 2017 • Lu Sheng, Jianfei Cai, Tat-Jen Cham, Vladimir Pavlovic, King Ngi Ngan
We consider the problem of depth-based robust 3D facial pose tracking under unconstrained scenarios with heavy occlusions and arbitrary facial expression variations.
no code implementations • CVPR 2017 • Behnam Gholami, Vladimir Pavlovic
In this paper, we propose a unified non-parametric generative framework for temporal subspace clustering to segment data drawn from a sequentially ordered union of subspaces that deals with the missing features in a principled way.
no code implementations • CVPR 2017 • Robert Walecki, Ognjen, Rudovic, Vladimir Pavlovic, Björn Schuller, Maja Pantic
The goal of this paper is to model these structures and estimate complex feature representations simultaneously by combining conditional random field (CRF) encoded AU dependencies with deep learning.
no code implementations • 25 Mar 2017 • Long Zhao, Fangda Han, Xi Peng, Xun Zhang, Mubbasir Kapadia, Vladimir Pavlovic, Dimitris N. Metaxas
We first recover the facial identity and expressions from the video by fitting a face morphable model for each frame.
no code implementations • 26 Sep 2016 • Shahriar Shariat, Vladimir Pavlovic
Traditional pairwise sequence alignment is based on matching individual samples from two sequences, under time monotonicity constraints.
no code implementations • CVPR 2016 • Robert Walecki, Ognjen Rudovic, Vladimir Pavlovic, Maja Pantic
Joint modeling of the intensity of facial action units (AUs) from face images is challenging due to the large number of AUs (30+) and their intensity levels (6).
no code implementations • 13 Oct 2015 • Robert Walecki, Ognjen Rudovic, Vladimir Pavlovic, Maja Pantic
For instance, in the case of AU detection, the goal is to discriminate between the segments of an image sequence in which this AU is active or inactive.
no code implementations • 10 Jul 2015 • Hai X. Pham, Chongyu Chen, Luc N. Dao, Vladimir Pavlovic, Jianfei Cai, Tat-Jen Cham
We introduce a novel robust hybrid 3D face tracking framework from RGBD video streams, which is capable of tracking head pose and facial actions without pre-calibration or intervention from a user.
no code implementations • 9 Jul 2015 • Chintan A. Dalal, Vladimir Pavlovic, Robert E. Kopp
Designing a covariance function that represents the underlying correlation is a crucial step in modeling complex natural systems, such as climate models.
no code implementations • 3 Jul 2015 • Behnam Babagholami-Mohamadabadi, Sejong Yoon, Vladimir Pavlovic
Bayesian models provide a framework for probabilistic modelling of complex datasets.
no code implementations • 30 Jun 2015 • Changkyu Song, Sejong Yoon, Vladimir Pavlovic
We propose new methods to speed up convergence of the Alternating Direction Method of Multipliers (ADMM), a common optimization tool in the context of large scale and distributed learning.
no code implementations • 30 Jun 2015 • Cuong Tran, Vladimir Pavlovic, Robert Kopp
We study the Gaussian Process regression model in the context of training data with noise in both input and output.
no code implementations • ICCV 2015 • Saehoon Yi, Vladimir Pavlovic
Video segmentation is a stepping stone to understanding video context.
no code implementations • 30 Jun 2015 • Jongpil Kim, Vladimir Pavlovic
We also propose a new framework to recognize the Roman coins which exploits hierarchical structure of the ancient Roman coins using the state-of-the-art classification power of the CNNs adopted to a new task of coin classification.
no code implementations • 22 Jan 2013 • Ognjen Rudovic, Maja Pantic, Vladimir Pavlovic
We propose a novel method for automatic pain intensity estimation from facial images based on the framework of kernel Conditional Ordinal Random Fields (KCORF).
no code implementations • NeurIPS 2012 • Sejong Yoon, Vladimir Pavlovic
In this work we present an approach to estimation and learning of generative probabilistic models in a distributed context where certain sensor data can be missing.
no code implementations • NeurIPS 2008 • Pavel P. Kuksa, Pai-Hsi Huang, Vladimir Pavlovic
We present a new family of linear time algorithms based on sufficient statistics for string comparison with mismatches under the string kernels framework.