no code implementations • 19 Mar 2025 • Fereshteh Forghani, Jason J. Yu, Tristan Aumentado-Armstrong, Konstantinos G. Derpanis, Marcus A. Brubaker
We study the effect of scene scale ambiguity in GNVS when sampled from a single image by isolating its effect on the resulting models and, based on these intuitions, define new metrics that measure the scale inconsistency of generated views.
no code implementations • 18 Feb 2025 • Ahmad Salimi, Tristan Aumentado-Armstrong, Marcus A. Brubaker, Konstantinos G. Derpanis
In this paper, we focus on 3D scene inpainting, where parts of an input image set, captured from different viewpoints, are masked out.
no code implementations • CVPR 2024 • Matthew Kowal, Richard P. Wildes, Konstantinos G. Derpanis
Understanding what deep network models capture in their learned representations is a fundamental challenge in computer vision.
no code implementations • 28 Feb 2024 • Jason J. Yu, Tristan Aumentado-Armstrong, Fereshteh Forghani, Konstantinos G. Derpanis, Marcus A. Brubaker
This paper considers the problem of generative novel view synthesis (GNVS), generating novel, plausible views of a scene given a limited number of known views.
no code implementations • CVPR 2024 • Matthew Kowal, Achal Dave, Rares Ambrus, Adrien Gaidon, Konstantinos G. Derpanis, Pavel Tokmakov
Concretely, we seek to explain the decision-making process of video transformers based on high-level, spatiotemporal concepts that are automatically discovered.
no code implementations • 27 Oct 2023 • Tristan Aumentado-Armstrong, Ashkan Mirzaei, Marcus A. Brubaker, Jonathan Kelly, Alex Levinshtein, Konstantinos G. Derpanis, Igor Gilitschenski
The resulting latent-space NeRF can produce novel views with higher quality than standard colour-space NeRFs, as the AE can correct certain visual artifacts, while rendering over three times faster.
1 code implementation • ICCV 2023 • Mohamed Ashraf Abdelsalam, Samrudhdhi B. Rangrej, Isma Hadji, Nikita Dvornik, Konstantinos G. Derpanis, Afsaneh Fazly
While most previous work focus on the problem of data scarcity in procedural video datasets, another core challenge of future anticipation is how to account for multiple plausible future realizations in natural settings.
no code implementations • 16 Sep 2023 • Shayan shekarforoush, Amanpreet Walia, Marcus A. Brubaker, Konstantinos G. Derpanis, Alex Levinshtein
Recent image enhancement methods have shown the advantages of using a pair of long and short-exposure images for low-light photography.
no code implementations • 17 Aug 2023 • Ashkan Mirzaei, Tristan Aumentado-Armstrong, Marcus A. Brubaker, Jonathan Kelly, Alex Levinshtein, Konstantinos G. Derpanis, Igor Gilitschenski
A field is trained on relevance maps of training views, denoted as the relevance field, defining the 3D region within which modifications should be made.
no code implementations • CVPR 2023 • Nikita Dvornik, Isma Hadji, Ran Zhang, Konstantinos G. Derpanis, Animesh Garg, Richard P. Wildes, Allan D. Jepson
This motivates the need to temporally localize the instruction steps in such videos, i. e. the task called key-step localization.
1 code implementation • ICCV 2023 • Jason J. Yu, Fereshteh Forghani, Konstantinos G. Derpanis, Marcus A. Brubaker
In this paper, we propose a novel generative model capable of producing a sequence of photorealistic images consistent with a specified camera trajectory, and a single starting image.
no code implementations • ICCV 2023 • Ashkan Mirzaei, Tristan Aumentado-Armstrong, Marcus A. Brubaker, Jonathan Kelly, Alex Levinshtein, Konstantinos G. Derpanis, Igor Gilitschenski
The popularity of Neural Radiance Fields (NeRFs) for view synthesis has led to a desire for NeRF editing tools.
no code implementations • CVPR 2023 • Ashkan Mirzaei, Tristan Aumentado-Armstrong, Konstantinos G. Derpanis, Jonathan Kelly, Marcus A. Brubaker, Igor Gilitschenski, Alex Levinshtein
We refer to this task as 3D inpainting.
no code implementations • 3 Nov 2022 • Matthew Kowal, Mennatullah Siam, Md Amirul Islam, Neil D. B. Bruce, Richard P. Wildes, Konstantinos G. Derpanis
(ii) Some datasets that are assumed to be biased toward dynamics are actually biased toward static information.
1 code implementation • 31 Oct 2022 • Avery Ma, Nikita Dvornik, Ran Zhang, Leila Pishdad, Konstantinos G. Derpanis, Afsaneh Fazly
For image classification, the most popular data augmentation techniques range from simple photometric and geometrical transformations, to more complex methods that use visual saliency to craft new training examples.
1 code implementation • CVPR 2022 • Matthew Kowal, Mennatullah Siam, Md Amirul Islam, Neil D. B. Bruce, Richard P. Wildes, Konstantinos G. Derpanis
To show the efficacy of our approach, we analyse two widely studied tasks, action recognition and video object segmentation.
1 code implementation • CVPR 2022 • He Zhao, Isma Hadji, Nikita Dvornik, Konstantinos G. Derpanis, Richard P. Wildes, Allan D. Jepson
Our model is based on a transformer equipped with a memory module, which maps the start and goal observations to a sequence of plausible actions.
no code implementations • 20 Apr 2022 • Leila Pishdad, Ran Zhang, Konstantinos G. Derpanis, Allan Jepson, Afsaneh Fazly
Probabilistic embeddings have proven useful for capturing polysemous word meanings, as well as ambiguity in image matching.
1 code implementation • 12 Apr 2022 • Karl Schmeckpeper, Philip R. Osteen, Yufu Wang, Georgios Pavlakos, Kenneth Chaney, Wyatt Jordan, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis
Empirically, we show that our approach can accurately recover the 6-DoF object pose for both instance- and class-based scenarios even against a cluttered background.
1 code implementation • 27 Mar 2022 • Mennatullah Siam, Konstantinos G. Derpanis, Richard P. Wildes
In this paper, we present a simple but effective temporal transductive inference (TTI) approach that leverages temporal consistency in the unlabelled video frames during few-shot inference.
no code implementations • 20 Oct 2021 • Md Amirul Islam, Matthew Kowal, Sen Jia, Konstantinos G. Derpanis, Neil D. B. Bruce
Extensive experiments demonstrate the high quality of our generated pseudo-labels and effectiveness of the proposed framework in a variety of domains.
no code implementations • NeurIPS 2021 • Nikita Dvornik, Isma Hadji, Konstantinos G. Derpanis, Animesh Garg, Allan D. Jepson
In our experiments, we show that Drop-DTW is a robust similarity measure for sequence retrieval and demonstrate its effectiveness as a training loss on diverse applications.
no code implementations • 23 Aug 2021 • Md Amirul Islam, Matthew Kowal, Konstantinos G. Derpanis, Neil D. B. Bruce
The premise is based on the notion of feature binding, which is defined as the process by which activations spread across space and layers in the network are successfully integrated to arrive at a correct inference decision.
1 code implementation • ICCV 2021 • Md Amirul Islam, Matthew Kowal, Sen Jia, Konstantinos G. Derpanis, Neil D. B. Bruce
In this paper, we challenge the common assumption that collapsing the spatial dimensions of a 3D (spatial-channel) tensor in a convolutional neural network (CNN) into a vector via global pooling removes all spatial information.
1 code implementation • CVPR 2021 • Isma Hadji, Konstantinos G. Derpanis, Allan D. Jepson
We introduce a weakly supervised method for representation learning based on aligning temporal sequences (e. g., videos) of the same process (e. g., human action).
1 code implementation • CVPR 2021 • Michael Dorkenwald, Timo Milbich, Andreas Blattmann, Robin Rombach, Konstantinos G. Derpanis, Björn Ommer
Video understanding calls for a model to learn the characteristic interplay between static scene content and its dynamics: Given an image, the model must be able to predict a future progression of the portrayed scene and, conversely, a video should be explained in terms of its static image content and all the remaining characteristics not present in the initial frame.
no code implementations • 28 Jan 2021 • Md Amirul Islam, Matthew Kowal, Sen Jia, Konstantinos G. Derpanis, Neil D. B. Bruce
; (ii) Does position encoding affect the learning of semantic representations?
no code implementations • 27 Jan 2021 • Md Amirul Islam, Matthew Kowal, Patrick Esser, Sen Jia, Bjorn Ommer, Konstantinos G. Derpanis, Neil Bruce
Contrasting the previous evidence that neurons in the later layers of a Convolutional Neural Network (CNN) respond to complex object shapes, recent studies have shown that CNNs actually exhibit a `texture bias': given an image with both texture and shape cues (e. g., a stylized image), a CNN is biased towards predicting the category corresponding to the texture.
no code implementations • ICLR 2021 • Md Amirul Islam, Matthew Kowal, Patrick Esser, Sen Jia, Björn Ommer, Konstantinos G. Derpanis, Neil Bruce
Contrasting the previous evidence that neurons in the later layers of a Convolutional Neural Network (CNN) respond to complex object shapes, recent studies have shown that CNNs actually exhibit a 'texture bias': given an image with both texture and shape cues (e. g., a stylized image), a CNN is biased towards predicting the category corresponding to the texture.
no code implementations • 1 Jan 2021 • Md Amirul Islam, Matthew Kowal, Sen Jia, Konstantinos G. Derpanis, Neil Bruce
Finally, we demonstrate the implications of these findings on a number of real-world tasks to show that position information can act as a feature or a bug.
2 code implementations • 16 Nov 2020 • Tristan Aumentado-Armstrong, Alex Levinshtein, Stavros Tsogkas, Konstantinos G. Derpanis, Allan D. Jepson
In the context of computer vision, this corresponds to a learnable module that serves two purposes: (i) generate a realistic rendering of a 3D object (shape-to-image translation) and (ii) infer a realistic 3D shape from an image (image-to-shape translation).
1 code implementation • NeurIPS 2020 • Jason J. Yu, Konstantinos G. Derpanis, Marcus A. Brubaker
Normalizing flows are a class of probabilistic generative models which allow for both fast density computation and efficient sampling and are effective at modelling complex distributions like images.
no code implementations • 13 Aug 2020 • Md Amirul Islam, Matthew Kowal, Konstantinos G. Derpanis, Neil D. B. Bruce
In this paper, we present a strategy for training convolutional neural networks to effectively resolve interference arising from competing hypotheses relating to inter-categorical information throughout the network.
2 code implementations • CVPR 2021 • Mahmoud Afifi, Konstantinos G. Derpanis, Björn Ommer, Michael S. Brown
In contrast, our proposed method targets both over- and underexposure errors in photographs.
Ranked #4 on
Image Enhancement
on Exposure-Errors
no code implementations • 25 Sep 2019 • Karl Pertsch, Oleh Rybkin, Jingyun Yang, Konstantinos G. Derpanis, Kostas Daniilidis, Joseph J. Lim, Andrew Jaegle
To flexibly and efficiently reason about temporal sequences, abstract representations that compactly represent the important information in the sequence are needed.
1 code implementation • ICCV 2019 • Daniel Gehrig, Antonio Loquercio, Konstantinos G. Derpanis, Davide Scaramuzza
Event cameras are vision sensors that record asynchronous streams of per-pixel brightness changes, referred to as "events".
Ranked #3 on
Classification
on N-CARS
(using extra training data)
no code implementations • L4DC 2020 • Karl Pertsch, Oleh Rybkin, Jingyun Yang, Shenghao Zhou, Konstantinos G. Derpanis, Kostas Daniilidis, Joseph Lim, Andrew Jaegle
We propose a model that learns to discover these important events and the times when they occur and uses them to represent the full sequence.
no code implementations • 16 Jan 2019 • Tony Joseph, Konstantinos G. Derpanis, Faisal Z. Qureshi
In this paper, we propose a novel approach that learns to sequentially attend to different Convolutional Neural Networks (CNN) layers (i. e., ``what'' feature abstraction to attend to) and different spatial locations of the selected feature map (i. e., ``where'') to perform the task at hand.
no code implementations • ICLR 2019 • Oleh Rybkin, Karl Pertsch, Konstantinos G. Derpanis, Kostas Daniilidis, Andrew Jaegle
We introduce a loss term that encourages the network to capture the composability of visual sequences and show that it leads to representations that disentangle the structure of actions.
1 code implementation • 3 May 2018 • Titus Cieslewski, Konstantinos G. Derpanis, Davide Scaramuzza
In certain cases, our detector is able to obtain an equivalent amount of inliers with as little as 60% of the amount of points of other detectors.
no code implementations • 26 Mar 2018 • Andrew Jaegle, Oleh Rybkin, Konstantinos G. Derpanis, Kostas Daniilidis
We couple this latent state with a recurrent neural network (RNN) core that predicts future frames by transforming past states into future states by applying the accumulated state transformation with a learned operator.
1 code implementation • ICCV 2017 • Adam W. Harley, Konstantinos G. Derpanis, Iasonas Kokkinos
We introduce an approach to integrate segmentation information within a convolutional neural network (CNN).
1 code implementation • CVPR 2018 • Matthew Tesfaldet, Marcus A. Brubaker, Konstantinos G. Derpanis
Given an input dynamic texture, statistics of filter responses from the object recognition ConvNet encapsulate the per-frame appearance of the input texture, while statistics of filter responses from the optical flow ConvNet model its dynamics.
no code implementations • CVPR 2017 • Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis
In this paper, we present a geometry-driven approach to automatically collect annotations for human pose prediction tasks.
Ranked #28 on
Weakly-supervised 3D Human Pose Estimation
on Human3.6M
1 code implementation • 14 Mar 2017 • Georgios Pavlakos, Xiaowei Zhou, Aaron Chan, Konstantinos G. Derpanis, Kostas Daniilidis
This paper presents a novel approach to estimating the continuous six degree of freedom (6-DoF) pose (3D translation and rotation) of an object from a single RGB image.
Ranked #1 on
Keypoint Detection
on Pascal3D+
no code implementations • 23 Feb 2017 • Domenic Curro, Konstantinos G. Derpanis, Andriy V. Miranskyy
Our goal is to construct an automatic approach to extract information about user actions from instructional videos.
4 code implementations • CVPR 2017 • Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis
This paper addresses the challenge of 3D human pose estimation from a single color image.
Ranked #18 on
3D Human Pose Estimation
on HumanEva-I
no code implementations • 20 Aug 2016 • Jason J. Yu, Adam W. Harley, Konstantinos G. Derpanis
Recently, convolutional networks (convnets) have proven useful for predicting optical flow.
no code implementations • 13 Nov 2015 • Adam W. Harley, Konstantinos G. Derpanis, Iasonas Kokkinos
That is, for any two pixels on the same object, the embeddings are trained to be similar; for any pair that straddles an object boundary, the embeddings are trained to be dissimilar.
no code implementations • 25 Feb 2015 • Adam W. Harley, Alex Ufkes, Konstantinos G. Derpanis
This paper presents a new state-of-the-art for document image classification and retrieval, using features learned by deep convolutional neural networks (CNNs).