Search Results for author: Konstantinos G. Derpanis

Found 48 papers, 18 papers with code

Visual Concept Connectome (VCC): Open World Concept Discovery and their Interlayer Connections in Deep Models

no code implementations2 Apr 2024 Matthew Kowal, Richard P. Wildes, Konstantinos G. Derpanis

Understanding what deep network models capture in their learned representations is a fundamental challenge in computer vision.

Image Classification

PolyOculus: Simultaneous Multi-view Image-based Novel View Synthesis

no code implementations28 Feb 2024 Jason J. Yu, Tristan Aumentado-Armstrong, Fereshteh Forghani, Konstantinos G. Derpanis, Marcus A. Brubaker

This paper considers the problem of generative novel view synthesis (GNVS), generating novel, plausible views of a scene given a limited number of known views.

Novel View Synthesis

Understanding Video Transformers via Universal Concept Discovery

no code implementations19 Jan 2024 Matthew Kowal, Achal Dave, Rares Ambrus, Adrien Gaidon, Konstantinos G. Derpanis, Pavel Tokmakov

Concretely, we seek to explain the decision-making process of video transformers based on high-level, spatiotemporal concepts that are automatically discovered.

Decision Making Fine-grained Action Recognition +3

Reconstructive Latent-Space Neural Radiance Fields for Efficient 3D Scene Representations

no code implementations27 Oct 2023 Tristan Aumentado-Armstrong, Ashkan Mirzaei, Marcus A. Brubaker, Jonathan Kelly, Alex Levinshtein, Konstantinos G. Derpanis, Igor Gilitschenski

The resulting latent-space NeRF can produce novel views with higher quality than standard colour-space NeRFs, as the AE can correct certain visual artifacts, while rendering over three times faster.

Continual Learning Novel View Synthesis

GePSAn: Generative Procedure Step Anticipation in Cooking Videos

no code implementations ICCV 2023 Mohamed Ashraf Abdelsalam, Samrudhdhi B. Rangrej, Isma Hadji, Nikita Dvornik, Konstantinos G. Derpanis, Afsaneh Fazly

While most previous work focus on the problem of data scarcity in procedural video datasets, another core challenge of future anticipation is how to account for multiple plausible future realizations in natural settings.

Dual-Camera Joint Deblurring-Denoising

no code implementations16 Sep 2023 Shayan shekarforoush, Amanpreet Walia, Marcus A. Brubaker, Konstantinos G. Derpanis, Alex Levinshtein

Recent image enhancement methods have shown the advantages of using a pair of long and short-exposure images for low-light photography.

Deblurring Denoising +1

Watch Your Steps: Local Image and Scene Editing by Text Instructions

no code implementations17 Aug 2023 Ashkan Mirzaei, Tristan Aumentado-Armstrong, Marcus A. Brubaker, Jonathan Kelly, Alex Levinshtein, Konstantinos G. Derpanis, Igor Gilitschenski

A field is trained on relevance maps of training views, denoted as the relevance field, defining the 3D region within which modifications should be made.

Denoising Image Generation

Long-Term Photometric Consistent Novel View Synthesis with Diffusion Models

1 code implementation ICCV 2023 Jason J. Yu, Fereshteh Forghani, Konstantinos G. Derpanis, Marcus A. Brubaker

In this paper, we propose a novel generative model capable of producing a sequence of photorealistic images consistent with a specified camera trajectory, and a single starting image.

Novel View Synthesis

SAGE: Saliency-Guided Mixup with Optimal Rearrangements

1 code implementation31 Oct 2022 Avery Ma, Nikita Dvornik, Ran Zhang, Leila Pishdad, Konstantinos G. Derpanis, Afsaneh Fazly

For image classification, the most popular data augmentation techniques range from simple photometric and geometrical transformations, to more complex methods that use visual saliency to craft new training examples.

Data Augmentation Domain Generalization +2

P3IV: Probabilistic Procedure Planning from Instructional Videos with Weak Supervision

1 code implementation CVPR 2022 He Zhao, Isma Hadji, Nikita Dvornik, Konstantinos G. Derpanis, Richard P. Wildes, Allan D. Jepson

Our model is based on a transformer equipped with a memory module, which maps the start and goal observations to a sequence of plausible actions.

Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations

no code implementations20 Apr 2022 Leila Pishdad, Ran Zhang, Konstantinos G. Derpanis, Allan Jepson, Afsaneh Fazly

Probabilistic embeddings have proven useful for capturing polysemous word meanings, as well as ambiguity in image matching.

Image Retrieval Image-text matching +2

Semantic keypoint-based pose estimation from single RGB frames

1 code implementation12 Apr 2022 Karl Schmeckpeper, Philip R. Osteen, Yufu Wang, Georgios Pavlakos, Kenneth Chaney, Wyatt Jordan, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis

Empirically, we show that our approach can accurately recover the 6-DoF object pose for both instance- and class-based scenarios even against a cluttered background.

Object Pose Estimation

Temporal Transductive Inference for Few-Shot Video Object Segmentation

1 code implementation27 Mar 2022 Mennatullah Siam, Konstantinos G. Derpanis, Richard P. Wildes

In this paper, we present a simple but effective temporal transductive inference (TTI) approach that leverages temporal consistency in the unlabelled video frames during few-shot inference.

Meta-Learning Object +3

Simpler Does It: Generating Semantic Labels with Objectness Guidance

no code implementations20 Oct 2021 Md Amirul Islam, Matthew Kowal, Sen Jia, Konstantinos G. Derpanis, Neil D. B. Bruce

Extensive experiments demonstrate the high quality of our generated pseudo-labels and effectiveness of the proposed framework in a variety of domains.

Multi-Task Learning Object +1

Drop-DTW: Aligning Common Signal Between Sequences While Dropping Outliers

no code implementations NeurIPS 2021 Nikita Dvornik, Isma Hadji, Konstantinos G. Derpanis, Animesh Garg, Allan D. Jepson

In our experiments, we show that Drop-DTW is a robust similarity measure for sequence retrieval and demonstrate its effectiveness as a training loss on diverse applications.

Dynamic Time Warping Representation Learning +1

SegMix: Co-occurrence Driven Mixup for Semantic Segmentation and Adversarial Robustness

no code implementations23 Aug 2021 Md Amirul Islam, Matthew Kowal, Konstantinos G. Derpanis, Neil D. B. Bruce

The premise is based on the notion of feature binding, which is defined as the process by which activations spread across space and layers in the network are successfully integrated to arrive at a correct inference decision.

Adversarial Robustness Denoising +1

Global Pooling, More than Meets the Eye: Position Information is Encoded Channel-Wise in CNNs

1 code implementation ICCV 2021 Md Amirul Islam, Matthew Kowal, Sen Jia, Konstantinos G. Derpanis, Neil D. B. Bruce

In this paper, we challenge the common assumption that collapsing the spatial dimensions of a 3D (spatial-channel) tensor in a convolutional neural network (CNN) into a vector via global pooling removes all spatial information.

Data Augmentation Position +2

Representation Learning via Global Temporal Alignment and Cycle-Consistency

1 code implementation CVPR 2021 Isma Hadji, Konstantinos G. Derpanis, Allan D. Jepson

We introduce a weakly supervised method for representation learning based on aligning temporal sequences (e. g., videos) of the same process (e. g., human action).

Action Classification Dynamic Time Warping +5

Stochastic Image-to-Video Synthesis using cINNs

1 code implementation CVPR 2021 Michael Dorkenwald, Timo Milbich, Andreas Blattmann, Robin Rombach, Konstantinos G. Derpanis, Björn Ommer

Video understanding calls for a model to learn the characteristic interplay between static scene content and its dynamics: Given an image, the model must be able to predict a future progression of the portrayed scene and, conversely, a video should be explained in terms of its static image content and all the remaining characteristics not present in the initial frame.

Video Understanding

Shape or Texture: Understanding Discriminative Features in CNNs

no code implementations27 Jan 2021 Md Amirul Islam, Matthew Kowal, Patrick Esser, Sen Jia, Bjorn Ommer, Konstantinos G. Derpanis, Neil Bruce

Contrasting the previous evidence that neurons in the later layers of a Convolutional Neural Network (CNN) respond to complex object shapes, recent studies have shown that CNNs actually exhibit a `texture bias': given an image with both texture and shape cues (e. g., a stylized image), a CNN is biased towards predicting the category corresponding to the texture.

Boundary Effects in CNNs: Feature or Bug?

no code implementations1 Jan 2021 Md Amirul Islam, Matthew Kowal, Sen Jia, Konstantinos G. Derpanis, Neil Bruce

Finally, we demonstrate the implications of these findings on a number of real-world tasks to show that position information can act as a feature or a bug.

Position

Shape or Texture: Disentangling Discriminative Features in CNNs

no code implementations ICLR 2021 Md Amirul Islam, Matthew Kowal, Patrick Esser, Sen Jia, Björn Ommer, Konstantinos G. Derpanis, Neil Bruce

Contrasting the previous evidence that neurons in the later layers of a Convolutional Neural Network (CNN) respond to complex object shapes, recent studies have shown that CNNs actually exhibit a 'texture bias': given an image with both texture and shape cues (e. g., a stylized image), a CNN is biased towards predicting the category corresponding to the texture.

Cycle-Consistent Generative Rendering for 2D-3D Modality Translation

2 code implementations16 Nov 2020 Tristan Aumentado-Armstrong, Alex Levinshtein, Stavros Tsogkas, Konstantinos G. Derpanis, Allan D. Jepson

In the context of computer vision, this corresponds to a learnable module that serves two purposes: (i) generate a realistic rendering of a 3D object (shape-to-image translation) and (ii) infer a realistic 3D shape from an image (image-to-shape translation).

Image Generation Translation

Wavelet Flow: Fast Training of High Resolution Normalizing Flows

1 code implementation NeurIPS 2020 Jason J. Yu, Konstantinos G. Derpanis, Marcus A. Brubaker

Normalizing flows are a class of probabilistic generative models which allow for both fast density computation and efficient sampling and are effective at modelling complex distributions like images.

Super-Resolution Vocal Bursts Intensity Prediction

Feature Binding with Category-Dependant MixUp for Semantic Segmentation and Adversarial Robustness

no code implementations13 Aug 2020 Md Amirul Islam, Matthew Kowal, Konstantinos G. Derpanis, Neil D. B. Bruce

In this paper, we present a strategy for training convolutional neural networks to effectively resolve interference arising from competing hypotheses relating to inter-categorical information throughout the network.

Adversarial Robustness Denoising +1

Keyframing the Future: Discovering Temporal Hierarchy with Keyframe-Inpainter Prediction

no code implementations25 Sep 2019 Karl Pertsch, Oleh Rybkin, Jingyun Yang, Konstantinos G. Derpanis, Kostas Daniilidis, Joseph J. Lim, Andrew Jaegle

To flexibly and efficiently reason about temporal sequences, abstract representations that compactly represent the important information in the sequence are needed.

Temporal Sequences

Joint Spatial and Layer Attention for Convolutional Networks

no code implementations16 Jan 2019 Tony Joseph, Konstantinos G. Derpanis, Faisal Z. Qureshi

In this paper, we propose a novel approach that learns to sequentially attend to different Convolutional Neural Networks (CNN) layers (i. e., ``what'' feature abstraction to attend to) and different spatial locations of the selected feature map (i. e., ``where'') to perform the task at hand.

Camera Localization Classification +2

Learning what you can do before doing anything

no code implementations ICLR 2019 Oleh Rybkin, Karl Pertsch, Konstantinos G. Derpanis, Kostas Daniilidis, Andrew Jaegle

We introduce a loss term that encourages the network to capture the composability of visual sequences and show that it leads to representations that disentangle the structure of actions.

Video Prediction

SIPs: Succinct Interest Points from Unsupervised Inlierness Probability Learning

1 code implementation3 May 2018 Titus Cieslewski, Konstantinos G. Derpanis, Davide Scaramuzza

In certain cases, our detector is able to obtain an equivalent amount of inliers with as little as 60% of the amount of points of other detectors.

Interest Point Detection Pose Estimation +1

Predicting the Future with Transformational States

no code implementations26 Mar 2018 Andrew Jaegle, Oleh Rybkin, Konstantinos G. Derpanis, Kostas Daniilidis

We couple this latent state with a recurrent neural network (RNN) core that predicts future frames by transforming past states into future states by applying the accumulated state transformation with a learned operator.

Two-Stream Convolutional Networks for Dynamic Texture Synthesis

1 code implementation CVPR 2018 Matthew Tesfaldet, Marcus A. Brubaker, Konstantinos G. Derpanis

Given an input dynamic texture, statistics of filter responses from the object recognition ConvNet encapsulate the per-frame appearance of the input texture, while statistics of filter responses from the optical flow ConvNet model its dynamics.

Object Recognition Optical Flow Estimation +3

6-DoF Object Pose from Semantic Keypoints

1 code implementation14 Mar 2017 Georgios Pavlakos, Xiaowei Zhou, Aaron Chan, Konstantinos G. Derpanis, Kostas Daniilidis

This paper presents a novel approach to estimating the continuous six degree of freedom (6-DoF) pose (3D translation and rotation) of an object from a single RGB image.

Keypoint Detection Object +1

Building Usage Profiles Using Deep Neural Nets

no code implementations23 Feb 2017 Domenic Curro, Konstantinos G. Derpanis, Andriy V. Miranskyy

Our goal is to construct an automatic approach to extract information about user actions from instructional videos.

Learning Dense Convolutional Embeddings for Semantic Segmentation

no code implementations13 Nov 2015 Adam W. Harley, Konstantinos G. Derpanis, Iasonas Kokkinos

That is, for any two pixels on the same object, the embeddings are trained to be similar; for any pair that straddles an object boundary, the embeddings are trained to be dissimilar.

General Classification Object +1

Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval

no code implementations25 Feb 2015 Adam W. Harley, Alex Ufkes, Konstantinos G. Derpanis

This paper presents a new state-of-the-art for document image classification and retrieval, using features learned by deep convolutional neural networks (CNNs).

Descriptive Document Image Classification +2

Cannot find the paper you are looking for? You can Submit a new open access paper.