Search Results for author: Konstantinos G. Derpanis

Found 48 papers, 18 papers with code

Visual Concept Connectome (VCC): Open World Concept Discovery and their Interlayer Connections in Deep Models

no code implementations • 2 Apr 2024 • Matthew Kowal, Richard P. Wildes, Konstantinos G. Derpanis

Understanding what deep network models capture in their learned representations is a fundamental challenge in computer vision.

Image Classification

Paper
Add Code

PolyOculus: Simultaneous Multi-view Image-based Novel View Synthesis

no code implementations • 28 Feb 2024 • Jason J. Yu, Tristan Aumentado-Armstrong, Fereshteh Forghani, Konstantinos G. Derpanis, Marcus A. Brubaker

This paper considers the problem of generative novel view synthesis (GNVS), generating novel, plausible views of a scene given a limited number of known views.

Novel View Synthesis

Paper
Add Code

Understanding Video Transformers via Universal Concept Discovery

no code implementations • 19 Jan 2024 • Matthew Kowal, Achal Dave, Rares Ambrus, Adrien Gaidon, Konstantinos G. Derpanis, Pavel Tokmakov

Concretely, we seek to explain the decision-making process of video transformers based on high-level, spatiotemporal concepts that are automatically discovered.

Decision Making Fine-grained Action Recognition +3

Paper
Add Code

Reconstructive Latent-Space Neural Radiance Fields for Efficient 3D Scene Representations

no code implementations • 27 Oct 2023 • Tristan Aumentado-Armstrong, Ashkan Mirzaei, Marcus A. Brubaker, Jonathan Kelly, Alex Levinshtein, Konstantinos G. Derpanis, Igor Gilitschenski

The resulting latent-space NeRF can produce novel views with higher quality than standard colour-space NeRFs, as the AE can correct certain visual artifacts, while rendering over three times faster.

Continual Learning Novel View Synthesis

Paper
Add Code

GePSAn: Generative Procedure Step Anticipation in Cooking Videos

no code implementations • ICCV 2023 • Mohamed Ashraf Abdelsalam, Samrudhdhi B. Rangrej, Isma Hadji, Nikita Dvornik, Konstantinos G. Derpanis, Afsaneh Fazly

While most previous work focus on the problem of data scarcity in procedural video datasets, another core challenge of future anticipation is how to account for multiple plausible future realizations in natural settings.

Paper
Add Code

Dual-Camera Joint Deblurring-Denoising

no code implementations • 16 Sep 2023 • Shayan shekarforoush, Amanpreet Walia, Marcus A. Brubaker, Konstantinos G. Derpanis, Alex Levinshtein

Recent image enhancement methods have shown the advantages of using a pair of long and short-exposure images for low-light photography.

Deblurring Denoising +1

Paper
Add Code

Watch Your Steps: Local Image and Scene Editing by Text Instructions

no code implementations • 17 Aug 2023 • Ashkan Mirzaei, Tristan Aumentado-Armstrong, Marcus A. Brubaker, Jonathan Kelly, Alex Levinshtein, Konstantinos G. Derpanis, Igor Gilitschenski

A field is trained on relevance maps of training views, denoted as the relevance field, defining the 3D region within which modifications should be made.

Denoising Image Generation

Paper
Add Code

StepFormer: Self-supervised Step Discovery and Localization in Instructional Videos

no code implementations • CVPR 2023 • Nikita Dvornik, Isma Hadji, Ran Zhang, Konstantinos G. Derpanis, Animesh Garg, Richard P. Wildes, Allan D. Jepson

This motivates the need to temporally localize the instruction steps in such videos, i. e. the task called key-step localization.

Paper
Add Code

Long-Term Photometric Consistent Novel View Synthesis with Diffusion Models

1 code implementation • ICCV 2023 • Jason J. Yu, Fereshteh Forghani, Konstantinos G. Derpanis, Marcus A. Brubaker

In this paper, we propose a novel generative model capable of producing a sequence of photorealistic images consistent with a specified camera trajectory, and a single starting image.

Novel View Synthesis

Paper
Code

Reference-guided Controllable Inpainting of Neural Radiance Fields

no code implementations • ICCV 2023 • Ashkan Mirzaei, Tristan Aumentado-Armstrong, Marcus A. Brubaker, Jonathan Kelly, Alex Levinshtein, Konstantinos G. Derpanis, Igor Gilitschenski

The popularity of Neural Radiance Fields (NeRFs) for view synthesis has led to a desire for NeRF editing tools.

Paper
Add Code

SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting with Neural Radiance Fields

no code implementations • CVPR 2023 • Ashkan Mirzaei, Tristan Aumentado-Armstrong, Konstantinos G. Derpanis, Jonathan Kelly, Marcus A. Brubaker, Igor Gilitschenski, Alex Levinshtein

We refer to this task as 3D inpainting.

3D Inpainting 3D Instance Segmentation +3

Paper
Add Code

Quantifying and Learning Static vs. Dynamic Information in Deep Spatiotemporal Networks

no code implementations • 3 Nov 2022 • Matthew Kowal, Mennatullah Siam, Md Amirul Islam, Neil D. B. Bruce, Richard P. Wildes, Konstantinos G. Derpanis

(ii) Some datasets that are assumed to be biased toward dynamics are actually biased toward static information.

Action Recognition Instance Segmentation +4

Paper
Add Code

SAGE: Saliency-Guided Mixup with Optimal Rearrangements

1 code implementation • 31 Oct 2022 • Avery Ma, Nikita Dvornik, Ran Zhang, Leila Pishdad, Konstantinos G. Derpanis, Afsaneh Fazly

For image classification, the most popular data augmentation techniques range from simple photometric and geometrical transformations, to more complex methods that use visual saliency to craft new training examples.

Data Augmentation Domain Generalization +2

Paper
Code

A Deeper Dive Into What Deep Spatiotemporal Networks Encode: Quantifying Static vs. Dynamic Information

1 code implementation • CVPR 2022 • Matthew Kowal, Mennatullah Siam, Md Amirul Islam, Neil D. B. Bruce, Richard P. Wildes, Konstantinos G. Derpanis

To show the efficacy of our approach, we analyse two widely studied tasks, action recognition and video object segmentation.

Action Recognition Semantic Segmentation +2

Paper
Code

P3IV: Probabilistic Procedure Planning from Instructional Videos with Weak Supervision

1 code implementation • CVPR 2022 • He Zhao, Isma Hadji, Nikita Dvornik, Konstantinos G. Derpanis, Richard P. Wildes, Allan D. Jepson

Our model is based on a transformer equipped with a memory module, which maps the start and goal observations to a sequence of plausible actions.

Paper
Code

Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations

no code implementations • 20 Apr 2022 • Leila Pishdad, Ran Zhang, Konstantinos G. Derpanis, Allan Jepson, Afsaneh Fazly

Probabilistic embeddings have proven useful for capturing polysemous word meanings, as well as ambiguity in image matching.

Image Retrieval Image-text matching +2

Paper
Add Code

Semantic keypoint-based pose estimation from single RGB frames

1 code implementation • 12 Apr 2022 • Karl Schmeckpeper, Philip R. Osteen, Yufu Wang, Georgios Pavlakos, Kenneth Chaney, Wyatt Jordan, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis

Empirically, we show that our approach can accurately recover the 6-DoF object pose for both instance- and class-based scenarios even against a cluttered background.

Object Pose Estimation

Paper
Code

Temporal Transductive Inference for Few-Shot Video Object Segmentation

1 code implementation • 27 Mar 2022 • Mennatullah Siam, Konstantinos G. Derpanis, Richard P. Wildes

In this paper, we present a simple but effective temporal transductive inference (TTI) approach that leverages temporal consistency in the unlabelled video frames during few-shot inference.

Meta-Learning Object +3

Paper
Code

Simpler Does It: Generating Semantic Labels with Objectness Guidance

no code implementations • 20 Oct 2021 • Md Amirul Islam, Matthew Kowal, Sen Jia, Konstantinos G. Derpanis, Neil D. B. Bruce

Extensive experiments demonstrate the high quality of our generated pseudo-labels and effectiveness of the proposed framework in a variety of domains.

Multi-Task Learning Object +1

Paper
Add Code

Drop-DTW: Aligning Common Signal Between Sequences While Dropping Outliers

no code implementations • NeurIPS 2021 • Nikita Dvornik, Isma Hadji, Konstantinos G. Derpanis, Animesh Garg, Allan D. Jepson

In our experiments, we show that Drop-DTW is a robust similarity measure for sequence retrieval and demonstrate its effectiveness as a training loss on diverse applications.

Dynamic Time Warping Representation Learning +1

Paper
Add Code

SegMix: Co-occurrence Driven Mixup for Semantic Segmentation and Adversarial Robustness

no code implementations • 23 Aug 2021 • Md Amirul Islam, Matthew Kowal, Konstantinos G. Derpanis, Neil D. B. Bruce

The premise is based on the notion of feature binding, which is defined as the process by which activations spread across space and layers in the network are successfully integrated to arrive at a correct inference decision.

Adversarial Robustness Denoising +1

Paper
Add Code

Global Pooling, More than Meets the Eye: Position Information is Encoded Channel-Wise in CNNs

1 code implementation • ICCV 2021 • Md Amirul Islam, Matthew Kowal, Sen Jia, Konstantinos G. Derpanis, Neil D. B. Bruce

In this paper, we challenge the common assumption that collapsing the spatial dimensions of a 3D (spatial-channel) tensor in a convolutional neural network (CNN) into a vector via global pooling removes all spatial information.

Data Augmentation Position +2

Paper
Code

Representation Learning via Global Temporal Alignment and Cycle-Consistency

1 code implementation • CVPR 2021 • Isma Hadji, Konstantinos G. Derpanis, Allan D. Jepson

We introduce a weakly supervised method for representation learning based on aligning temporal sequences (e. g., videos) of the same process (e. g., human action).

Action Classification Dynamic Time Warping +5

Paper
Code

Stochastic Image-to-Video Synthesis using cINNs

1 code implementation • CVPR 2021 • Michael Dorkenwald, Timo Milbich, Andreas Blattmann, Robin Rombach, Konstantinos G. Derpanis, Björn Ommer

Video understanding calls for a model to learn the characteristic interplay between static scene content and its dynamics: Given an image, the model must be able to predict a future progression of the portrayed scene and, conversely, a video should be explained in terms of its static image content and all the remaining characteristics not present in the initial frame.

Video Understanding

179

Paper
Code

Position, Padding and Predictions: A Deeper Look at Position Information in CNNs

no code implementations • 28 Jan 2021 • Md Amirul Islam, Matthew Kowal, Sen Jia, Konstantinos G. Derpanis, Neil D. B. Bruce

; (ii) Does position encoding affect the learning of semantic representations?

Position

Paper
Add Code

Shape or Texture: Understanding Discriminative Features in CNNs

no code implementations • 27 Jan 2021 • Md Amirul Islam, Matthew Kowal, Patrick Esser, Sen Jia, Bjorn Ommer, Konstantinos G. Derpanis, Neil Bruce

Contrasting the previous evidence that neurons in the later layers of a Convolutional Neural Network (CNN) respond to complex object shapes, recent studies have shown that CNNs actually exhibit a `texture bias': given an image with both texture and shape cues (e. g., a stylized image), a CNN is biased towards predicting the category corresponding to the texture.

Paper
Add Code

Boundary Effects in CNNs: Feature or Bug?

no code implementations • 1 Jan 2021 • Md Amirul Islam, Matthew Kowal, Sen Jia, Konstantinos G. Derpanis, Neil Bruce

Finally, we demonstrate the implications of these findings on a number of real-world tasks to show that position information can act as a feature or a bug.

Position

Paper
Add Code

Shape or Texture: Disentangling Discriminative Features in CNNs

no code implementations • ICLR 2021 • Md Amirul Islam, Matthew Kowal, Patrick Esser, Sen Jia, Björn Ommer, Konstantinos G. Derpanis, Neil Bruce

Contrasting the previous evidence that neurons in the later layers of a Convolutional Neural Network (CNN) respond to complex object shapes, recent studies have shown that CNNs actually exhibit a 'texture bias': given an image with both texture and shape cues (e. g., a stylized image), a CNN is biased towards predicting the category corresponding to the texture.

Paper
Add Code

Cycle-Consistent Generative Rendering for 2D-3D Modality Translation

2 code implementations • 16 Nov 2020 • Tristan Aumentado-Armstrong, Alex Levinshtein, Stavros Tsogkas, Konstantinos G. Derpanis, Allan D. Jepson

In the context of computer vision, this corresponds to a learnable module that serves two purposes: (i) generate a realistic rendering of a 3D object (shape-to-image translation) and (ii) infer a realistic 3D shape from an image (image-to-shape translation).

Image Generation Translation

Paper
Code

Wavelet Flow: Fast Training of High Resolution Normalizing Flows

1 code implementation • NeurIPS 2020 • Jason J. Yu, Konstantinos G. Derpanis, Marcus A. Brubaker

Normalizing flows are a class of probabilistic generative models which allow for both fast density computation and efficient sampling and are effective at modelling complex distributions like images.

Super-Resolution Vocal Bursts Intensity Prediction

Paper
Code

Feature Binding with Category-Dependant MixUp for Semantic Segmentation and Adversarial Robustness

no code implementations • 13 Aug 2020 • Md Amirul Islam, Matthew Kowal, Konstantinos G. Derpanis, Neil D. B. Bruce

In this paper, we present a strategy for training convolutional neural networks to effectively resolve interference arising from competing hypotheses relating to inter-categorical information throughout the network.

Adversarial Robustness Denoising +1

Paper
Add Code

Learning Multi-Scale Photo Exposure Correction

2 code implementations • CVPR 2021 • Mahmoud Afifi, Konstantinos G. Derpanis, Björn Ommer, Michael S. Brown

In contrast, our proposed method targets both over- and underexposure errors in photographs.

Ranked #3 on Image Enhancement on Exposure-Errors

Image Enhancement

485

Paper
Code

Keyframing the Future: Discovering Temporal Hierarchy with Keyframe-Inpainter Prediction

no code implementations • 25 Sep 2019 • Karl Pertsch, Oleh Rybkin, Jingyun Yang, Konstantinos G. Derpanis, Kostas Daniilidis, Joseph J. Lim, Andrew Jaegle

To flexibly and efficiently reason about temporal sequences, abstract representations that compactly represent the important information in the sequence are needed.

Temporal Sequences

Paper
Add Code

End-to-End Learning of Representations for Asynchronous Event-Based Data

1 code implementation • ICCV 2019 • Daniel Gehrig, Antonio Loquercio, Konstantinos G. Derpanis, Davide Scaramuzza

Event cameras are vision sensors that record asynchronous streams of per-pixel brightness changes, referred to as "events".

Ranked #15 on Robust classification on N-ImageNet

Classification Object Recognition +2

130

Paper
Code

Keyframing the Future: Keyframe Discovery for Visual Prediction and Planning

no code implementations • L4DC 2020 • Karl Pertsch, Oleh Rybkin, Jingyun Yang, Shenghao Zhou, Konstantinos G. Derpanis, Kostas Daniilidis, Joseph Lim, Andrew Jaegle

We propose a model that learns to discover these important events and the times when they occur and uses them to represent the full sequence.

Video Prediction

Paper
Add Code

Joint Spatial and Layer Attention for Convolutional Networks

no code implementations • 16 Jan 2019 • Tony Joseph, Konstantinos G. Derpanis, Faisal Z. Qureshi

In this paper, we propose a novel approach that learns to sequentially attend to different Convolutional Neural Networks (CNN) layers (i. e., ``what'' feature abstraction to attend to) and different spatial locations of the selected feature map (i. e., ``where'') to perform the task at hand.

Camera Localization Classification +2

Paper
Add Code

Learning what you can do before doing anything

no code implementations • ICLR 2019 • Oleh Rybkin, Karl Pertsch, Konstantinos G. Derpanis, Kostas Daniilidis, Andrew Jaegle

We introduce a loss term that encourages the network to capture the composability of visual sequences and show that it leads to representations that disentangle the structure of actions.

Video Prediction

Paper
Add Code

SIPs: Succinct Interest Points from Unsupervised Inlierness Probability Learning

1 code implementation • 3 May 2018 • Titus Cieslewski, Konstantinos G. Derpanis, Davide Scaramuzza

In certain cases, our detector is able to obtain an equivalent amount of inliers with as little as 60% of the amount of points of other detectors.

Interest Point Detection Pose Estimation +1

Paper
Code

Predicting the Future with Transformational States

no code implementations • 26 Mar 2018 • Andrew Jaegle, Oleh Rybkin, Konstantinos G. Derpanis, Kostas Daniilidis

We couple this latent state with a recurrent neural network (RNN) core that predicts future frames by transforming past states into future states by applying the accumulated state transformation with a learned operator.

Paper
Add Code

Segmentation-Aware Convolutional Networks Using Local Attention Masks

1 code implementation • ICCV 2017 • Adam W. Harley, Konstantinos G. Derpanis, Iasonas Kokkinos

We introduce an approach to integrate segmentation information within a convolutional neural network (CNN).

Optical Flow Estimation Segmentation +1

144

Paper
Code

Two-Stream Convolutional Networks for Dynamic Texture Synthesis

1 code implementation • CVPR 2018 • Matthew Tesfaldet, Marcus A. Brubaker, Konstantinos G. Derpanis

Given an input dynamic texture, statistics of filter responses from the object recognition ConvNet encapsulate the per-frame appearance of the input texture, while statistics of filter responses from the optical flow ConvNet model its dynamics.

Object Recognition Optical Flow Estimation +3

191

Paper
Code

Harvesting Multiple Views for Marker-less 3D Human Pose Annotations

no code implementations • CVPR 2017 • Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis

In this paper, we present a geometry-driven approach to automatically collect annotations for human pose prediction tasks.

Ranked #28 on Weakly-supervised 3D Human Pose Estimation on Human3.6M

Pose Prediction Weakly-supervised 3D Human Pose Estimation

Paper
Add Code

6-DoF Object Pose from Semantic Keypoints

1 code implementation • 14 Mar 2017 • Georgios Pavlakos, Xiaowei Zhou, Aaron Chan, Konstantinos G. Derpanis, Kostas Daniilidis

This paper presents a novel approach to estimating the continuous six degree of freedom (6-DoF) pose (3D translation and rotation) of an object from a single RGB image.

Ranked #1 on Keypoint Detection on Pascal3D+

Keypoint Detection Object +1

Paper
Code

Building Usage Profiles Using Deep Neural Nets

no code implementations • 23 Feb 2017 • Domenic Curro, Konstantinos G. Derpanis, Andriy V. Miranskyy

Our goal is to construct an automatic approach to extract information about user actions from instructional videos.

Paper
Add Code

Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose

3 code implementations • CVPR 2017 • Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis

This paper addresses the challenge of 3D human pose estimation from a single color image.

Ranked #16 on 3D Human Pose Estimation on HumanEva-I

3D Human Pose Estimation

Paper
Code

Back to Basics: Unsupervised Learning of Optical Flow via Brightness Constancy and Motion Smoothness

no code implementations • 20 Aug 2016 • Jason J. Yu, Adam W. Harley, Konstantinos G. Derpanis

Recently, convolutional networks (convnets) have proven useful for predicting optical flow.

Optical Flow Estimation

Paper
Add Code

Learning Dense Convolutional Embeddings for Semantic Segmentation

no code implementations • 13 Nov 2015 • Adam W. Harley, Konstantinos G. Derpanis, Iasonas Kokkinos

That is, for any two pixels on the same object, the embeddings are trained to be similar; for any pair that straddles an object boundary, the embeddings are trained to be dissimilar.

General Classification Object +1

Paper
Add Code

Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval

no code implementations • 25 Feb 2015 • Adam W. Harley, Alex Ufkes, Konstantinos G. Derpanis

This paper presents a new state-of-the-art for document image classification and retrieval, using features learned by deep convolutional neural networks (CNNs).

Descriptive Document Image Classification +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.