Search Results for author: Theo Gevers

Found 48 papers, 18 papers with code

Joint 3D Layout and Depth Prediction from a Single Indoor Panorama Image

no code implementations ECCV 2020 Wei Zeng, Sezer Karaoglu, Theo Gevers

Leveraging the layout depth map as an intermediate representation, our proposed method outperforms existing methods for both panorama layout prediction and depth estimation.

Depth Estimation Depth Prediction

LumiNet: Latent Intrinsics Meets Diffusion Models for Indoor Scene Relighting

no code implementations29 Nov 2024 Xiaoyan Xing, Konrad Groh, Sezer Karaoglu, Theo Gevers, Anand Bhattad

Our approach makes two key contributions: a data curation strategy from the StyleGAN-based relighting model for our training, and a modified diffusion-based ControlNet that processes both latent intrinsic properties from the source image and latent extrinsic properties from the target image.

MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM

no code implementations25 Nov 2024 Vladimir Yugay, Theo Gevers, Martin R. Oswald

Unfortunately, existing methods are slow, cannot accurately render real-world data, are restricted to two agents, and have limited tracking accuracy.

Autonomous Driving Novel View Synthesis +1

FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training

no code implementations4 Nov 2024 Ruihong Yin, Vladimir Yugay, Yue Li, Sezer Karaoglu, Theo Gevers

To address this, we present a 3D Gaussian-based novel view synthesis method using sparse input images that can accurately render the scene from the viewpoints not covered by the training images.

Depth Estimation Novel View Synthesis

RealDiff: Real-world 3D Shape Completion using Self-Supervised Diffusion Models

no code implementations16 Sep 2024 Başak Melis Öcal, Maxim Tatarchenko, Sezer Karaoglu, Theo Gevers

Specifically, RealDiff simulates a diffusion process at the missing object parts while conditioning the generation on the partial input to address the multimodal nature of the task.

Object Point Cloud Completion

Ray-Distance Volume Rendering for Neural Scene Reconstruction

no code implementations28 Aug 2024 Ruihong Yin, Yunlu Chen, Sezer Karaoglu, Theo Gevers

To tackle this issue, our work proposes a novel approach for indoor scene reconstruction, which instead parameterizes the density function with the Signed Ray Distance Function (SRDF).

3D geometry Indoor Scene Reconstruction

ShapeSplat: A Large-scale Dataset of Gaussian Splats and Their Self-Supervised Pretraining

no code implementations20 Aug 2024 Qi Ma, Yue Li, Bin Ren, Nicu Sebe, Ender Konukoglu, Theo Gevers, Luc van Gool, Danda Pani Paudel

In particular, we show that (1) the distribution of the optimized GS centroids significantly differs from the uniformly sampled point cloud (used for initialization) counterpart; (2) this change in distribution results in degradation in classification but improvement in segmentation tasks when using only the centroids; (3) to leverage additional Gaussian parameters, we propose Gaussian feature grouping in a normalized feature space, along with splats pooling layer, offering a tailored solution to effectively group and embed similar Gaussians, which leads to notable improvement in finetuning tasks.

Representation Learning

SceneTeller: Language-to-3D Scene Generation

no code implementations30 Jul 2024 Başak Melis Öcal, Maxim Tatarchenko, Sezer Karaoglu, Theo Gevers

Designing high-quality indoor 3D scenes is important in many practical applications, such as room planning or game development.

In-Context Learning Scene Generation

Retinex-Diffusion: On Controlling Illumination Conditions in Diffusion Models via Retinex Theory

no code implementations29 Jul 2024 Xiaoyan Xing, Vincent Tao Hu, Jan Hendrik Metzen, Konrad Groh, Sezer Karaoglu, Theo Gevers

This paper introduces a novel approach to illumination manipulation in diffusion models, addressing the gap in conditional image generation with a focus on lighting conditions.

Conditional Image Generation

Auto-Vocabulary Segmentation for LiDAR Points

no code implementations13 Jun 2024 Weijie Wei, Osman Ülger, Fatemeh Karimi Nejadasl, Theo Gevers, Martin R. Oswald

Existing perception methods for autonomous driving fall short of recognizing unknown entities not covered in the training data.

Autonomous Driving Object +3

Modeling Weather Uncertainty for Multi-weather Co-Presence Estimation

no code implementations29 Mar 2024 Qi Bi, ShaoDi You, Theo Gevers

In this paper, we start with solid revisit of the physics definition of weather and how it can be described as a continuous machine learning and computer vision task.

Classification Multi-Label Classification +2

Learning Generalized Segmentation for Foggy-scenes by Bi-directional Wavelet Guidance

1 code implementation Association for the Advancement of Artificial Intelligence (AAAI) 2024 Qi Bi, ShaoDi You, Theo Gevers

We argue that an ideal segmentation model that can be well generalized to foggy-scenes need to simultaneously enhance the content, de-correlate the urban-scene style and de-correlate the fog style.

Autonomous Driving Domain Generalization +4

Gaussian-SLAM: Photo-realistic Dense SLAM with Gaussian Splatting

no code implementations6 Dec 2023 Vladimir Yugay, Yue Li, Theo Gevers, Martin R. Oswald

We present a dense simultaneous localization and mapping (SLAM) method that uses 3D Gaussians as a scene representation.

Simultaneous Localization and Mapping

Relational Prior Knowledge Graphs for Detection and Instance Segmentation

1 code implementation11 Oct 2023 Osman Ülger, Yu Wang, Ysbrand Galama, Sezer Karaoglu, Theo Gevers, Martin R. Oswald

Humans have a remarkable ability to perceive and reason about the world around them by understanding the relationships between objects.

Instance Segmentation Knowledge Graphs +5

HaarNet: Large-scale Linear-Morphological Hybrid Network for RGB-D Semantic Segmentation

no code implementations11 Oct 2023 Rick Groenendijk, Leo Dorst, Theo Gevers

In the network, morphological Haar sampling is applied to both feature channels in several layers, which splits extreme values and high-frequency information such that both can be processed to improve both modalities.

Semantic Segmentation

APNet: Urban-level Scene Segmentation of Aerial Images and Point Clouds

1 code implementation29 Sep 2023 Weijie Wei, Martin R. Oswald, Fatemeh Karimi Nejadasl, Theo Gevers

To leverage the different properties of each branch, we employ a geometry-aware fusion module that is learned to combine the results of each branch.

Scene Segmentation

Intrinsic Image Decomposition Using Point Cloud Representation

1 code implementation20 Jul 2023 Xiaoyan Xing, Konrad Groh, Sezer Karaoglu, Theo Gevers

The purpose of intrinsic decomposition is to separate an image into its albedo (reflective properties) and shading components (illumination properties).

Intrinsic Image Decomposition

Interactive Learning of Intrinsic and Extrinsic Properties for All-day Semantic Segmentation

1 code implementation IEEE Transactions on Image Processing 2023 Qi Bi, ShaoDi You, Theo Gevers

In this paper, in contrast to existing methods, we tackle this challenge from the perspective of image formulation itself, where the image appearance is determined by both intrinsic (e. g., semantic category, structure) and extrinsic (e. g., lighting) properties.

All-day Semantic Segmentation Domain Adaptation +1

Learning Content-enhanced Mask Transformer for Domain Generalized Urban-Scene Segmentation

1 code implementation1 Jul 2023 Qi Bi, ShaoDi You, Theo Gevers

Unlike domain gap challenges, USSS is unique in that the semantic categories are often similar in different urban scenes, while the styles can vary significantly due to changes in urban landscapes, weather conditions, lighting, and other factors.

Decoder Domain Generalization +4

MorphPool: Efficient Non-linear Pooling & Unpooling in CNNs

1 code implementation25 Nov 2022 Rick Groenendijk, Leo Dorst, Theo Gevers

Pooling is essentially an operation from the field of Mathematical Morphology, with max pooling as a limited special case.

Decoder

SIGNet: Intrinsic Image Decomposition by a Semantic and Invariant Gradient Driven Network for Indoor Scenes

1 code implementation30 Aug 2022 Partha Das, Sezer Karaoglu, Arjan Gijsenij, Theo Gevers

An ablation study is conducted showing that the use of the proposed priors and progressive CNN increase the IID performance.

Intrinsic Image Decomposition

Multi-person 3D pose estimation from a single image captured by a fisheye camera

no code implementations Computer Vision and Image Understanding 2022 YaHui Zhang, ShaoDi You, Sezer Karaoglu, Theo Gevers

Multi-person 3D pose estimation with absolute depths for a fisheye camera is a challenging task but with valuable applications in daily life, especially for video surveillance.

3D Human Pose Estimation 3D Pose Estimation

Invariant Descriptors for Intrinsic Reflectance Optimization

no code implementations8 Apr 2022 Anil S. Baslamisli, Theo Gevers

We improve upon their model by introducing illumination invariant image descriptors: color ratios.

Intrinsic Image Decomposition

PIE-Net: Photometric Invariant Edge Guided Network for Intrinsic Image Decomposition

1 code implementation CVPR 2022 Partha Das, Sezer Karaoglu, Theo Gevers

An extensive ablation study and large scale experiments are conducted showing that it is beneficial for edge-driven hybrid IID networks to make use of illumination invariant descriptors and that separating global and local cues helps in improving the performance of the network.

Intrinsic Image Decomposition

Spatio-temporal Features for Generalized Detection of Deepfake Videos

no code implementations22 Oct 2020 Ipek Ganiyusufoglu, L. Minh Ngô, Nedko Savov, Sezer Karaoglu, Theo Gevers

In this paper, we empirically show that existing approaches on image and sequence classifiers generalize poorly to new manipulation techniques.

DeepFake Detection Face Swapping

Novel View Synthesis from Single Images via Point Cloud Transformation

no code implementations17 Sep 2020 Hoang-An Le, Thomas Mensink, Partha Das, Theo Gevers

In this paper the argument is made that for true novel view synthesis of objects, where the object can be synthesized from any viewpoint, an explicit 3D shape representation isdesired.

3D Shape Representation Novel View Synthesis

Physics-based Shading Reconstruction for Intrinsic Image Decomposition

no code implementations3 Sep 2020 Anil S. Baslamisli, Yang Liu, Sezer Karaoglu, Theo Gevers

We investigate the use of photometric invariance and deep learning to compute intrinsic images (albedo and shading).

Intrinsic Image Decomposition

Multi-Loss Weighting with Coefficient of Variations

1 code implementation3 Sep 2020 Rick Groenendijk, Sezer Karaoglu, Theo Gevers, Thomas Mensink

In this paper, we propose a weighting scheme based on the coefficient of variations and set the weights based on properties observed while training the model.

Monocular Depth Estimation Multi-Task Learning +1

Kinship Identification through Joint Learning Using Kinship Verification Ensembles

1 code implementation ECCV 2020 Wei Wang, ShaoDi You, Sezer Karaoglu, Theo Gevers

The experiments further show significant performance improvement of kinship verification when trained on the same dataset with more realistic distributions.

Kinship Verification

On the Benefit of Adversarial Training for Monocular Depth Estimation

1 code implementation29 Oct 2019 Rick Groenendijk, Sezer Karaoglu, Theo Gevers, Thomas Mensink

For the quality of the image reconstruction and disparity prediction, a combination of different losses is used, including L1 image reconstruction losses and left-right disparity smoothness.

Depth Prediction Generative Adversarial Network +2

Improving Face Detection Performance with 3D-Rendered Synthetic Data

no code implementations18 Dec 2018 Jian Han, Sezer Karaoglu, Hoang-An Le, Theo Gevers

In this paper, we provide a synthetic data generator methodology with fully controlled, multifaceted variations based on a new 3D face dataset (3DU-Face).

Data Augmentation Face Detection

Color Constancy by GANs: An Experimental Survey

no code implementations7 Dec 2018 Partha Das, Anil S. Baslamisli, Yang Liu, Sezer Karaoglu, Theo Gevers

In this paper, we formulate the color constancy task as an image-to-image translation problem using GANs.

Color Constancy Image-to-Image Translation +2

Automatic Generation of Dense Non-rigid Optical Flow

1 code implementation5 Dec 2018 Hoàng-Ân Lê, Tushar Nimbhorkar, Thomas Mensink, Anil S. Baslamisli, Sezer Karaoglu, Theo Gevers

There hardly exists any large-scale datasets with dense optical flow of non-rigid motion from real-world imagery as of today.

Optical Flow Estimation

Three for one and one for three: Flow, Segmentation, and Surface Normals

1 code implementation19 Jul 2018 Hoang-An Le, Anil S. Baslamisli, Thomas Mensink, Theo Gevers

Optical flow, semantic segmentation, and surface normals represent different information modalities, yet together they bring better cues for scene understanding problems.

Optical Flow Estimation Scene Understanding +2

CNN based Learning using Reflection and Retinex Models for Intrinsic Image Decomposition

no code implementations CVPR 2018 Anil S. Baslamisli, Hoang-An Le, Theo Gevers

On the other hand, recent research use deep learning models as in-and-out black box and do not consider the well-established, traditional image formation process as the basis of their intrinsic learning process.

Deep Learning Intrinsic Image Decomposition

Detect2Rank : Combining Object Detectors Using Learning to Rank

no code implementations26 Dec 2014 Sezer Karaoglu, Yang Liu, Theo Gevers

Experiments on the PASCAL VOC07 and VOC10 datasets show that the proposed method significantly outperforms single object detectors, DPM (8. 4%), CN (6. 8%) and EES (17. 0%) on VOC07 and DPM (6. 5%), CN (5. 5%) and EES (16. 2%) on VOC10.

Learning-To-Rank Object +2

Road Detection by One-Class Color Classification: Dataset and Experiments

no code implementations11 Dec 2014 Jose M. Alvarez, Theo Gevers, Antonio M. Lopez

These algorithms reduce the effect of lighting variations and weather conditions by exploiting the discriminant/invariant properties of different color representations.

Autonomous Driving Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.