Search Results for author: Shariq Farooq Bhat

Found 9 papers, 6 papers with code

LooseControl: Lifting ControlNet for Generalized Depth Conditioning

no code implementations5 Dec 2023 Shariq Farooq Bhat, Niloy J. Mitra, Peter Wonka

We present LooseControl to allow generalized depth conditioning for diffusion-based image generation.

Attribute Image Generation

LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts

1 code implementation16 Oct 2023 Hanan Gani, Shariq Farooq Bhat, Muzammal Naseer, Salman Khan, Peter Wonka

Diffusion-based generative models have significantly advanced text-to-image generation but encounter challenges when processing lengthy and intricate text prompts describing complex scenes with multiple objects.

Layout-to-Image Generation Object +2

ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth

3 code implementations23 Feb 2023 Shariq Farooq Bhat, Reiner Birkl, Diana Wofk, Peter Wonka, Matthias Müller

Finally, ZoeD-M12-NK is the first model that can jointly train on multiple datasets (NYU Depth v2 and KITTI) without a significant drop in performance and achieve unprecedented zero-shot generalization performance to eight unseen datasets from both indoor and outdoor domains.

Ranked #13 on Monocular Depth Estimation on NYU-Depth V2 (using extra training data)

Monocular Depth Estimation Zero-shot Generalization

LocalBins: Improving Depth Estimation by Learning Local Distributions

1 code implementation28 Mar 2022 Shariq Farooq Bhat, Ibraheem Alhashim, Peter Wonka

We build on AdaBins which estimates a global distribution of depth values for the input image and evolve the architecture in two ways.

Monocular Depth Estimation

Self-Supervised Learning of Domain Invariant Features for Depth Estimation

no code implementations4 Jun 2021 Hiroyasu Akada, Shariq Farooq Bhat, Ibraheem Alhashim, Peter Wonka

Specifically, we extend self-supervised learning from traditional representation learning, which works on images from a single domain, to domain invariant representation learning, which works on images from two different domains by utilizing an image-to-image translation network.

Depth Estimation Domain Adaptation +5

Cannot find the paper you are looking for? You can Submit a new open access paper.