1 code implementation • 11 Oct 2024 • Ling Yang, Zixiang Zhang, Junlin Han, Bohan Zeng, Runjia Li, Philip Torr, Wentao Zhang
To overcome these challenges, we introduce a novel SDS approach, Semantic Score Distillation Sampling (SemanticSDS), designed to effectively improve the expressiveness and accuracy of compositional text-to-3D generation.
no code implementations • 1 Oct 2024 • Junlin Han, Jianyuan Wang, Andrea Vedaldi, Philip Torr, Filippos Kokkinos
We employ a fine-tuned multi-view image diffusion model and a video diffusion model to generate a pool of candidate views, enabling a rich representation of the target 3D object.
no code implementations • 12 Sep 2024 • Runjia Li, Junlin Han, Luke Melas-Kyriazi, Chunyi Sun, Zhaochong An, Zhongrui Gui, Shuyang Sun, Philip Torr, Tomas Jakab
Existing SDS methods often struggle with this generation task due to a limited understanding of part-level semantics in text-to-image diffusion models.
no code implementations • 27 Aug 2024 • Fangjinhua Wang, Qingtian Zhu, Di Chang, Quankai Gao, Junlin Han, Tong Zhang, Richard Hartley, Marc Pollefeys
3D reconstruction aims to recover the dense 3D structure of a scene.
no code implementations • 18 Mar 2024 • Junlin Han, Filippos Kokkinos, Philip Torr
This results in a significant disparity in scale compared to the vast quantities of other types of data.
1 code implementation • 15 Mar 2024 • Zeyu Zhang, Junlin Han, Chenhui Gou, Hongdong Li, Liang Zheng
To address this need, we add controllability to the blind image decomposition process, allowing users to enter which types of degradation to remove or retain.
1 code implementation • 27 Nov 2023 • Haoqin Tu, Chenhang Cui, Zijun Wang, Yiyang Zhou, Bingchen Zhao, Junlin Han, Wangchunshu Zhou, Huaxiu Yao, Cihang Xie
Different from prior studies, we shift our focus from evaluating standard performance to introducing a comprehensive safety evaluation suite, covering both out-of-distribution (OOD) generalization and adversarial robustness.
no code implementations • 19 Oct 2023 • Chunyi Sun, Junlin Han, Weijian Deng, Xinlong Wang, Zishan Qin, Stephen Gould
Our work highlights the potential of LLMs in 3D modeling, offering a basic framework for future advancements in scene generation and animation.
no code implementations • ICCV 2023 • Jie Hong, Zeeshan Hayder, Junlin Han, Pengfei Fang, Mehrtash Harandi, Lars Petersson
Audio-visual zero-shot learning aims to classify samples consisting of a pair of corresponding audio and video sequences from classes that are not present during training.
Ranked #2 on GZSL Video Classification on ActivityNet-GZSL (cls)
no code implementations • 7 Dec 2022 • Chunyi Sun, Yanbin Liu, Junlin Han, Stephen Gould
Specifically, we use a NeRF model to generate numerous image-angle pairs to train an adjustor, which can adjust the StyleGAN latent code to generate high-fidelity stylized images for any given angle.
1 code implementation • 14 Nov 2022 • Junlin Han, Huangying Zhan, Jie Hong, Pengfei Fang, Hongdong Li, Lars Petersson, Ian Reid
This paper studies the problem of measuring and predicting how memorable an image is to pattern recognition machines, as a path to explore machine intelligence.
no code implementations • 2 Aug 2022 • Jie Hong, Pengfei Fang, Weihao Li, Junlin Han, Lars Petersson, Mehrtash Harandi
Learning a latent embedding to understand the underlying nature of data distribution is often formulated in Euclidean spaces with zero curvature.
1 code implementation • 31 May 2022 • Junlin Han, Lars Petersson, Hongdong Li, Ian Reid
We present a simple method, CropMix, for the purpose of producing a rich input distribution from the original dataset distribution.
no code implementations • 23 Mar 2022 • Jie Hong, Weihao Li, Junlin Han, Jiyang Zheng, Pengfei Fang, Mehrtash Harandi, Lars Petersson
In this paper, we present and study a new image segmentation task, called Generalized Open-set Semantic Segmentation (GOSS).
1 code implementation • 28 Jan 2022 • Junlin Han, Pengfei Fang, Weihao Li, Jie Hong, Mohammad Ali Armin, Ian Reid, Lars Petersson, Hongdong Li
We present You Only Cut Once (YOCO) for performing data augmentations.
1 code implementation • 25 Aug 2021 • Junlin Han, Weihao Li, Pengfei Fang, Chunyi Sun, Jie Hong, Mohammad Ali Armin, Lars Petersson, Hongdong Li
We propose and study a novel task named Blind Image Decomposition (BID), which requires separating a superimposed image into constituent underlying images in a blind setting, that is, both the source components involved in mixing as well as the mixing mechanism are unknown.
1 code implementation • 20 Jun 2021 • Junlin Han, Mehrdad Shoeiby, Tim Malthus, Elizabeth Botha, Janet Anstee, Saeed Anwar, Ran Wei, Mohammad Ali Armin, Hongdong Li, Lars Petersson
There are 2000 reference restored images and 6003 original underwater images in the unpaired training set.
3 code implementations • 15 Apr 2021 • Junlin Han, Mehrdad Shoeiby, Lars Petersson, Mohammad Ali Armin
Unsupervised image-to-image translation tasks aim to find a mapping between a source domain X and a target domain Y from unpaired training data.
1 code implementation • 17 Mar 2021 • Junlin Han, Mehrdad Shoeiby, Tim Malthus, Elizabeth Botha, Janet Anstee, Saeed Anwar, Ran Wei, Lars Petersson, Mohammad Ali Armin
Underwater image restoration attracts significant attention due to its importance in unveiling the underwater world.