1 code implementation • 6 Oct 2023 • Qingguo Liu, Pan Gao, Kang Han, Ningzhong Liu, Wei Xiang
In particular, we integrate both CNN and Transformer components into the SR network, where we first use the CNN modulated by the degradation information to extract local features, and then employ the degradation-aware Transformer to extract global semantic features.
no code implementations • 15 Jun 2023 • Lu Yu, Wei Xiang, Kang Han
To address this challenge, we propose the Edit-DiffNeRF framework, which is composed of a frozen diffusion model, a proposed delta module to edit the latent semantic space of the diffusion model, and a NeRF.
1 code implementation • CVPR 2023 • Kang Han, Wei Xiang
Instead of encoding view directions to model view-dependent effects, we further propose to encode the rendering equation in the feature space by employing the anisotropic spherical Gaussian mixture predicted from the proposed multiscale representation.
no code implementations • 13 May 2022 • Jianan Liu, Hao Li, Tao Huang, Euijoon Ahn, Kang Han, Adeel Razi, Wei Xiang, Jinman Kim, David Dagan Feng
However, the difference in degradation representations between synthetic and authentic LR images suppresses the quality of SR images reconstructed from authentic LR images.
no code implementations • 19 Sep 2018 • Kang Han, Chen Defeng
Compared with traditional methods, the segmentation algorithms based on fully convolutional neural network greatly improve the accuracy of semantic segmentation.
no code implementations • 26 Oct 2017 • Haiyan Yao, Kang Han, Wanggen Wan, Li Hou
Computer vision techniques have been used to produce accurate and generic crowd count estimators in recent years.
1 code implementation • 12 Jun 2017 • Kang Han, Wanggen Wan, Haiyan Yao, Li Hou
In this paper, we propose a method called Convolutional Neural Network-Markov Random Field (CNN-MRF) to estimate the crowd count in a still image.