1 code implementation • 3 Feb 2023 • Lanqing Guo, Siyu Huang, Ding Liu, Hao Cheng, Bihan Wen
It is still challenging for the deep shadow removal model to exploit the global contextual correlation between shadow and non-shadow regions.
Ranked #1 on
Shadow Removal
on Adjusted ISTD
no code implementations • 20 Dec 2022 • Siyu Huang, Tianyang Wang, Haoyi Xiong, Bihan Wen, Jun Huan, Dejing Dou
Inspired by the fact that the samples with higher loss are usually more informative to the model than the samples with lower loss, in this paper we present a novel deep active learning approach that queries the oracle for data annotation when the unlabeled sample is believed to incorporate high loss.
no code implementations • CVPR 2023 • Siyu Huang, Jie An, Donglai Wei, Jiebo Luo, Hanspeter Pfister
The mechanism of existing style transfer algorithms is by minimizing a hybrid loss function to push the generated image toward high similarities in both content and style.
no code implementations • CVPR 2023 • Lanqing Guo, Chong Wang, Wenhan Yang, Siyu Huang, YuFei Wang, Hanspeter Pfister, Bihan Wen
Recent deep learning methods have achieved promising results in image shadow removal.
1 code implementation • 5 Oct 2022 • Liangyu Chen, Yutong Bai, Siyu Huang, Yongyi Lu, Bihan Wen, Alan L. Yuille, Zongwei Zhou
However, we uncover a striking contradiction to this promise: active learning fails to select data as efficiently as random selection at the first few choices.
no code implementations • 6 Apr 2022 • Leander Lauenburg, Zudi Lin, Ruihan Zhang, Márcia dos Santos, Siyu Huang, Ignacio Arganda-Carreras, Edward S. Boyden, Hanspeter Pfister, Donglai Wei
Instance segmentation for unlabeled imaging modalities is a challenging but essential task as collecting expert annotation can be expensive and time-consuming.
1 code implementation • 10 Dec 2021 • Tianyang Wang, Xingjian Li, Pengkun Yang, Guosheng Hu, Xiangrui Zeng, Siyu Huang, Cheng-Zhong Xu, Min Xu
In this work, we explore such an impact by theoretically proving that selecting unlabeled data of higher gradient norm leads to a lower upper-bound of test loss, resulting in better test performance.
no code implementations • 11 Nov 2021 • Lanqing Guo, Siyu Huang, Haosen Liu, Bihan Wen
One of the fundamental challenges in image restoration is denoising, where the objective is to estimate the clean image from its noisy measurements.
1 code implementation • 21 Sep 2021 • Yihang Yin, Qingzhong Wang, Siyu Huang, Haoyi Xiong, Xiang Zhang
Most of the existing contrastive learning methods employ pre-defined view generation methods, e. g., node drop or edge perturbation, which usually cannot adapt to input data or preserve the original semantic structures well.
no code implementations • 2 Sep 2021 • Xuhong LI, Haoyi Xiong, Siyu Huang, Shilei Ji, Dejing Dou
Existing interpretation algorithms have found that, even deep models make the same and right predictions on the same image, they might rely on different sets of input features for classification.
1 code implementation • ICCV 2021 • Siyu Huang, Tianyang Wang, Haoyi Xiong, Jun Huan, Dejing Dou
To lower the cost of data annotation, active learning has been proposed to interactively query an oracle to annotate a small proportion of informative samples in an unlabeled dataset.
1 code implementation • 13 Jul 2021 • Rongkai Zhang, Lanqing Guo, Siyu Huang, Bihan Wen
Low-light image enhancement (LLIE) is a pervasive yet challenging problem, since: 1) low-light measurements may vary due to different imaging conditions in practice; 2) images can be enlightened subjectively according to diverse preferences by each individual.
no code implementations • 20 Jun 2021 • Xuanyu Wu, Xuhong LI, Haoyi Xiong, Xiao Zhang, Siyu Huang, Dejing Dou
Incorporating with a set of randomized strategies for well-designed data transformations over the training set, ContRE adopts classification errors and Fisher ratios on the generated contrastive examples to assess and analyze the generalization performance of deep models in complement with a testing set.
1 code implementation • 19 Apr 2021 • Yihang Yin, Siyu Huang, Xiang Zhang
Deep neural networks (DNNs) have shown superior performances on various multimodal learning problems.
1 code implementation • CVPR 2021 • Jie An, Siyu Huang, Yibing Song, Dejing Dou, Wei Liu, Jiebo Luo
The forward inference projects input images into deep features, while the backward inference remaps deep features back to input images in a lossless and unbiased way.
no code implementations • 1 Jan 2021 • Xuhong LI, Haoyi Xiong, Siyu Huang, Shilei Ji, Yanjie Fu, Dejing Dou
Given any task/dataset, Consensus first obtains the interpretation results using existing tools, e. g., LIME (Ribeiro et al., 2016), for every model in the committee, then aggregates the results from the entire committee and approximates the “ground truth” of interpretations through voting.
no code implementations • Findings of the Association for Computational Linguistics 2020 • Tao Jin, Siyu Huang, Yingming Li, Zhongfei Zhang
Tensor-based fusion methods have been proven effective in multimodal fusion tasks.
no code implementations • 7 Oct 2020 • Fangbo Qin, Jie Qin, Siyu Huang, De Xu
For the novel CPI extraction task, we built the Object Contour Primitives dataset using online public images, and the Robotic Object Contour Measurement dataset using a camera mounted on a robot.
2 code implementations • ECCV 2020 • Siyu Huang, Fangbo Qin, Pengfei Xiong, Ning Ding, Yijia He, Xiao Liu
To realize one-step detection with a faster and more compact model, we introduce the tri-points representation, converting the line segment detection to the end-to-end prediction of a root-point and two endpoints for each line segment.
Ranked #2 on
Line Segment Detection
on York Urban Dataset
(sAP5 metric)
no code implementations • 23 Jul 2020 • Tao Jin, Siyu Huang, Ming Chen, Yingming Li, Zhongfei Zhang
However, video captioning is a multimodal learning problem, and the video features have much redundancy between different time steps.
1 code implementation • 17 Jul 2020 • Siyu Huang, Haoyi Xiong, Zhi-Qi Cheng, Qingzhong Wang, Xingran Zhou, Bihan Wen, Jun Huan, Dejing Dou
Generation of high-quality person images is challenging, due to the sophisticated entanglements among image factors, e. g., appearance, pose, foreground, background, local details, global structures, etc.
no code implementations • 29 May 2020 • Siyu Huang, Wensha Gou, Hongbo Cai, Xiaomeng Li, Qinghua Chen
In addition, we apply the network to reflect the purity of the trade relations among countries.
1 code implementation • 17 Mar 2020 • Siyu Huang, Haoyi Xiong, Tianyang Wang, Bihan Wen, Qingzhong Wang, Zeyu Chen, Jun Huan, Dejing Dou
This paper further presents a real-time feed-forward model to leverage Style Projection for arbitrary image style transfer, which includes a regularization term for matching the semantics between input contents and stylized outputs.
no code implementations • IJCNLP 2019 • Tao Jin, Siyu Huang, Yingming Li, Zhongfei Zhang
This paper addresses the challenging task of video captioning which aims to generate descriptions for video data.
no code implementations • CVPR 2019 • Xingran Zhou, Siyu Huang, Bin Li, Yingming Li, Jiachen Li, Zhongfei Zhang
This paper presents a novel method to manipulate the visual appearance (pose and attribute) of a person image according to natural language descriptions.
no code implementations • 29 Nov 2018 • Siyu Huang, Zhi-Qi Cheng, Xi Li, Xiao Wu, Zhongfei Zhang, Alexander Hauptmann
To tackle this challenge, we present a novel pipeline comprised of an Observer Engine and a Physicist Engine by respectively imitating the actions of an observer and a physicist in the real world.
1 code implementation • 22 Aug 2018 • Siyu Huang, Xi Li, Zhi-Qi Cheng, Zhongfei Zhang, Alexander Hauptmann
In this work, we explore the cross-scale similarity in crowd counting scenario, in which the regions of different scales often exhibit high visual similarity.
no code implementations • 19 Apr 2018 • Siyu Huang, Xi Li, Zhi-Qi Cheng, Zhongfei Zhang, Alexander Hauptmann
A key problem in deep multi-attribute learning is to effectively discover the inter-attribute correlation structures.
no code implementations • 27 Jan 2016 • Siyu Huang, Xi Li, Zhongfei Zhang, Zhouzhou He, Fei Wu, Wei Liu, Jinhui Tang, Yueting Zhuang
The highly effective visual representation and deep context models ensure that our framework makes a deep semantic understanding of the scene and motion pattern, consequently improving the performance of the visual path prediction task.