no code implementations • 20 Mar 2025 • Xi Liu, Chaoyi Zhou, Nanxuan Zhao, Siyu Huang
Differentiable vector graphics (VGs) are widely used in image vectorization and vector synthesis, while existing representations are costly to optimize and struggle to achieve high-quality rendering results for high-resolution images.
no code implementations • 13 Feb 2025 • Chaoyi Zhou, Xi Liu, Feng Luo, Siyu Huang
The framework consists of three stages: (1) a correspondence-aware autoencoding method that enhances the 3D consistency of 2D latent representations, (2) a latent radiance field (LRF) that lifts these 3D-aware 2D representations into 3D space, and (3) a VAE-Radiance Field (VAE-RF) alignment strategy that improves image decoding from the rendered 2D representations.
no code implementations • 21 Oct 2024 • Xi Liu, Chaoyi Zhou, Siyu Huang
Novel-view synthesis aims to generate novel views of a scene from multiple input images or videos, and recent advancements like 3D Gaussian splatting (3DGS) have achieved notable success in producing photorealistic renderings with efficient pipelines.
1 code implementation • 17 Oct 2024 • Yifeng Wang, Xueying Zhan, Siyu Huang
AutoAL consists of two neural nets, named SearchNet and FitNet, which are optimized concurrently under a differentiable bi-level optimization framework.
no code implementations • 17 Oct 2024 • Weiyi Zhang, Jiancheng Yang, Ruoyu Chen, Siyu Huang, Pusheng Xu, Xiaolan Chen, Shanfu Lu, Hongyu Cao, Mingguang He, Danli Shi
Fundus fluorescein angiography (FFA) is crucial for diagnosing and monitoring retinal vascular issues but is limited by its invasive nature and restricted accessibility compared to color fundus (CF) imaging.
no code implementations • 18 Sep 2024 • Yuchang Su, Renping Zhou, Siyu Huang, Xingjian Li, Tianyang Wang, Ziyue Wang, Min Xu
Generalized Category Discovery (GCD) aims to classify inputs into both known and novel categories, a task crucial for open-world scientific discoveries.
1 code implementation • 11 Sep 2024 • Xinrui Wang, Lanqing Guo, Xiyu Wang, Siyu Huang, Bihan Wen
In view of this, inspired by the physical model of shadow formation, we introduce novel soft shadow masks specifically designed for shadow removal.
1 code implementation • 10 Sep 2024 • Danli Shi, Weiyi Zhang, Jiancheng Yang, Siyu Huang, Xiaolan Chen, Mayinuer Yusufu, Kai Jin, Shan Lin, Shunming Liu, Qing Zhang, Mingguang He
Early detection of eye diseases like glaucoma, macular degeneration, and diabetic retinopathy is crucial for preventing vision loss.
1 code implementation • 27 Aug 2024 • Weiyi Zhang, Siyu Huang, Jiancheng Yang, Ruoyu Chen, ZongYuan Ge, Yingfeng Zheng, Danli Shi, Mingguang He
In this work, we pioneer dynamic FFA video generation from static CF images.
1 code implementation • 11 Jul 2024 • Laniqng Guo, Chong Wang, YuFei Wang, Yi Yu, Siyu Huang, Wenhan Yang, Alex C. Kot, Bihan Wen
In this paper, we are the first to provide a comprehensive survey to cover various aspects ranging from technical details to applications.
1 code implementation • 31 May 2024 • Nerea Aranjuelo, Siyu Huang, Ignacio Arganda-Carreras, Luis Unzueta, Oihana Otaegui, Hanspeter Pfister, Donglai Wei
In this work, we present a generative framework to create annotated gaze data by leveraging the benefits of labeled and unlabeled data sources.
no code implementations • 18 May 2024 • Danli Shi, Weiyi Zhang, Xiaolan Chen, Yexin Liu, Jiancheng Yang, Siyu Huang, Yih Chung Tham, Yingfeng Zheng, Mingguang He
EyeFound provides a generalizable solution to improve model performance and lessen the annotation burden on experts, facilitating widespread clinical AI applications for retinal imaging.
1 code implementation • 16 Feb 2024 • Lanqing Guo, Yingqing He, Haoxin Chen, Menghan Xia, Xiaodong Cun, YuFei Wang, Siyu Huang, Yong Zhang, Xintao Wang, Qifeng Chen, Ying Shan, Bihan Wen
Diffusion models have proven to be highly effective in image and video generation; however, they still face composition challenges when generating images of varying sizes due to single-scale training data.
no code implementations • 25 Oct 2023 • Kangxian Xie, Siyu Huang, Sebastian Cajas Ordone, Hanspeter Pfister, Donglai Wei
Deep-learning models have been successful in biomedical image segmentation.
no code implementations • 3 Oct 2023 • Weibin Liao, Haoyi Xiong, Qingzhong Wang, Yan Mo, Xuhong LI, Yi Liu, Zeyu Chen, Siyu Huang, Dejing Dou
In this work, we study a novel self-supervised pre-training pipeline, namely Multi-task Self-super-vised Continual Learning (MUSCLE), for multiple medical imaging tasks, such as classification and segmentation, using X-ray images collected from multiple body parts, including heads, lungs, and bones.
1 code implementation • 26 Jun 2023 • Siyu Huang, Jie An, Donglai Wei, Zudi Lin, Jiebo Luo, Hanspeter Pfister
However, given a UNIT model trained on certain domains, it is difficult for current methods to incorporate new domains because they often need to train the full model on both existing and new domains.
2 code implementations • 3 Feb 2023 • Lanqing Guo, Siyu Huang, Ding Liu, Hao Cheng, Bihan Wen
It is still challenging for the deep shadow removal model to exploit the global contextual correlation between shadow and non-shadow regions.
Ranked #1 on
Shadow Removal
on ISTD
no code implementations • 20 Dec 2022 • Siyu Huang, Tianyang Wang, Haoyi Xiong, Bihan Wen, Jun Huan, Dejing Dou
Inspired by the fact that the samples with higher loss are usually more informative to the model than the samples with lower loss, in this paper we present a novel deep active learning approach that queries the oracle for data annotation when the unlabeled sample is believed to incorporate high loss.
1 code implementation • CVPR 2023 • Siyu Huang, Jie An, Donglai Wei, Jiebo Luo, Hanspeter Pfister
The mechanism of existing style transfer algorithms is by minimizing a hybrid loss function to push the generated image toward high similarities in both content and style.
1 code implementation • CVPR 2023 • Lanqing Guo, Chong Wang, Wenhan Yang, Siyu Huang, YuFei Wang, Hanspeter Pfister, Bihan Wen
Recent deep learning methods have achieved promising results in image shadow removal.
Ranked #9 on
Shadow Removal
on ISTD+
1 code implementation • 5 Oct 2022 • Liangyu Chen, Yutong Bai, Siyu Huang, Yongyi Lu, Bihan Wen, Alan L. Yuille, Zongwei Zhou
However, we uncover a striking contradiction to this promise: active learning fails to select data as efficiently as random selection at the first few choices.
no code implementations • 6 Apr 2022 • Leander Lauenburg, Zudi Lin, Ruihan Zhang, Márcia dos Santos, Siyu Huang, Ignacio Arganda-Carreras, Edward S. Boyden, Hanspeter Pfister, Donglai Wei
Instance segmentation for unlabeled imaging modalities is a challenging but essential task as collecting expert annotation can be expensive and time-consuming.
1 code implementation • 10 Dec 2021 • Tianyang Wang, Xingjian Li, Pengkun Yang, Guosheng Hu, Xiangrui Zeng, Siyu Huang, Cheng-Zhong Xu, Min Xu
In this work, we explore such an impact by theoretically proving that selecting unlabeled data of higher gradient norm leads to a lower upper-bound of test loss, resulting in better test performance.
no code implementations • 11 Nov 2021 • Lanqing Guo, Siyu Huang, Haosen Liu, Bihan Wen
One of the fundamental challenges in image restoration is denoising, where the objective is to estimate the clean image from its noisy measurements.
1 code implementation • 21 Sep 2021 • Yihang Yin, Qingzhong Wang, Siyu Huang, Haoyi Xiong, Xiang Zhang
Most of the existing contrastive learning methods employ pre-defined view generation methods, e. g., node drop or edge perturbation, which usually cannot adapt to input data or preserve the original semantic structures well.
no code implementations • 2 Sep 2021 • Xuhong LI, Haoyi Xiong, Siyu Huang, Shilei Ji, Dejing Dou
Existing interpretation algorithms have found that, even deep models make the same and right predictions on the same image, they might rely on different sets of input features for classification.
1 code implementation • ICCV 2021 • Siyu Huang, Tianyang Wang, Haoyi Xiong, Jun Huan, Dejing Dou
To lower the cost of data annotation, active learning has been proposed to interactively query an oracle to annotate a small proportion of informative samples in an unlabeled dataset.
1 code implementation • 13 Jul 2021 • Rongkai Zhang, Lanqing Guo, Siyu Huang, Bihan Wen
Low-light image enhancement (LLIE) is a pervasive yet challenging problem, since: 1) low-light measurements may vary due to different imaging conditions in practice; 2) images can be enlightened subjectively according to diverse preferences by each individual.
no code implementations • 20 Jun 2021 • Xuanyu Wu, Xuhong LI, Haoyi Xiong, Xiao Zhang, Siyu Huang, Dejing Dou
Incorporating with a set of randomized strategies for well-designed data transformations over the training set, ContRE adopts classification errors and Fisher ratios on the generated contrastive examples to assess and analyze the generalization performance of deep models in complement with a testing set.
1 code implementation • 19 Apr 2021 • Yihang Yin, Siyu Huang, Xiang Zhang
Deep neural networks (DNNs) have shown superior performances on various multimodal learning problems.
1 code implementation • CVPR 2021 • Jie An, Siyu Huang, Yibing Song, Dejing Dou, Wei Liu, Jiebo Luo
The forward inference projects input images into deep features, while the backward inference remaps deep features back to input images in a lossless and unbiased way.
no code implementations • 1 Jan 2021 • Xuhong LI, Haoyi Xiong, Siyu Huang, Shilei Ji, Yanjie Fu, Dejing Dou
Given any task/dataset, Consensus first obtains the interpretation results using existing tools, e. g., LIME (Ribeiro et al., 2016), for every model in the committee, then aggregates the results from the entire committee and approximates the “ground truth” of interpretations through voting.
no code implementations • Findings of the Association for Computational Linguistics 2020 • Tao Jin, Siyu Huang, Yingming Li, Zhongfei Zhang
Tensor-based fusion methods have been proven effective in multimodal fusion tasks.
no code implementations • 7 Oct 2020 • Fangbo Qin, Jie Qin, Siyu Huang, De Xu
For the novel CPI extraction task, we built the Object Contour Primitives dataset using online public images, and the Robotic Object Contour Measurement dataset using a camera mounted on a robot.
2 code implementations • ECCV 2020 • Siyu Huang, Fangbo Qin, Pengfei Xiong, Ning Ding, Yijia He, Xiao Liu
To realize one-step detection with a faster and more compact model, we introduce the tri-points representation, converting the line segment detection to the end-to-end prediction of a root-point and two endpoints for each line segment.
Ranked #6 on
Line Segment Detection
on York Urban Dataset
no code implementations • 23 Jul 2020 • Tao Jin, Siyu Huang, Ming Chen, Yingming Li, Zhongfei Zhang
However, video captioning is a multimodal learning problem, and the video features have much redundancy between different time steps.
1 code implementation • 17 Jul 2020 • Siyu Huang, Haoyi Xiong, Zhi-Qi Cheng, Qingzhong Wang, Xingran Zhou, Bihan Wen, Jun Huan, Dejing Dou
Generation of high-quality person images is challenging, due to the sophisticated entanglements among image factors, e. g., appearance, pose, foreground, background, local details, global structures, etc.
no code implementations • 29 May 2020 • Siyu Huang, Wensha Gou, Hongbo Cai, Xiaomeng Li, Qinghua Chen
In addition, we apply the network to reflect the purity of the trade relations among countries.
1 code implementation • 17 Mar 2020 • Siyu Huang, Haoyi Xiong, Tianyang Wang, Bihan Wen, Qingzhong Wang, Zeyu Chen, Jun Huan, Dejing Dou
This paper further presents a real-time feed-forward model to leverage Style Projection for arbitrary image style transfer, which includes a regularization term for matching the semantics between input contents and stylized outputs.
no code implementations • IJCNLP 2019 • Tao Jin, Siyu Huang, Yingming Li, Zhongfei Zhang
This paper addresses the challenging task of video captioning which aims to generate descriptions for video data.
no code implementations • CVPR 2019 • Xingran Zhou, Siyu Huang, Bin Li, Yingming Li, Jiachen Li, Zhongfei Zhang
This paper presents a novel method to manipulate the visual appearance (pose and attribute) of a person image according to natural language descriptions.
no code implementations • 29 Nov 2018 • Siyu Huang, Zhi-Qi Cheng, Xi Li, Xiao Wu, Zhongfei Zhang, Alexander Hauptmann
To tackle this challenge, we present a novel pipeline comprised of an Observer Engine and a Physicist Engine by respectively imitating the actions of an observer and a physicist in the real world.
1 code implementation • 22 Aug 2018 • Siyu Huang, Xi Li, Zhi-Qi Cheng, Zhongfei Zhang, Alexander Hauptmann
In this work, we explore the cross-scale similarity in crowd counting scenario, in which the regions of different scales often exhibit high visual similarity.
no code implementations • 19 Apr 2018 • Siyu Huang, Xi Li, Zhi-Qi Cheng, Zhongfei Zhang, Alexander Hauptmann
A key problem in deep multi-attribute learning is to effectively discover the inter-attribute correlation structures.
no code implementations • 27 Jan 2016 • Siyu Huang, Xi Li, Zhongfei Zhang, Zhouzhou He, Fei Wu, Wei Liu, Jinhui Tang, Yueting Zhuang
The highly effective visual representation and deep context models ensure that our framework makes a deep semantic understanding of the scene and motion pattern, consequently improving the performance of the visual path prediction task.