no code implementations • 9 Jan 2025 • Xueyi Ke, Satoshi Tsutsui, Yayun Zhang, Bihan Wen
Ultimately, our work bridges cognitive science and computer vision by analyzing the internal representations of a computational model trained on an infant's visual and linguistic inputs.
1 code implementation • 17 Dec 2024 • Shiqi Huang, Shuting He, Bihan Wen
Instance segmentation algorithms in remote sensing are typically based on conventional methods, limiting their application to seen scenarios and closed-set predictions.
no code implementations • 28 Nov 2024 • Yinjie Zhao, Heng Zhao, Bihan Wen, Yew-Soon Ong, Joey Tianyi Zhou
Video \textbf{sets} have two dimensions of redundancies: within-sample and inter-sample redundancies.
1 code implementation • 17 Sep 2024 • Zixuan Fu, Lanqing Guo, Chong Wang, YuFei Wang, Zhihao LI, Bihan Wen
Recent advancements in deep learning have shown impressive results in image and video denoising, leveraging extensive pairs of noisy and noise-free data for supervision.
1 code implementation • 11 Sep 2024 • Xinrui Wang, Lanqing Guo, Xiyu Wang, Siyu Huang, Bihan Wen
In view of this, inspired by the physical model of shadow formation, we introduce novel soft shadow masks specifically designed for shadow removal.
1 code implementation • 25 Jul 2024 • Kailai Zhou, Lijing Cai, Yibo Wang, Mengya Zhang, Bihan Wen, Qiu Shen, Xun Cao
The integration of miniaturized spectrometers into mobile devices offers new avenues for image quality enhancement and facilitates novel downstream tasks.
no code implementations • 18 Jul 2024 • Shuting He, Henghui Ding, Xudong Jiang, Bihan Wen
Despite significant progress in 3D point cloud segmentation, existing methods primarily address specific tasks and depend on explicit instructions to identify targets, lacking the capability to infer and understand implicit user intentions in a unified framework.
1 code implementation • 11 Jul 2024 • Laniqng Guo, Chong Wang, YuFei Wang, Yi Yu, Siyu Huang, Wenhan Yang, Alex C. Kot, Bihan Wen
In this paper, we are the first to provide a comprehensive survey to cover various aspects ranging from technical details to applications.
1 code implementation • 9 Jul 2024 • Winnie Pang, Xueyi Ke, Satoshi Tsutsui, Bihan Wen
Concept bottleneck models (CBMs), which predict human-interpretable concepts (e. g., nucleus shapes in cell images) before predicting the final output (e. g., cell type), provide insights into the decision-making processes of the model.
no code implementations • 12 Jun 2024 • Zhihao LI, YuFei Wang, Alex Kot, Bihan Wen
Our study reveals that 3D Gaussian Splatting (3DGS) is particularly susceptible to this noise, leading to numerous elongated Gaussian shapes that overfit the noise, thereby significantly degrading reconstruction quality and reducing inference speed, especially in scenarios with limited views.
1 code implementation • 31 May 2024 • YuFei Wang, Zhihao LI, Lanqing Guo, Wenhan Yang, Alex C. Kot, Bihan Wen
Recently, 3D Gaussian Splatting (3DGS) has become a promising framework for novel view synthesis, offering fast rendering speeds and high fidelity.
1 code implementation • 30 May 2024 • Honghao Fu, YuFei Wang, Wenhan Yang, Bihan Wen
To our knowledge, DP-IQA is the first method to apply pre-trained diffusion priors in blind IQA.
no code implementations • 20 May 2024 • Xiyu Wang, YuFei Wang, Satoshi Tsutsui, Weisi Lin, Bihan Wen, Alex C. Kot
Additionally, to mitigate the character confusion of generated results, we propose EpicEvo, a method that customizes a diffusion-based visual story generation model with a single story featuring the new characters seamlessly integrating them into established character dynamics.
no code implementations • 1 May 2024 • Deng Li, Xin Liu, Bohao Xing, Baiqiang Xia, Yuan Zong, Bihan Wen, Heikki Kälviäinen
In contrast, long sequential videos can reveal authentic emotions; 2) Previous studies commonly utilize various signals such as facial, speech, and even sensitive biological signals (e. g., electrocardiogram).
1 code implementation • CVPR 2024 • Chong Wang, Lanqing Guo, YuFei Wang, Hao Cheng, Yi Yu, Bihan Wen
Starting from decomposing the original maximum-a-posteriori problem of accelerated MRI, we present a rigorous derivation of the proposed PDAC framework, which could be further unfolded into an end-to-end trainable network.
no code implementations • 15 Mar 2024 • Chong Wang, Yi Yu, Lanqing Guo, Bihan Wen
This is primarily due to the unique characteristic of spatially varying illumination within shadow images.
1 code implementation • 16 Feb 2024 • Lanqing Guo, Yingqing He, Haoxin Chen, Menghan Xia, Xiaodong Cun, YuFei Wang, Siyu Huang, Yong Zhang, Xintao Wang, Qifeng Chen, Ying Shan, Bihan Wen
Diffusion models have proven to be highly effective in image and video generation; however, they still face composition challenges when generating images of varying sizes due to single-scale training data.
1 code implementation • CVPR 2024 • Haonan Zhang, Longjun Liu, Yuqi Huang, Zhao Yang, Xinyu Lei, Bihan Wen
To address these issues we propose a simple yet effective Category-aware Knowledge Distillation and Pruning (CaKDP) framework for compressing 3D detectors.
1 code implementation • CVPR 2024 • YuFei Wang, Wenhan Yang, Xinyuan Chen, Yaohui Wang, Lanqing Guo, Lap-Pui Chau, Ziwei Liu, Yu Qiao, Alex C. Kot, Bihan Wen
Extensive experiments conducted on synthetic and real-world datasets demonstrate that the proposed method can achieve comparable or even superior performance compared to both previous SOTA methods and the teacher model, in just one sampling step, resulting in a remarkable up to x10 speedup for inference.
no code implementations • 12 Sep 2023 • Purui Zhang, Xingchao Jian, Feng Ji, Wee Peng Tay, Bihan Wen
We recall the notion of a complexon as the limit of a simplicial complex sequence [1].
1 code implementation • ICCV 2023 • YuFei Wang, Yi Yu, Wenhan Yang, Lanqing Guo, Lap-Pui Chau, Alex C. Kot, Bihan Wen
Different from a vanilla diffusion model that has to perform Gaussian denoising, with the injected physics-based exposure model, our restoration process can directly start from a noisy image instead of pure noise.
Ranked #1 on Image Denoising on Image Denoising on SID x300
no code implementations • 9 Jul 2023 • Shulin Tian, YuFei Wang, Renjie Wan, Wenhan Yang, Alex C. Kot, Bihan Wen
In this work, we propose a novel approach to increase the visibility of images captured under low-light environments by removing the in-camera infrared (IR) cut-off filter, which allows for the capture of more photons and results in improved signal-to-noise ratio due to the inclusion of information from the IR spectrum.
1 code implementation • NeurIPS 2023 • Satoshi Tsutsui, Winnie Pang, Bihan Wen
We then annotated ten thousand WBC images with these attributes.
1 code implementation • 21 Jun 2023 • YuFei Wang, Yi Yu, Wenhan Yang, Lanqing Guo, Lap-Pui Chau, Alex C. Kot, Bihan Wen
Besides, we propose a novel design of the context model, which can better predict the order masks of encoding/decoding based on both the sRGB image and the masks of already processed features.
3 code implementations • 15 May 2023 • Yuanzhi Zhu, Kai Zhang, Jingyun Liang, JieZhang Cao, Bihan Wen, Radu Timofte, Luc van Gool
Although diffusion models have shown impressive performance for high-quality image synthesis, their potential to serve as a generative denoiser prior to the plug-and-play IR methods remains to be further explored.
no code implementations • 31 Mar 2023 • Tao Bai, Chen Chen, Lingjuan Lyu, Jun Zhao, Bihan Wen
Recent studies show that models trained by continual learning can achieve the comparable performances as the standard supervised learning and the learning flexibility of continual learning models enables their wide applications in the real world.
no code implementations • 18 Mar 2023 • Xiyu Wang, Yuecong Xu, Jianfei Yang, Bihan Wen, Alex C. Kot
The second module compares the outputs of augmented data from the current model to the outputs of weakly augmented data from the source model, forming a novel consistency regularization on the model to alleviate the accumulation of prediction errors.
no code implementations • 3 Mar 2023 • Ziwang Xu, Lanqing Guo, Shuyan Zhang, Alex C. Kot, Bihan Wen
In this work, we propose a novel unsupervised deep learning framework for the digital staining of cell images using knowledge distillation and generative adversarial networks (GANs).
1 code implementation • 3 Mar 2023 • Satoshi Tsutsui, Zhengyang Su, Bihan Wen
Recognizing the types of white blood cells (WBCs) in microscopic images of human blood smears is a fundamental task in the fields of pathology and hematology.
1 code implementation • CVPR 2023 • YuFei Wang, Yi Yu, Wenhan Yang, Lanqing Guo, Lap-Pui Chau, Alex Kot, Bihan Wen
While raw images exhibit advantages over sRGB images (e. g., linearity and fine-grained quantization level), they are not widely used by common users due to the large storage requirements.
1 code implementation • 11 Feb 2023 • YuFei Wang, Renjie Wan, Wenhan Yang, Bihan Wen, Lap-Pui Chau, Alex C. Kot
Removing image artifacts from the scratched lens protector is inherently challenging due to the occasional flare artifacts and the co-occurring interference within mixed artifacts.
2 code implementations • 3 Feb 2023 • Lanqing Guo, Siyu Huang, Ding Liu, Hao Cheng, Bihan Wen
It is still challenging for the deep shadow removal model to exploit the global contextual correlation between shadow and non-shadow regions.
Ranked #1 on Shadow Removal on ISTD
no code implementations • ICCV 2023 • Lanqing Guo, Chong Wang, Wenhan Yang, YuFei Wang, Bihan Wen
Recent deep learning methods have achieved superior results in shadow removal.
no code implementations • ICCV 2023 • Hao Cheng, Siyuan Yang, Joey Tianyi Zhou, Lanqing Guo, Bihan Wen
Few-shot classification aims to learn a discriminative feature representation to recognize unseen classes with few labeled support samples.
1 code implementation • CVPR 2023 • Zixuan Fu, Lanqing Guo, Bihan Wen
Modeling and synthesizing real noise in the standard RGB (sRGB) domain is challenging due to the complicated noise distribution.
no code implementations • 20 Dec 2022 • Siyu Huang, Tianyang Wang, Haoyi Xiong, Bihan Wen, Jun Huan, Dejing Dou
Inspired by the fact that the samples with higher loss are usually more informative to the model than the samples with lower loss, in this paper we present a novel deep active learning approach that queries the oracle for data annotation when the unlabeled sample is believed to incorporate high loss.
1 code implementation • CVPR 2023 • Lanqing Guo, Chong Wang, Wenhan Yang, Siyu Huang, YuFei Wang, Hanspeter Pfister, Bihan Wen
Recent deep learning methods have achieved promising results in image shadow removal.
Ranked #10 on Shadow Removal on ISTD+
1 code implementation • 5 Oct 2022 • Liangyu Chen, Yutong Bai, Siyu Huang, Yongyi Lu, Bihan Wen, Alan L. Yuille, Zongwei Zhou
However, we uncover a striking contradiction to this promise: active learning fails to select data as efficiently as random selection at the first few choices.
1 code implementation • 13 Sep 2022 • Rongkai Zhang, Cong Zhang, Zhiguang Cao, Wen Song, Puay Siew Tan, Jie Zhang, Bihan Wen, Justin Dauwels
We propose a manager-worker framework based on deep reinforcement learning to tackle a hard yet nontrivial variant of Travelling Salesman Problem (TSP), \ie~multiple-vehicle TSP with time window and rejections (mTSPTWR), where customers who cannot be served before the deadline are subject to rejections.
no code implementations • 25 Jul 2022 • Chong Wang, Rongkai Zhang, Saiprasad Ravishankar, Bihan Wen
To this end, we propose a novel deep reinforcement learning (DRL) based PnP framework, dubbed RePNP, by leveraging a light-weight DRL-based denoiser for robust image restoration tasks.
no code implementations • 10 Jun 2022 • Jieyi Ye, Jiafei Duan, Samson Yu, Bihan Wen, Cheston Tan
How can the most common 1, 000 concepts (89\% of everyday use) be learnt in a naturalistic children's setting?
1 code implementation • 17 Mar 2022 • Zhiyuan Zha, Bihan Wen, Xin Yuan, Saiprasad Ravishankar, Jiantao Zhou, Ce Zhu
Furthermore, we present a unified framework for incorporating various GSR and LR models and discuss the relationship between GSR and LR models.
no code implementations • 10 Jan 2022 • Lanqing Guo, Renjie Wan, Wenhan Yang, Alex Kot, Bihan Wen
Images captured in the low-light condition suffer from low visibility and various imaging artifacts, e. g., real noise.
no code implementations • 11 Nov 2021 • Lanqing Guo, Siyu Huang, Haosen Liu, Bihan Wen
One of the fundamental challenges in image restoration is denoising, where the objective is to estimate the clean image from its noisy measurements.
no code implementations • 15 Oct 2021 • Tao Bai, Jun Zhao, Lanqing Guo, Bihan Wen
Deep learning models are vulnerable to adversarial examples and make incomprehensible mistakes, which puts a threat on their real-world deployment.
1 code implementation • 26 Sep 2021 • Hao Cheng, YuFei Wang, Haoliang Li, Alex C. Kot, Bihan Wen
In this work, we propose a novel Disentangled Feature Representation framework, dubbed DFR, for few-shot learning applications.
no code implementations • 13 Sep 2021 • YuFei Wang, Haoliang Li, Hao Cheng, Bihan Wen, Lap-Pui Chau, Alex C. Kot
Domain generalization aims to learn an invariant model that can generalize well to the unseen target domain.
no code implementations • 10 Sep 2021 • Jiafei Duan, Samson Yu, Soujanya Poria, Bihan Wen, Cheston Tan
However, there is a lack of intuitive physics models that are tested on long physical interaction sequences with multiple interactions among different objects.
Ranked #1 on Semantic Object Interaction Classification on SPACE
1 code implementation • 13 Jul 2021 • Rongkai Zhang, Lanqing Guo, Siyu Huang, Bihan Wen
Low-light image enhancement (LLIE) is a pervasive yet challenging problem, since: 1) low-light measurements may vary due to different imaging conditions in practice; 2) images can be enlightened subjectively according to diverse preferences by each individual.
no code implementations • 12 Jul 2021 • Rongkai Zhang, Jiang Zhu, Zhiyuan Zha, Justin Dauwels, Bihan Wen
To benchmark the effectiveness of reinforcement learning in R3L, we train a recurrent neural network with the same architecture for residual recovery using the deterministic loss, thus to analyze how the two different training strategies affect the denoising performance.
Ranked #1 on Image Denoising on BSD68 sigma30
1 code implementation • 1 Jun 2021 • Hao Cheng, Kim-Hui Yap, Bihan Wen
Recent image classification algorithms, by learning deep features from large-scale datasets, have achieved significantly better results comparing to the classic feature-based approaches.
no code implementations • 1 Mar 2021 • Way Tan, Bihan Wen, Xulei Yang
StyleGAN is one of the state-of-the-art image generators which is well-known for synthesizing high-resolution and hyper-realistic face images.
no code implementations • 2 Feb 2021 • Tao Bai, Jinqi Luo, Jun Zhao, Bihan Wen, Qian Wang
Adversarial training is one of the most effective approaches defending against adversarial examples for deep learning models.
no code implementations • 14 Jan 2021 • Yanjun Li, Bihan Wen, Hao Cheng, Yoram Bresler
In this paper, we propose a supervised dimensionality reduction method that learns linear embeddings jointly for two feature vectors representing data of different modalities or data from distinct types of entities.
no code implementations • 21 Sep 2020 • Tao Bai, Jinnan Chen, Jun Zhao, Bihan Wen, Xudong Jiang, Alex Kot
In this paper, we propose a novel approach called Guided Adversarial Contrastive Distillation (GACD), to effectively transfer adversarial robustness from teacher to student with features.
no code implementations • 2 Aug 2020 • Xuankai Liu, Fengting Li, Bihan Wen, Qi Li
In this paper, we benchmark the robustness of watermarking, and propose a novel backdoor-based watermark removal framework using limited data, dubbed WILD.
1 code implementation • 17 Jul 2020 • Siyu Huang, Haoyi Xiong, Zhi-Qi Cheng, Qingzhong Wang, Xingran Zhou, Bihan Wen, Jun Huan, Dejing Dou
Generation of high-quality person images is challenging, due to the sophisticated entanglements among image factors, e. g., appearance, pose, foreground, background, local details, global structures, etc.
no code implementations • 14 Jul 2020 • Hao Cheng, Joey Tianyi Zhou, Wee Peng Tay, Bihan Wen
Graph Neural Networks (GNN) has demonstrated the superior performance in many challenging applications, including the few-shot learning tasks.
1 code implementation • 24 Jun 2020 • Lanqing Guo, Zhiyuan Zha, Saiprasad Ravishankar, Bihan Wen
Experimental results demonstrate that (1) Self-Convolution can significantly speed up most of the popular non-local image restoration algorithms, with two-fold to nine-fold faster block matching, and (2) the proposed multi-modality image restoration scheme achieves superior denoising results in both efficiency and effectiveness on RGB-NIR images.
no code implementations • 14 Jun 2020 • Zerui Shao, Yi-Fei PU, Jiliu Zhou, Bihan Wen, Yi Zhang
Robust Principal Component Analysis (RPCA), as one of the most popular moving object modelling methods, aims to separate the temporally varying (i. e., moving) foreground objects from the static background in video, assuming the background frames to be low-rank while the foreground to be spatially sparse.
no code implementations • 16 May 2020 • Zhiyuan Zha, Xin Yuan, Joey Tianyi Zhou, Jiantao Zhou, Bihan Wen, Ce Zhu
In this paper, we propose a joint low-rank and deep (LRD) image model, which contains a pair of triply complementary priors, namely \textit{external} and \textit{internal}, \textit{deep} and \textit{shallow}, and \textit{local} and \textit{non-local} priors.
no code implementations • 29 Mar 2020 • Bihan Wen, Yanjun Li, Yuqi Li, Yoram Bresler
Furthermore, we relate the denoising performance improvement by combining multiple models, to the image model relationships.
1 code implementation • 17 Mar 2020 • Siyu Huang, Haoyi Xiong, Tianyang Wang, Bihan Wen, Qingzhong Wang, Zeyu Chen, Jun Huan, Dejing Dou
This paper further presents a real-time feed-forward model to leverage Style Projection for arbitrary image style transfer, which includes a regularization term for matching the semantics between input contents and stylized outputs.
2 code implementations • 22 May 2019 • Sicheng Wang, Bihan Wen, Junru Wu, DaCheng Tao, Zhangyang Wang
Several recent works discussed application-driven image restoration neural networks, which are capable of not only removing noise in images but also preserving their semantic-aware details, making them suitable for various high-level computer vision tasks as the pre-processing step.
no code implementations • 25 Mar 2019 • Bihan Wen, Saiprasad Ravishankar, Luke Pfister, Yoram Bresler
The model could be pre-learned from datasets, or learned simultaneously with the reconstruction, i. e., blind CS (BCS).
1 code implementation • 6 Sep 2018 • Ding Liu, Bihan Wen, Jianbo Jiao, Xian-Ming Liu, Zhangyang Wang, Thomas S. Huang
Second we propose a deep neural network solution that cascades two modules for image denoising and various high-level tasks, respectively, and use the joint loss for updating only the denoising network via back-propagation.
no code implementations • 3 Aug 2018 • Bihan Wen, Yanjun Li, Yoram Bresler
Recent works on adaptive sparse and on low-rank signal modeling have demonstrated their usefulness in various image / video processing applications.
1 code implementation • 6 Jul 2018 • Zhiyuan Zha, Xin Yuan, Bihan Wen, Jiantao Zhou, Jiachao Zhang, Ce Zhu
Towards this end, we first obtain a good reference of the original image groups by using the image NSS prior, and then the rank residual of the image groups between this reference and the degraded image is minimized to achieve a better estimate to the desired image.
1 code implementation • NeurIPS 2018 • Ding Liu, Bihan Wen, Yuchen Fan, Chen Change Loy, Thomas S. Huang
The main contributions of this work are: (1) Unlike existing methods that measure self-similarity in an isolated manner, the proposed non-local module can be flexibly integrated into existing deep networks for end-to-end training to capture deep feature correlation between each location and its neighborhood.
Ranked #1 on Grayscale Image Denoising on Set12 sigma30
1 code implementation • 3 Oct 2017 • Bihan Wen, Saiprasad Ravishankar, Yoram Bresler
Transform learning methods involve cheap computations and have been demonstrated to perform well in applications such as image denoising and medical image reconstruction.
1 code implementation • ICCV 2017 • Bihan Wen, Yanjun Li, Luke Pfister, Yoram Bresler
In this work, we propose a novel video denoising method, based on an online tensor reconstruction scheme with a joint adaptive sparse and low-rank model, dubbed SALT.
no code implementations • 12 Sep 2017 • Zhiyuan Zha, Xin Yuan, Bihan Wen, Jiantao Zhou, Jiachao Zhang, Ce Zhu
Sparse coding has achieved a great success in various image processing tasks.
2 code implementations • 14 Jun 2017 • Ding Liu, Bihan Wen, Xianming Liu, Zhangyang Wang, Thomas S. Huang
Conventionally, image denoising and high-level vision tasks are handled separately in computer vision.
no code implementations • 16 Aug 2016 • Zhiyuan Zha, Bihan Wen, Jiachao Zhang, Jiantao Zhou, Ce Zhu
Inspired by enhancing sparsity of the weighted L1-norm minimization in comparison with L1-norm minimization in sparse representation, we thus explain that WNNM is more effective than NMM.
no code implementations • 9 Jun 2016 • Soumyabrata Dev, Bihan Wen, Yee Hui Lee, Stefan Winkler
Ground-based whole sky cameras have opened up new opportunities for monitoring the earth's atmosphere.
1 code implementation • journals 2016 • Ding Liu, Zhaowen Wang, Bihan Wen, Student Member, Jianchao Yang, Member, Wei Han, and Thomas S. Huang, Fellow, IEEE
We demonstrate that a sparse coding model particularly designed for SR can be incarnated as a neural network with the merit of end-to-end optimization over training data.
no code implementations • 19 Nov 2015 • Bihan Wen, Saiprasad Ravishankar, Yoram Bresler
Features based on sparse representation, especially using the synthesis dictionary model, have been heavily exploited in signal processing and computer vision.