no code implementations • ECCV 2020 • Qili Deng, Ziling Huang, Chung-Chi Tsai, Chia-Wen Lin
In this paper, we present a Haze-Aware Representation Distillation Generative Adversarial Network named HardGAN for single-image dehazing.
no code implementations • 16 Sep 2024 • Hao-Chiang Shao, Guan-Yu Chen, Yu-Hsien Lin, Chia-Wen Lin, Shao-Yun Fang, Pin-Yian Tsai, Yan-Hsiu Liu
However, by taking an object detection network as the backbone, recent learning-based hotspot detectors learn to recognize only the problematic layout patterns in the training data.
no code implementations • 5 Sep 2024 • Jingcheng Ke, Dele Wang, Jun-Cheng Chen, I-Hong Jhuo, Chia-Wen Lin, Yen-Yu Lin
Extensive experimental results on the RefCOCO, RefCOCO+, RefCOCOg, Flickr30K, RefClef, and Ref-reasoning datasets demonstrate the effectiveness of the DGC module and the EGR strategy in consistently boosting the performances of various graph-based REC methods.
no code implementations • 2 Sep 2024 • Taorong Liu, Jing Xiao, Liang Liao, Chia-Wen Lin
Online Domain Adaptation (OnDA) is designed to handle unforeseeable domain changes at minimal cost that occur during the deployment of the model, lacking clear boundaries between the domain, such as sudden weather events.
no code implementations • 1 Sep 2024 • Xujie Wan, Wenjie Li, Guangwei Gao, Huimin Lu, Jian Yang, Chia-Wen Lin
Recently, CNN and Transformer hybrid networks demonstrated excellent performance in face super-resolution (FSR) tasks.
1 code implementation • 12 Jul 2024 • Jin-Ting He, Fu-Jen Tsai, Jia-Hao Wu, Yan-Tsung Peng, Chung-Chi Tsai, Chia-Wen Lin, Yen-Yu Lin
Next, we utilize a blurring model to produce blurred images based on the pseudo-sharp images extracted during testing.
1 code implementation • 8 Jul 2024 • Chunwei Tian, Menghua Zheng, Chia-Wen Lin, Zhiwu Li, David Zhang
To make a tradeoff between distance modeling and denoising time, we propose a heterogeneous window transformer (HWformer) for image denoising.
1 code implementation • 5 Jul 2024 • Wengyi Zhan, Mingbao Lin, Chia-Wen Lin, Rongrong Ji
As a contrast to off-the-shelf methods that solve SR tasks across various scales with the same computing costs, our AnySR innovates in: 1) building arbitrary-scale tasks as any-resource implementation, reducing resource requirements for smaller scales without additional parameters; 2) enhancing any-scale performance in a feature-interweaving fashion, inserting scale pairs into features at regular intervals and ensuring correct feature/scale processing.
1 code implementation • 8 May 2024 • Yi Xiao, Qiangqiang Yuan, Kui Jiang, Yuzeng Chen, Qiang Zhang, Chia-Wen Lin
To alleviate these issues, we develop the first attempt to integrate the Vision State Space Model (Mamba) for RSI-SR, which specializes in processing large-scale RSI by capturing long-range dependency with linear complexity.
no code implementations • 7 May 2024 • Ammarah Hashmi, Sahibzada Adil Shahzad, Chia-Wen Lin, Yu Tsao, Hsin-Min Wang
Can we humans correctly perceive the authenticity of the content of the videos we watch?
no code implementations • 14 Apr 2024 • Chih-Ling Chang, Fu-Jen Tsai, Zi-Ling Huang, Lin Gu, Chia-Wen Lin
Image dehazing faces challenges when dealing with hazy images in real-world scenarios.
1 code implementation • 9 Mar 2024 • Chunwei Tian, Menghua Zheng, Tiancai Jiao, WangMeng Zuo, Yanning Zhang, Chia-Wen Lin
Popular convolutional neural networks mainly use paired images in a supervised way for image watermark removal.
no code implementations • 2 Mar 2024 • Shufan Pei, Junhong Lin, Wenxi Liu, Tiesong Zhao, Chia-Wen Lin
Thereby, we obtain an image free of low light and light effects, which improves the performance of nighttime object detection.
1 code implementation • 24 Feb 2024 • Chunwei Tian, Xuanyu Zhang, Tao Wang, WangMeng Zuo, Yanning Zhang, Chia-Wen Lin
The lower network utilizes a symmetric architecture to enhance relations of different layers to mine more structural information, which is complementary with a upper network for image super-resolution.
no code implementations • 22 Dec 2023 • Fu-Jen Tsai, Yan-Tsung Peng, Chen-Yu Chang, Chan-Yu Li, Yen-Yu Lin, Chung-Chi Tsai, Chia-Wen Lin
Besides, ViStripformer is an effective and efficient transformer architecture with much lower memory usage than the vanilla transformer.
1 code implementation • CVPR 2024 • Jia-Hao Wu, Fu-Jen Tsai, Yan-Tsung Peng, Chung-Chi Tsai, Chia-Wen Lin, Yen-Yu Lin
Since continuous motion causes blurred artifacts during image exposure, we aspire to develop a groundbreaking blur augmentation method to generate diverse blurred images by simulating motion trajectories in a continuous space.
Ranked #2 on Deblurring on RealBlur-J
no code implementations • 19 Oct 2023 • Ammarah Hashmi, Sahibzada Adil Shahzad, Chia-Wen Lin, Yu Tsao, Hsin-Min Wang
For a detailed analysis, we evaluate AVTENet, its variants, and several existing methods on multiple test sets of the FakeAVCeleb dataset.
1 code implementation • 27 Sep 2023 • Wenjie Li, Mei Wang, Kai Zhang, Juncheng Li, Xiaoming Li, Yuhang Zhang, Guangwei Gao, Weihong Deng, Chia-Wen Lin
We also discuss notable benchmarks commonly utilized in the field.
1 code implementation • 20 Jun 2023 • Liang Liao, Taorong Liu, Delin Chen, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh
For precise utilization of the reference features for guidance, a reference-patch alignment (Ref-PA) module is proposed to align the patch features of the reference and corrupted images and harmonize their style differences, while a reference-patch transformer (Ref-PT) module is proposed to refine the embedded reference feature.
no code implementations • 28 Apr 2023 • Weng-Tai Su, Yi-Chun Hung, Po-Jen Yu, Shang-Hua Yang, Chia-Wen Lin
Terahertz (THz) tomographic imaging has recently attracted significant attention thanks to its non-invasive, non-destructive, non-ionizing, material-classification, and ultra-fast nature for object exploration and inspection.
2 code implementations • 10 Apr 2023 • Yi Xiao, Qiangqiang Yuan, Kui Jiang, Xianyu Jin, Jiang He, Liangpei Zhang, Chia-Wen Lin
To explore the global dependency in the entire frame sequence, a Long-term Temporal Difference Module (L-TDM) is proposed, where the differences between forward and backward segments are incorporated and activated to guide the modulation of the temporal feature, leading to a holistic global compensation.
no code implementations • CVPR 2023 • Haoqian Wu, Keyu Chen, Haozhe Liu, Mingchen Zhuge, Bing Li, Ruizhi Qiao, Xiujun Shu, Bei Gan, Liangsheng Xu, Bo Ren, Mengmeng Xu, Wentian Zhang, Raghavendra Ramachandra, Chia-Wen Lin, Bernard Ghanem
Temporal video segmentation is the get-to-go automatic video analysis, which decomposes a long-form video into smaller components for the following-up understanding tasks.
1 code implementation • 29 Dec 2022 • Wenjie Li, Juncheng Li, Guangwei Gao, Weihong Deng, Jian Yang, Guo-Jun Qi, Chia-Wen Lin
Lightweight image super-resolution aims to reconstruct high-resolution images from low-resolution images using low computational costs.
1 code implementation • 14 Oct 2022 • Po-Sheng Liu, Fu-Jen Tsai, Yan-Tsung Peng, Chung-Chi Tsai, Chia-Wen Lin, Yen-Yu Lin
Most previous deblurring methods were built with a generic model trained on blurred images and their sharp counterparts.
1 code implementation • 26 Sep 2022 • Chunwei Tian, Yanning Zhang, WangMeng Zuo, Chia-Wen Lin, David Zhang, Yixuan Yuan
To prevent loss of original information, a multi-level enhancement mechanism guides a CNN to achieve a symmetric architecture for promoting expressive ability of HGSRCNN.
1 code implementation • 21 Jul 2022 • Kui Jiang, Zhongyuan Wang, Chen Chen, Zheng Wang, Laizhong Cui, Chia-Wen Lin
Convolutional neural network (CNN) and Transformer have achieved great success in multimedia applications.
1 code implementation • 10 Jun 2022 • Liang Liao, WenYi Chen, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh
Specifically, based on the two discoveries of local spatial similarity and adjacent temporal correspondence of the sequential image data, we propose a novel Target-Domain driven pseudo label Diffusion (TDo-Dif) scheme.
1 code implementation • 29 May 2022 • Chunwei Tian, Yixuan Yuan, Shichao Zhang, Chia-Wen Lin, WangMeng Zuo, David Zhang
In this paper, we present an enhanced super-resolution group CNN (ESRGCNN) with a shallow architecture by fully fusing deep and wide channel features to extract more accurate low-frequency information in terms of correlations of different channels in single image super-resolution (SISR).
no code implementations • 30 Apr 2022 • Weng-Tai Su, Yi-Chun Hung, Po-Jen Yu, Chia-Wen Lin, Shang-Hua Yang
Visualizing information inside objects is an ever-lasting need to bridge the world from physics, chemistry, biology to computation.
no code implementations • 28 Apr 2022 • Chunwei Tian, Xuanyu Zhang, Jerry Chun-Wei Lin, WangMeng Zuo, Yanning Zhang, Chia-Wen Lin
Second, we present popular architectures for GANs in big and small samples for image applications.
2 code implementations • 10 Apr 2022 • Fu-Jen Tsai, Yan-Tsung Peng, Yen-Yu Lin, Chung-Chi Tsai, Chia-Wen Lin
Images taken in dynamic scenes may contain unwanted motion blur, which significantly degrades visual quality.
Ranked #8 on Deblurring on RealBlur-R
1 code implementation • 15 Feb 2022 • Mingbao Lin, Liujuan Cao, Yuxin Zhang, Ling Shao, Chia-Wen Lin, Rongrong Ji
Then, we introduce a recommendation-based filter selection scheme where each filter recommends a group of its closest filters.
no code implementations • 24 Jan 2022 • Hao-Chiang Shao, Hsing-Lei Ping, Kuo-shiuan Chen, Weng-Tai Su, Chia-Wen Lin, Shao-Yun Fang, Pin-Yian Tsai, Yan-Hsiu Liu
To address the problem, we propose a deep learning-based layout novelty detection scheme to identify novel (unseen) layout patterns, which cannot be well predicted by a pre-trained pre-simulation model.
no code implementations • CVPR 2022 • Xianzheng Ma, Zhixiang Wang, Yacheng Zhan, Yinqiang Zheng, Zheng Wang, Dengxin Dai, Chia-Wen Lin
Unlike previous methods that mainly focus on closing the domain gap caused by fog -- defogging the foggy images or fogging the clear images, we propose to alleviate the domain gap by considering fog influence and style variation simultaneously.
Ranked #4 on Domain Adaptation on Cityscapes-to-FoggyZurich
no code implementations • 21 Oct 2021 • Sadid Sahami, Gene Cheung, Chia-Wen Lin
We prove that, after partitioning $\mathcal{G}$ into $Q$ sub-graphs $\{\mathcal{G}^q\}^Q_{q=1}$, the smallest Gershgorin circle theorem (GCT) lower bound of $Q$ corresponding coefficient matrices -- $\min_q \lambda^-_{\min}(\mathbf{B}^q)$ -- is a lower bound for $\lambda_{\min}(\mathbf{B})$.
1 code implementation • CVPR 2021 • Qiong Wu, Pingyang Dai, Jie Chen, Chia-Wen Lin, Yongjian Wu, Feiyue Huang, Bineng Zhong, Rongrong Ji
In this paper, we propose a joint Modality and Pattern Alignment Network (MPANet) to discover cross-modality nuances in different patterns for visible-infrared person Re-ID, which introduces a modality alleviation module and a pattern alignment module to jointly extract discriminative features.
1 code implementation • 24 Apr 2021 • Yuxin Zhang, Mingbao Lin, Chia-Wen Lin, Jie Chen, Feiyue Huang, Yongjian Wu, Yonghong Tian, Rongrong Ji
Specifically, to model the contribution of each channel to differentiating categories, we develop a class-wise mask for each channel, implemented in a dynamic training manner w. r. t.
1 code implementation • 25 Mar 2021 • Chunwei Tian, Yong Xu, WangMeng Zuo, Chia-Wen Lin, David Zhang
In this paper, we propose an asymmetric CNN (ACNet) comprising an asymmetric block (AB), a memory enhancement block (MEB) and a high-frequency feature enhancement block (HFFEB) for image super-resolution.
1 code implementation • 19 Mar 2021 • Kui Jiang, Zhongyuan Wang, Zheng Wang, Chen Chen, Peng Yi, Tao Lu, Chia-Wen Lin
Different from existing methods tending to accomplish the relighting task directly by ignoring the fidelity and naturalness recovery, we investigate the intrinsic degradation and relight the low-light image while refining the details and color in two steps.
no code implementations • 13 Mar 2021 • Hao-Chiang Shao, Hsin-Chieh Wang, Weng-Tai Su, Chia-Wen Lin
Here we focus on the problem that noisy labels are primarily mislabeled samples, which tend to be concentrated near decision boundaries, rather than uniformly distributed, and whose features should be equivocal.
3 code implementations • 24 Feb 2021 • Bing Li, Yuanlue Zhu, Yitong Wang, Chia-Wen Lin, Bernard Ghanem, Linlin Shen
Specifically, a new generator architecture is proposed to simultaneously transfer color/texture styles and transform local facial shapes into anime-like counterparts based on the style of a reference anime-face, while preserving the global structure of the source photo-face.
2 code implementations • 16 Feb 2021 • Mingbao Lin, Rongrong Ji, Zihan Xu, Baochang Zhang, Fei Chao, Chia-Wen Lin, Ling Shao
In this paper, we show that our weight binarization provides an analytical solution by encoding high-magnitude weights into +1s, and 0s otherwise.
1 code implementation • 19 Jan 2021 • Fu-Jen Tsai, Yan-Tsung Peng, Yen-Yu Lin, Chung-Chi Tsai, Chia-Wen Lin
Image motion blur results from a combination of object motions and camera shakes, and such blurring effect is generally directional and non-uniform.
Ranked #10 on Deblurring on RealBlur-R
1 code implementation • 16 Jan 2021 • Yunpeng Luo, Jiayi Ji, Xiaoshuai Sun, Liujuan Cao, Yongjian Wu, Feiyue Huang, Chia-Wen Lin, Rongrong Ji
Descriptive region features extracted by object detection networks have played an important role in the recent advancements of image captioning.
no code implementations • ICCV 2021 • Bing Li, Chia-Wen Lin, Cheng Zheng, Shan Liu, Junsong Yuan, Bernard Ghanem, C.-C. Jay Kuo
In the second stage, we derive another warping model to refine warping results in less important regions by eliminating serious distortions in shape, disparity and 3D structure.
Vocal Bursts Intensity Prediction Vocal Bursts Valence Prediction
no code implementations • CVPR 2021 • Liang Liao, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh
In this paper, we introduce coherence priors between the semantics and textures which make it possible to concentrate on completing separate textures in a semantic-wise manner.
1 code implementation • 28 Oct 2020 • Hao-Chiang Shao, Ya-Jen Cheng, Meng-Yun Duh, Chia-Wen Lin
Recently, falsified images have been found in papers involved in research misconducts.
2 code implementations • NeurIPS 2020 • Mingbao Lin, Rongrong Ji, Zihan Xu, Baochang Zhang, Yan Wang, Yongjian Wu, Feiyue Huang, Chia-Wen Lin
In this paper, for the first time, we explore the influence of angular bias on the quantization error and then introduce a Rotated Binary Neural Network (RBNN), which considers the angle alignment between the full-precision weight vector and its binarized version.
no code implementations • 5 Aug 2020 • Wei Hu, Jiahao Pang, Xian-Ming Liu, Dong Tian, Chia-Wen Lin, Anthony Vetro
Geometric data acquired from real-world scenes, e. g., 2D depth images, 3D point clouds, and 4D dynamic point clouds, have found a wide range of applications including immersive telepresence, autonomous driving, surveillance, etc.
1 code implementation • 21 Jul 2020 • Xian Zhong, Cheng Gu, Wenxin Huang, Lin Li, Shuqin Chen, Chia-Wen Lin
As a result, a meta-learner cannot be trained well in a high-dimensional parameter space to generalize to new tasks.
Ranked #18 on Few-Shot Image Classification on FC100 5-way (5-shot)
1 code implementation • 8 Jul 2020 • Chunwei Tian, Ruibin Zhuge, Zhihao Wu, Yong Xu, WangMeng Zuo, Chen Chen, Chia-Wen Lin
Finally, the IRB uses coarse high-frequency features from the RB to learn more accurate SR features and construct a SR image.
Ranked #55 on Image Super-Resolution on Set14 - 4x upscaling
1 code implementation • 8 Jul 2020 • Chunwei Tian, Yong Xu, WangMeng Zuo, Bo Du, Chia-Wen Lin, David Zhang
The enhancement block gathers and fuses the global and local features to provide complementary information for the latter network.
no code implementations • ECCV 2020 • Liang Liao, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh
Completing a corrupted image with correct structures and reasonable textures for a mixed scene remains an elusive challenge.
no code implementations • 23 Feb 2020 • Hao-Chiang Shao, Kang-Yu Liu, Chia-Wen Lin, Jiwen Lu
With their aid, DotFAN can learn a disentangled face representation and effectively generate face images of various facial attributes while preserving the identity of augmented faces.
no code implementations • 11 Feb 2020 • Hao-Chiang Shao, Chao-Yi Peng, Jun-Rei Wu, Chia-Wen Lin, Shao-Yun Fang, Pin-Yen Tsai, Yan-Hsiu Liu
By learning the shape correspondences between pairs of layout design patterns and their scanning electron microscope (SEM) images of the product wafer thereof, given an IC layout pattern, LithoNet can mimic the fabrication process to predict its fabricated circuit shape.
no code implementations • 31 Dec 2019 • Chunwei Tian, Lunke Fei, Wenxian Zheng, Yong Xu, WangMeng Zuo, Chia-Wen Lin
However, there are substantial differences in the various types of deep learning methods dealing with image denoising.
1 code implementation • 7 Dec 2019 • Yiyi Zhou, Rongrong Ji, Gen Luo, Xiaoshuai Sun, Jinsong Su, Xinghao Ding, Chia-Wen Lin, Qi Tian
Referring Expression Comprehension (REC) is an emerging research spot in computer vision, which refers to detecting the target region in an image given an text description.
no code implementations • 9 Oct 2019 • Fuhai Chen, Rongrong Ji, Chengpeng Dai, Xiaoshuai Sun, Chia-Wen Lin, Jiayi Ji, Baochang Zhang, Feiyue Huang, Liujuan Cao
Specially, we propose a novel Structured-Spatial Semantic Embedding model for image deblurring (termed S3E-Deblur), which introduces a novel Structured-Spatial Semantic tree model (S3-tree) to bridge two basic tasks in computer vision: image deblurring (ImD) and image captioning (ImC).
1 code implementation • NeurIPS 2019 • Jie Hu, Rongrong Ji, Shengchuan Zhang, Xiaoshuai Sun, Qixiang Ye, Chia-Wen Lin, Qi Tian
Learning representations with diversified information remains as an open problem.
no code implementations • 13 May 2019 • Ziling Huang, Zheng Wang, Chung-Chi Tsai, Shin'ichi Satoh, Chia-Wen Lin
To gain the superiority of deep learning models, we treat a group as multiple persons and transfer the domain of a labeled ReID dataset to a G-ReID target dataset style to learn single representations.
no code implementations • 2 Feb 2019 • Yuting Yang, Juan Cao, Mingyan Lu, Jintao Li, Chia-Wen Lin
SNQAM performs excellently on predicting quality, presenting interpretable quality score and giving accessible suggestions on how to improve it according to writing guidelines we referred to.
1 code implementation • 22 Jul 2018 • Chih-Chung Hsu, Chia-Wen Lin, Weng-Tai Su, Gene Cheung
Despite generative adversarial networks (GANs) can hallucinate photo-realistic high-resolution (HR) faces from low-resolution (LR) faces, they cannot guarantee preserving the identities of hallucinated HR faces, making the HR faces poorly recognizable.
no code implementations • CVPR 2018 • Bing Li, Chia-Wen Lin, Boxin Shi, Tiejun Huang, Wen Gao, C. -C. Jay Kuo
As compared with traditional video retargeting, stereo video retargeting poses new challenges because stereo video contains the depth information of salient objects and its time dynamics.
no code implementations • 19 May 2017 • Chih-Chung Hsu, Chia-Wen Lin
Given a large unlabeled set of images, how to efficiently and effectively group them into clusters based on extracted visual representations remains a challenging problem.
no code implementations • 10 Feb 2017 • Weng-Tai Su, Gene Cheung, Chia-Wen Lin
Recent advent in graph signal processing (GSP) has led to the development of new graph-based transforms and wavelets for image / video coding, where the underlying graph describes inter-pixel correlations.
no code implementations • 15 Nov 2016 • Gene Cheung, Weng-Tai Su, Yu Mao, Chia-Wen Lin
In response, we derive an optimal perturbation matrix $\boldsymbol{\Delta}$ - based on a fast lower-bound computation of the minimum eigenvalue of $\mathbf{L}$ via a novel application of the Haynsworth inertia additivity formula---so that $\mathbf{L} + \boldsymbol{\Delta}$ is positive semi-definite, resulting in a stable signal prior.