no code implementations • ECCV 2020 • Wending Yan, Robby T. Tan, Dengxin Dai
Given an RGB foggy nighttime image, our grayscale module takes the grayscale version of the image as input, and decomposes it into high and low frequency layers.
no code implementations • ECCV 2020 • Yuan Liu, Ruoteng Li, Yu Cheng, Robby T. Tan, Xiubao Sui
To facilitate the future prediction ability, we follow three key observations: 1) object motion trajectory is affected significantly by camera motion; 2) the past trajectory of an object can act as a salient cue to estimate the object motion in the spatial domain; 3) previous frames contain the surroundings and appearance of the target object, which is useful for predicting the target object’s future locations.
no code implementations • 22 Aug 2024 • Aishik Nagar, Yutong Liu, Andy T. Liu, Viktor Schlegel, Vijay Prakash Dwivedi, Arun-Kumar Kaliya-Perumal, Guna Pratheep Kalanchiam, Yili Tang, Robby T. Tan
Current methods often sacrifice key information for faithfulness or introduce confabulations when prioritizing informativeness.
no code implementations • 17 Aug 2024 • Xiao Cao, Beibei Lin, Bo wang, Zhiyong Huang, Robby T. Tan
To address these artifacts and enhance robustness, we propose SSNeRF, a sparse view semi supervised NeRF method based on a teacher student framework.
no code implementations • 22 Jul 2024 • Yihao Ai, Yifei Qi, Bo wang, Yu Cheng, Xinchao Wang, Robby T. Tan
Our primary novelty lies in leveraging two complementary-teacher networks to generate more reliable pseudo labels, enabling our model achieves competitive performance on extremely low-light images without the need for training with low-light ground truths.
no code implementations • 23 May 2024 • Yiming Chen, Chen Zhang, Danqing Luo, Luis Fernando D'Haro, Robby T. Tan, Haizhou Li
Specifically, inspired by the recent success of large language models (LLMs) in text generation and evaluation, we adopt strong LLMs as both the data generator and gold evaluator.
no code implementations • CVPR 2024 • Mikhail Kennerley, Jian-Gang Wang, Bharadwaj Veeravalli, Robby T. Tan
We propose Class-Aware Teacher (CAT) to address the class bias issue in the domain adaptation setting.
no code implementations • 12 Mar 2024 • Beibei Lin, Yeying Jin, Wending Yan, Wei Ye, Yuan Yuan, Robby T. Tan
By increasing the noise values to approach as high as the pixel intensity values of the glow and light effect blended images, our augmentation becomes severe, resulting in stronger priors.
no code implementations • 15 Jan 2024 • Xin Yang, Wending Yan, Yuan Yuan, Michael Bi Mi, Robby T. Tan
They struggle to acquire new knowledge while also retaining previously learned knowledge. To address these problems, we propose a semantic segmentation method for multiple adverse weather conditions that incorporates adaptive knowledge acquisition, pseudolabel blending, and weather composition replay.
no code implementations • CVPR 2024 • Shuwei Li, Robby T. Tan
To specifically address the unique lighting conditions of nighttime and ensure the robustness of pseudo labels we propose adaptive channel masking and light uncertainty.
no code implementations • 29 Dec 2023 • Xin Zhang, Jinheng Xie, Yuan Yuan, Michael Bi Mi, Robby T. Tan
Further, to ensure the distinguishability among various regions, we introduce a region-level contrastive clustering loss to pull closer similar regions across images.
no code implementations • 17 Dec 2023 • Qinqian Lei, Bo wang, Robby T. Tan
In our proposed method, we introduce novel label-uncertain query augmentation techniques to enhance the diversity of the query inputs, aiming to distinguish the positive HOI class from the negative ones.
no code implementations • 21 Sep 2023 • Yu Cheng, Bo wang, Robby T. Tan
In 3D human shape and pose estimation from a monocular video, models trained with limited labeled data cannot generalize well to videos with occlusion, which is common in the wild videos.
1 code implementation • ICCV 2023 • Chenxin Xu, Robby T. Tan, Yuhong Tan, Siheng Chen, Xinchao Wang, Yanfeng Wang
To work with auxiliary tasks, we propose a novel auxiliary-adapted transformer, which can handle incomplete, corrupted motion data and achieve coordinate recovery via capturing spatial-temporal dependencies.
Ranked #4 on Human Pose Forecasting on Human3.6M
1 code implementation • 3 Aug 2023 • Yeying Jin, Beibei Lin, Wending Yan, Yuan Yuan, Wei Ye, Robby T. Tan
In this paper, we enhance the visibility from a single nighttime haze image by suppressing glow and enhancing low-light regions.
1 code implementation • 20 May 2023 • Yiming Chen, Simin Chen, Zexin Li, Wei Yang, Cong Liu, Robby T. Tan, Haizhou Li
Despite much success in natural language processing (NLP), pre-trained language models typically lead to a high computational cost during inference.
1 code implementation • CVPR 2023 • Heyuan Li, Bo wang, Yu Cheng, Mohan Kankanhalli, Robby T. Tan
Thanks to the proposed fusion module, our method is robust not only to occlusion and large pitch and roll view angles, which is the benefit of our image space approach, but also to noise and large yaw angles, which is the benefit of our model space method.
Ranked #1 on 3D Face Reconstruction on AFLW2000-3D (Mean NME metric)
1 code implementation • CVPR 2023 • Jiadong Wang, Xinyuan Qian, Malu Zhang, Robby T. Tan, Haizhou Li
To address the problem, we propose using a lip-reading expert to improve the intelligibility of the generated lip regions by penalizing the incorrect generation results.
1 code implementation • CVPR 2023 • Mikhail Kennerley, Jian-Gang Wang, Bharadwaj Veeravalli, Robby T. Tan
Object detection at night is a challenging problem due to the absence of night image annotations.
1 code implementation • CVPR 2023 • Chenxin Xu, Robby T. Tan, Yuhong Tan, Siheng Chen, Yu Guang Wang, Xinchao Wang, Yanfeng Wang
In motion prediction tasks, maintaining motion equivariance under Euclidean geometric transformations and invariance of agent interaction is a critical and fundamental principle.
Ranked #1 on Human Pose Forecasting on HARPER
1 code implementation • ICCV 2023 • Weilong Yan, Robby T. Tan, Bing Zeng, Shuaicheng Liu
In this work, we adopt a more straightforward method to learn deep homography mixture motion between an RS image and its corresponding GS image, without large solution space or strict restrictions on image features.
1 code implementation • 27 Nov 2022 • Yeying Jin, Ruoteng Li, Wenhan Yang, Robby T. Tan
To further enforce the reflectance layer to be independent of shadows and specularities in the second-stage refinement, we introduce an S-Aware network that distinguishes the reflectance image from the input image.
1 code implementation • 24 Nov 2022 • Xin Yang, Michael Bi Mi, Yuan Yuan, Xin Wang, Robby T. Tan
In our DA framework, we retain the depth and background information during the domain feature alignment.
no code implementations • 16 Nov 2022 • Shuwei Li, Jikai Wang, Michael S. Brown, Robby T. Tan
To have better cues of the local surface/light colors under multiple light color conditions, we design a novel multi-task learning framework.
no code implementations • 15 Nov 2022 • Beibei Lin, Chen Liu, Ming Wang, Lincheng Li, Shunli Zhang, Robby T. Tan, Xin Yu
Existing gait recognition frameworks retrieve an identity in the gallery based on the distance between a probe sample and the identities in the gallery.
1 code implementation • 15 Nov 2022 • Yeying Jin, Wei Ye, Wenhan Yang, Yuan Yuan, Robby T. Tan
Most existing methods rely on binary shadow masks, without considering the ambiguous boundaries of soft and self shadows.
1 code implementation • 6 Oct 2022 • Yeying Jin, Wending Yan, Wenhan Yang, Robby T. Tan
Few existing image defogging or dehazing methods consider dense and non-uniform particle distributions, which usually happen in smoke, dust and fog.
Ranked #2 on Nonhomogeneous Image Dehazing on NH-HAZE validation
no code implementations • 25 Aug 2022 • Yu Cheng, Yihao Ai, Bo wang, Xinchao Wang, Robby T. Tan
In multi-person 2D pose estimation, the bottom-up methods simultaneously predict poses for all persons, and unlike the top-down methods, do not rely on human detection.
1 code implementation • ICCV 2021 • Yeying Jin, Aashish Sharma, Robby T. Tan
To address the problem, in this paper, we propose an unsupervised domain-classifier guided shadow removal network, DC-ShadowNet.
Ranked #4 on Shadow Removal on SRD
1 code implementation • 21 Jul 2022 • Yeying Jin, Wenhan Yang, Robby T. Tan
To address this problem, we need to suppress the light effects in bright regions while, at the same time, boosting the intensity of dark regions.
Ranked #26 on Low-Light Image Enhancement on LOL
no code implementations • 29 May 2022 • Wending Yan, Lu Xu, Wenhan Yang, Robby T. Tan
Our single image module employs a raindrop removal network to generate initial raindrop removal results, and create a mask representing the differences between the input and initial output.
1 code implementation • 2 May 2022 • Yu Cheng, Bo wang, Robby T. Tan
Most of the methods focus on single persons, which estimate the poses in the person-centric coordinates, i. e., the coordinates based on the center of the target person.
Ranked #1 on 3D Human Pose Estimation on JTA
3D Multi-Person Pose Estimation (absolute) 3D Multi-Person Pose Estimation (root-relative) +4
no code implementations • CVPR 2021 • Aashish Sharma, Robby T. Tan
In this paper, given a single nighttime image as input, our goal is to enhance its visibility by increasing the dynamic range of the intensity, and thus can boost the intensity of the low light regions, and at the same time, suppress the light effects (glow, glare) simultaneously.
1 code implementation • CVPR 2021 • Wending Yan, Robby T. Tan, Wenhan Yang, Dengxin Dai
In this paper, we address the problems of rain streaks and rain accumulation removal in video, by developing a self-aligned network with transmission-depth consistency.
no code implementations • 7 May 2021 • Lu Liu, Robby T. Tan
At inference, we propose a human-object regrouping approach by considering the object-exclusive property of an action, where the target object should not be shared by more than one human.
1 code implementation • CVPR 2021 • Yu Cheng, Bo wang, Bo Yang, Robby T. Tan
Besides the integration of top-down and bottom-up networks, unlike existing pose discriminators that are designed solely for single person, and consequently cannot assess natural inter-person interactions, we propose a two-person pose discriminator that enforces natural two-person interactions.
3D Multi-Person Pose Estimation (absolute) 3D Multi-Person Pose Estimation (root-relative) +2
1 code implementation • 22 Dec 2020 • Yu Cheng, Bo wang, Bo Yang, Robby T. Tan
To tackle this problem, we propose a novel framework integrating graph convolutional networks (GCNs) and temporal convolutional networks (TCNs) to robustly estimate camera-centric multi-person 3D poses that do not require camera parameters.
Ranked #1 on Root Joint Localization on Human3.6M
3D Absolute Human Pose Estimation 3D Multi-Person Pose Estimation (absolute) +5
no code implementations • 15 Oct 2020 • YuAn Liu, Ruoteng Li, Robby T. Tan, Yu Cheng, Xiubao Sui
Our trajectory prediction module predicts the target object's locations in the current and future frames based on the object's past trajectory.
no code implementations • 13 Oct 2020 • Cheng Yu, Bo wang, Bo Yang, Robby T. Tan
Addressing these problems, we introduce a spatio-temporal network for robust 3D human pose estimation.
no code implementations • 8 Oct 2020 • Aashish Sharma, Robby T. Tan, Loong-Fah Cheong
Second, we employ a gradual refinement scheme in which we start from a simple CRF model to generate a result which is more robust to noise but less accurate, and then we gradually increase the model's complexity to improve the result.
no code implementations • AAAI Conference on Artificial Intelligence, AAAI 2020 2020 • Yu Cheng, Bo Yang, Bo wang, Robby T. Tan
Estimating 3D poses from a monocular video is still a challenging task, despite the significant progress that has been made in recent years.
Ranked #3 on 3D Human Pose Estimation on HumanEva-I
no code implementations • CVPR 2020 • Wending Yan, Aashish Sharma, Robby T. Tan
Initially, given a pair of synthetic fog images, its corresponding clean images and optical flow ground-truths, in one training batch we train our network in a supervised manner.
no code implementations • 16 Dec 2019 • Wenhan Yang, Robby T. Tan, Shiqi Wang, Yuming Fang, Jiaying Liu
The goal of single-image deraining is to restore the rain-free background scenes of an image degraded by rain streaks and rain accumulation.
2 code implementations • 30 Sep 2019 • Aashish Sharma, Lionel Heng, Loong-Fah Cheong, Robby T. Tan
To address the problem, we introduce a network joining day/night translation and stereo.
no code implementations • 24 Jul 2019 • Shaodi You, Erqi Huang, Shuaizhe Liang, Yongrong Zheng, Yunxiang Li, Fan Wang, Sen Lin, Qiu Shen, Xun Cao, Diming Zhang, Yuanjiang Li, Yu Li, Ying Fu, Boxin Shi, Feng Lu, Yinqiang Zheng, Robby T. Tan
This document introduces the background and the usage of the Hyperspectral City Dataset and the benchmark.
no code implementations • 23 Jul 2019 • Angelica I. Aviles-Rivero, Nicolas Papadakis, Ruoteng Li, Philip Sellars, Qingnan Fan, Robby T. Tan, Carola-Bibiane Schönlieb
The task of classifying X-ray data is a problem of both theoretical and clinical interest.
1 code implementation • 10 Apr 2019 • Ruotent Li, Loong Fah Cheong, Robby T. Tan
This filtering is guided by a rain-free residue image --- its content is used to set the passbands for the two channels in a spatially-variant manner so that the background details do not get mixed up with the rain-streaks.
no code implementations • 17 Jan 2019 • Lu Liu, Robby T. Tan
Specifically, we propose two approaches, i. e. Filtering CCL and Temperature CCL to either filter out uncertain predictions or pay less attention on them in the consistency regularization.
no code implementations • 11 Dec 2018 • Lu Liu, Robby T. Tan, ShaoDi You
This requirement of bounding boxes as part of the input is needed to enable the methods to ignore irrelevant contexts and extract only human features.
no code implementations • ECCV 2018 • Ruoteng Li, Robby T. Tan, Loong-Fah Cheong
Optical flow estimation in rainy scenes is challenging due to degradation caused by rain streaks and rain accumulation, where the latter refers to the poor visibility of remote scenes due to intense rainfall.
no code implementations • 19 Dec 2017 • Ruoteng Li, Loong-Fah Cheong, Robby T. Tan
Given a single input rainy image, our goal is to visually remove rain streaks and the veiling effect caused by scattering and transmission of rain streaks and rain droplets.
3 code implementations • CVPR 2018 • Rui Qian, Robby T. Tan, Wenhan Yang, Jiajun Su, Jiaying Liu
This injection of visual attention to both generative and discriminative networks is the main contribution of this paper.
no code implementations • 18 Apr 2017 • Ruoteng Li, Robby T. Tan, Loong-Fah Cheong
To handle rain accumulation, our method decomposes the image into a piecewise-smooth background layer and a high-frequency detail layer.
2 code implementations • CVPR 2017 • Wenhan Yang, Robby T. Tan, Jiashi Feng, Jiaying Liu, Zongming Guo, Shuicheng Yan
Based on the first model, we develop a multi-task deep learning architecture that learns the binary rain streak map, the appearance of rain streaks, and the clean background, which is our ultimate output.
no code implementations • 21 Jul 2016 • Yu Li, ShaoDi You, Michael S. Brown, Robby T. Tan
This paper provides a comprehensive survey of methods dealing with visibility enhancement of images taken in hazy or foggy scenes.
no code implementations • CVPR 2016 • Yu Li, Robby T. Tan, Xiaojie Guo, Jiangbo Lu, Michael S. Brown
This paper addresses the problem of rain streak removal from a single image.
no code implementations • CVPR 2016 • Jiaolong Yang, Hongdong Li, Yuchao Dai, Robby T. Tan
This paper deals with a challenging, frequently encountered, yet not properly investigated problem in two-frame optical flow estimation.
no code implementations • 4 Apr 2016 • Shaodi You, Robby T. Tan, Rei Kawakami, Yasuhiro Mukaigawa, Katsushi Ikeuchi
(2) The imagery inside a water-drop is determined by the water-drop 3D shape and total reflection at the boundary.
no code implementations • ICCV 2015 • Yu Li, Robby T. Tan, Michael S. Brown
We demonstrate the effectiveness of our nighttime haze model and correction method on a number of examples and compare our results with existing daytime and nighttime dehazing methods' results.
no code implementations • 2 Nov 2015 • Pascal Mettes, Robby T. Tan, Remco C. Veltkamp
Experimental evaluation on the Video Water Database and the DynTex database indicates the effectiveness of the proposed algorithm, outperforming multiple algorithms for dynamic texture recognition and material recognition by ca.
no code implementations • CVPR 2015 • Zhuwen Li, Ping Tan, Robby T. Tan, Danping Zou, Steven Zhiying Zhou, Loong-Fah Cheong
We present a method to jointly estimate scene depth and recover the clear latent image from a foggy video sequence.
no code implementations • CVPR 2013 • Shaodi You, Robby T. Tan, Rei Kawakami, Katsushi Ikeuchi
First, it detects raindrops based on the motion and the intensity temporal derivatives of the input video.