1 code implementation • ECCV 2020 • Yu Liu, Sarah Parisot, Gregory Slabaugh, Xu Jia, Ales Leonardis, Tinne Tuytelaars
Since those regularization strategies are mostly associated with classifier outputs, we propose a MUlti-Classifier (MUC) incremental learning paradigm that integrates an ensemble of auxiliary classifiers to estimate more effective regularization constraints.
no code implementations • 2 Dec 2024 • Xiaomin Li, Xu Jia, Qinghe Wang, Haiwen Diao, Mengmeng Ge, Pengxiang Li, You He, Huchuan Lu
They often fail to effectively decouple motion and the appearance in the limited reference videos, thereby weakening the modeling capability of motion patterns.
1 code implementation • 26 Nov 2024 • Yicheng Yang, Pengxiang Li, Lu Zhang, Liqian Ma, Ping Hu, Siyu Du, Yunzhi Zhuge, Xu Jia, Huchuan Lu
Extensive experiments demonstrate that DreamMix effectively balances identity preservation and attribute editability across various application scenarios, including object insertion, attribute editing, and small object inpainting.
1 code implementation • 18 Nov 2024 • ZiYi Yang, Zaibin Zhang, Zirui Zheng, Yuxian Jiang, Ziyue Gan, Zhiyu Wang, Zijian Ling, Jinsong Chen, Martz Ma, Bowen Dong, Prateek Gupta, Shuyue Hu, Zhenfei Yin, Guohao Li, Xu Jia, Lijun Wang, Bernard Ghanem, Huchuan Lu, Chaochao Lu, Wanli Ouyang, Yu Qiao, Philip Torr, Jing Shao
There has been a growing interest in enhancing rule-based agent-based models (ABMs) for social media platforms (i. e., X, Reddit) with more realistic large language model (LLM) agents, thereby allowing for a more nuanced study of complex systems.
no code implementations • 17 Jul 2024 • Pengyu Zhang, Hao Yin, Zeren Wang, Wenyue Chen, Shengming Li, Dong Wang, Huchuan Lu, Xu Jia
Sign language is one of the most effective communication tools for people with hearing difficulties.
1 code implementation • 10 Jul 2024 • Haiwen Diao, Bo Wan, Xu Jia, Yunzhi Zhuge, Ying Zhang, Huchuan Lu, Long Chen
Parameter-efficient transfer learning (PETL) has emerged as a flourishing research field for adapting large pre-trained models to downstream tasks, greatly reducing trainable parameters while grappling with memory challenges during fine-tuning.
no code implementations • 29 May 2024 • Wangbo Yu, Chaoran Feng, Jiye Tang, Jiashu Yang, Xu Jia, Yuchao Yang, Li Yuan, Yonghong Tian
Capitalizing on the high temporal resolution and dynamic range offered by the event camera, we leverage the event streams to explicitly model the formation process of motion-blurred images and guide the deblurring reconstruction of 3D-GS.
1 code implementation • 24 Apr 2024 • Qinghe Wang, Baolu Li, Xiaomin Li, Bing Cao, Liqian Ma, Huchuan Lu, Xu Jia
In this work, we propose CharacterFactory, a framework that allows sampling new characters with consistent identities in the latent space of GANs for diffusion models.
no code implementations • 16 Apr 2024 • Kai Chen, Yanze Li, Wenhua Zhang, Yanxin Liu, Pengxiang Li, Ruiyuan Gao, Lanqing Hong, Meng Tian, Xinhai Zhao, Zhenguo Li, Dit-yan Yeung, Huchuan Lu, Xu Jia
Moreover, with our CODA-LM, we build CODA-VLM, a new driving LVLM surpassing all open-sourced counterparts on CODA-LM.
1 code implementation • 29 Jan 2024 • Qinghe Wang, Xu Jia, Xiaomin Li, Taiqing Li, Liqian Ma, Yunzhi Zhuge, Huchuan Lu
We believe that the proposed StableIdentity is an important step to unify image, video, and 3D customized generation models.
1 code implementation • 1 Dec 2023 • Pengxiang Li, Kai Chen, Zhili Liu, Ruiyuan Gao, Lanqing Hong, Guo Zhou, Hua Yao, Dit-yan Yeung, Huchuan Lu, Xu Jia
Despite remarkable achievements in video synthesis, achieving granular control over complex dynamics, such as nuanced movement among multiple interacting objects, still presents a significant hurdle for dynamic world modeling, compounded by the necessity to manage appearance and disappearance, drastic scale changes, and ensure consistency for instances across frames.
1 code implementation • 11 Oct 2023 • Ruotong Liao, Xu Jia, Yangzhe Li, Yunpu Ma, Volker Tresp
Extensive experiments have shown that GenTKG outperforms conventional methods of temporal relational forecasting with low computation resources using extremely limited training data as few as 16 samples.
1 code implementation • CVPR 2024 • Haiwen Diao, Bo Wan, Ying Zhang, Xu Jia, Huchuan Lu, Long Chen
Parameter-efficient transfer learning (PETL), i. e., fine-tuning a small portion of parameters, is an effective strategy for adapting pre-trained models to downstream domains.
1 code implementation • 23 May 2023 • Xinyu Zhang, Hefei Huang, Xu Jia, Dong Wang, Huchuan Lu
In this work, we aim to re-expose the captured photo in post-processing to provide a more flexible way of addressing those issues within a unified framework.
Ranked #5 on Deblurring on GoPro (using extra training data)
1 code implementation • 13 May 2023 • Wenjie Xu, Ben Liu, Miao Peng, Xu Jia, Min Peng
We train our model with a masking strategy to convert TKGC task into a masked token prediction task, which can leverage the semantic information in pre-trained language models.
no code implementations • CVPR 2023 • Jianchuan Chen, Wentao Yi, Liqian Ma, Xu Jia, Huchuan Lu
The results demonstrate that our approach outperforms state-of-the-art methods in terms of novel view synthesis and geometric reconstruction.
1 code implementation • 17 Mar 2023 • Dongsheng Wang, Xu Jia, Yang Zhang, Xinyu Zhang, Yaoyuan Wang, Ziyang Zhang, Dong Wang, Huchuan Lu
To fully exploit information with event streams to detect objects, a dual-memory aggregation network (DMANet) is proposed to leverage both long and short memory along event streams to aggregate effective information for object detection.
1 code implementation • CVPR 2023 • Yingwei Wang, Xu Jia, Xin Tao, Takashi Isobe, Huchuan Lu, Yu-Wing Tai
Videos stored on mobile devices or delivered on the Internet are usually in compressed format and are of various unknown compression parameters, but most video super-resolution (VSR) methods often assume ideal inputs resulting in large performance gap between experimental settings and real-world applications.
1 code implementation • CVPR 2022 • Canqian Yang, Meiguang Jin, Xu Jia, Yi Xu, Ying Chen
They adopt a sub-optimal uniform sampling point allocation, limiting the expressiveness of the learned LUTs since the (tri-)linear interpolation between uniform sampling points in the LUT transform might fail to model local non-linearities of the color transform.
Ranked #2 on Photo Retouching on MIT-Adobe 5k
1 code implementation • CVPR 2022 • Takashi Isobe, Xu Jia, Xin Tao, Changlin Li, Ruihuang Li, Yongjie Shi, Jing Mu, Huchuan Lu, Yu-Wing Tai
Instead of directly feeding consecutive frames into a VSR model, we propose to compute the temporal difference between frames and divide those pixels into two subsets according to the level of difference.
no code implementations • 4 Apr 2022 • Liqian Ma, Stamatios Georgoulis, Xu Jia, Luc van Gool
The ability to make educated predictions about their surroundings, and associate them with certain confidence, is important for intelligent systems, like autonomous vehicles and robots.
no code implementations • CVPR 2022 • Weihua He, Kaichao You, Zhendong Qiao, Xu Jia, Ziyang Zhang, Wenhui Wang, Huchuan Lu, Yaoyuan Wang, Jianxing Liao
Since event camera is a novel sensor, its potential has not been fulfilled due to the lack of processing algorithms.
no code implementations • 21 Mar 2022 • Xiaodong Cun, Zhendong Wang, Chi-Man Pun, Jianzhuang Liu, Wengang Zhou, Xu Jia, Houqiang Li
Color constancy aims to restore the constant colors of a scene under different illuminants.
1 code implementation • CVPR 2022 • Ruihuang Li, Shuai Li, Chenhang He, Yabin Zhang, Xu Jia, Lei Zhang
One popular solution to this challenging task is self-training, which selects high-scoring predictions on target samples as pseudo labels for training.
Ranked #9 on Image-to-Image Translation on SYNTHIA-to-Cityscapes
no code implementations • 1 Dec 2021 • Yue Wang, Xu Jia, Lu Zhang, Yuke Li, James Elder, Huchuan Lu
TFFM conducts a sufficient feature fusion by integrating features from multiple scales and two modalities over all positions simultaneously.
no code implementations • ICCV 2021 • Fang Xu, Lei Yu, Bishan Wang, Wen Yang, Gui-Song Xia, Xu Jia, Zhendong Qiao, Jianzhuang Liu
In this paper, we propose an end-to-end learning framework for event-based motion deblurring in a self-supervised manner, where real-world events are exploited to alleviate the performance degradation caused by data inconsistency.
1 code implementation • 3 Aug 2021 • Tianhong Dai, Wei Li, Xilei Cao, Jianzhuang Liu, Xu Jia, Ales Leonardis, Youliang Yan, Shanxin Yuan
The frequency-guided upsampling module reconstructs details from multiple frequency-specific components with rich details.
1 code implementation • ICCV 2021 • Ruihuang Li, Xu Jia, Jianzhong He, Shuaijun Chen, QinGhua Hu
Most existing domain adaptation methods focus on adaptation from only one source domain, however, in practice there are a number of relevant sources that could be leveraged to help improve performance on target domain.
Ranked #2 on Unsupervised Domain Adaptation on PACS
1 code implementation • 25 Jun 2021 • Jianchuan Chen, Ying Zhang, Di Kang, Xuefei Zhe, Linchao Bao, Xu Jia, Huchuan Lu
We present animatable neural radiance fields (animatable NeRF) for detailed human avatar creation from monocular videos.
no code implementations • CVPR 2021 • Takashi Isobe, Xu Jia, Shuaijun Chen, Jianzhong He, Yongjie Shi, Jianzhuang Liu, Huchuan Lu, Shengjin Wang
To obtain a single model that works across multiple target domains, we propose to simultaneously learn a student model which is trained to not only imitate the output of each expert on the corresponding target domain, but also to pull different expert close to each other with regularization on their weights.
Ranked #4 on Domain Adaptation on GTAV to Cityscapes+Mapillary
1 code implementation • CVPR 2021 • Shuaijun Chen, Xu Jia, Jianzhong He, Yongjie Shi, Jianzhuang Liu
To address the task of SSDA, a novel framework based on dual-level domain mixing is proposed.
no code implementations • CVPR 2021 • Jianzhong He, Xu Jia, Shuaijun Chen, Jianzhuang Liu
Multi-source unsupervised domain adaptation~(MSDA) aims at adapting models trained on multiple labeled source domains to an unlabeled target domain.
Ranked #1 on Domain Adaptation on GTA5+Synscapes to Cityscapes
Multi-Source Unsupervised Domain Adaptation Semantic Segmentation +1
12 code implementations • CVPR 2021 • Tao Huang, Songjiang Li, Xu Jia, Huchuan Lu, Jianzhuang Liu
In this paper, we present a very simple yet effective method named Neighbor2Neighbor to train an effective image denoising model with only noisy images.
2 code implementations • 13 Aug 2020 • Takashi Isobe, Fang Zhu, Xu Jia, Shengjin Wang
Video super-resolution plays an important role in surveillance video analysis and ultra-high-definition video display, which has drawn much attention in both the research and industrial communities.
Ranked #1 on Video Super-Resolution on SPMCS - 4x upscaling
2 code implementations • ECCV 2020 • Takashi Isobe, Xu Jia, Shuhang Gu, Songjiang Li, Shengjin Wang, Qi Tian
Most video super-resolution methods super-resolve a single reference frame with the help of neighboring frames in a temporal sliding window.
1 code implementation • CVPR 2020 • Takashi Isobe, Songjiang Li, Xu Jia, Shanxin Yuan, Gregory Slabaugh, Chunjing Xu, Ya-Li Li, Shengjin Wang, Qi Tian
Video super-resolution, which aims at producing a high-resolution video from its corresponding low-resolution version, has recently drawn increasing attention.
no code implementations • CVPR 2020 • Linjun Zhou, Peng Cui, Xu Jia, Shiqiang Yang, Qi Tian
Few-shot learning has attracted intensive research attention in recent years.
1 code implementation • CVPR 2020 • Matthias De Lange, Xu Jia, Sarah Parisot, Ales Leonardis, Gregory Slabaugh, Tinne Tuytelaars
This framework flexibly disentangles user-adaptation into model personalization on the server and local data regularization on the user device, with desirable properties regarding scalability and privacy constraints.
no code implementations • ICLR 2020 • Tao Huang, Zhen Han, Xu Jia, Hanyuan Hang
In this paper, we propose a novel kind of kernel, random forest kernel, to enhance the empirical performance of MMD GAN.
no code implementations • 7 Oct 2019 • Zhen Han, Enyan Dai, Xu Jia, Xiaoying Ren, Shuaijun Chen, Chunjing Xu, Jianzhuang Liu, Qi Tian
The task of single image super-resolution (SISR) aims at reconstructing a high-resolution (HR) image from a low-resolution (LR) image.
3 code implementations • 25 Sep 2019 • Dehua Song, Chang Xu, Xu Jia, Yiyi Chen, Chunjing Xu, Yunhe Wang
Focusing on this issue, we propose an efficient residual dense block search algorithm with multiple objectives to hunt for fast, lightweight and accurate networks for image super-resolution.
1 code implementation • 18 Sep 2019 • Matthias De Lange, Rahaf Aljundi, Marc Masana, Sarah Parisot, Xu Jia, Ales Leonardis, Gregory Slabaugh, Tinne Tuytelaars
Artificial neural networks thrive in solving the classification problem for a particular rigid task, acquiring knowledge through generalized learning behaviour from a distinct training phase.
2 code implementations • ICCV 2019 • Han Shu, Yunhe Wang, Xu Jia, Kai Han, Hanting Chen, Chunjing Xu, Qi Tian, Chang Xu
Generative adversarial networks (GANs) have been successfully used for considerable computer vision tasks, especially the image-to-image translation.
2 code implementations • CVPR 2019 • Junting Pan, Chengyu Wang, Xu Jia, Jing Shao, Lu Sheng, Junjie Yan, Xiaogang Wang
This paper proposes the novel task of video generation conditioned on a SINGLE semantic label map, which provides a good balance between flexibility and quality in the generation process.
no code implementations • ICLR 2019 • Liqian Ma, Xu Jia, Stamatios Georgoulis, Tinne Tuytelaars, Luc van Gool
Experimental results on various datasets show that EGSC-IT does not only translate the source image to diverse instances in the target domain, but also preserves the semantic consistency during the process.
no code implementations • 18 Dec 2017 • Xu Jia, Hong Chang, Tinne Tuytelaars
In this work, we revisit the more traditional interpolation-based methods, that were popular before, now with the help of deep learning.
no code implementations • 11 Dec 2017 • Yu-Hui Huang, Xu Jia, Stamatios Georgoulis, Tinne Tuytelaars, Luc van Gool
Pixelwise semantic image labeling is an important, yet challenging, task with many applications.
2 code implementations • NeurIPS 2017 • Liqian Ma, Xu Jia, Qianru Sun, Bernt Schiele, Tinne Tuytelaars, Luc van Gool
This paper proposes the novel Pose Guided Person Generation Network (PG$^2$) that allows to synthesize person images in arbitrary poses, based on an image of that person and a novel pose.
Ranked #3 on Gesture-to-Gesture Translation on Senz3D
1 code implementation • NeurIPS 2016 • Bert De Brabandere, Xu Jia, Tinne Tuytelaars, Luc van Gool
In a traditional convolutional layer, the learned filters stay fixed after training.
Ranked #1 on Video Prediction on KTH (Cond metric)
no code implementations • ICCV 2015 • Xu Jia, Efstratios Gavves, Basura Fernando, Tinne Tuytelaars
In this work we focus on the problem of image caption generation.
no code implementations • 26 Nov 2015 • Amir Ghodrati, Xu Jia, Marco Pedersoli, Tinne Tuytelaars
Learning the distribution of images in order to generate new samples is a challenging task due to the high dimensionality of the data and the highly non-linear relations that are involved.
1 code implementation • 16 Sep 2015 • Xu Jia, Efstratios Gavves, Basura Fernando, Tinne Tuytelaars
In this work we focus on the problem of image caption generation.