no code implementations • CVPR 2018 • Ying-Cong Chen, Huaijia Lin, Michelle Shu, Ruiyu Li, Xin Tao, Yangang Ye, Xiaoyong Shen, Jiaya Jia
Digital face manipulation has become a popular and fascinating way to touch images with the prevalence of smartphones and social networks.
no code implementations • ICCV 2017 • Xiaoyong Shen, Hongyun Gao, Xin Tao, Chao Zhou, Jiaya Jia
Estimating correspondence between two images and extracting the foreground object are two challenges in computer vision.
no code implementations • 7 Apr 2017 • Xiaoyong Shen, Ying-Cong Chen, Xin Tao, Jiaya Jia
We propose a principled convolutional neural pyramid (CNP) framework for general low-level vision and image processing tasks.
no code implementations • CVPR 2015 • Ziyang Ma, Renjie Liao, Xin Tao, Li Xu, Jiaya Jia, Enhua Wu
Ubiquitous motion blur easily fails multi-frame super-resolution (MFSR).
no code implementations • ICCV 2015 • Renjie Liao, Xin Tao, Ruiyu Li, Ziyang Ma, Jiaya Jia
We propose a new direction for fast video super-resolution (VideoSR) via a SR draft ensemble, which is defined as the set of high-resolution patch candidates before final image deconvolution.
no code implementations • 2 Jul 2019 • Ruizheng Wu, Xiaodong Gu, Xin Tao, Xiaoyong Shen, Yu-Wing Tai, Jiaya Jia
In this paper, we are interested in generating an cartoon face of a person by using unpaired training data between real faces and cartoon ones.
no code implementations • ECCV 2020 • Ruizheng Wu, Xin Tao, Ying-Cong Chen, Xiaoyong Shen, Jiaya Jia
Unpaired image-to-image translation aims to translate images from the source class to target one by providing sufficient data for these classes.
no code implementations • 28 May 2021 • Xin Tao, Jonas Mårtensson, Håkan Warnquist, Anna Pernestål
We also present a maintenance planning model using a risk-based decision-making method, which identifies the maintenance decision with minimal economic risk of the truck company.
no code implementations • SEMEVAL 2020 • Xin Tao, Xiaobing Zhou
We only participated in the first subtask, and a neural sequence model was used to perform the sequence tagging task.
no code implementations • SEMEVAL 2021 • Yingjia Zhao, Xin Tao
This article introduces the submission of subtask 1 and subtask 2 that we participate in SemEval-2021 Task 7: HaHackathon: Detecting and Rating Humor and Offense, we use a model based on ALBERT that uses ALBERT as the module for extracting text features.
no code implementations • 16 Oct 2021 • Xinhai Zhang, Jianbo Tao, Kaige Tan, Martin Törngren, José Manuel Gaspar Sánchez, Muhammad Rusyadi Ramli, Xin Tao, Magnus Gyllenhammar, Franz Wotawa, Naveen Mohan, Mihai Nica, Hermann Felbinger
The main contributions are: (i) introducing a comprehensive taxonomy for critical scenario identification methods; (ii) giving an overview of the state-of-the-art research based on the taxonomy encompassing 86 papers between 2017 and 2020; and (iii) identifying open issues and directions for further research.
no code implementations • EACL (LTEDI) 2021 • Yingjia Zhao, Xin Tao
We use the attention mechanism to adjust the weight of all the output layers of XLM-RoBERTa to make full use of the information extracted from each layer, and use the weighted sum of all the output layers to complete the classification task.
no code implementations • EACL (DravidianLangTech) 2021 • Yingjia Zhao, Xin Tao
The development of online media platforms has given users more opportunities to post and comment freely, but the negative impact of offensive language has become increasingly apparent.
no code implementations • 9 Feb 2022 • Xin Tao, Zhao Yuan
This paper addresses the challenges of decision-making for autonomous vehicles under faults during a transport mission.
no code implementations • 27 May 2022 • Xin Tao, Mladen Čičić, Jonas Mårtensson
With the proposed method, alternate decisions can be derived to reduce the risks of public time loss significantly with a low increase in the risk of mission delay.
no code implementations • 28 Sep 2022 • Jiayin Cai, Changlin Li, Xin Tao, Chun Yuan, Yu-Wing Tai
This paper proposes a novel video inpainting method.
no code implementations • 21 Nov 2022 • Changlin Li, Guangyang Wu, Yanan sun, Xin Tao, Chi-Keung Tang, Yu-Wing Tai
The learnt deformable kernel is then utilized in convolving the input frames for predicting the interpolated frame.
no code implementations • 7 Aug 2023 • Huimin Zeng, Weinong Wang, Xin Tao, Zhiwei Xiong, Yu-Wing Tai, Wenjie Pei
First, our model decouples the learning of source image semantics from the encoding of user guidance to process two types of input domains separately.
no code implementations • 9 Aug 2023 • Songlin Tang, Wenjie Pei, Xin Tao, Tanghui Jia, Guangming Lu, Yu-Wing Tai
Existing methods for interactive segmentation in radiance fields entail scene-specific optimization and thus cannot generalize across different scenes, which greatly limits their applicability.
no code implementations • 29 Mar 2024 • Luozhou Wang, Guibao Shen, Yixun Liang, Xin Tao, Pengfei Wan, Di Zhang, Yijun Li, Yingcong Chen
In this research, we present a novel approach to motion customization in video generation, addressing the widespread gap in the thorough exploration of motion representation within video generative models.
no code implementations • 10 Apr 2024 • Guangyang Wu, Xin Tao, Changlin Li, Wenyi Wang, Xiaohong Liu, Qingqing Zheng
In practice, motion estimates often prove to be error-prone, resulting in misaligned features.
no code implementations • 15 Apr 2024 • Zhaokun Zhou, Qiulin Wang, Bin Lin, Yiwei Su, Rui Chen, Xin Tao, Amin Zheng, Li Yuan, Pengfei Wan, Di Zhang
To further evaluate the IAA capability of MLLMs, we construct the UNIAA-Bench, which consists of three aesthetic levels: Perception, Description, and Assessment.
1 code implementation • 21 Nov 2019 • Ligong Han, Ruijiang Gao, Mun Kim, Xin Tao, Bo Liu, Dimitris Metaxas
Conditional generative adversarial networks have shown exceptional generation performance over the past few years.
1 code implementation • CVPR 2022 • Takashi Isobe, Xu Jia, Xin Tao, Changlin Li, Ruihuang Li, Yongjie Shi, Jing Mu, Huchuan Lu, Yu-Wing Tai
Instead of directly feeding consecutive frames into a VSR model, we propose to compute the temporal difference between frames and divide those pixels into two subsets according to the level of difference.
1 code implementation • ICCV 2019 • Ruizheng Wu, Xin Tao, Xiaodong Gu, Xiaoyong Shen, Jiaya Jia
Current image translation methods, albeit effective to produce high-quality results in various applications, still do not consider much geometric transform.
1 code implementation • CVPR 2023 • Yingwei Wang, Xu Jia, Xin Tao, Takashi Isobe, Huchuan Lu, Yu-Wing Tai
Videos stored on mobile devices or delivered on the Internet are usually in compressed format and are of various unknown compression parameters, but most video super-resolution (VSR) methods often assume ideal inputs resulting in large performance gap between experimental settings and real-world applications.
1 code implementation • ICCV 2017 • Xin Tao, Chao Zhou, Xiaoyong Shen, Jue Wang, Jiaya Jia
In this paper, we study an unconventional but practically meaningful reversibility problem of commonly used image filters.
2 code implementations • ECCV 2020 • Yi Wang, Ying-Cong Chen, Xin Tao, Jiaya Jia
Blind inpainting is a task to automatically complete visual contents without specifying masks for missing areas in an image.
1 code implementation • 27 Nov 2023 • Qi Fan, Xin Tao, Lei Ke, Mingqiao Ye, Yuan Zhang, Pengfei Wan, Zhongyuan Wang, Yu-Wing Tai, Chi-Keung Tang
Thus, our solution, termed Stable-SAM, offers several advantages: 1) improved SAM's segmentation stability across a wide range of prompt qualities, while 2) retaining SAM's powerful promptable segmentation efficiency and generality, with 3) minimal learnable parameters (0. 08 M) and fast adaptation (by 1 training epoch).
1 code implementation • 20 Dec 2023 • Tao Zhang, Xingye Tian, Yikang Zhou, Shunping Ji, Xuebo Wang, Xin Tao, Yuan Zhang, Pengfei Wan, Zhongyuan Wang, Yu Wu
We present the \textbf{D}ecoupled \textbf{VI}deo \textbf{S}egmentation (DVIS) framework, a novel approach for the challenging task of universal video segmentation, including video instance segmentation (VIS), video semantic segmentation (VSS), and video panoptic segmentation (VPS).
Ranked #1 on Video Semantic Segmentation on VSPW
1 code implementation • 7 Jun 2023 • Tao Zhang, Xingye Tian, Haoran Wei, Yu Wu, Shunping Ji, Xuebo Wang, Xin Tao, Yuan Zhang, Pengfei Wan
In this report, we successfully validated the effectiveness of the decoupling strategy in video panoptic segmentation.
1 code implementation • 28 Aug 2023 • Tao Zhang, Xingye Tian, Yikang Zhou, Yu Wu, Shunping Ji, Cilin Yan, Xuebo Wang, Xin Tao, Yuan Zhang, Pengfei Wan
Video instance segmentation is a challenging task that serves as the cornerstone of numerous downstream applications, including video editing and autonomous driving.
1 code implementation • CVPR 2021 • Liying Lu, Wenbo Li, Xin Tao, Jiangbo Lu, Jiaya Jia
Therefore, high-quality correspondence matching is critical.
1 code implementation • ECCV 2020 • Wenbo Li, Xin Tao, Taian Guo, Lu Qi, Jiangbo Lu, Jiaya Jia
Motivated by these findings, we propose a temporal multi-correspondence aggregation strategy to leverage similar patches across frames, and a cross-scale nonlocal-correspondence aggregation scheme to explore self-similarity of images across scales.
1 code implementation • ICCV 2017 • Xin Tao, Hongyun Gao, Renjie Liao, Jue Wang, Jiaya Jia
In this paper, we show that proper frame alignment and motion compensation is crucial for achieving high quality results.
Ranked #11 on Video Super-Resolution on Vid4 - 4x upscaling
2 code implementations • NeurIPS 2018 • Yi Wang, Xin Tao, Xiaojuan Qi, Xiaoyong Shen, Jiaya Jia
In this paper, we propose a generative multi-column network for image inpainting.
4 code implementations • CVPR 2018 • Xin Tao, Hongyun Gao, Yi Wang, Xiaoyong Shen, Jue Wang, Jiaya Jia
In single image deblurring, the "coarse-to-fine" scheme, i. e. gradually restoring the sharp image on different resolutions in a pyramid, is very successful in both traditional optimization-based methods and recent neural-network-based approaches.
Ranked #3 on Image Deblurring on GoPro (Params (M) metric, using extra training data)