Search Results for author: Xin Tao

Found 37 papers, 15 papers with code

Particularity beyond Commonality: Unpaired Identity Transfer with Multiple References

no code implementations • ECCV 2020 • Ruizheng Wu, Xin Tao, Ying-Cong Chen, Xiaoyong Shen, Jiaya Jia

Unpaired image-to-image translation aims to translate images from the source class to target one by providing sufficient data for these classes.

Image-to-Image Translation Translation

Paper
Add Code

ZYJ123@DravidianLangTech-EACL2021: Offensive Language Identification based on XLM-RoBERTa with DPCNN

no code implementations • EACL (DravidianLangTech) 2021 • Yingjia Zhao, Xin Tao

The development of online media platforms has given users more opportunities to post and comment freely, but the negative impact of offensive language has become increasingly apparent.

Language Identification

Paper
Add Code

ZYJ@LT-EDI-EACL2021:XLM-RoBERTa-Based Model with Attention for Hope Speech Detection

no code implementations • EACL (LTEDI) 2021 • Yingjia Zhao, Xin Tao

We use the attention mechanism to adjust the weight of all the output layers of XLM-RoBERTa to make full use of the information extracted from each layer, and use the weighted sum of all the output layers to complete the classification task.

Hope Speech Detection

Paper
Add Code

UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark

no code implementations • 15 Apr 2024 • Zhaokun Zhou, Qiulin Wang, Bin Lin, Yiwei Su, Rui Chen, Xin Tao, Amin Zheng, Li Yuan, Pengfei Wan, Di Zhang

To further evaluate the IAA capability of MLLMs, we construct the UNIAA-Bench, which consists of three aesthetic levels: Perception, Description, and Assessment.

Language Modelling Large Language Model

Paper
Add Code

Perception-Oriented Video Frame Interpolation via Asymmetric Blending

no code implementations • 10 Apr 2024 • Guangyang Wu, Xin Tao, Changlin Li, Wenyi Wang, Xiaohong Liu, Qingqing Zheng

In practice, motion estimates often prove to be error-prone, resulting in misaligned features.

Video Frame Interpolation

Paper
Add Code

Motion Inversion for Video Customization

no code implementations • 29 Mar 2024 • Luozhou Wang, Guibao Shen, Yixun Liang, Xin Tao, Pengfei Wan, Di Zhang, Yijun Li, Yingcong Chen

In this research, we present a novel approach to motion customization in video generation, addressing the widespread gap in the thorough exploration of motion representation within video generative models.

Video Generation

Paper
Add Code

DVIS++: Improved Decoupled Framework for Universal Video Segmentation

1 code implementation • 20 Dec 2023 • Tao Zhang, Xingye Tian, Yikang Zhou, Shunping Ji, Xuebo Wang, Xin Tao, Yuan Zhang, Pengfei Wan, Zhongyuan Wang, Yu Wu

We present the \textbf{D}ecoupled \textbf{VI}deo \textbf{S}egmentation (DVIS) framework, a novel approach for the challenging task of universal video segmentation, including video instance segmentation (VIS), video semantic segmentation (VSS), and video panoptic segmentation (VPS).

Ranked #1 on Video Semantic Segmentation on VSPW

Contrastive Learning Denoising +6

Paper
Code

Stable Segment Anything Model

1 code implementation • 27 Nov 2023 • Qi Fan, Xin Tao, Lei Ke, Mingqiao Ye, Yuan Zhang, Pengfei Wan, Zhongyuan Wang, Yu-Wing Tai, Chi-Keung Tang

Thus, our solution, termed Stable-SAM, offers several advantages: 1) improved SAM's segmentation stability across a wide range of prompt qualities, while 2) retaining SAM's powerful promptable segmentation efficiency and generality, with 3) minimal learnable parameters (0. 08 M) and fast adaptation (by 1 training epoch).

Segmentation

Paper
Code

1st Place Solution for the 5th LSVOS Challenge: Video Instance Segmentation

1 code implementation • 28 Aug 2023 • Tao Zhang, Xingye Tian, Yikang Zhou, Yu Wu, Shunping Ji, Cilin Yan, Xuebo Wang, Xin Tao, Yuan Zhang, Pengfei Wan

Video instance segmentation is a challenging task that serves as the cornerstone of numerous downstream applications, including video editing and autonomous driving.

Autonomous Driving Denoising +6

114

Paper
Code

Scene-Generalizable Interactive Segmentation of Radiance Fields

no code implementations • 9 Aug 2023 • Songlin Tang, Wenjie Pei, Xin Tao, Tanghui Jia, Guangming Lu, Yu-Wing Tai

Existing methods for interactive segmentation in radiance fields entail scene-specific optimization and thus cannot generalize across different scenes, which greatly limits their applicability.

Interactive Segmentation Segmentation +1

Paper
Add Code

Feature Decoupling-Recycling Network for Fast Interactive Segmentation

no code implementations • 7 Aug 2023 • Huimin Zeng, Weinong Wang, Xin Tao, Zhiwei Xiong, Yu-Wing Tai, Wenjie Pei

First, our model decouples the learning of source image semantics from the encoding of user guidance to process two types of input domains separately.

Image Segmentation Interactive Segmentation +3

Paper
Add Code

1st Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation

1 code implementation • 7 Jun 2023 • Tao Zhang, Xingye Tian, Haoran Wei, Yu Wu, Shunping Ji, Xuebo Wang, Xin Tao, Yuan Zhang, Pengfei Wan

In this report, we successfully validated the effectiveness of the decoupling strategy in video panoptic segmentation.

Autonomous Driving Segmentation +2

114

Paper
Code

Compression-Aware Video Super-Resolution

1 code implementation • CVPR 2023 • Yingwei Wang, Xu Jia, Xin Tao, Takashi Isobe, Huchuan Lu, Yu-Wing Tai

Videos stored on mobile devices or delivered on the Internet are usually in compressed format and are of various unknown compression parameters, but most video super-resolution (VSR) methods often assume ideal inputs resulting in large performance gap between experimental settings and real-world applications.

Model Compression Video Enhancement +1

Paper
Code

H-VFI: Hierarchical Frame Interpolation for Videos with Large Motions

no code implementations • 21 Nov 2022 • Changlin Li, Guangyang Wu, Yanan sun, Xin Tao, Chi-Keung Tang, Yu-Wing Tai

The learnt deformable kernel is then utilized in convolving the input frames for predicting the interpolated frame.

Video Frame Interpolation

Paper
Add Code

DeViT: Deformed Vision Transformers in Video Inpainting

no code implementations • 28 Sep 2022 • Jiayin Cai, Changlin Li, Xin Tao, Chun Yuan, Yu-Wing Tai

This paper proposes a novel video inpainting method.

Video Inpainting

Paper
Add Code

Multi-criteria Decision-making of Intelligent Vehicles under Fault Condition Enhancing Public-private Partnership

no code implementations • 27 May 2022 • Xin Tao, Mladen Čičić, Jonas Mårtensson

With the proposed method, alternate decisions can be derived to reduce the risks of public time loss significantly with a low increase in the risk of mission delay.

Decision Making

Paper
Add Code

Look Back and Forth: Video Super-Resolution with Explicit Temporal Difference Modeling

1 code implementation • CVPR 2022 • Takashi Isobe, Xu Jia, Xin Tao, Changlin Li, Ruihuang Li, Yongjie Shi, Jing Mu, Huchuan Lu, Yu-Wing Tai

Instead of directly feeding consecutive frames into a VSR model, we propose to compute the temporal difference between frames and divide those pixels into two subsets according to the level of difference.

Motion Compensation Optical Flow Estimation +1

Paper
Code

Real-time decision-making for autonomous vehicles under faults

no code implementations • 9 Feb 2022 • Xin Tao, Zhao Yuan

This paper addresses the challenges of decision-making for autonomous vehicles under faults during a transport mission.

Autonomous Vehicles Computational Efficiency +2

Paper
Add Code

Finding Critical Scenarios for Automated Driving Systems: A Systematic Literature Review

no code implementations • 16 Oct 2021 • Xinhai Zhang, Jianbo Tao, Kaige Tan, Martin Törngren, José Manuel Gaspar Sánchez, Muhammad Rusyadi Ramli, Xin Tao, Magnus Gyllenhammar, Franz Wotawa, Naveen Mohan, Mihai Nica, Hermann Felbinger

The main contributions are: (i) introducing a comprehensive taxonomy for critical scenario identification methods; (ii) giving an overview of the state-of-the-art research based on the taxonomy encompassing 86 papers between 2017 and 2020; and (iii) identifying open issues and directions for further research.

Autonomous Driving

Paper
Add Code

ZYJ at SemEval-2021 Task 7: HaHackathon: Detecting and Rating Humor and Offense with ALBERT-Based Model

no code implementations • SEMEVAL 2021 • Yingjia Zhao, Xin Tao

This article introduces the submission of subtask 1 and subtask 2 that we participate in SemEval-2021 Task 7: HaHackathon: Detecting and Rating Humor and Offense, we use a model based on ALBERT that uses ALBERT as the module for extracting text features.

Paper
Add Code

MASA-SR: Matching Acceleration and Spatial Adaptation for Reference-Based Image Super-Resolution

1 code implementation • CVPR 2021 • Liying Lu, Wenbo Li, Xin Tao, Jiangbo Lu, Jiaya Jia

Therefore, high-quality correspondence matching is critical.

Image Super-Resolution

153

Paper
Code

Short-term Maintenance Planning of Autonomous Trucks for Minimizing Economic Risk

no code implementations • 28 May 2021 • Xin Tao, Jonas Mårtensson, Håkan Warnquist, Anna Pernestål

We also present a maintenance planning model using a risk-based decision-making method, which identifies the maintenance decision with minimal economic risk of the truck company.

Autonomous Driving Decision Making +1

Paper
Add Code

YNUtaoxin at SemEval-2020 Task 11: Identification Fragments of Propaganda Technique by Neural Sequence Labeling Models with Different Tagging Schemes and Pre-trained Language Model

no code implementations • SEMEVAL 2020 • Xin Tao, Xiaobing Zhou

We only participated in the first subtask, and a neural sequence model was used to perform the sequence tagging task.

Language Modelling

Paper
Add Code

MuCAN: Multi-Correspondence Aggregation Network for Video Super-Resolution

1 code implementation • ECCV 2020 • Wenbo Li, Xin Tao, Taian Guo, Lu Qi, Jiangbo Lu, Jiaya Jia

Motivated by these findings, we propose a temporal multi-correspondence aggregation strategy to leverage similar patches across frames, and a cross-scale nonlocal-correspondence aggregation scheme to explore self-similarity of images across scales.

Optical Flow Estimation Video Super-Resolution

223

Paper
Code

VCNet: A Robust Approach to Blind Image Inpainting

2 code implementations • ECCV 2020 • Yi Wang, Ying-Cong Chen, Xin Tao, Jiaya Jia

Blind inpainting is a task to automatically complete visual contents without specifying masks for missing areas in an image.

Image Inpainting

Paper
Code

Robust Conditional GAN from Uncertainty-Aware Pairwise Comparisons

1 code implementation • 21 Nov 2019 • Ligong Han, Ruijiang Gao, Mun Kim, Xin Tao, Bo Liu, Dimitris Metaxas

Conditional generative adversarial networks have shown exceptional generation performance over the past few years.

Attribute Generative Adversarial Network

Paper
Code

Attribute-Driven Spontaneous Motion in Unpaired Image Translation

1 code implementation • ICCV 2019 • Ruizheng Wu, Xin Tao, Xiaodong Gu, Xiaoyong Shen, Jiaya Jia

Current image translation methods, albeit effective to produce high-quality results in various applications, still do not consider much geometric transform.

Attribute Motion Estimation +1

Paper
Code

Landmark Assisted CycleGAN for Cartoon Face Generation

no code implementations • 2 Jul 2019 • Ruizheng Wu, Xiaodong Gu, Xin Tao, Xiaoyong Shen, Yu-Wing Tai, Jiaya Jia

In this paper, we are interested in generating an cartoon face of a person by using unpaired training data between real faces and cartoon ones.

Face Generation

Paper
Add Code

Image Inpainting via Generative Multi-column Convolutional Neural Networks

2 code implementations • NeurIPS 2018 • Yi Wang, Xin Tao, Xiaojuan Qi, Xiaoyong Shen, Jiaya Jia

In this paper, we propose a generative multi-column network for image inpainting.

Image Inpainting

417

Paper
Code

Facelet-Bank for Fast Portrait Manipulation

no code implementations • CVPR 2018 • Ying-Cong Chen, Huaijia Lin, Michelle Shu, Ruiyu Li, Xin Tao, Yangang Ye, Xiaoyong Shen, Jiaya Jia

Digital face manipulation has become a popular and fascinating way to touch images with the prevalence of smartphones and social networks.

Facial Editing

Paper
Add Code

Scale-recurrent Network for Deep Image Deblurring

4 code implementations • CVPR 2018 • Xin Tao, Hongyun Gao, Yi Wang, Xiaoyong Shen, Jue Wang, Jiaya Jia

In single image deblurring, the "coarse-to-fine" scheme, i. e. gradually restoring the sharp image on different resolutions in a pyramid, is very successful in both traditional optimization-based methods and recent neural-network-based approaches.

Ranked #3 on Image Deblurring on GoPro (Params (M) metric, using extra training data)

Deblurring Image Deblurring +1

709

Paper
Code

Zero-order Reverse Filtering

1 code implementation • ICCV 2017 • Xin Tao, Chao Zhou, Xiaoyong Shen, Jue Wang, Jiaya Jia

In this paper, we study an unconventional but practically meaningful reversibility problem of commonly used image filters.

Paper
Code

Detail-revealing Deep Video Super-resolution

1 code implementation • ICCV 2017 • Xin Tao, Hongyun Gao, Renjie Liao, Jue Wang, Jiaya Jia

In this paper, we show that proper frame alignment and motion compensation is crucial for achieving high quality results.

Ranked #11 on Video Super-Resolution on Vid4 - 4x upscaling

Image Super-Resolution Motion Compensation +1

259

Paper
Code

Convolutional Neural Pyramid for Image Processing

no code implementations • 7 Apr 2017 • Xiaoyong Shen, Ying-Cong Chen, Xin Tao, Jiaya Jia

We propose a principled convolutional neural pyramid (CNP) framework for general low-level vision and image processing tasks.

Colorization Image Enhancement +2

Paper
Add Code

High-Quality Correspondence and Segmentation Estimation for Dual-Lens Smart-Phone Portraits

no code implementations • ICCV 2017 • Xiaoyong Shen, Hongyun Gao, Xin Tao, Chao Zhou, Jiaya Jia

Estimating correspondence between two images and extracting the foreground object are two challenges in computer vision.

Paper
Add Code

Video Super-Resolution via Deep Draft-Ensemble Learning

no code implementations • ICCV 2015 • Renjie Liao, Xin Tao, Ruiyu Li, Ziyang Ma, Jiaya Jia

We propose a new direction for fast video super-resolution (VideoSR) via a SR draft ensemble, which is defined as the set of high-resolution patch candidates before final image deconvolution.

Ensemble Learning Image Deconvolution +1

Paper
Add Code

Handling Motion Blur in Multi-Frame Super-Resolution

no code implementations • CVPR 2015 • Ziyang Ma, Renjie Liao, Xin Tao, Li Xu, Jiaya Jia, Enhua Wu

Ubiquitous motion blur easily fails multi-frame super-resolution (MFSR).

Image Reconstruction Multi-Frame Super-Resolution

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.