Search Results for author: Xin Tao

Found 37 papers, 15 papers with code

Particularity beyond Commonality: Unpaired Identity Transfer with Multiple References

no code implementations ECCV 2020 Ruizheng Wu, Xin Tao, Ying-Cong Chen, Xiaoyong Shen, Jiaya Jia

Unpaired image-to-image translation aims to translate images from the source class to target one by providing sufficient data for these classes.

Image-to-Image Translation Translation

ZYJ123@DravidianLangTech-EACL2021: Offensive Language Identification based on XLM-RoBERTa with DPCNN

no code implementations EACL (DravidianLangTech) 2021 Yingjia Zhao, Xin Tao

The development of online media platforms has given users more opportunities to post and comment freely, but the negative impact of offensive language has become increasingly apparent.

Language Identification

ZYJ@LT-EDI-EACL2021:XLM-RoBERTa-Based Model with Attention for Hope Speech Detection

no code implementations EACL (LTEDI) 2021 Yingjia Zhao, Xin Tao

We use the attention mechanism to adjust the weight of all the output layers of XLM-RoBERTa to make full use of the information extracted from each layer, and use the weighted sum of all the output layers to complete the classification task.

Hope Speech Detection

UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark

no code implementations15 Apr 2024 Zhaokun Zhou, Qiulin Wang, Bin Lin, Yiwei Su, Rui Chen, Xin Tao, Amin Zheng, Li Yuan, Pengfei Wan, Di Zhang

To further evaluate the IAA capability of MLLMs, we construct the UNIAA-Bench, which consists of three aesthetic levels: Perception, Description, and Assessment.

Language Modelling Large Language Model

Motion Inversion for Video Customization

no code implementations29 Mar 2024 Luozhou Wang, Guibao Shen, Yixun Liang, Xin Tao, Pengfei Wan, Di Zhang, Yijun Li, Yingcong Chen

In this research, we present a novel approach to motion customization in video generation, addressing the widespread gap in the thorough exploration of motion representation within video generative models.

Video Generation

DVIS++: Improved Decoupled Framework for Universal Video Segmentation

1 code implementation20 Dec 2023 Tao Zhang, Xingye Tian, Yikang Zhou, Shunping Ji, Xuebo Wang, Xin Tao, Yuan Zhang, Pengfei Wan, Zhongyuan Wang, Yu Wu

We present the \textbf{D}ecoupled \textbf{VI}deo \textbf{S}egmentation (DVIS) framework, a novel approach for the challenging task of universal video segmentation, including video instance segmentation (VIS), video semantic segmentation (VSS), and video panoptic segmentation (VPS).

Contrastive Learning Denoising +6

Stable Segment Anything Model

1 code implementation27 Nov 2023 Qi Fan, Xin Tao, Lei Ke, Mingqiao Ye, Yuan Zhang, Pengfei Wan, Zhongyuan Wang, Yu-Wing Tai, Chi-Keung Tang

Thus, our solution, termed Stable-SAM, offers several advantages: 1) improved SAM's segmentation stability across a wide range of prompt qualities, while 2) retaining SAM's powerful promptable segmentation efficiency and generality, with 3) minimal learnable parameters (0. 08 M) and fast adaptation (by 1 training epoch).

Segmentation

1st Place Solution for the 5th LSVOS Challenge: Video Instance Segmentation

1 code implementation28 Aug 2023 Tao Zhang, Xingye Tian, Yikang Zhou, Yu Wu, Shunping Ji, Cilin Yan, Xuebo Wang, Xin Tao, Yuan Zhang, Pengfei Wan

Video instance segmentation is a challenging task that serves as the cornerstone of numerous downstream applications, including video editing and autonomous driving.

Autonomous Driving Denoising +6

Scene-Generalizable Interactive Segmentation of Radiance Fields

no code implementations9 Aug 2023 Songlin Tang, Wenjie Pei, Xin Tao, Tanghui Jia, Guangming Lu, Yu-Wing Tai

Existing methods for interactive segmentation in radiance fields entail scene-specific optimization and thus cannot generalize across different scenes, which greatly limits their applicability.

Interactive Segmentation Segmentation +1

Feature Decoupling-Recycling Network for Fast Interactive Segmentation

no code implementations7 Aug 2023 Huimin Zeng, Weinong Wang, Xin Tao, Zhiwei Xiong, Yu-Wing Tai, Wenjie Pei

First, our model decouples the learning of source image semantics from the encoding of user guidance to process two types of input domains separately.

Image Segmentation Interactive Segmentation +3

1st Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation

1 code implementation7 Jun 2023 Tao Zhang, Xingye Tian, Haoran Wei, Yu Wu, Shunping Ji, Xuebo Wang, Xin Tao, Yuan Zhang, Pengfei Wan

In this report, we successfully validated the effectiveness of the decoupling strategy in video panoptic segmentation.

Autonomous Driving Segmentation +2

Compression-Aware Video Super-Resolution

1 code implementation CVPR 2023 Yingwei Wang, Xu Jia, Xin Tao, Takashi Isobe, Huchuan Lu, Yu-Wing Tai

Videos stored on mobile devices or delivered on the Internet are usually in compressed format and are of various unknown compression parameters, but most video super-resolution (VSR) methods often assume ideal inputs resulting in large performance gap between experimental settings and real-world applications.

Model Compression Video Enhancement +1

H-VFI: Hierarchical Frame Interpolation for Videos with Large Motions

no code implementations21 Nov 2022 Changlin Li, Guangyang Wu, Yanan sun, Xin Tao, Chi-Keung Tang, Yu-Wing Tai

The learnt deformable kernel is then utilized in convolving the input frames for predicting the interpolated frame.

Video Frame Interpolation

Multi-criteria Decision-making of Intelligent Vehicles under Fault Condition Enhancing Public-private Partnership

no code implementations27 May 2022 Xin Tao, Mladen Čičić, Jonas Mårtensson

With the proposed method, alternate decisions can be derived to reduce the risks of public time loss significantly with a low increase in the risk of mission delay.

Decision Making

Look Back and Forth: Video Super-Resolution with Explicit Temporal Difference Modeling

1 code implementation CVPR 2022 Takashi Isobe, Xu Jia, Xin Tao, Changlin Li, Ruihuang Li, Yongjie Shi, Jing Mu, Huchuan Lu, Yu-Wing Tai

Instead of directly feeding consecutive frames into a VSR model, we propose to compute the temporal difference between frames and divide those pixels into two subsets according to the level of difference.

Motion Compensation Optical Flow Estimation +1

Real-time decision-making for autonomous vehicles under faults

no code implementations9 Feb 2022 Xin Tao, Zhao Yuan

This paper addresses the challenges of decision-making for autonomous vehicles under faults during a transport mission.

Autonomous Vehicles Computational Efficiency +2

Finding Critical Scenarios for Automated Driving Systems: A Systematic Literature Review

no code implementations16 Oct 2021 Xinhai Zhang, Jianbo Tao, Kaige Tan, Martin Törngren, José Manuel Gaspar Sánchez, Muhammad Rusyadi Ramli, Xin Tao, Magnus Gyllenhammar, Franz Wotawa, Naveen Mohan, Mihai Nica, Hermann Felbinger

The main contributions are: (i) introducing a comprehensive taxonomy for critical scenario identification methods; (ii) giving an overview of the state-of-the-art research based on the taxonomy encompassing 86 papers between 2017 and 2020; and (iii) identifying open issues and directions for further research.

Autonomous Driving

ZYJ at SemEval-2021 Task 7: HaHackathon: Detecting and Rating Humor and Offense with ALBERT-Based Model

no code implementations SEMEVAL 2021 Yingjia Zhao, Xin Tao

This article introduces the submission of subtask 1 and subtask 2 that we participate in SemEval-2021 Task 7: HaHackathon: Detecting and Rating Humor and Offense, we use a model based on ALBERT that uses ALBERT as the module for extracting text features.

Short-term Maintenance Planning of Autonomous Trucks for Minimizing Economic Risk

no code implementations28 May 2021 Xin Tao, Jonas Mårtensson, Håkan Warnquist, Anna Pernestål

We also present a maintenance planning model using a risk-based decision-making method, which identifies the maintenance decision with minimal economic risk of the truck company.

Autonomous Driving Decision Making +1

MuCAN: Multi-Correspondence Aggregation Network for Video Super-Resolution

1 code implementation ECCV 2020 Wenbo Li, Xin Tao, Taian Guo, Lu Qi, Jiangbo Lu, Jiaya Jia

Motivated by these findings, we propose a temporal multi-correspondence aggregation strategy to leverage similar patches across frames, and a cross-scale nonlocal-correspondence aggregation scheme to explore self-similarity of images across scales.

Optical Flow Estimation Video Super-Resolution

VCNet: A Robust Approach to Blind Image Inpainting

2 code implementations ECCV 2020 Yi Wang, Ying-Cong Chen, Xin Tao, Jiaya Jia

Blind inpainting is a task to automatically complete visual contents without specifying masks for missing areas in an image.

Image Inpainting

Robust Conditional GAN from Uncertainty-Aware Pairwise Comparisons

1 code implementation21 Nov 2019 Ligong Han, Ruijiang Gao, Mun Kim, Xin Tao, Bo Liu, Dimitris Metaxas

Conditional generative adversarial networks have shown exceptional generation performance over the past few years.

Attribute Generative Adversarial Network

Attribute-Driven Spontaneous Motion in Unpaired Image Translation

1 code implementation ICCV 2019 Ruizheng Wu, Xin Tao, Xiaodong Gu, Xiaoyong Shen, Jiaya Jia

Current image translation methods, albeit effective to produce high-quality results in various applications, still do not consider much geometric transform.

Attribute Motion Estimation +1

Landmark Assisted CycleGAN for Cartoon Face Generation

no code implementations2 Jul 2019 Ruizheng Wu, Xiaodong Gu, Xin Tao, Xiaoyong Shen, Yu-Wing Tai, Jiaya Jia

In this paper, we are interested in generating an cartoon face of a person by using unpaired training data between real faces and cartoon ones.

Face Generation

Facelet-Bank for Fast Portrait Manipulation

no code implementations CVPR 2018 Ying-Cong Chen, Huaijia Lin, Michelle Shu, Ruiyu Li, Xin Tao, Yangang Ye, Xiaoyong Shen, Jiaya Jia

Digital face manipulation has become a popular and fascinating way to touch images with the prevalence of smartphones and social networks.

Facial Editing

Scale-recurrent Network for Deep Image Deblurring

4 code implementations CVPR 2018 Xin Tao, Hongyun Gao, Yi Wang, Xiaoyong Shen, Jue Wang, Jiaya Jia

In single image deblurring, the "coarse-to-fine" scheme, i. e. gradually restoring the sharp image on different resolutions in a pyramid, is very successful in both traditional optimization-based methods and recent neural-network-based approaches.

Ranked #3 on Image Deblurring on GoPro (Params (M) metric, using extra training data)

Deblurring Image Deblurring +1

Zero-order Reverse Filtering

1 code implementation ICCV 2017 Xin Tao, Chao Zhou, Xiaoyong Shen, Jue Wang, Jiaya Jia

In this paper, we study an unconventional but practically meaningful reversibility problem of commonly used image filters.

Convolutional Neural Pyramid for Image Processing

no code implementations7 Apr 2017 Xiaoyong Shen, Ying-Cong Chen, Xin Tao, Jiaya Jia

We propose a principled convolutional neural pyramid (CNP) framework for general low-level vision and image processing tasks.

Colorization Image Enhancement +2

High-Quality Correspondence and Segmentation Estimation for Dual-Lens Smart-Phone Portraits

no code implementations ICCV 2017 Xiaoyong Shen, Hongyun Gao, Xin Tao, Chao Zhou, Jiaya Jia

Estimating correspondence between two images and extracting the foreground object are two challenges in computer vision.

Video Super-Resolution via Deep Draft-Ensemble Learning

no code implementations ICCV 2015 Renjie Liao, Xin Tao, Ruiyu Li, Ziyang Ma, Jiaya Jia

We propose a new direction for fast video super-resolution (VideoSR) via a SR draft ensemble, which is defined as the set of high-resolution patch candidates before final image deconvolution.

Ensemble Learning Image Deconvolution +1

Cannot find the paper you are looking for? You can Submit a new open access paper.