Search Results for author: Xiaoyu Shi

Found 17 papers, 10 papers with code

Decoupled Spatial-Temporal Transformer for Video Inpainting

1 code implementation • 14 Apr 2021 • Rui Liu, Hanming Deng, Yangyi Huang, Xiaoyu Shi, Lewei Lu, Wenxiu Sun, Xiaogang Wang, Jifeng Dai, Hongsheng Li

Seamless combination of these two novel designs forms a better spatial-temporal attention scheme and our proposed model achieves better performance than state-of-the-art video inpainting approaches with significant boosted efficiency.

Video Inpainting

Paper
Code

Exploring the Quality of GAN Generated Images for Person Re-Identification

no code implementations • 23 Aug 2021 • Yiqi Jiang, Weihua Chen, Xiuyu Sun, Xiaoyu Shi, Fan Wang, Hao Li

Recently, GAN based method has demonstrated strong effectiveness in generating augmentation data for person re-identification (ReID), on account of its ability to bridge the gap between domains and enrich the data variety in feature space.

Person Re-Identification Unsupervised Domain Adaptation

Paper
Add Code

FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting

1 code implementation • ICCV 2021 • Rui Liu, Hanming Deng, Yangyi Huang, Xiaoyu Shi, Lewei Lu, Wenxiu Sun, Xiaogang Wang, Jifeng Dai, Hongsheng Li

On the contrary, the soft composition operates by stitching different patches into a whole feature map where pixels in overlapping regions are summed up.

Ranked #3 on Video Inpainting on DAVIS

Seeing Beyond the Visible Video Inpainting

102

Paper
Code

FlowFormer: A Transformer Architecture for Optical Flow

1 code implementation • 30 Mar 2022 • Zhaoyang Huang, Xiaoyu Shi, Chao Zhang, Qiang Wang, Ka Chun Cheung, Hongwei Qin, Jifeng Dai, Hongsheng Li

We introduce optical Flow transFormer, dubbed as FlowFormer, a transformer-based neural network architecture for learning optical flow.

Ranked #1 on Optical Flow Estimation on Sintel-final

Optical Flow Estimation

374

Paper
Code

A Simple Baseline for Video Restoration with Grouped Spatial-temporal Shift

1 code implementation • CVPR 2023 • Dasong Li, Xiaoyu Shi, Yi Zhang, Ka Chun Cheung, Simon See, Xiaogang Wang, Hongwei Qin, Hongsheng Li

In this study, we propose a simple yet effective framework for video restoration.

Ranked #1 on Deblurring on GoPro (using extra training data)

Deblurring Denoising +3

Paper
Code

Learning the policy for mixed electric platoon control of automated and human-driven vehicles at signalized intersection: a random search approach

no code implementations • 24 Jun 2022 • Xia Jiang, Jian Zhang, Xiaoyu Shi, Jian Cheng

Meanwhile, the simulation results demonstrate the effectiveness of the delay reward, which is designed to outperform distributed reward mechanism} Compared with normal car-following behavior, the sensitivity analysis reveals that the energy can be saved to different extends (39. 27%-82. 51%) by adjusting the relative importance of the optimization goal.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Unsupervised Domain Adaptive Fundus Image Segmentation with Category-level Regularization

1 code implementation • 8 Jul 2022 • Wei Feng, Lin Wang, Lie Ju, Xin Zhao, Xin Wang, Xiaoyu Shi, ZongYuan Ge

Existing unsupervised domain adaptation methods based on adversarial learning have achieved good performance in several medical imaging tasks.

Image Segmentation Semantic Segmentation +1

Paper
Code

FlowFormer++: Masked Cost Volume Autoencoding for Pretraining Optical Flow Estimation

1 code implementation • CVPR 2023 • Xiaoyu Shi, Zhaoyang Huang, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li

FlowFormer introduces a transformer architecture into optical flow estimation and achieves state-of-the-art performance.

Optical Flow Estimation

Paper
Code

KBNet: Kernel Basis Network for Image Restoration

1 code implementation • 6 Mar 2023 • Yi Zhang, Dasong Li, Xiaoyu Shi, Dailan He, Kangning Song, Xiaogang Wang, Hongwei Qin, Hongsheng Li

In this paper, we propose a kernel basis attention (KBA) module, which introduces learnable kernel bases to model representative image patterns for spatial information aggregation.

Ranked #1 on Color Image Denoising on McMaster sigma50

Color Image Denoising Deblurring +4

176

Paper
Code

BlinkFlow: A Dataset to Push the Limits of Event-based Optical Flow Estimation

no code implementations • 14 Mar 2023 • Yijin Li, Zhaoyang Huang, Shuo Chen, Xiaoyu Shi, Hongsheng Li, Hujun Bao, Zhaopeng Cui, Guofeng Zhang

BlinkSim consists of a configurable rendering engine and a flexible engine for event data simulation.

Event-based Optical Flow Optical Flow Estimation

Paper
Add Code

VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation

1 code implementation • ICCV 2023 • Xiaoyu Shi, Zhaoyang Huang, Weikang Bian, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li

We first propose a TRi-frame Optical Flow (TROF) module that estimates bi-directional optical flows for the center frame in a three-frame manner.

Optical Flow Estimation

211

Paper
Code

Context-PIPs: Persistent Independent Particles Demands Spatial Context Features

no code implementations • 3 Jun 2023 • Weikang Bian, Zhaoyang Huang, Xiaoyu Shi, Yitong Dong, Yijin Li, Hongsheng Li

We tackle the problem of Persistent Independent Particles (PIPs), also called Tracking Any Point (TAP), in videos, which specifically aims at estimating persistent long-term trajectories of query points in videos.

Point Tracking

Paper
Add Code

FlowFormer: A Transformer Architecture and Its Masked Cost Volume Autoencoding for Optical Flow

no code implementations • 8 Jun 2023 • Zhaoyang Huang, Xiaoyu Shi, Chao Zhang, Qiang Wang, Yijin Li, Hongwei Qin, Jifeng Dai, Xiaogang Wang, Hongsheng Li

This paper introduces a novel transformer-based network architecture, FlowFormer, along with the Masked Cost Volume AutoEncoding (MCVA) for pretraining it to tackle the problem of optical flow estimation.

Optical Flow Estimation

Paper
Add Code

Cross-modality Attention Adapter: A Glioma Segmentation Fine-tuning Method for SAM Using Multimodal Brain MR Images

no code implementations • 3 Jul 2023 • Xiaoyu Shi, Shurong Chai, Yinhao Li, Jingliang Cheng, Jie Bai, Guohua Zhao, Yen-Wei Chen

However, for medical images with small dataset sizes, deep learning methods struggle to achieve better results on real-world image datasets.

Paper
Add Code

Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling

no code implementations • 29 Jan 2024 • Xiaoyu Shi, Zhaoyang Huang, Fu-Yun Wang, Weikang Bian, Dasong Li, Yi Zhang, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li

For the first stage, we propose a diffusion-based motion field predictor, which focuses on deducing the trajectories of the reference image's pixels.

Image to Video Generation

Paper
Add Code

AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning

1 code implementation • 1 Feb 2024 • Fu-Yun Wang, Zhaoyang Huang, Xiaoyu Shi, Weikang Bian, Guanglu Song, Yu Liu, Hongsheng Li

We validate the proposed strategy in image-conditioned video generation and layout-conditioned video generation, all achieving top-performing results.

Conditional Image Generation Denoising +1

405

Paper
Code

Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation

1 code implementation • 20 Mar 2024 • Fu-Yun Wang, Xiaoshi Wu, Zhaoyang Huang, Xiaoyu Shi, Dazhong Shen, Guanglu Song, Yu Liu, Hongsheng Li

We introduce MOTIA Mastering Video Outpainting Through Input-Specific Adaptation, a diffusion-based pipeline that leverages both the intrinsic data-specific patterns of the source video and the image/video generative prior for effective outpainting.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.