no code implementations • ECCV 2020 • Yuan Tian, Zhaohui Che, Wenbo Bao, Guangtao Zhai, Zhiyong Gao
Motion representation is key to many computer vision problems but has never been well studied in the literature.
1 code implementation • ICCV 2023 • Yuan Tian, Guo Lu, Guangtao Zhai, Zhiyong Gao
Most video compression methods aim to improve the decoded video visual quality, instead of particularly guaranteeing the semantic-completeness, which deteriorates downstream video analysis tasks, e. g., action recognition.
2 code implementations • 25 Jun 2022 • Wang Shen, Cheng Ming, Wenbo Bao, Guangtao Zhai, Li Chen, Zhiyong Gao
With AutoFI and SktFI, the interpolated animation frames show high perceptual quality.
1 code implementation • 6 Feb 2022 • Yuan Tian, Guo Lu, Yichao Yan, Guangtao Zhai, Li Chen, Zhiyong Gao
The framework is optimized by ensuring that a transportation-efficient semantic representation of the video is preserved w. r. t.
no code implementations • NeurIPS 2021 • Cong Geng, Jia Wang, Zhiyong Gao, Jes Frellsen, Søren Hauberg
Energy-based models (EBMs) provide an elegant framework for density estimation, but they are notoriously difficult to train.
no code implementations • 2 Aug 2021 • Jingqian Sun, Pei Wang, Zhiyong Gao, Zichu Liu, Yaxin Li, Xiaozheng Gan
Tree point cloud was classified into wood points and leaf points by using intensity threshold, neighborhood density and voxelization successively.
1 code implementation • ICCV 2021 • Yuan Tian, Guo Lu, Xiongkuo Min, Zhaohui Che, Guangtao Zhai, Guodong Guo, Zhiyong Gao
After optimization, the downscaled video by our framework preserves more meaningful information, which is beneficial for both the upscaling step and the downstream tasks, e. g., video action recognition task.
1 code implementation • 22 Jul 2021 • Yuan Tian, Yichao Yan, Guangtao Zhai, Guodong Guo, Zhiyong Gao
In this paper, we propose a unified action recognition framework to investigate the dynamic nature of video content by introducing the following designs.
Ranked #15 on
Action Recognition
on Something-Something V1
no code implementations • 17 Mar 2021 • Wang Shen, Wenbo Bao, Guangtao Zhai, Charlie L Wang, Jerry W Hu, Zhiyong Gao
An effective approach is to transmit frames in lower-quality under poor bandwidth conditions, such as using scalable video coding.
no code implementations • 23 Sep 2020 • Cong Geng, Jia Wang, Li Chen, Zhiyong Gao
Variational Autoencoder (VAE) and its variations are classic generative models by learning a low-dimensional latent representation to satisfy some prior distribution (e. g., Gaussian distribution).
no code implementations • 22 Jul 2020 • Yuan Tian, Guangtao Zhai, Zhiyong Gao
More specifically, an \textit{action perceptron synthesizer} is proposed to generate the kernels from a bag of fixed-size kernels that are interacted by dense routing paths.
no code implementations • ECCV 2020 • Guo Lu, Chunlei Cai, Xiaoyun Zhang, Li Chen, Wanli Ouyang, Dong Xu, Zhiyong Gao
Therefore, the encoder is adaptive to different video contents and achieves better compression performance by reducing the domain gap between the training and testing datasets.
1 code implementation • CVPR 2020 • Wang Shen, Wenbo Bao, Guangtao Zhai, Li Chen, Xiongkuo Min, Zhiyong Gao
Existing works reduce motion blur and up-convert frame rate through two separate ways, including frame deblurring and frame interpolation.
no code implementations • 12 Feb 2020 • Cong Geng, Jia Wang, Li Chen, Wenbo Bao, Chu Chu, Zhiyong Gao
Based on this defined Riemannian metric, we introduce a constant speed loss and a minimizing geodesic loss to regularize the interpolation network to generate uniform interpolation along the learned geodesic on the manifold.
5 code implementations • CVPR 2019 • Wenbo Bao, Wei-Sheng Lai, Chao Ma, Xiaoyun Zhang, Zhiyong Gao, Ming-Hsuan Yang
The proposed model then warps the input frames, depth maps, and contextual features based on the optical flow and local interpolation kernels for synthesizing the output frame.
Ranked #5 on
Video Frame Interpolation
on Middlebury
4 code implementations • CVPR 2019 • Guo Lu, Wanli Ouyang, Dong Xu, Xiaoyun Zhang, Chunlei Cai, Zhiyong Gao
Conventional video compression approaches use the predictive coding architecture and encode the corresponding motion information and residual information.
1 code implementation • 20 Oct 2018 • Wenbo Bao, Wei-Sheng Lai, Xiaoyun Zhang, Zhiyong Gao, Ming-Hsuan Yang
Recently, a number of data-driven frame interpolation methods based on convolutional neural networks have been proposed.
Ranked #21 on
Video Frame Interpolation
on Vimeo90K
1 code implementation • arXiv 2018 • Wenbo Bao, Wei-Sheng Lai, Xiaoyun Zhang, Zhiyong Gao, Ming-Hsuan Yang
In this work, we propose a motion estimation and motion compensation driven neural network for video frame interpolation.
Ranked #6 on
Video Frame Interpolation
on Middlebury
1 code implementation • ECCV 2018 • Guo Lu, Wanli Ouyang, Dong Xu, Xiaoyun Zhang, Zhiyong Gao, Ming-Ting Sun
In this paper, we model the video artifact reduction task as a Kalman filtering procedure and restore decoded frames through a deep Kalman filtering network.