1 code implementation • CVPR 2020 • Jiayu Yang, Wei Mao, Jose M. Alvarez, Miaomiao Liu
We propose a cost volume-based neural network for depth inference from multi-view images.
Ranked #14 on 3D Reconstruction on DTU
no code implementations • 26 Mar 2021 • Dewang Hou, Yang Zhao, Yuyao Ye, Jiayu Yang, Jian Zhang, Ronggang Wang
Scaling and lossy coding are widely used in video transmission and storage.
1 code implementation • CVPR 2021 • Jiayu Yang, Jose M. Alvarez, Miaomiao Liu
Here, we propose a self-supervised learning framework for multi-view stereo that exploit pseudo labels from the input data.
1 code implementation • 21 Apr 2021 • Ren Yang, Radu Timofte, Jing Liu, Yi Xu, Xinjian Zhang, Minyi Zhao, Shuigeng Zhou, Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy, Xin Li, Fanglong Liu, He Zheng, Lielin Jiang, Qi Zhang, Dongliang He, Fu Li, Qingqing Dang, Yibin Huang, Matteo Maggioni, Zhongqian Fu, Shuai Xiao, Cheng Li, Thomas Tanay, Fenglong Song, Wentao Chao, Qiang Guo, Yan Liu, Jiang Li, Xiaochao Qu, Dewang Hou, Jiayu Yang, Lyn Jiang, Di You, Zhenyu Zhang, Chong Mou, Iaroslav Koshelev, Pavel Ostyakov, Andrey Somov, Jia Hao, Xueyi Zou, Shijie Zhao, Xiaopeng Sun, Yiting Liao, Yuanzhi Zhang, Qing Wang, Gen Zhan, Mengxi Guo, Junlin Li, Ming Lu, Zhan Ma, Pablo Navarrete Michelini, Hai Wang, Yiyun Chen, Jingyu Guo, Liliang Zhang, Wenming Yang, Sijung Kim, Syehoon Oh, Yucong Wang, Minjie Cai, Wei Hao, Kangdi Shi, Liangyan Li, Jun Chen, Wei Gao, Wang Liu, XiaoYu Zhang, Linjie Zhou, Sixin Lin, Ru Wang
This paper reviews the first NTIRE challenge on quality enhancement of compressed video, with a focus on the proposed methods and results.
1 code implementation • CVPR 2022 • Jiayu Yang, Jose M. Alvarez, Miaomiao Liu
Boundary pixels usually follow a multi-modal distribution as they represent different depths; Therefore, the assumption results in an erroneous depth prediction at the coarser level of the cost volume pyramid and can not be corrected in the refinement levels leading to wrong depth predictions.
1 code implementation • 14 Nov 2022 • Wei Jiang, Jiayu Yang, Yongqi Zhai, Peirong Ning, Feng Gao, Ronggang Wang
Based on MEM and MEM$^+$, we propose image compression models MLIC and MLIC$^+$.
Ranked #1 on Image Compression on kodak
1 code implementation • ICCV 2023 • Jiayu Yang, Enze Xie, Miaomiao Liu, Jose M. Alvarez
In contrast, we propose to use parametric depth distribution modeling for feature transformation.
no code implementations • 6 Mar 2023 • Feng Wang, Haihang Ruan, Fei Xiong, Jiayu Yang, Litian Li, Ronggang Wang
Using more reference frames can significantly improve the compression efficiency in neural video compression.
no code implementations • 19 Apr 2023 • Wei Jiang, Peirong Ning, Jiayu Yang, Yongqi Zhai, Feng Gao, Ronggang Wang
To demonstrate the effectiveness of proposed transform coding, we align the entropy model to compare with existing transform methods and obtain models LLIC-STF, LLIC-ELIC, LLIC-TCM.
1 code implementation • 9 Jul 2023 • Jiayu Yang, Enze Xie, Miaomiao Liu, Jose M. Alvarez
In contrast, we propose to use parametric depth distribution modeling for feature transformation.
1 code implementation • 28 Jul 2023 • Wei Jiang, Jiayu Yang, Yongqi Zhai, Feng Gao, Ronggang Wang
Additionally, to capture global contexts, we propose the linear complexity attention-based global correlations capturing by leveraging the decomposition of the softmax operation.
Ranked #1 on Image Compression on kodak
no code implementations • 8 Sep 2023 • Ziang Cheng, Jiayu Yang, Hongdong Li
One of the major difficulties is the lack of high-quality indoor video stereo training datasets captured by head-mounted VR/AR glasses.
no code implementations • 21 Sep 2023 • Jiakang Li, Songning Lai, Zhihao Shuai, Yuan Tan, Yifan Jia, Mianyang Yu, Zichen Song, Xiaokang Peng, Ziyang Xu, Yongxin Ni, Haifeng Qiu, Jiayu Yang, Yutong Liu, Yonggang Lu
This review article delves into the topic of community detection in graphs, which serves as a thorough exposition of various community detection methods from perspectives of modularity-based method, spectral clustering, probabilistic modelling, and deep learning.
1 code implementation • 16 Oct 2023 • Jiayu Yang, Ziang Cheng, Yunfei Duan, Pan Ji, Hongdong Li
Given a single image of a 3D object, this paper proposes a novel method (named ConsistNet) that is able to generate multiple images of the same object, as if seen they are captured from different viewpoints, while the 3D (multi-view) consistencies among those multiple generated images are effectively exploited.
no code implementations • 2 Feb 2024 • Jiayu Yang, Wei Jiang, Yongqi Zhai, Chunhui Yang, Ronggang Wang
This paper presents a learned video compression method in response to video compression track of the 6th Challenge on Learned Image Compression (CLIC), at DCC 2024. Specifically, we propose a unified contextual video compression framework (UCVC) for joint P-frame and B-frame coding.