no code implementations • CVPR 2024 • Xingtong Ge, Jixiang Luo, Xinjie Zhang, Tongda Xu, Guo Lu, Dailan He, Jing Geng, Yan Wang, Jun Zhang, Hongwei Qin
Prior research on deep video compression (DVC) for machine tasks typically necessitates training a unique codec for each specific task, mandating a dedicated decoder per task.
no code implementations • 19 Mar 2024 • Jixiang Luo, Yan Wang, Hongwei Qin
MSE-based models aim to improve objective metrics while generative models are leveraged to improve visual quality measured by subjective metrics.
1 code implementation • 13 Mar 2024 • Xinjie Zhang, Xingtong Ge, Tongda Xu, Dailan He, Yan Wang, Hongwei Qin, Guo Lu, Jing Geng, Jun Zhang
In response, we propose a groundbreaking paradigm of image representation and compression by 2D Gaussian Splatting, named GaussianImage.
1 code implementation • CVPR 2024 • Xinjie Zhang, Ren Yang, Dailan He, Xingtong Ge, Tongda Xu, Yan Wang, Hongwei Qin, Jun Zhang
Implicit neural representations (INRs) have emerged as a promising approach for video storage and processing, showing remarkable versatility across various video tasks.
no code implementations • 9 Feb 2024 • Tongda Xu, Ziran Zhu, Jian Li, Dailan He, Yuanyuan Wang, Ming Sun, Ling Li, Hongwei Qin, Yan Wang, Jingjing Liu, Ya-Qin Zhang
Diffusion Inverse Solvers (DIS) are designed to sample from the conditional distribution $p_{\theta}(X_0|y)$, with a predefined diffusion model $p_{\theta}(X_0)$, an operator $f(\cdot)$, and a measurement $y=f(x'_0)$ derived from an unknown image $x'_0$.
no code implementations • 29 Jan 2024 • Xiaoyu Shi, Zhaoyang Huang, Fu-Yun Wang, Weikang Bian, Dasong Li, Yi Zhang, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li
For the first stage, we propose a diffusion-based motion field predictor, which focuses on deducing the trajectories of the reference image's pixels.
1 code implementation • 17 Jan 2024 • Tongda Xu, Ziran Zhu, Dailan He, Yanghao Li, Lina Guo, Yuanyuan Wang, Zhe Wang, Hongwei Qin, Yan Wang, Jingjing Liu, Ya-Qin Zhang
However, we find that theoretically: 1) Conditional generative model-based perceptual codec satisfies idempotence; 2) Unconditional generative model with idempotence constraint is equivalent to conditional generative codec.
no code implementations • 5 Dec 2023 • Jianghui Zhang, Yuanyuan Wang, Lina Guo, Jixiang Luo, Tongda Xu, Yan Wang, Zhi Wang, Hongwei Qin
Most image compression algorithms only consider uncompressed original image, while ignoring a large number of already existing JPEG images.
no code implementations • 25 Aug 2023 • Lina Guo, Yuanyuan Wang, Tongda Xu, Jixiang Luo, Dailan He, Zhenjun Ji, Shanshan Wang, Yang Wang, Hongwei Qin
Second, we propose pipeline parallel context model (PPCM) and compressed checkerboard context model (CCCM) for the effective conditional modeling and efficient decoding within luma and chroma components.
no code implementations • 16 Aug 2023 • Tongda Xu, Qian Zhang, Yanghao Li, Dailan He, Zhe Wang, Yuanyuan Wang, Hongwei Qin, Yan Wang, Jingjing Liu, Ya-Qin Zhang
We propose conditional perceptual quality, an extension of the perceptual quality defined in \citet{blau2018perception}, by conditioning it on user defined information.
no code implementations • 8 Jun 2023 • Zhaoyang Huang, Xiaoyu Shi, Chao Zhang, Qiang Wang, Yijin Li, Hongwei Qin, Jifeng Dai, Xiaogang Wang, Hongsheng Li
This paper introduces a novel transformer-based network architecture, FlowFormer, along with the Masked Cost Volume AutoEncoding (MCVA) for pretraining it to tackle the problem of optical flow estimation.
1 code implementation • ICCV 2023 • Xiaoyu Shi, Zhaoyang Huang, Weikang Bian, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li
We first propose a TRi-frame Optical Flow (TROF) module that estimates bi-directional optical flows for the center frame in a three-frame manner.
1 code implementation • 6 Mar 2023 • Yi Zhang, Dasong Li, Xiaoyu Shi, Dailan He, Kangning Song, Xiaogang Wang, Hongwei Qin, Hongsheng Li
In this paper, we propose a kernel basis attention (KBA) module, which introduces learnable kernel bases to model representative image patterns for spatial information aggregation.
Ranked #1 on
Grayscale Image Denoising
on BSD68 sigma25
1 code implementation • CVPR 2023 • Xiaoyu Shi, Zhaoyang Huang, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li
FlowFormer introduces a transformer architecture into optical flow estimation and achieves state-of-the-art performance.
no code implementations • 29 Sep 2022 • Tongda Xu, Yifan Shao, Yan Wang, Hongwei Qin
In recent years, there has been widespread attention drawn to convolutional neural network (CNN) based blind image quality assessment (IQA).
no code implementations • 29 Sep 2022 • Tongda Xu, Han Gao, Yuanyuan Wang, Hongwei Qin, Yan Wang, Jingjing Liu, Ya-Qin Zhang
In this paper, we investigate the problem of bit allocation in Neural Video Compression (NVC).
no code implementations • 28 Sep 2022 • Tongda Xu, Yan Wang, Dailan He, Chenjian Gao, Han Gao, Kunzan Liu, Hongwei Qin
This paper considers the problem of lossy neural image compression (NIC).
1 code implementation • 20 Sep 2022 • Tongda Xu, Han Gao, Chenjian Gao, Yuanyuan Wang, Dailan He, Jinyong Pi, Jixiang Luo, Ziyu Zhu, Mao Ye, Hongwei Qin, Yan Wang, Jingjing Liu, Ya-Qin Zhang
In this paper, we consider the problem of bit allocation in Neural Video Compression (NVC).
no code implementations • 19 Sep 2022 • Chenjian Gao, Tongda Xu, Dailan He, Hongwei Qin, Yan Wang
Neural image compression (NIC) has outperformed traditional image codecs in rate-distortion (R-D) performance.
1 code implementation • 10 Aug 2022 • Dasong Li, Yi Zhang, Ka Chun Cheung, Xiaogang Wang, Hongwei Qin, Hongsheng Li
With the integration, MSDI-Net can handle various and complicated blurry patterns adaptively.
Ranked #21 on
Image Deblurring
on GoPro
no code implementations • 29 Jul 2022 • Hongjiu Yu, Qiancheng Sun, Jin Hu, Xingyuan Xue, Jixiang Luo, Dailan He, Yilong Li, Pengbo Wang, Yuanyuan Wang, Yaxu Dai, Yan Wang, Hongwei Qin
On CPU, the latency of our implementation is comparable with JPEG XL.
1 code implementation • CVPR 2023 • Dasong Li, Xiaoyu Shi, Yi Zhang, Ka Chun Cheung, Simon See, Xiaogang Wang, Hongwei Qin, Hongsheng Li
In this study, we propose a simple yet effective framework for video restoration.
Ranked #2 on
Deblurring
on GoPro
(using extra training data)
1 code implementation • 28 May 2022 • Dailan He, Ziming Yang, Hongjiu Yu, Tongda Xu, Jixiang Luo, Yuan Chen, Chenjian Gao, Xinjie Shi, Hongwei Qin, Yan Wang
In the past years, learned image compression (LIC) has achieved remarkable performance.
no code implementations • 10 May 2022 • Dasong Li, Yi Zhang, Ka Lung Law, Xiaogang Wang, Hongwei Qin, Hongsheng Li
As for each sub-network, we propose an efficient multi-frequency denoising network to remove noise of different frequencies.
1 code implementation • 30 Mar 2022 • Zhaoyang Huang, Xiaoyu Shi, Chao Zhang, Qiang Wang, Ka Chun Cheung, Hongwei Qin, Jifeng Dai, Hongsheng Li
We introduce optical Flow transFormer, dubbed as FlowFormer, a transformer-based neural network architecture for learning optical flow.
Ranked #1 on
Optical Flow Estimation
on Sintel-final
no code implementations • CVPR 2022 • Lina Guo, Xinjie Shi, Dailan He, Yuanyuan Wang, Rui Ma, Hongwei Qin, Yan Wang
JPEG is a popular image compression method widely used by individuals, data center, cloud storage and network filesystems.
6 code implementations • CVPR 2022 • Dailan He, Ziming Yang, Weikun Peng, Rui Ma, Hongwei Qin, Yan Wang
Recently, learned image compression techniques have achieved remarkable performance, even surpassing the best manually designed lossy image coders.
Ranked #1 on
Image Compression
on kodak
no code implementations • 15 Feb 2022 • Dailan He, Ziming Yang, Yuan Chen, Qi Zhang, Hongwei Qin, Yan Wang
It has been witnessed that learned image compression has outperformed conventional image coding techniques and tends to be practical in industrial applications.
1 code implementation • CVPR 2022 • Yi Zhang, Dasong Li, Ka Lung Law, Xiaogang Wang, Hongwei Qin, Hongsheng Li
To evaluate raw image denoising performance in real-world applications, we build a high-quality raw image dataset SenseNoise-500 that contains 500 real-life scenes.
1 code implementation • ICCV 2021 • Yi Zhang, Hongwei Qin, Xiaogang Wang, Hongsheng Li
However, the real raw image noise is contributed by many noise sources and varies greatly among different sensors.
Ranked #2 on
Image Denoising
on SID SonyA7S2 x100
no code implementations • 30 Sep 2021 • Baocheng Sun, Meng Gu, Dailan He, Tongda Xu, Yan Wang, Hongwei Qin
Learned image compression is making good progress in recent years.
no code implementations • 29 Sep 2021 • Dailan He, Ziming Yang, Yan Wang, Yuan Chen, Qi Zhang, Hongwei Qin
It has been witnessed that learned image compression has outperformed conventional image coding techniques and tends to be practical in industrial applications.
4 code implementations • CVPR 2021 • Dailan He, Yaoyan Zheng, Baocheng Sun, Yan Wang, Hongwei Qin
To the best of our knowledge, this is the first exploration on parallelization-friendly spatial context model for learned image compression.
4 code implementations • 14 Jan 2020 • Yongqiang Yao, Yan Wang, Yu Guo, Jiaojiao Lin, Hongwei Qin, Junjie Yan
Given two or more already labeled datasets that target for different object classes, cross-dataset training aims to detect the union of the different classes, so that we do not have to label all the classes for all the datasets.
no code implementations • ICLR 2019 • Wei Gao, Yi Wei, Quanquan Li, Hongwei Qin, Wanli Ouyang, Junjie Yan
Hints can improve the performance of student model by transferring knowledge from teacher model.
no code implementations • ECCV 2018 • Yi Wei, Xinyu Pan, Hongwei Qin, Wanli Ouyang, Junjie Yan
To the best of our knowledge, our method, called Quantization Mimic, is the first one focusing on very tiny networks.
no code implementations • 16 Dec 2017 • Congrui Hetang, Hongwei Qin, Shaohui Liu, Junjie Yan
Video object detection is more challenging compared to image object detection.
no code implementations • CVPR 2017 • Zekun Hao, Yu Liu, Hongwei Qin, Junjie Yan, Xiu Li, Xiaolin Hu
Then the scale histogram guides the zoom-in and zoom-out of the image.
no code implementations • CVPR 2016 • Hongwei Qin, Junjie Yan, Xiu Li, Xiaolin Hu
Cascade has been widely used in face detection, where classifier with low computation cost can be firstly used to shrink most of the background while keeping the recall.