1 code implementation • 3 Oct 2024 • Ming Lu, Zhihao Duan, Wuyang Cong, Dandan Ding, Fengqing Zhu, Zhan Ma
This feature-space processing operates from the lowest to the highest scale of each frame, completely eliminating the need for the complexity-intensive motion estimation and compensation techniques that have been standard in video codecs for decades.
1 code implementation • 29 Sep 2024 • Xu Zhang, Peiyao Guo, Ming Lu, Zhan Ma
Experimental results show that MPA achieves performance comparable to state-of-the-art methods in both task-specific and multi-objective optimization across human viewing and machine analysis tasks.
no code implementations • 2 Sep 2024 • Muchen Dong, Ming Lu, Zhan Ma
Despite the unprecedented compression efficiency achieved by deep learned image compression (LIC), existing methods usually approximate the desired bitrate by adjusting a single quality factor for a given input image, which may compromise the rate control results.
no code implementations • 1 Aug 2024 • Qianyun He, Xinya Ji, Yicheng Gong, Yuanxun Lu, Zhengyu Diao, Linjia Huang, Yao Yao, Siyu Zhu, Zhan Ma, Songcen Xu, Xiaofei Wu, Zixiao Zhang, Xun Cao, Hao Zhu
We present a novel approach for synthesizing 3D talking heads with controllable emotion, featuring enhanced lip synchronization and rendering quality.
1 code implementation • 31 Jul 2024 • Junqi Shi, Mingyi Jiang, Ming Lu, Tong Chen, Xun Cao, Zhan Ma
For downstream classification on compressed HSI, we theoretically demonstrate the task accuracy is not only related to the classification loss but also to the reconstruction fidelity through a first-order expansion of the accuracy degradation, and accordingly adapt the reconstruction by introducing Adaptive Spectral Weighting.
no code implementations • 28 Jul 2024 • Hao Zhu, Zhen Liu, Qi Zhang, Jingde Fu, Weibing Deng, Zhan Ma, Yanwen Guo, Xun Cao
By initializing the bias of the neural network with different ranges, sub-functions with various frequencies in the variable-periodic function are selected for activation.
no code implementations • 11 Jul 2024 • Delong Wu, Hao Zhu, Qi Zhang, You Li, Zhan Ma, Xun Cao
To tackle this issue, we introduce the Neural Poisson Solver, a plug-and-play and universally applicable framework across different signal dimensions for blending visual signals represented by INRs.
no code implementations • CVPR 2024 • Qi Zhao, M. Salman Asif, Zhan Ma
To address this issue, we introduce the Pyramidal Neural Representation for Videos (PNeRV), which is built on a multi-scale information connection and comprises a lightweight rescaling operator, Kronecker Fully-connected layer (KFc), and a Benign Selective Memory (BSM) mechanism.
1 code implementation • 10 Apr 2024 • Kang You, Pan Gao, Zhan Ma
In this paper, we propose PoLoPCAC, an efficient and generic lossless PCAC method that achieves high compression efficiency and strong generalizability simultaneously.
1 code implementation • CVPR 2024 • Zhihao Duan, Ming Lu, Justin Yang, Jiangpeng He, Zhan Ma, Fengqing Zhu
This paper explores the possibility of extending the capability of pre-trained neural image compressors (e. g., adapting to new data or target bitrates) without breaking backward compatibility, the ability to decode bitstreams encoded by the original model.
no code implementations • 21 Jan 2024 • Yichi Zhang, Zhihao Duan, Ming Lu, Dandan Ding, Fengqing Zhu, Zhan Ma
While convolution and self-attention are extensively used in learned image compression (LIC) for transform coding, this paper proposes an alternative called Contextual Clustering based LIC (CLIC) which primarily relies on clustering operations and local attention for correlation characterization and compact representation of an image.
no code implementations • 12 Dec 2023 • Ming Lu, Zhihao Duan, Fengqing Zhu, Zhan Ma
Recently, probabilistic predictive coding that directly models the conditional distribution of latent features across successive frames for temporal redundancy removal has yielded promising results.
no code implementations • CVPR 2024 • Zhen Liu, Hao Zhu, Qi Zhang, Jingde Fu, Weibing Deng, Zhan Ma, Yanwen Guo, Xun Cao
Implicit Neural Representation (INR), which utilizes a neural network to map coordinate inputs to corresponding attributes, is causing a revolution in the field of signal processing.
no code implementations • 4 Oct 2023 • Chengkang Shen, Hao Zhu, You Zhou, Yu Liu, Si Yi, Lili Dong, Weipeng Zhao, David J. Brady, Xun Cao, Zhan Ma, Yi Lin
Myocardial motion tracking stands as an essential clinical tool in the prevention and detection of cardiovascular diseases (CVDs), the foremost cause of death globally.
no code implementations • 22 Sep 2023 • Hao Zhu, Fengyi Liu, Qi Zhang, Xun Cao, Zhan Ma
This connection ensures a seamless backpropagation of gradients from the network's output back to the input coordinates, thereby enhancing regularization.
no code implementations • 20 Aug 2023 • Bowei Xu, Hao Chen, Zhan Ma
Unlike direct observation-to-action mapping, Karma recurrently maintains a multi-dimensional time series of observations, returns, and actions as input and employs causal sequence modeling via a decision transformer to determine the next action.
no code implementations • CVPR 2023 • Qi Zhao, M. Salman Asif, Zhan Ma
DNeRV achieves competitive results against the state-of-the-art neural compression approaches and outperforms existing implicit methods on downstream inpainting and interpolation for $960 \times 1920$ videos.
no code implementations • 3 Apr 2023 • Hao Zhu, Shaowen Xie, Zhen Liu, Fengyi Liu, Qi Zhang, You Zhou, Yi Lin, Zhan Ma, Xun Cao
However, the expressive power of INR is limited by the spectral bias in the network training.
no code implementations • 22 Mar 2023 • Jianqiang Wang, Dandan Ding, Zhan Ma
With this aim, we extensively exploit cross-scale, cross-group, and cross-color correlations of point cloud attribute to ensure accurate probability estimation and thus high coding efficiency.
2 code implementations • 16 Feb 2023 • Zhihao Duan, Ming Lu, Jack Ma, Yuning Huang, Zhan Ma, Fengqing Zhu
This paper addresses the problem of lossy image compression, a fundamental problem in image processing and information theory that is involved in many real-world applications.
no code implementations • 28 Jan 2023 • Jianqiang Wang, Dandan Ding, Hao Chen, Zhan Ma
This work extends the Multiscale Sparse Representation (MSR) framework developed for static Point Cloud Geometry Compression (PCGC) to support the dynamic PCGC through the use of multiscale inter conditional coding.
1 code implementation • 15 Dec 2022 • Zhihao LI, Ming Lu, Xu Zhang, Xin Feng, M. Salman Asif, Zhan Ma
Conventional cameras capture image irradiance on a sensor and convert it to RGB images using an image signal processor (ISP).
no code implementations • CVPR 2023 • Shaowen Xie, Hao Zhu, Zhen Liu, Qi Zhang, You Zhou, Xun Cao, Zhan Ma
Implicit neural representation (INR) characterizes the attributes of a signal as a function of corresponding coordinates which emerges as a sharp weapon for solving inverse problems.
no code implementations • 5 Nov 2022 • Junqi Shi, Ming Lu, Zhan Ma
Quantizing a floating-point neural network to its fixed-point representation is crucial for Learned Image Compression (LIC) because it improves decoding consistency for interoperability and reduces space-time complexity for implementation.
no code implementations • 17 Sep 2022 • Dandan Ding, Junzhe Zhang, Jianqiang Wang, Zhan Ma
A learning-based adaptive loop filter is developed for the Geometry-based Point Cloud Compression (G-PCC) standard to reduce attribute compression artifacts.
2 code implementations • 27 Aug 2022 • Zhihao Duan, Ming Lu, Zhan Ma, Fengqing Zhu
Recent research has shown a strong theoretical connection between variational autoencoders (VAEs) and the rate-distortion theory.
no code implementations • 26 Aug 2022 • Ruixiang Xue, Jianqiang Wang, Zhan Ma
Although convolutional representation of multiscale sparse tensor demonstrated its superior efficiency to accurately model the occupancy probability for the compression of geometry component of dense object point clouds, its capacity for representing sparse LiDAR point cloud geometry (PCG) was largely limited.
no code implementations • 4 Aug 2022 • Ming Cheng, Yiling Xu, Wang Shen, M. Salman Asif, Chao Ma, Jun Sun, Zhan Ma
We utilize a disparity network to transfer spatiotemporal information across views even in large disparity scenes, based on which, we propose disparity-guided flow-based warping for LSR-HFR view and complementary warping for HSR-LFR view.
no code implementations • 17 Jun 2022 • Zhan Ma, Zisheng Ye, Wenxiao Pan
Predicting the dynamic behaviors of particles in suspension subject to hydrodynamic interaction (HI) and external drive can be critical for many applications.
1 code implementation • 25 Apr 2022 • Ming Lu, Fangdong Chen, ShiLiang Pu, Zhan Ma
To this end, Integrated Convolution and Self-Attention (ICSA) unit is first proposed to form a content-adaptive transform to characterize and embed neighborhood information dynamically of any input.
1 code implementation • 19 Apr 2022 • Zhihao LI, Si Yi, Zhan Ma
Image signal processing (ISP) is crucial for camera imaging, and neural networks (NN) solutions are extensively deployed for daytime scenes.
1 code implementation • 11 Apr 2022 • Bin Jiang, Zhihao LI, M. Salman Asif, Xun Cao, Zhan Ma
The event camera's low power consumption and ability to capture microsecond brightness changes make it attractive for various computer vision tasks.
1 code implementation • 3 Apr 2022 • Jianqiang Wang, Zhan Ma
Recently, numerous learning-based compression methods have been developed with outstanding performance for the coding of the geometry information of point clouds.
no code implementations • 26 Feb 2022 • Zhihao Duan, Ming Lu, Zhan Ma, Fengqing Zhu
End-to-end learned lossy image coders (LICs), as opposed to hand-crafted image codecs, have shown increasing superiority in terms of the rate-distortion performance.
no code implementations • 16 Dec 2021 • Tong Chen, Zhan Ma
Deep neural network-based image compression has been extensively studied.
2 code implementations • 20 Nov 2021 • Jianqiang Wang, Dandan Ding, Zhu Li, Xiaoxing Feng, Chuntong Cao, Zhan Ma
We call this compression method SparsePCGC.
no code implementations • 12 Nov 2021 • Ming Lu, Peiyao Guo, Huiqing Shi, Chuntong Cao, Zhan Ma
A Transformer-based Image Compression (TIC) approach is developed which reuses the canonical variational autoencoder (VAE) architecture with paired main and hyper encoder-decoders.
no code implementations • 15 Sep 2021 • Yipeng Liu, Qi Yang, Yiling Xu, Zhan Ma
Point cloud compression (PCC) has made remarkable achievement in recent years.
no code implementations • 5 Aug 2021 • Haojie Liu, Ming Lu, Zhiqi Chen, Xun Cao, Zhan Ma, Yao Wang
We further design a one-to-many decoder pipeline to generate multiple predictions from the CSTR, including vector-based resampling, adaptive kernel-based resampling, compensation mode selection maps and texture enhancements, and combines them adaptively to achieve more accurate inter prediction.
no code implementations • 26 Apr 2021 • Jiaoyang Yin, Yiling Xu, Hao Chen, Yunfei Zhang, Steve Appleby, Zhan Ma
Adaptive Bit Rate (ABR) decision plays a crucial role for ensuring satisfactory Quality of Experience (QoE) in video streaming applications, in which past network statistics are mainly leveraged for future network bandwidth prediction.
1 code implementation • 21 Apr 2021 • Ren Yang, Radu Timofte, Jing Liu, Yi Xu, Xinjian Zhang, Minyi Zhao, Shuigeng Zhou, Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy, Xin Li, Fanglong Liu, He Zheng, Lielin Jiang, Qi Zhang, Dongliang He, Fu Li, Qingqing Dang, Yibin Huang, Matteo Maggioni, Zhongqian Fu, Shuai Xiao, Cheng Li, Thomas Tanay, Fenglong Song, Wentao Chao, Qiang Guo, Yan Liu, Jiang Li, Xiaochao Qu, Dewang Hou, Jiayu Yang, Lyn Jiang, Di You, Zhenyu Zhang, Chong Mou, Iaroslav Koshelev, Pavel Ostyakov, Andrey Somov, Jia Hao, Xueyi Zou, Shijie Zhao, Xiaopeng Sun, Yiting Liao, Yuanzhi Zhang, Qing Wang, Gen Zhan, Mengxi Guo, Junlin Li, Ming Lu, Zhan Ma, Pablo Navarrete Michelini, Hai Wang, Yiyun Chen, Jingyu Guo, Liliang Zhang, Wenming Yang, Sijung Kim, Syehoon Oh, Yucong Wang, Minjie Cai, Wei Hao, Kangdi Shi, Liangyan Li, Jun Chen, Wei Gao, Wang Liu, XiaoYu Zhang, Linjie Zhou, Sixin Lin, Ru Wang
This paper reviews the first NTIRE challenge on quality enhancement of compressed video, with a focus on the proposed methods and results.
1 code implementation • 4 Mar 2021 • Qi Yang, Yujie Zhang, Siheng Chen, Yiling Xu, Jun Sun, Zhan Ma
In this paper, we propose a new distortion quantification method for point clouds, the multiscale potential energy discrepancy (MPED).
no code implementations • 1 Dec 2020 • Ming Lu, Tong Chen, Dandan Ding, Fengqing Zhu, Zhan Ma
Inspired by the facts that retinal cells actually segregate the visual scene into different attributes (e. g., spatial details, temporal motion) for respective neuronal processing, we propose to first decompose the input video into respective spatial texture frames (STF) at its native spatial resolution that preserve the rich spatial details, and the other temporal motion frames (TMF) at a lower spatial resolution that retain the motion smoothness; then compress them together using any popular video coder; and finally synthesize decoded STFs and TMFs for high-fidelity video reconstruction at the same resolution as its native input.
3 code implementations • 7 Nov 2020 • Jianqiang Wang, Dandan Ding, Zhu Li, Zhan Ma
Recent years have witnessed the growth of point cloud based applications because of its realistic and fine-grained representation of 3D objects and scenes.
no code implementations • 9 Jul 2020 • Haojie Liu, Ming Lu, Zhan Ma, Fan Wang, Zhihuang Xie, Xun Cao, Yao Wang
Over the past two decades, traditional block-based video coding has made remarkable progress and spawned a series of well-known standards such as MPEG-4, H. 264/AVC and H. 265/HEVC.
1 code implementation • 31 May 2020 • Qi Yang, Zhan Ma, Yiling Xu, Zhu Li, Jun Sun
We propose the GraphSIM -- an objective metric to accurately predict the subjective quality of point cloud with superimposed geometry and color impairments.
no code implementations • 18 Mar 2020 • Qi Xia, Haojie Liu, Zhan Ma
The Object-Based Image Coding (OBIC) that was extensively studied about two decades ago, promised a vast application perspective for both ultra-low bitrate communication and high-level semantical content understanding, but it had rarely been used due to the inefficient compact representation of object with arbitrary shape.
2 code implementations • 10 Feb 2020 • Yueyu Hu, Wenhan Yang, Zhan Ma, Jiaying Liu
In this paper, we first conduct a comprehensive literature survey of learned image compression methods.
no code implementations • 13 Dec 2019 • Haojie Liu, Han Shen, Lichao Huang, Ming Lu, Tong Chen, Zhan Ma
Traditional video compression technologies have been developed over decades in pursuit of higher coding efficiency.
1 code implementation • 11 Oct 2019 • Tong Chen, Haojie Liu, Zhan Ma, Qiu Shen, Xun Cao, Yao Wang
This paper proposes a novel Non-Local Attention optmization and Improved Context modeling-based image compression (NLAIC) algorithm, which is built on top of the deep nerual network (DNN)-based variational auto-encoder (VAE) structure.
no code implementations • 28 Sep 2019 • Ming Cheng, Zhan Ma, M. Salman Asif, Yiling Xu, Haojie Liu, Wenbo Bao, Jun Sun
This paper presents a dual camera system for high spatiotemporal resolution (HSTR) video acquisition, where one camera shoots a video with high spatial resolution and low frame rate (HSR-LFR) and another one captures a low spatial resolution and high frame rate (LSR-HFR) video.
2 code implementations • 26 Sep 2019 • Jianqiang Wang, Hao Zhu, Zhan Ma, Tong Chen, Haojie Liu, Qiu Shen
This paper presents a novel end-to-end Learned Point Cloud Geometry Compression (a. k. a., Learned-PCGC) framework, to efficiently compress the point cloud geometry (PCG) using deep neural networks (DNN) based variational autoencoders (VAE).
no code implementations • 3 May 2019 • Ming Lu, Ming Cheng, Yiling Xu, ShiLiang Pu, Qiu Shen, Zhan Ma
Networked video applications, e. g., video conferencing, often suffer from poor visual quality due to unexpected network fluctuation and limited bandwidth.
no code implementations • 22 Apr 2019 • Haojie Liu, Tong Chen, Peiyao Guo, Qiu Shen, Xun Cao, Yao Wang, Zhan Ma
This paper proposes a novel Non-Local Attention Optimized Deep Image Compression (NLAIC) framework, which is built on top of the popular variational auto-encoder (VAE) structure.
no code implementations • 8 Apr 2019 • Chao Huang, Haojie Liu, Tong Chen, Qiu Shen, Zhan Ma
We propose a MultiScale AutoEncoder(MSAE) based extreme image compression framework to offer visually pleasing reconstruction at a very low bitrate.
no code implementations • 27 Feb 2019 • Haojie Liu, Tong Chen, Peiyao Guo, Qiu Shen, Zhan Ma
Besides, a field study on perceptual quality is also given via a dedicated subjective assessment, to compare the efficiency of our proposed methods and other conventional image compression methods.
1 code implementation • 5 Jun 2018 • Haojie Liu, Tong Chen, Qiu Shen, Tao Yue, Zhan Ma
We present a lossy image compression method based on deep convolutional neural networks (CNNs), which outperforms the existing BPG, WebP, JPEG2000 and JPEG as measured via multi-scale structural similarity (MS-SSIM), at the same bit rate.