Search Results for author: Zhan Ma

Found 50 papers, 17 papers with code

PNeRV: Enhancing Spatial Consistency via Pyramidal Neural Representation for Videos

no code implementations • 13 Apr 2024 • Qi Zhao, M. Salman Asif, Zhan Ma

To address this issue, we introduce the Pyramidal Neural Representation for Videos (PNeRV), which is built on a multi-scale information connection and comprises a lightweight rescaling operator, Kronecker Fully-connected layer (KFc), and a Benign Selective Memory (BSM) mechanism.

SSIM Tensor Decomposition

Paper
Add Code

Efficient and Generic Point Model for Lossless Point Cloud Attribute Compression

2 code implementations • 10 Apr 2024 • Kang You, Pan Gao, Zhan Ma

In this paper, we propose PoLoPCAC, an efficient and generic lossless PCAC method that achieves high compression efficiency and strong generalizability simultaneously.

2k Attribute

131

Paper
Code

Towards Backward-Compatible Continual Learning of Image Compression

1 code implementation • 29 Feb 2024 • Zhihao Duan, Ming Lu, Justin Yang, Jiangpeng He, Zhan Ma, Fengqing Zhu

This paper explores the possibility of extending the capability of pre-trained neural image compressors (e. g., adapting to new data or target bitrates) without breaking backward compatibility, the ability to decode bitstreams encoded by the original model.

Continual Learning Image Compression +1

Paper
Code

Another Way to the Top: Exploit Contextual Clustering in Learned Image Coding

no code implementations • 21 Jan 2024 • Yichi Zhang, Zhihao Duan, Ming Lu, Dandan Ding, Fengqing Zhu, Zhan Ma

While convolution and self-attention are extensively used in learned image compression (LIC) for transform coding, this paper proposes an alternative called Contextual Clustering based LIC (CLIC) which primarily relies on clustering operations and local attention for correlation characterization and compact representation of an image.

Clustering Image Compression +3

Paper
Add Code

Deep Hierarchical Video Compression

no code implementations • 12 Dec 2023 • Ming Lu, Zhihao Duan, Fengqing Zhu, Zhan Ma

Recently, probabilistic predictive coding that directly models the conditional distribution of latent features across successive frames for temporal redundancy removal has yielded promising results.

Video Compression

Paper
Add Code

FINER: Flexible spectral-bias tuning in Implicit NEural Representation by Variable-periodic Activation Functions

no code implementations • 5 Dec 2023 • Zhen Liu, Hao Zhu, Qi Zhang, Jingde Fu, Weibing Deng, Zhan Ma, Yanwen Guo, Xun Cao

Implicit Neural Representation (INR), which utilizes a neural network to map coordinate inputs to corresponding attributes, is causing a revolution in the field of signal processing.

Paper
Add Code

Tracking Anything in Heart All at Once

no code implementations • 4 Oct 2023 • Chengkang Shen, Hao Zhu, You Zhou, Yu Liu, Si Yi, Lili Dong, Weipeng Zhao, David J. Brady, Xun Cao, Zhan Ma, Yi Lin

Myocardial motion tracking stands as an essential clinical tool in the prevention and detection of Cardiovascular Diseases (CVDs), the foremost cause of death globally.

Motion Estimation

Paper
Add Code

RHINO: Regularizing the Hash-based Implicit Neural Representation

no code implementations • 22 Sep 2023 • Hao Zhu, Fengyi Liu, Qi Zhang, Xun Cao, Zhan Ma

This connection ensures a seamless backpropagation of gradients from the network's output back to the input coordinates, thereby enhancing regularization.

Paper
Add Code

Karma: Adaptive Video Streaming via Causal Sequence Modeling

no code implementations • 20 Aug 2023 • Bowei Xu, Hao Chen, Zhan Ma

Unlike direct observation-to-action mapping, Karma recurrently maintains a multi-dimensional time series of observations, returns, and actions as input and employs causal sequence modeling via a decision transformer to determine the next action.

Paper
Add Code

DNeRV: Modeling Inherent Dynamics via Difference Neural Representation for Videos

no code implementations • CVPR 2023 • Qi Zhao, M. Salman Asif, Zhan Ma

DNeRV achieves competitive results against the state-of-the-art neural compression approaches and outperforms existing implicit methods on downstream inpainting and interpolation for $960 \times 1920$ videos.

Video Compression

Paper
Add Code

Disorder-invariant Implicit Neural Representation

no code implementations • 3 Apr 2023 • Hao Zhu, Shaowen Xie, Zhen Liu, Fengyi Liu, Qi Zhang, You Zhou, Yi Lin, Zhan Ma, Xun Cao

However, the expressive power of INR is limited by the spectral bias in the network training.

Attribute Retrieval

Paper
Add Code

Lossless Point Cloud Attribute Compression Using Cross-scale, Cross-group, and Cross-color Prediction

no code implementations • 22 Mar 2023 • Jianqiang Wang, Dandan Ding, Zhan Ma

With this aim, we extensively exploit cross-scale, cross-group, and cross-color correlations of point cloud attribute to ensure accurate probability estimation and thus high coding efficiency.

Attribute

Paper
Add Code

QARV: Quantization-Aware ResNet VAE for Lossy Image Compression

2 code implementations • 16 Feb 2023 • Zhihao Duan, Ming Lu, Jack Ma, Yuning Huang, Zhan Ma, Fengqing Zhu

This paper addresses the problem of lossy image compression, a fundamental problem in image processing and information theory that is involved in many real-world applications.

Image Compression Quantization

Paper
Code

Dynamic Point Cloud Geometry Compression Using Multiscale Inter Conditional Coding

no code implementations • 28 Jan 2023 • Jianqiang Wang, Dandan Ding, Hao Chen, Zhan Ma

This work extends the Multiscale Sparse Representation (MSR) framework developed for static Point Cloud Geometry Compression (PCGC) to support the dynamic PCGC through the use of multiscale inter conditional coding.

Paper
Add Code

Efficient Visual Computing with Camera RAW Snapshots

1 code implementation • 15 Dec 2022 • Zhihao LI, Ming Lu, Xu Zhang, Xin Feng, M. Salman Asif, Zhan Ma

Conventional cameras capture image irradiance on a sensor and convert it to RGB images using an image signal processor (ISP).

Autonomous Driving Image Compression +2

Paper
Code

DINER: Disorder-Invariant Implicit Neural Representation

no code implementations • CVPR 2023 • Shaowen Xie, Hao Zhu, Zhen Liu, Qi Zhang, You Zhou, Xun Cao, Zhan Ma

Implicit neural representation (INR) characterizes the attributes of a signal as a function of corresponding coordinates which emerges as a sharp weapon for solving inverse problems.

Retrieval

Paper
Add Code

Rate-Distortion Optimized Post-Training Quantization for Learned Image Compression

no code implementations • 5 Nov 2022 • Junqi Shi, Ming Lu, Zhan Ma

Quantizing a floating-point neural network to its fixed-point representation is crucial for Learned Image Compression (LIC) because it improves decoding consistency for interoperability and reduces space-time complexity for implementation.

Image Classification Image Compression +2

Paper
Add Code

CARNet:Compression Artifact Reduction for Point Cloud Attribute

no code implementations • 17 Sep 2022 • Dandan Ding, Junzhe Zhang, Jianqiang Wang, Zhan Ma

A learning-based adaptive loop filter is developed for the Geometry-based Point Cloud Compression (G-PCC) standard to reduce attribute compression artifacts.

Attribute

Paper
Add Code

Lossy Image Compression with Quantized Hierarchical VAEs

2 code implementations • 27 Aug 2022 • Zhihao Duan, Ming Lu, Zhan Ma, Fengqing Zhu

Recent research has shown a strong theoretical connection between variational autoencoders (VAEs) and the rate-distortion theory.

Image Compression Quantization

Paper
Code

Efficient LiDAR Point Cloud Geometry Compression Through Neighborhood Point Attention

no code implementations • 26 Aug 2022 • Ruixiang Xue, Jianqiang Wang, Zhan Ma

Although convolutional representation of multiscale sparse tensor demonstrated its superior efficiency to accurately model the occupancy probability for the compression of geometry component of dense object point clouds, its capacity for representing sparse LiDAR point cloud geometry (PCG) was largely limited.

Paper
Add Code

H2-Stereo: High-Speed, High-Resolution Stereoscopic Video System

no code implementations • 4 Aug 2022 • Ming Cheng, Yiling Xu, Wang Shen, M. Salman Asif, Chao Ma, Jun Sun, Zhan Ma

We utilize a disparity network to transfer spatiotemporal information across views even in large disparity scenes, based on which, we propose disparity-guided flow-based warping for LSR-HFR view and complementary warping for HSR-LFR view.

Super-Resolution Vocal Bursts Intensity Prediction

Paper
Add Code

Fast Simulation of Particulate Suspensions Enabled by Graph Neural Network

no code implementations • 17 Jun 2022 • Zhan Ma, Zisheng Ye, Wenxiao Pan

Predicting the dynamic behaviors of particles in suspension subject to hydrodynamic interaction (HI) and external drive can be critical for many applications.

Computational Efficiency

Paper
Add Code

High-Efficiency Lossy Image Coding Through Adaptive Neighborhood Information Aggregation

1 code implementation • 25 Apr 2022 • Ming Lu, Fangdong Chen, ShiLiang Pu, Zhan Ma

To this end, Integrated Convolution and Self-Attention (ICSA) unit is first proposed to form a content-adaptive transform to characterize and embed neighborhood information dynamically of any input.

Vocal Bursts Intensity Prediction

Paper
Code

Rendering Nighttime Image Via Cascaded Color and Brightness Compensation

1 code implementation • 19 Apr 2022 • Zhihao LI, Si Yi, Zhan Ma

Image signal processing (ISP) is crucial for camera imaging, and neural networks (NN) solutions are extensively deployed for daytime scenes.

Tone Mapping

Paper
Code

Event Transformer

no code implementations • 11 Apr 2022 • Zhihao LI, M. Salman Asif, Zhan Ma

The event camera is a bio-vision inspired camera with high dynamic range, high response speed, and low power consumption, recently attracting extensive attention for its use in vast vision tasks.

Event-based vision

Paper
Add Code

Sparse Tensor-based Point Cloud Attribute Compression

1 code implementation • 3 Apr 2022 • Jianqiang Wang, Zhan Ma

Recently, numerous learning-based compression methods have been developed with outstanding performance for the coding of the geometry information of point clouds.

Attribute

2,290

Paper
Code

Opening the Black Box of Learned Image Coders

no code implementations • 26 Feb 2022 • Zhihao Duan, Ming Lu, Zhan Ma, Fengqing Zhu

End-to-end learned lossy image coders (LICs), as opposed to hand-crafted image codecs, have shown increasing superiority in terms of the rate-distortion performance.

Paper
Add Code

Towards Robust Neural Image Compression: Adversarial Attack and Model Finetuning

no code implementations • 16 Dec 2021 • Tong Chen, Zhan Ma

Deep neural network-based image compression has been extensively studied.

Adversarial Attack Image Compression

Paper
Add Code

Sparse Tensor-based Multiscale Representation for Point Cloud Geometry Compression

2 code implementations • 20 Nov 2021 • Jianqiang Wang, Dandan Ding, Zhu Li, Xiaoxing Feng, Chuntong Cao, Zhan Ma

We call this compression method SparsePCGC.

Attribute

Paper
Code

Transformer-based Image Compression

no code implementations • 12 Nov 2021 • Ming Lu, Peiyao Guo, Huiqing Shi, Chuntong Cao, Zhan Ma

A Transformer-based Image Compression (TIC) approach is developed which reuses the canonical variational autoencoder (VAE) architecture with paired main and hyper encoder-decoders.

Image Compression Image Reconstruction

Paper
Add Code

Which One is Better: Assessing Objective Metrics for Point Cloud Compression

no code implementations • 15 Sep 2021 • Yipeng Liu, Qi Yang, Yiling Xu, Zhan Ma

Point cloud compression (PCC) has made remarkable achievement in recent years.

Point Cloud Quality Assessment

Paper
Add Code

End-to-end Neural Video Coding Using a Compound Spatiotemporal Representation

no code implementations • 5 Aug 2021 • Haojie Liu, Ming Lu, Zhiqi Chen, Xun Cao, Zhan Ma, Yao Wang

We further design a one-to-many decoder pipeline to generate multiple predictions from the CSTR, including vector-based resampling, adaptive kernel-based resampling, compensation mode selection maps and texture enhancements, and combines them adaptively to achieve more accurate inter prediction.

Motion Compensation MS-SSIM +3

Paper
Add Code

ANT: Learning Accurate Network Throughput for Better Adaptive Video Streaming

no code implementations • 26 Apr 2021 • Jiaoyang Yin, Yiling Xu, Hao Chen, Yunfei Zhang, Steve Appleby, Zhan Ma

Adaptive Bit Rate (ABR) decision plays a crucial role for ensuring satisfactory Quality of Experience (QoE) in video streaming applications, in which past network statistics are mainly leveraged for future network bandwidth prediction.

Reinforcement Learning (RL)

Paper
Add Code

NTIRE 2021 Challenge on Quality Enhancement of Compressed Video: Methods and Results

1 code implementation • 21 Apr 2021 • Ren Yang, Radu Timofte, Jing Liu, Yi Xu, Xinjian Zhang, Minyi Zhao, Shuigeng Zhou, Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy, Xin Li, Fanglong Liu, He Zheng, Lielin Jiang, Qi Zhang, Dongliang He, Fu Li, Qingqing Dang, Yibin Huang, Matteo Maggioni, Zhongqian Fu, Shuai Xiao, Cheng Li, Thomas Tanay, Fenglong Song, Wentao Chao, Qiang Guo, Yan Liu, Jiang Li, Xiaochao Qu, Dewang Hou, Jiayu Yang, Lyn Jiang, Di You, Zhenyu Zhang, Chong Mou, Iaroslav Koshelev, Pavel Ostyakov, Andrey Somov, Jia Hao, Xueyi Zou, Shijie Zhao, Xiaopeng Sun, Yiting Liao, Yuanzhi Zhang, Qing Wang, Gen Zhan, Mengxi Guo, Junlin Li, Ming Lu, Zhan Ma, Pablo Navarrete Michelini, Hai Wang, Yiyun Chen, Jingyu Guo, Liliang Zhang, Wenming Yang, Sijung Kim, Syehoon Oh, Yucong Wang, Minjie Cai, Wei Hao, Kangdi Shi, Liangyan Li, Jun Chen, Wei Gao, Wang Liu, XiaoYu Zhang, Linjie Zhou, Sixin Lin, Ru Wang

This paper reviews the first NTIRE challenge on quality enhancement of compressed video, with a focus on the proposed methods and results.

Paper
Code

MPED: Quantifying Point Cloud Distortion based on Multiscale Potential Energy Discrepancy

1 code implementation • 4 Mar 2021 • Qi Yang, Yujie Zhang, Siheng Chen, Yiling Xu, Jun Sun, Zhan Ma

In this paper, we propose a new distortion quantification method for point clouds, the multiscale potential energy discrepancy (MPED).

Point cloud reconstruction

Paper
Code

Decomposition, Compression, and Synthesis (DCS)-based Video Coding: A Neural Exploration via Resolution-Adaptive Learning

no code implementations • 1 Dec 2020 • Ming Lu, Tong Chen, Dandan Ding, Fengqing Zhu, Zhan Ma

Inspired by the facts that retinal cells actually segregate the visual scene into different attributes (e. g., spatial details, temporal motion) for respective neuronal processing, we propose to first decompose the input video into respective spatial texture frames (STF) at its native spatial resolution that preserve the rich spatial details, and the other temporal motion frames (TMF) at a lower spatial resolution that retain the motion smoothness; then compress them together using any popular video coder; and finally synthesize decoded STFs and TMFs for high-fidelity video reconstruction at the same resolution as its native input.

Motion Compensation Super-Resolution +2

Paper
Add Code

Multiscale Point Cloud Geometry Compression

3 code implementations • 7 Nov 2020 • Jianqiang Wang, Dandan Ding, Zhu Li, Zhan Ma

Recent years have witnessed the growth of point cloud based applications because of its realistic and fine-grained representation of 3D objects and scenes.

Attribute

Paper
Code

Neural Video Coding using Multiscale Motion Compensation and Spatiotemporal Context Model

no code implementations • 9 Jul 2020 • Haojie Liu, Ming Lu, Zhan Ma, Fan Wang, Zhihuang Xie, Xun Cao, Yao Wang

Over the past two decades, traditional block-based video coding has made remarkable progress and spawned a series of well-known standards such as MPEG-4, H. 264/AVC and H. 265/HEVC.

Motion Compensation MS-SSIM +2

Paper
Add Code

Inferring Point Cloud Quality via Graph Similarity

1 code implementation • 31 May 2020 • Qi Yang, Zhan Ma, Yiling Xu, Zhu Li, Jun Sun

We propose the GraphSIM -- an objective metric to accurately predict the subjective quality of point cloud with superimposed geometry and color impairments.

Graph Similarity

Paper
Code

Object-Based Image Coding: A Learning-Driven Revisit

no code implementations • 18 Mar 2020 • Qi Xia, Haojie Liu, Zhan Ma

The Object-Based Image Coding (OBIC) that was extensively studied about two decades ago, promised a vast application perspective for both ultra-low bitrate communication and high-level semantical content understanding, but it had rarely been used due to the inefficient compact representation of object with arbitrary shape.

Image Compression Object +1

Paper
Add Code

Learning End-to-End Lossy Image Compression: A Benchmark

2 code implementations • 10 Feb 2020 • Yueyu Hu, Wenhan Yang, Zhan Ma, Jiaying Liu

In this paper, we first conduct a comprehensive literature survey of learned image compression methods.

Image Compression

Paper
Code

Learned Video Compression via Joint Spatial-Temporal Correlation Exploration

no code implementations • 13 Dec 2019 • Haojie Liu, Han Shen, Lichao Huang, Ming Lu, Tong Chen, Zhan Ma

Traditional video compression technologies have been developed over decades in pursuit of higher coding efficiency.

Optical Flow Estimation Video Compression

Paper
Add Code

Neural Image Compression via Non-Local Attention Optimization and Improved Context Modeling

1 code implementation • 11 Oct 2019 • Tong Chen, Haojie Liu, Zhan Ma, Qiu Shen, Xun Cao, Yao Wang

This paper proposes a novel Non-Local Attention optmization and Improved Context modeling-based image compression (NLAIC) algorithm, which is built on top of the deep nerual network (DNN)-based variational auto-encoder (VAE) structure.

Image Compression MS-SSIM +1

Paper
Code

A Dual Camera System for High Spatiotemporal Resolution Video Acquisition

no code implementations • 28 Sep 2019 • Ming Cheng, Zhan Ma, M. Salman Asif, Yiling Xu, Haojie Liu, Wenbo Bao, Jun Sun

This paper presents a dual camera system for high spatiotemporal resolution (HSTR) video acquisition, where one camera shoots a video with high spatial resolution and low frame rate (HSR-LFR) and another one captures a low spatial resolution and high frame rate (LSR-HFR) video.

Vocal Bursts Intensity Prediction

Paper
Add Code

Learned Point Cloud Geometry Compression

2 code implementations • 26 Sep 2019 • Jianqiang Wang, Hao Zhu, Zhan Ma, Tong Chen, Haojie Liu, Qiu Shen

This paper presents a novel end-to-end Learned Point Cloud Geometry Compression (a. k. a., Learned-PCGC) framework, to efficiently compress the point cloud geometry (PCG) using deep neural networks (DNN) based variational autoencoders (VAE).

Surface Reconstruction

Paper
Code

Learned Quality Enhancement via Multi-Frame Priors for HEVC Compliant Low-Delay Applications

no code implementations • 3 May 2019 • Ming Lu, Ming Cheng, Yiling Xu, ShiLiang Pu, Qiu Shen, Zhan Ma

Networked video applications, e. g., video conferencing, often suffer from poor visual quality due to unexpected network fluctuation and limited bandwidth.

Video Compression

Paper
Add Code

Non-local Attention Optimized Deep Image Compression

no code implementations • 22 Apr 2019 • Haojie Liu, Tong Chen, Peiyao Guo, Qiu Shen, Xun Cao, Yao Wang, Zhan Ma

This paper proposes a novel Non-Local Attention Optimized Deep Image Compression (NLAIC) framework, which is built on top of the popular variational auto-encoder (VAE) structure.

Image Compression MS-SSIM +1

Paper
Add Code

Extreme Image Coding via Multiscale Autoencoders With Generative Adversarial Optimization

no code implementations • 8 Apr 2019 • Chao Huang, Haojie Liu, Tong Chen, Qiu Shen, Zhan Ma

We propose a MultiScale AutoEncoder(MSAE) based extreme image compression framework to offer visually pleasing reconstruction at a very low bitrate.

Generative Adversarial Network Image Compression

Paper
Add Code

Gated Context Model with Embedded Priors for Deep Image Compression

no code implementations • 27 Feb 2019 • Haojie Liu, Tong Chen, Peiyao Guo, Qiu Shen, Zhan Ma

Besides, a field study on perceptual quality is also given via a dedicated subjective assessment, to compare the efficiency of our proposed methods and other conventional image compression methods.

Image Compression Image Reconstruction +2

Paper
Add Code

Deep Image Compression via End-to-End Learning

1 code implementation • 5 Jun 2018 • Haojie Liu, Tong Chen, Qiu Shen, Tao Yue, Zhan Ma

We present a lossy image compression method based on deep convolutional neural networks (CNNs), which outperforms the existing BPG, WebP, JPEG2000 and JPEG as measured via multi-scale structural similarity (MS-SSIM), at the same bit rate.

Image Compression MS-SSIM +3

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.