Search Results for author: Zhan Ma

Found 57 papers, 21 papers with code

High-Efficiency Neural Video Compression via Hierarchical Predictive Learning

1 code implementation3 Oct 2024 Ming Lu, Zhihao Duan, Wuyang Cong, Dandan Ding, Fengqing Zhu, Zhan Ma

This feature-space processing operates from the lowest to the highest scale of each frame, completely eliminating the need for the complexity-intensive motion estimation and compensation techniques that have been standard in video codecs for decades.

Motion Estimation Video Compression

All-in-One Image Coding for Joint Human-Machine Vision with Multi-Path Aggregation

1 code implementation29 Sep 2024 Xu Zhang, Peiyao Guo, Ming Lu, Zhan Ma

Experimental results show that MPA achieves performance comparable to state-of-the-art methods in both task-specific and multi-objective optimization across human viewing and machine analysis tasks.

Data Compression Feature Importance +3

Accelerating block-level rate control for learned image compression

no code implementations2 Sep 2024 Muchen Dong, Ming Lu, Zhan Ma

Despite the unprecedented compression efficiency achieved by deep learned image compression (LIC), existing methods usually approximate the desired bitrate by adjusting a single quality factor for a given input image, which may compromise the rate control results.

Image Compression

EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head

no code implementations1 Aug 2024 Qianyun He, Xinya Ji, Yicheng Gong, Yuanxun Lu, Zhengyu Diao, Linjia Huang, Yao Yao, Siyu Zhu, Zhan Ma, Songcen Xu, Xiaofei Wu, Zixiao Zhang, Xun Cao, Hao Zhu

We present a novel approach for synthesizing 3D talking heads with controllable emotion, featuring enhanced lip synchronization and rendering quality.

3D geometry Motion Generation

HINER: Neural Representation for Hyperspectral Image

1 code implementation31 Jul 2024 Junqi Shi, Mingyi Jiang, Ming Lu, Tong Chen, Xun Cao, Zhan Ma

For downstream classification on compressed HSI, we theoretically demonstrate the task accuracy is not only related to the classification loss but also to the reconstruction fidelity through a first-order expansion of the accuracy degradation, and accordingly adapt the reconstruction by introducing Adaptive Spectral Weighting.

Classification Data Augmentation +1

FINER++: Building a Family of Variable-periodic Functions for Activating Implicit Neural Representation

no code implementations28 Jul 2024 Hao Zhu, Zhen Liu, Qi Zhang, Jingde Fu, Weibing Deng, Zhan Ma, Yanwen Guo, Xun Cao

By initializing the bias of the neural network with different ranges, sub-functions with various frequencies in the variable-periodic function are selected for activation.

Neural Poisson Solver: A Universal and Continuous Framework for Natural Signal Blending

no code implementations11 Jul 2024 Delong Wu, Hao Zhu, Qi Zhang, You Li, Zhan Ma, Xun Cao

To tackle this issue, we introduce the Neural Poisson Solver, a plug-and-play and universally applicable framework across different signal dimensions for blending visual signals represented by INRs.

PNeRV: Enhancing Spatial Consistency via Pyramidal Neural Representation for Videos

no code implementations CVPR 2024 Qi Zhao, M. Salman Asif, Zhan Ma

To address this issue, we introduce the Pyramidal Neural Representation for Videos (PNeRV), which is built on a multi-scale information connection and comprises a lightweight rescaling operator, Kronecker Fully-connected layer (KFc), and a Benign Selective Memory (BSM) mechanism.

SSIM Tensor Decomposition

Efficient and Generic Point Model for Lossless Point Cloud Attribute Compression

1 code implementation10 Apr 2024 Kang You, Pan Gao, Zhan Ma

In this paper, we propose PoLoPCAC, an efficient and generic lossless PCAC method that achieves high compression efficiency and strong generalizability simultaneously.

2k Attribute

Towards Backward-Compatible Continual Learning of Image Compression

1 code implementation CVPR 2024 Zhihao Duan, Ming Lu, Justin Yang, Jiangpeng He, Zhan Ma, Fengqing Zhu

This paper explores the possibility of extending the capability of pre-trained neural image compressors (e. g., adapting to new data or target bitrates) without breaking backward compatibility, the ability to decode bitstreams encoded by the original model.

Continual Learning Image Compression +1

Another Way to the Top: Exploit Contextual Clustering in Learned Image Coding

no code implementations21 Jan 2024 Yichi Zhang, Zhihao Duan, Ming Lu, Dandan Ding, Fengqing Zhu, Zhan Ma

While convolution and self-attention are extensively used in learned image compression (LIC) for transform coding, this paper proposes an alternative called Contextual Clustering based LIC (CLIC) which primarily relies on clustering operations and local attention for correlation characterization and compact representation of an image.

Clustering Image Compression +3

Deep Hierarchical Video Compression

no code implementations12 Dec 2023 Ming Lu, Zhihao Duan, Fengqing Zhu, Zhan Ma

Recently, probabilistic predictive coding that directly models the conditional distribution of latent features across successive frames for temporal redundancy removal has yielded promising results.

Video Compression

FINER: Flexible spectral-bias tuning in Implicit NEural Representation by Variable-periodic Activation Functions

no code implementations CVPR 2024 Zhen Liu, Hao Zhu, Qi Zhang, Jingde Fu, Weibing Deng, Zhan Ma, Yanwen Guo, Xun Cao

Implicit Neural Representation (INR), which utilizes a neural network to map coordinate inputs to corresponding attributes, is causing a revolution in the field of signal processing.

Continuous 3D Myocardial Motion Tracking via Echocardiography

no code implementations4 Oct 2023 Chengkang Shen, Hao Zhu, You Zhou, Yu Liu, Si Yi, Lili Dong, Weipeng Zhao, David J. Brady, Xun Cao, Zhan Ma, Yi Lin

Myocardial motion tracking stands as an essential clinical tool in the prevention and detection of cardiovascular diseases (CVDs), the foremost cause of death globally.

Motion Estimation

RHINO: Regularizing the Hash-based Implicit Neural Representation

no code implementations22 Sep 2023 Hao Zhu, Fengyi Liu, Qi Zhang, Xun Cao, Zhan Ma

This connection ensures a seamless backpropagation of gradients from the network's output back to the input coordinates, thereby enhancing regularization.

Karma: Adaptive Video Streaming via Causal Sequence Modeling

no code implementations20 Aug 2023 Bowei Xu, Hao Chen, Zhan Ma

Unlike direct observation-to-action mapping, Karma recurrently maintains a multi-dimensional time series of observations, returns, and actions as input and employs causal sequence modeling via a decision transformer to determine the next action.

DNeRV: Modeling Inherent Dynamics via Difference Neural Representation for Videos

no code implementations CVPR 2023 Qi Zhao, M. Salman Asif, Zhan Ma

DNeRV achieves competitive results against the state-of-the-art neural compression approaches and outperforms existing implicit methods on downstream inpainting and interpolation for $960 \times 1920$ videos.

Video Compression

Disorder-invariant Implicit Neural Representation

no code implementations3 Apr 2023 Hao Zhu, Shaowen Xie, Zhen Liu, Fengyi Liu, Qi Zhang, You Zhou, Yi Lin, Zhan Ma, Xun Cao

However, the expressive power of INR is limited by the spectral bias in the network training.

Attribute Retrieval

Lossless Point Cloud Attribute Compression Using Cross-scale, Cross-group, and Cross-color Prediction

no code implementations22 Mar 2023 Jianqiang Wang, Dandan Ding, Zhan Ma

With this aim, we extensively exploit cross-scale, cross-group, and cross-color correlations of point cloud attribute to ensure accurate probability estimation and thus high coding efficiency.

Attribute

QARV: Quantization-Aware ResNet VAE for Lossy Image Compression

2 code implementations16 Feb 2023 Zhihao Duan, Ming Lu, Jack Ma, Yuning Huang, Zhan Ma, Fengqing Zhu

This paper addresses the problem of lossy image compression, a fundamental problem in image processing and information theory that is involved in many real-world applications.

Image Compression Quantization

Dynamic Point Cloud Geometry Compression Using Multiscale Inter Conditional Coding

no code implementations28 Jan 2023 Jianqiang Wang, Dandan Ding, Hao Chen, Zhan Ma

This work extends the Multiscale Sparse Representation (MSR) framework developed for static Point Cloud Geometry Compression (PCGC) to support the dynamic PCGC through the use of multiscale inter conditional coding.

Efficient Visual Computing with Camera RAW Snapshots

1 code implementation15 Dec 2022 Zhihao LI, Ming Lu, Xu Zhang, Xin Feng, M. Salman Asif, Zhan Ma

Conventional cameras capture image irradiance on a sensor and convert it to RGB images using an image signal processor (ISP).

Autonomous Driving Image Compression +2

DINER: Disorder-Invariant Implicit Neural Representation

no code implementations CVPR 2023 Shaowen Xie, Hao Zhu, Zhen Liu, Qi Zhang, You Zhou, Xun Cao, Zhan Ma

Implicit neural representation (INR) characterizes the attributes of a signal as a function of corresponding coordinates which emerges as a sharp weapon for solving inverse problems.

Retrieval

Rate-Distortion Optimized Post-Training Quantization for Learned Image Compression

no code implementations5 Nov 2022 Junqi Shi, Ming Lu, Zhan Ma

Quantizing a floating-point neural network to its fixed-point representation is crucial for Learned Image Compression (LIC) because it improves decoding consistency for interoperability and reduces space-time complexity for implementation.

Image Classification Image Compression +2

CARNet:Compression Artifact Reduction for Point Cloud Attribute

no code implementations17 Sep 2022 Dandan Ding, Junzhe Zhang, Jianqiang Wang, Zhan Ma

A learning-based adaptive loop filter is developed for the Geometry-based Point Cloud Compression (G-PCC) standard to reduce attribute compression artifacts.

Attribute

Lossy Image Compression with Quantized Hierarchical VAEs

2 code implementations27 Aug 2022 Zhihao Duan, Ming Lu, Zhan Ma, Fengqing Zhu

Recent research has shown a strong theoretical connection between variational autoencoders (VAEs) and the rate-distortion theory.

Image Compression Quantization

Efficient LiDAR Point Cloud Geometry Compression Through Neighborhood Point Attention

no code implementations26 Aug 2022 Ruixiang Xue, Jianqiang Wang, Zhan Ma

Although convolutional representation of multiscale sparse tensor demonstrated its superior efficiency to accurately model the occupancy probability for the compression of geometry component of dense object point clouds, its capacity for representing sparse LiDAR point cloud geometry (PCG) was largely limited.

H2-Stereo: High-Speed, High-Resolution Stereoscopic Video System

no code implementations4 Aug 2022 Ming Cheng, Yiling Xu, Wang Shen, M. Salman Asif, Chao Ma, Jun Sun, Zhan Ma

We utilize a disparity network to transfer spatiotemporal information across views even in large disparity scenes, based on which, we propose disparity-guided flow-based warping for LSR-HFR view and complementary warping for HSR-LFR view.

Super-Resolution Vocal Bursts Intensity Prediction

Fast Simulation of Particulate Suspensions Enabled by Graph Neural Network

no code implementations17 Jun 2022 Zhan Ma, Zisheng Ye, Wenxiao Pan

Predicting the dynamic behaviors of particles in suspension subject to hydrodynamic interaction (HI) and external drive can be critical for many applications.

Computational Efficiency Graph Neural Network

High-Efficiency Lossy Image Coding Through Adaptive Neighborhood Information Aggregation

1 code implementation25 Apr 2022 Ming Lu, Fangdong Chen, ShiLiang Pu, Zhan Ma

To this end, Integrated Convolution and Self-Attention (ICSA) unit is first proposed to form a content-adaptive transform to characterize and embed neighborhood information dynamically of any input.

Vocal Bursts Intensity Prediction

Rendering Nighttime Image Via Cascaded Color and Brightness Compensation

1 code implementation19 Apr 2022 Zhihao LI, Si Yi, Zhan Ma

Image signal processing (ISP) is crucial for camera imaging, and neural networks (NN) solutions are extensively deployed for daytime scenes.

Tone Mapping

Event Transformer

1 code implementation11 Apr 2022 Bin Jiang, Zhihao LI, M. Salman Asif, Xun Cao, Zhan Ma

The event camera's low power consumption and ability to capture microsecond brightness changes make it attractive for various computer vision tasks.

Event-based vision Optical Flow Estimation

Sparse Tensor-based Point Cloud Attribute Compression

1 code implementation3 Apr 2022 Jianqiang Wang, Zhan Ma

Recently, numerous learning-based compression methods have been developed with outstanding performance for the coding of the geometry information of point clouds.

Attribute

Opening the Black Box of Learned Image Coders

no code implementations26 Feb 2022 Zhihao Duan, Ming Lu, Zhan Ma, Fengqing Zhu

End-to-end learned lossy image coders (LICs), as opposed to hand-crafted image codecs, have shown increasing superiority in terms of the rate-distortion performance.

Transformer-based Image Compression

no code implementations12 Nov 2021 Ming Lu, Peiyao Guo, Huiqing Shi, Chuntong Cao, Zhan Ma

A Transformer-based Image Compression (TIC) approach is developed which reuses the canonical variational autoencoder (VAE) architecture with paired main and hyper encoder-decoders.

Image Compression Image Reconstruction

End-to-end Neural Video Coding Using a Compound Spatiotemporal Representation

no code implementations5 Aug 2021 Haojie Liu, Ming Lu, Zhiqi Chen, Xun Cao, Zhan Ma, Yao Wang

We further design a one-to-many decoder pipeline to generate multiple predictions from the CSTR, including vector-based resampling, adaptive kernel-based resampling, compensation mode selection maps and texture enhancements, and combines them adaptively to achieve more accurate inter prediction.

Motion Compensation MS-SSIM +3

ANT: Learning Accurate Network Throughput for Better Adaptive Video Streaming

no code implementations26 Apr 2021 Jiaoyang Yin, Yiling Xu, Hao Chen, Yunfei Zhang, Steve Appleby, Zhan Ma

Adaptive Bit Rate (ABR) decision plays a crucial role for ensuring satisfactory Quality of Experience (QoE) in video streaming applications, in which past network statistics are mainly leveraged for future network bandwidth prediction.

Reinforcement Learning (RL)

MPED: Quantifying Point Cloud Distortion based on Multiscale Potential Energy Discrepancy

1 code implementation4 Mar 2021 Qi Yang, Yujie Zhang, Siheng Chen, Yiling Xu, Jun Sun, Zhan Ma

In this paper, we propose a new distortion quantification method for point clouds, the multiscale potential energy discrepancy (MPED).

Point cloud reconstruction

Decomposition, Compression, and Synthesis (DCS)-based Video Coding: A Neural Exploration via Resolution-Adaptive Learning

no code implementations1 Dec 2020 Ming Lu, Tong Chen, Dandan Ding, Fengqing Zhu, Zhan Ma

Inspired by the facts that retinal cells actually segregate the visual scene into different attributes (e. g., spatial details, temporal motion) for respective neuronal processing, we propose to first decompose the input video into respective spatial texture frames (STF) at its native spatial resolution that preserve the rich spatial details, and the other temporal motion frames (TMF) at a lower spatial resolution that retain the motion smoothness; then compress them together using any popular video coder; and finally synthesize decoded STFs and TMFs for high-fidelity video reconstruction at the same resolution as its native input.

Motion Compensation Super-Resolution +2

Multiscale Point Cloud Geometry Compression

3 code implementations7 Nov 2020 Jianqiang Wang, Dandan Ding, Zhu Li, Zhan Ma

Recent years have witnessed the growth of point cloud based applications because of its realistic and fine-grained representation of 3D objects and scenes.

Attribute

Neural Video Coding using Multiscale Motion Compensation and Spatiotemporal Context Model

no code implementations9 Jul 2020 Haojie Liu, Ming Lu, Zhan Ma, Fan Wang, Zhihuang Xie, Xun Cao, Yao Wang

Over the past two decades, traditional block-based video coding has made remarkable progress and spawned a series of well-known standards such as MPEG-4, H. 264/AVC and H. 265/HEVC.

Motion Compensation MS-SSIM +2

Inferring Point Cloud Quality via Graph Similarity

1 code implementation31 May 2020 Qi Yang, Zhan Ma, Yiling Xu, Zhu Li, Jun Sun

We propose the GraphSIM -- an objective metric to accurately predict the subjective quality of point cloud with superimposed geometry and color impairments.

Graph Similarity

Object-Based Image Coding: A Learning-Driven Revisit

no code implementations18 Mar 2020 Qi Xia, Haojie Liu, Zhan Ma

The Object-Based Image Coding (OBIC) that was extensively studied about two decades ago, promised a vast application perspective for both ultra-low bitrate communication and high-level semantical content understanding, but it had rarely been used due to the inefficient compact representation of object with arbitrary shape.

Image Compression Object +1

Learning End-to-End Lossy Image Compression: A Benchmark

2 code implementations10 Feb 2020 Yueyu Hu, Wenhan Yang, Zhan Ma, Jiaying Liu

In this paper, we first conduct a comprehensive literature survey of learned image compression methods.

Image Compression

Learned Video Compression via Joint Spatial-Temporal Correlation Exploration

no code implementations13 Dec 2019 Haojie Liu, Han Shen, Lichao Huang, Ming Lu, Tong Chen, Zhan Ma

Traditional video compression technologies have been developed over decades in pursuit of higher coding efficiency.

Optical Flow Estimation Video Compression

Neural Image Compression via Non-Local Attention Optimization and Improved Context Modeling

1 code implementation11 Oct 2019 Tong Chen, Haojie Liu, Zhan Ma, Qiu Shen, Xun Cao, Yao Wang

This paper proposes a novel Non-Local Attention optmization and Improved Context modeling-based image compression (NLAIC) algorithm, which is built on top of the deep nerual network (DNN)-based variational auto-encoder (VAE) structure.

Image Compression MS-SSIM +1

A Dual Camera System for High Spatiotemporal Resolution Video Acquisition

no code implementations28 Sep 2019 Ming Cheng, Zhan Ma, M. Salman Asif, Yiling Xu, Haojie Liu, Wenbo Bao, Jun Sun

This paper presents a dual camera system for high spatiotemporal resolution (HSTR) video acquisition, where one camera shoots a video with high spatial resolution and low frame rate (HSR-LFR) and another one captures a low spatial resolution and high frame rate (LSR-HFR) video.

Vocal Bursts Intensity Prediction

Learned Point Cloud Geometry Compression

2 code implementations26 Sep 2019 Jianqiang Wang, Hao Zhu, Zhan Ma, Tong Chen, Haojie Liu, Qiu Shen

This paper presents a novel end-to-end Learned Point Cloud Geometry Compression (a. k. a., Learned-PCGC) framework, to efficiently compress the point cloud geometry (PCG) using deep neural networks (DNN) based variational autoencoders (VAE).

Surface Reconstruction

Learned Quality Enhancement via Multi-Frame Priors for HEVC Compliant Low-Delay Applications

no code implementations3 May 2019 Ming Lu, Ming Cheng, Yiling Xu, ShiLiang Pu, Qiu Shen, Zhan Ma

Networked video applications, e. g., video conferencing, often suffer from poor visual quality due to unexpected network fluctuation and limited bandwidth.

Decoder Video Compression

Non-local Attention Optimized Deep Image Compression

no code implementations22 Apr 2019 Haojie Liu, Tong Chen, Peiyao Guo, Qiu Shen, Xun Cao, Yao Wang, Zhan Ma

This paper proposes a novel Non-Local Attention Optimized Deep Image Compression (NLAIC) framework, which is built on top of the popular variational auto-encoder (VAE) structure.

Image Compression MS-SSIM +1

Extreme Image Coding via Multiscale Autoencoders With Generative Adversarial Optimization

no code implementations8 Apr 2019 Chao Huang, Haojie Liu, Tong Chen, Qiu Shen, Zhan Ma

We propose a MultiScale AutoEncoder(MSAE) based extreme image compression framework to offer visually pleasing reconstruction at a very low bitrate.

Generative Adversarial Network Image Compression

Gated Context Model with Embedded Priors for Deep Image Compression

no code implementations27 Feb 2019 Haojie Liu, Tong Chen, Peiyao Guo, Qiu Shen, Zhan Ma

Besides, a field study on perceptual quality is also given via a dedicated subjective assessment, to compare the efficiency of our proposed methods and other conventional image compression methods.

Decoder Image Compression +3

Deep Image Compression via End-to-End Learning

1 code implementation5 Jun 2018 Haojie Liu, Tong Chen, Qiu Shen, Tao Yue, Zhan Ma

We present a lossy image compression method based on deep convolutional neural networks (CNNs), which outperforms the existing BPG, WebP, JPEG2000 and JPEG as measured via multi-scale structural similarity (MS-SSIM), at the same bit rate.

Image Compression MS-SSIM +3

Cannot find the paper you are looking for? You can Submit a new open access paper.