no code implementations • 4 Oct 2023 • Chengkang Shen, Hao Zhu, You Zhou, Yu Liu, Si Yi, Lili Dong, Weipeng Zhao, David J. Brady, Xun Cao, Zhan Ma, Yi Lin
Myocardial motion tracking stands as an essential clinical tool in the prevention and detection of Cardiovascular Diseases (CVDs), the foremost cause of death globally.
no code implementations • 22 Sep 2023 • Hao Zhu, Fengyi Liu, Qi Zhang, Xun Cao, Zhan Ma
This connection ensures a seamless backpropagation of gradients from the network's output back to the input coordinates, thereby enhancing regularization.
no code implementations • 20 Aug 2023 • Bowei Xu, Hao Chen, Zhan Ma
Unlike direct observation-to-action mapping, Karma recurrently maintains a multi-dimensional time series of observations, returns, and actions as input and employs causal sequence modeling via a decision transformer to determine the next action.
no code implementations • CVPR 2023 • Qi Zhao, M. Salman Asif, Zhan Ma
DNeRV achieves competitive results against the state-of-the-art neural compression approaches and outperforms existing implicit methods on downstream inpainting and interpolation for $960 \times 1920$ videos.
no code implementations • 3 Apr 2023 • Hao Zhu, Shaowen Xie, Zhen Liu, Fengyi Liu, Qi Zhang, You Zhou, Yi Lin, Zhan Ma, Xun Cao
However, the expressive power of INR is limited by the spectral bias in the network training.
no code implementations • 22 Mar 2023 • Jianqiang Wang, Dandan Ding, Zhan Ma
With this aim, we extensively exploit cross-scale, cross-group, and cross-color correlations of point cloud attribute to ensure accurate probability estimation and thus high coding efficiency.
2 code implementations • 16 Feb 2023 • Zhihao Duan, Ming Lu, Jack Ma, Yuning Huang, Zhan Ma, Fengqing Zhu
This paper addresses the problem of lossy image compression, a fundamental problem in image processing and information theory that is involved in many real-world applications.
no code implementations • 28 Jan 2023 • Jianqiang Wang, Dandan Ding, Hao Chen, Zhan Ma
This work extends the Multiscale Sparse Representation (MSR) framework developed for static Point Cloud Geometry Compression (PCGC) to support the dynamic PCGC through the use of multiscale inter conditional coding.
1 code implementation • 15 Dec 2022 • Zhihao LI, Ming Lu, Xu Zhang, Xin Feng, M. Salman Asif, Zhan Ma
Conventional cameras capture image irradiance on a sensor and convert it to RGB images using an image signal processor (ISP).
no code implementations • CVPR 2023 • Shaowen Xie, Hao Zhu, Zhen Liu, Qi Zhang, You Zhou, Xun Cao, Zhan Ma
Implicit neural representation (INR) characterizes the attributes of a signal as a function of corresponding coordinates which emerges as a sharp weapon for solving inverse problems.
no code implementations • 5 Nov 2022 • Junqi Shi, Ming Lu, Zhan Ma
Quantizing a floating-point neural network to its fixed-point representation is crucial for Learned Image Compression (LIC) because it improves decoding consistency for interoperability and reduces space-time complexity for implementation.
no code implementations • 17 Sep 2022 • Dandan Ding, Junzhe Zhang, Jianqiang Wang, Zhan Ma
A learning-based adaptive loop filter is developed for the Geometry-based Point Cloud Compression (G-PCC) standard to reduce attribute compression artifacts.
2 code implementations • 27 Aug 2022 • Zhihao Duan, Ming Lu, Zhan Ma, Fengqing Zhu
Recent research has shown a strong theoretical connection between variational autoencoders (VAEs) and the rate-distortion theory.
no code implementations • 26 Aug 2022 • Ruixiang Xue, Jianqiang Wang, Zhan Ma
Although convolutional representation of multiscale sparse tensor demonstrated its superior efficiency to accurately model the occupancy probability for the compression of geometry component of dense object point clouds, its capacity for representing sparse LiDAR point cloud geometry (PCG) was largely limited.
no code implementations • 4 Aug 2022 • Ming Cheng, Yiling Xu, Wang Shen, M. Salman Asif, Chao Ma, Jun Sun, Zhan Ma
We utilize a disparity network to transfer spatiotemporal information across views even in large disparity scenes, based on which, we propose disparity-guided flow-based warping for LSR-HFR view and complementary warping for HSR-LFR view.
no code implementations • 17 Jun 2022 • Zhan Ma, Zisheng Ye, Wenxiao Pan
Predicting the dynamic behaviors of particles in suspension subject to hydrodynamic interaction (HI) and external drive can be critical for many applications.
1 code implementation • 25 Apr 2022 • Ming Lu, Fangdong Chen, ShiLiang Pu, Zhan Ma
To this end, Integrated Convolution and Self-Attention (ICSA) unit is first proposed to form a content-adaptive transform to characterize and embed neighborhood information dynamically of any input.
1 code implementation • 19 Apr 2022 • Zhihao LI, Si Yi, Zhan Ma
Image signal processing (ISP) is crucial for camera imaging, and neural networks (NN) solutions are extensively deployed for daytime scenes.
no code implementations • 11 Apr 2022 • Zhihao LI, M. Salman Asif, Zhan Ma
The event camera is a bio-vision inspired camera with high dynamic range, high response speed, and low power consumption, recently attracting extensive attention for its use in vast vision tasks.
1 code implementation • 3 Apr 2022 • Jianqiang Wang, Zhan Ma
Recently, numerous learning-based compression methods have been developed with outstanding performance for the coding of the geometry information of point clouds.
no code implementations • 26 Feb 2022 • Zhihao Duan, Ming Lu, Zhan Ma, Fengqing Zhu
End-to-end learned lossy image coders (LICs), as opposed to hand-crafted image codecs, have shown increasing superiority in terms of the rate-distortion performance.
no code implementations • 16 Dec 2021 • Tong Chen, Zhan Ma
Deep neural network-based image compression has been extensively studied.
2 code implementations • 20 Nov 2021 • Jianqiang Wang, Dandan Ding, Zhu Li, Xiaoxing Feng, Chuntong Cao, Zhan Ma
We call this compression method SparsePCGC.
no code implementations • 12 Nov 2021 • Ming Lu, Peiyao Guo, Huiqing Shi, Chuntong Cao, Zhan Ma
A Transformer-based Image Compression (TIC) approach is developed which reuses the canonical variational autoencoder (VAE) architecture with paired main and hyper encoder-decoders.
no code implementations • 15 Sep 2021 • Yipeng Liu, Qi Yang, Yiling Xu, Zhan Ma
Point cloud compression (PCC) has made remarkable achievement in recent years.
no code implementations • 5 Aug 2021 • Haojie Liu, Ming Lu, Zhiqi Chen, Xun Cao, Zhan Ma, Yao Wang
We further design a one-to-many decoder pipeline to generate multiple predictions from the CSTR, including vector-based resampling, adaptive kernel-based resampling, compensation mode selection maps and texture enhancements, and combines them adaptively to achieve more accurate inter prediction.
no code implementations • 26 Apr 2021 • Jiaoyang Yin, Yiling Xu, Hao Chen, Yunfei Zhang, Steve Appleby, Zhan Ma
Adaptive Bit Rate (ABR) decision plays a crucial role for ensuring satisfactory Quality of Experience (QoE) in video streaming applications, in which past network statistics are mainly leveraged for future network bandwidth prediction.
1 code implementation • 21 Apr 2021 • Ren Yang, Radu Timofte, Jing Liu, Yi Xu, Xinjian Zhang, Minyi Zhao, Shuigeng Zhou, Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy, Xin Li, Fanglong Liu, He Zheng, Lielin Jiang, Qi Zhang, Dongliang He, Fu Li, Qingqing Dang, Yibin Huang, Matteo Maggioni, Zhongqian Fu, Shuai Xiao, Cheng Li, Thomas Tanay, Fenglong Song, Wentao Chao, Qiang Guo, Yan Liu, Jiang Li, Xiaochao Qu, Dewang Hou, Jiayu Yang, Lyn Jiang, Di You, Zhenyu Zhang, Chong Mou, Iaroslav Koshelev, Pavel Ostyakov, Andrey Somov, Jia Hao, Xueyi Zou, Shijie Zhao, Xiaopeng Sun, Yiting Liao, Yuanzhi Zhang, Qing Wang, Gen Zhan, Mengxi Guo, Junlin Li, Ming Lu, Zhan Ma, Pablo Navarrete Michelini, Hai Wang, Yiyun Chen, Jingyu Guo, Liliang Zhang, Wenming Yang, Sijung Kim, Syehoon Oh, Yucong Wang, Minjie Cai, Wei Hao, Kangdi Shi, Liangyan Li, Jun Chen, Wei Gao, Wang Liu, XiaoYu Zhang, Linjie Zhou, Sixin Lin, Ru Wang
This paper reviews the first NTIRE challenge on quality enhancement of compressed video, with a focus on the proposed methods and results.
1 code implementation • 4 Mar 2021 • Qi Yang, Yujie Zhang, Siheng Chen, Yiling Xu, Jun Sun, Zhan Ma
In this paper, we propose a new distortion quantification method for point clouds, the multiscale potential energy discrepancy (MPED).
no code implementations • 1 Dec 2020 • Ming Lu, Tong Chen, zhenyu Dai, Dong Wang, Dandan Ding, Zhan Ma
This paper proposes a decoder-side Cross Resolution Synthesis (CRS) module to pursue better compression efficiency beyond the latest Versatile Video Coding (VVC), where we encode intra frames at original high resolution (HR), compress inter frames at a lower resolution (LR), and then super-resolve decoded LR inter frames with the help from preceding HR intra and neighboring LR inter frames.
3 code implementations • 7 Nov 2020 • Jianqiang Wang, Dandan Ding, Zhu Li, Zhan Ma
Recent years have witnessed the growth of point cloud based applications because of its realistic and fine-grained representation of 3D objects and scenes.
no code implementations • 9 Jul 2020 • Haojie Liu, Ming Lu, Zhan Ma, Fan Wang, Zhihuang Xie, Xun Cao, Yao Wang
Over the past two decades, traditional block-based video coding has made remarkable progress and spawned a series of well-known standards such as MPEG-4, H. 264/AVC and H. 265/HEVC.
1 code implementation • 31 May 2020 • Qi Yang, Zhan Ma, Yiling Xu, Zhu Li, Jun Sun
We propose the GraphSIM -- an objective metric to accurately predict the subjective quality of point cloud with superimposed geometry and color impairments.
no code implementations • 18 Mar 2020 • Qi Xia, Haojie Liu, Zhan Ma
The Object-Based Image Coding (OBIC) that was extensively studied about two decades ago, promised a vast application perspective for both ultra-low bitrate communication and high-level semantical content understanding, but it had rarely been used due to the inefficient compact representation of object with arbitrary shape.
2 code implementations • 10 Feb 2020 • Yueyu Hu, Wenhan Yang, Zhan Ma, Jiaying Liu
In this paper, we first conduct a comprehensive literature survey of learned image compression methods.
no code implementations • 13 Dec 2019 • Haojie Liu, Han Shen, Lichao Huang, Ming Lu, Tong Chen, Zhan Ma
Traditional video compression technologies have been developed over decades in pursuit of higher coding efficiency.
1 code implementation • 11 Oct 2019 • Tong Chen, Haojie Liu, Zhan Ma, Qiu Shen, Xun Cao, Yao Wang
This paper proposes a novel Non-Local Attention optmization and Improved Context modeling-based image compression (NLAIC) algorithm, which is built on top of the deep nerual network (DNN)-based variational auto-encoder (VAE) structure.
no code implementations • 28 Sep 2019 • Ming Cheng, Zhan Ma, M. Salman Asif, Yiling Xu, Haojie Liu, Wenbo Bao, Jun Sun
This paper presents a dual camera system for high spatiotemporal resolution (HSTR) video acquisition, where one camera shoots a video with high spatial resolution and low frame rate (HSR-LFR) and another one captures a low spatial resolution and high frame rate (LSR-HFR) video.
2 code implementations • 26 Sep 2019 • Jianqiang Wang, Hao Zhu, Zhan Ma, Tong Chen, Haojie Liu, Qiu Shen
This paper presents a novel end-to-end Learned Point Cloud Geometry Compression (a. k. a., Learned-PCGC) framework, to efficiently compress the point cloud geometry (PCG) using deep neural networks (DNN) based variational autoencoders (VAE).
no code implementations • 3 May 2019 • Ming Lu, Ming Cheng, Yiling Xu, ShiLiang Pu, Qiu Shen, Zhan Ma
Networked video applications, e. g., video conferencing, often suffer from poor visual quality due to unexpected network fluctuation and limited bandwidth.
no code implementations • 22 Apr 2019 • Haojie Liu, Tong Chen, Peiyao Guo, Qiu Shen, Xun Cao, Yao Wang, Zhan Ma
This paper proposes a novel Non-Local Attention Optimized Deep Image Compression (NLAIC) framework, which is built on top of the popular variational auto-encoder (VAE) structure.
no code implementations • 8 Apr 2019 • Chao Huang, Haojie Liu, Tong Chen, Qiu Shen, Zhan Ma
We propose a MultiScale AutoEncoder(MSAE) based extreme image compression framework to offer visually pleasing reconstruction at a very low bitrate.
no code implementations • 27 Feb 2019 • Haojie Liu, Tong Chen, Peiyao Guo, Qiu Shen, Zhan Ma
Besides, a field study on perceptual quality is also given via a dedicated subjective assessment, to compare the efficiency of our proposed methods and other conventional image compression methods.
1 code implementation • 5 Jun 2018 • Haojie Liu, Tong Chen, Qiu Shen, Tao Yue, Zhan Ma
We present a lossy image compression method based on deep convolutional neural networks (CNNs), which outperforms the existing BPG, WebP, JPEG2000 and JPEG as measured via multi-scale structural similarity (MS-SSIM), at the same bit rate.