no code implementations • 10 Oct 2024 • Zhiyi Pan, Wei Gao, Shan Liu, Ge Li
Despite alleviating the dependence on dense annotations inherent to fully supervised methods, weakly supervised point cloud semantic segmentation suffers from inadequate supervision signals.
no code implementations • 13 Aug 2024 • Zihao Qi, Chen Feng, Fan Zhang, Xiaozhong Xu, Shan Liu, David Bull
Based on this collected subjective data, we benchmarked the performance of 10 full-reference and 11 no-reference quality metrics.
1 code implementation • 4 Jul 2024 • Yujie Zhang, Qi Yang, Yiling Xu, Shan Liu
To bridge the gap, in this paper, we propose a perception-guided hybrid metric (PHM) that adaptively leverages two visual strategies with respect to distortion degree to predict point cloud quality: to measure visible difference in high-quality samples, PHM takes into account the masking effect and employs texture complexity as an effective compensatory factor for absolute difference; on the other hand, PHM leverages spectral graph theory to evaluate appearance degradation in low-quality samples.
no code implementations • 3 Jun 2024 • Han Gao, Xin Zhao, Tianqi Liu, Shan Liu
The input to the mapping is a group of reconstructed luma samples, and the output is an offset value applied on the center luma or co-located chroma sample.
no code implementations • 28 Apr 2024 • Weijie Bao, Yuantong Zhang, Jianghao Jia, Zhenzhong Chen, Shan Liu
During RFS, two reconstructed frames are sent into STENet's synthesis pipeline to synthesize a virtual reference frame, similar to the current to-be-coded frame.
1 code implementation • 24 Apr 2024 • Marcos V. Conde, Saman Zadtootaghaj, Nabajeet Barman, Radu Timofte, Chenlong He, Qi Zheng, Ruoxi Zhu, Zhengzhong Tu, Haiqiang Wang, Xiangguang Chen, Wenhui Meng, Xiang Pan, Huiying Shi, Han Zhu, Xiaozhong Xu, Lei Sun, Zhenzhong Chen, Shan Liu, ZiCheng Zhang, HaoNing Wu, Yingjie Zhou, Chunyi Li, Xiaohong Liu, Weisi Lin, Guangtao Zhai, Wei Sun, Yuqin Cao, Yanwei Jiang, Jun Jia, Zhichao Zhang, Zijian Chen, Weixia Zhang, Xiongkuo Min, Steve Göring, Zihao Qi, Chen Feng
The performance of the top-5 submissions is reviewed and provided here as a survey of diverse deep models for efficient video quality assessment of user-generated content.
1 code implementation • 17 Apr 2024 • Xin Li, Kun Yuan, Yajing Pei, Yiting Lu, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Wei Sun, HaoNing Wu, ZiCheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai, Jianhui Sun, Tianyi Wang, Lei LI, Han Kong, Wenxuan Wang, Bing Li, Cheng Luo, Haiqiang Wang, Xiangguang Chen, Wenhui Meng, Xiang Pan, Huiying Shi, Han Zhu, Xiaozhong Xu, Lei Sun, Zhenzhong Chen, Shan Liu, Fangyuan Kong, Haotian Fan, Yifang Xu, Haoran Xu, Mengduo Yang, Jie zhou, Jiaze Li, Shijie Wen, Mai Xu, Da Li, Shunyu Yao, Jiazhi Du, WangMeng Zuo, Zhibo Li, Shuai He, Anlong Ming, Huiyuan Fu, Huadong Ma, Yong Wu, Fie Xue, Guozhi Zhao, Lina Du, Jie Guo, Yu Zhang, huimin zheng, JunHao Chen, Yue Liu, Dulan Zhou, Kele Xu, Qisheng Xu, Tao Sun, Zhixiang Ding, Yuhang Hu
This paper reviews the NTIRE 2024 Challenge on Shortform UGC Video Quality Assessment (S-UGC VQA), where various excellent solutions are submitted and evaluated on the collected dataset KVQ from popular short-form video platform, i. e., Kuaishou/Kwai Platform.
no code implementations • 15 Mar 2024 • Ziyu Shan, Yujie Zhang, Qi Yang, Haichen Yang, Yiling Xu, Shan Liu
Furthermore, in the model fine-tuning stage, the learned content-aware features serve as a guide to fuse the point cloud quality features extracted from different perspectives.
no code implementations • CVPR 2024 • Ziyu Shan, Yujie Zhang, Qi Yang, Haichen Yang, Yiling Xu, Jenq-Neng Hwang, Xiaozhong Xu, Shan Liu
Furthermore, in the model fine-tuning stage, we propose a semantic-guided multi-view fusion module to effectively integrate the features of projected images from multiple perspectives.
1 code implementation • 1 Mar 2024 • Jinyan Hou, Shan Liu, Ya zhang, Haotong Qin
To tackle these challenges, this paper introduces a novel graph construction method tailored to free-floating traffic mode.
no code implementations • 26 Feb 2024 • Wen-Yang Lu, Eduardo Pavez, Antonio Ortega, Xin Zhao, Shan Liu
Current video coding standards, including H. 264/AVC, HEVC, and VVC, employ discrete cosine transform (DCT), discrete sine transform (DST), and secondary to Karhunen-Loeve transforms (KLTs) decorrelate the intra-prediction residuals.
no code implementations • 6 Jan 2024 • Yang Sui, Zhuohang Li, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Zhenzhong Chen
Adversarial attacks can readily disrupt the image classification system, revealing the vulnerability of DNN-based recognition tasks.
1 code implementation • CVPR 2024 • Zhe Zhang, Huairui Wang, Zhenzhong Chen, Shan Liu
To tackle these issues we introduce Bit Plane Slicing (BPS) splitting images in the bit plane dimension with the considerations on different importance for latent variables.
no code implementations • 19 Dec 2023 • Zihao Qi, Chen Feng, Duolikun Danier, Fan Zhang, Xiaozhong Xu, Shan Liu, David Bull
In this work, we observe that existing full-/no-reference quality metrics fail to accurately predict the perceptual quality difference between transcoded UGC content and the corresponding unpristine references.
1 code implementation • 11 Dec 2023 • Zhiyi Pan, Nan Zhang, Wei Gao, Shan Liu, Ge Li
Utilizing uniformly distributed sparse annotations, weakly supervised learning alleviates the heavy reliance on fine-grained annotations in point cloud semantic segmentation tasks.
no code implementations • 29 Nov 2023 • Yang Sui, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Bo Yuan, Zhenzhong Chen
To tackle this issue, we conduct an in-depth analysis of the performance degradation observed in existing parallel context models, focusing on two aspects: the Quantity and Quality of information utilized for context prediction and decoding.
no code implementations • 27 Sep 2023 • Bingyang Cui, Qi Yang, Kaifa Yang, Yiling Xu, Xiaozhong Xu, Shan Liu
However, little research has been done on the quality assessment of textured meshes, which hinders the development of quality-oriented applications, such as mesh compression and enhancement.
no code implementations • 17 Aug 2023 • Huairui Wang, Nianxiang Fu, Zhenzhong Chen, Shan Liu
In this paper, we focus on extending spatial aggregation capability and propose a dynamic kernel-based transform coding.
no code implementations • 9 Aug 2023 • Qi Yang, Joel Jung, Xiaozhong Xu, Shan Liu
A two-step patch cropping algorithm and a patch texture mapping module refine the size of 1-hop geodesic patches and build the relationship between the mesh geometry and color information, resulting in the generation of 1-hop textured geodesic patches.
no code implementations • 3 Aug 2023 • Qi Yang, Joel Jung, Timon Deschamps, Xiaozhong Xu, Shan Liu
Dynamic colored meshes (DCM) are widely used in various applications; however, these meshes may undergo different processes, such as compression or transmission, which can distort them and degrade their quality.
no code implementations • 3 Aug 2023 • Qi Yang, Joel Jung, Haiqiang Wang, Xiaozhong Xu, Shan Liu
Static meshes with texture map are widely used in modern industrial and manufacturing sectors, attracting considerable attention in the mesh compression community due to its huge amount of data.
no code implementations • 20 Jul 2023 • Yafang Zheng, Lei Lin, Shuangtao Li, Yuxuan Yuan, Zhaohong Lai, Shan Liu, Biao Fu, Yidong Chen, Xiaodong Shi
Inspired by this, we propose LRF, a novel \textbf{L}ayer-wise \textbf{R}epresentation \textbf{F}usion framework for CG, which learns to fuse previous layers' information back into the encoding and decoding process effectively through introducing a \emph{fuse-attention module} at each encoder and decoder layer.
no code implementations • 4 Jul 2023 • Yipeng Liu, Qi Yang, Yujie Zhang, Yiling Xu, Le Yang, Xiaozhong Xu, Shan Liu
Second, to reduce the significant domain discrepancy, we establish an intermediate domain, the description domain, based on insights from subjective experiments, by considering the domain relevance among samples located in the perception domain and learning a structured latent space.
no code implementations • 1 Jun 2023 • Yang Sui, Zhuohang Li, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Zhenzhong Chen
Learned Image Compression (LIC) has recently become the trending technique for image transmission due to its notable performance.
no code implementations • 20 May 2023 • Lei Lin, Shuangtao Li, Yafang Zheng, Biao Fu, Shan Liu, Yidong Chen, Xiaodong Shi
There is mounting evidence that one of the reasons hindering CG is the representation of the encoder uppermost layer is entangled, i. e., the syntactic and semantic representations of sequences are entangled.
no code implementations • CVPR 2023 • Haozheng Yu, Lu He, Bing Jian, Weiwei Feng, Shan Liu
To reduce the negative impact of panoramic distortion, we incorporate a panel geometry embedding network that encodes both the local and global geometric features of a panel.
no code implementations • 20 Mar 2023 • Min Zhang, Jintang Xue, Pranav Kadam, Hardik Prajapati, Shan Liu, C. -C. Jay Kuo
On the other hand, the model size and inference complexity of DGCNN are 42X and 1203X of those of Green-PointHop, respectively.
no code implementations • 27 Feb 2023 • Pranav Kadam, Jiahao Gu, Shan Liu, C. -C. Jay Kuo
An efficient 3D scene flow estimation method called PointFlowHop is proposed in this work.
no code implementations • 22 Feb 2023 • Pranav Kadam, Hardik Prajapati, Min Zhang, Jintang Xue, Shan Liu, C. -C. Jay Kuo
Many point cloud classification methods are developed under the assumption that all point clouds in the dataset are well aligned with the canonical axes so that the 3D Cartesian point coordinates can be employed to learn features.
no code implementations • 13 Feb 2023 • Qingyang Zhou, Shan Liu, C. -C. Jay Kuo
A low-complexity point cloud compression method called the Green Point Cloud Geometry Codec (GPCGC), is proposed to encode the 3D spatial coordinates of static point clouds efficiently.
no code implementations • CVPR 2023 • Rui Song, Chunyang Fu, Shan Liu, Ge Li
Learning an accurate entropy model is a fundamental way to remove the redundancy in point cloud compression.
no code implementations • 30 Oct 2022 • Kai Zhang, Shan Liu, Momiao Xiong
We urgently need to shift the paradigm for data analysis from the classical Euclidean data analysis to both Euclidean and non Euclidean data analysis and develop more and more innovative methods for describing, estimating and inferring non Euclidean geometries of modern real datasets.
no code implementations • 29 Oct 2022 • Ziyu Shan, Qi Yang, Rui Ye, Yujie Zhang, Yiling Xu, Xiaozhong Xu, Shan Liu
To extract effective features for PCQA, we propose a new graph convolution kernel, i. e., GPAConv, which attentively captures the perturbation of structure and texture.
1 code implementation • 11 Oct 2022 • Xiangguang Chen, Ye Zhu, Yu Li, Bingtao Fu, Lei Sun, Ying Shan, Shan Liu
Unlike previous works, our framework is data efficient, which requires a small amount of matting ground-truth to learn to estimate high quality object mattes.
no code implementations • 30 Sep 2022 • Zhengyu Wang, Yujie Zhang, Qi Yang, Yiling Xu, Jun Sun, Shan Liu
Considering the importance of saliency detection in quality assessment, we propose an effective full-reference PCQA metric which makes the first attempt to utilize the saliency information to facilitate quality prediction, called point cloud quality assessment using 3D saliency maps (PQSM).
1 code implementation • 18 Jul 2022 • Chen Feng, Zihao Qi, Duolikun Danier, Fan Zhang, Xiaozhong Xu, Shan Liu, David Bull
In this work, we modify the MFRNet network architecture to enable multiple frame processing, and the new network, multi-frame MFRNet, has been integrated into the EBDA framework using two Versatile Video Coding (VVC) host codecs: VTM 16. 2 and the Fraunhofer Versatile Video Encoder (VVenC 1. 4. 0).
no code implementations • 18 Jul 2022 • Han Zhu, Zhenzhong Chen, Shan Liu
In addition, the KRNets are optimized in a meta-learning manner to ensure the knowledge transferring and the student learning are beneficial to improving the reconstructed quality of the student.
no code implementations • 8 Jul 2022 • Zhengang Li, Sheng Lin, Shan Liu, Songnan Li, Xue Lin, Wei Wang, Wei Jiang
Recently, high-quality video conferencing with fewer transmission bits has become a very hot and challenging problem.
no code implementations • CVPR 2022 • Zhihao Hu, Guo Lu, Jinyang Guo, Shan Liu, Wei Jiang, Dong Xu
The previous deep video compression approaches only use the single scale motion compensation strategy and rarely adopt the mode prediction technique from the traditional standards like H. 264/H. 265 for both motion and residual compression.
1 code implementation • CVPR 2022 • Yurui Ren, Xiaoqing Fan, Ge Li, Shan Liu, Thomas H. Li
Our model is trained to predict human images in arbitrary poses, which encourages it to extract disentangled and expressive neural textures representing the appearance of different semantic entities.
no code implementations • 16 Feb 2022 • Pranav Kadam, Qingyang Zhou, Shan Liu, C. -C. Jay Kuo
An unsupervised point cloud object retrieval and pose estimation method, called PCRP, is proposed in this work.
1 code implementation • 12 Feb 2022 • Chunyang Fu, Ge Li, Rui Song, Wei Gao, Shan Liu
In point cloud compression, sufficient contexts are significant for modeling the point cloud distribution.
no code implementations • CVPR 2022 • Zhenghao Chen, Guo Lu, Zhihao Hu, Shan Liu, Wei Jiang, Dong Xu
In this work, we propose the first end-to-end optimized framework for compressing automotive stereo videos (i. e., stereo videos from autonomous driving applications) from both left and right views.
no code implementations • 8 Dec 2021 • Pranav Kadam, Min Zhang, Jiahao Gu, Shan Liu, C. -C. Jay Kuo
GreenPCO is an unsupervised learning method that predicts object motion by matching features of consecutive point cloud scans.
no code implementations • 16 Nov 2021 • Wei Jiang, Wei Wang, Songnan Li, Shan Liu
This work addresses two major issues of end-to-end learned image compression (LIC) based on deep neural networks: variable-rate learning where separate networks are required to generate compressed images with varying qualities, and the train-test mismatch between differentiable approximate quantization and true hard quantization.
no code implementations • 24 Sep 2021 • Min Zhang, Pranav Kadam, Shan Liu, C. -C. Jay Kuo
It is named GSIP (Green Segmentation of Indoor Point clouds) and its performance is evaluated on a representative large-scale benchmark -- the Stanford 3D Indoor Segmentation (S3DIS) dataset.
1 code implementation • ICCV 2021 • Yurui Ren, Ge Li, Yuanqi Chen, Thomas H. Li, Shan Liu
The proposed model can generate photo-realistic portrait images with accurate movements according to intuitive modifications.
no code implementations • 4 Aug 2021 • Yurui Ren, Yubo Wu, Thomas H. Li, Shan Liu, Ge Li
Pose-guided person image synthesis aims to synthesize person images by transforming reference images into target poses.
no code implementations • 15 Jun 2021 • Sheng Lin, Wei Jiang, Wei Wang, Kaidi Xu, Yanzhi Wang, Shan Liu, Songnan Li
Compressing Deep Neural Network (DNN) models to alleviate the storage and computation requirements is essential for practical applications, especially for resource limited devices.
no code implementations • 26 May 2021 • Wen Gao, Shan Liu, Xiaozhong Xu, Manouchehr Rafie, Yuan Zhang, Igor Curcio
Specifically, we will first provide an overview of the MPEG VCM group including use cases, requirements, processing pipelines, plan for potential VCM standards, followed by the evaluation framework including machine-vision tasks, dataset, evaluation metrics, and anchor generation.
no code implementations • 16 May 2021 • Xiao Wang, Wei Jiang, Wei Wang, Shan Liu, Brian Kulis, Peter Chin
The key idea is to replace the image to be compressed with a substitutional one that outperforms the original one in a desired way.
no code implementations • 12 May 2021 • Xiaozhong Xu, Shan Liu, Zeqiang Li
Learning-based visual data compression and analysis have attracted great interest from both academia and industry recently.
1 code implementation • 15 Mar 2021 • Pranav Kadam, Min Zhang, Shan Liu, C. -C. Jay Kuo
Inspired by the recent PointHop classification method, an unsupervised 3D point cloud registration method, called R-PointHop, is proposed in this work.
no code implementations • 5 Mar 2021 • Yiming Li, Shan Liu, Yu Chen, Yushan Zheng, Sijia Chen, Bin Zhu, Jian Lou
As the successor of H. 265/HEVC, the new versatile video coding standard (H. 266/VVC) can provide up to 50% bitrate saving with the same subjective quality, at the cost of increased decoding complexity.
no code implementations • ICCV 2021 • Bing Li, Chia-Wen Lin, Cheng Zheng, Shan Liu, Junsong Yuan, Bernard Ghanem, C.-C. Jay Kuo
In the second stage, we derive another warping model to refine warping results in less important regions by eliminating serious distortions in shape, disparity and 3D structure.
Vocal Bursts Intensity Prediction Vocal Bursts Valence Prediction
no code implementations • ICCV 2021 • Munan Xu, Yuanqi Chen, Shan Liu, Thomas H. Li, Ge Li
Pose-guided virtual try-on task aims to modify the fashion item based on pose transfer task.
1 code implementation • 10 Dec 2020 • Yuanqi Chen, Ge Li, Cece Jin, Shan Liu, Thomas Li
This issue makes the generator lack the incentive from the discriminator to learn high-frequency content of data, resulting in a significant spectrum discrepancy between generated images and real images.
1 code implementation • 24 Nov 2020 • Qiao Tian, Yi Chen, Zewang Zhang, Heng Lu, LingHui Chen, Lei Xie, Shan Liu
On one hand, we propose to discriminate ground-truth waveform from synthetic one in frequency domain for offering more consistency guarantees instead of only in time domain.
no code implementations • 2 Sep 2020 • Pranav Kadam, Min Zhang, Shan Liu, C. -C. Jay Kuo
An unsupervised point cloud registration method, called salient points analysis (SPA), is proposed in this work.
no code implementations • 2 Sep 2020 • Min Zhang, Pranav Kadam, Shan Liu, C. -C. Jay Kuo
The UFF method exploits statistical correlations of points in a point cloud set to learn shape and point features in a one-pass feedforward manner through a cascaded encoder-decoder architecture.
1 code implementation • 27 Aug 2020 • Yurui Ren, Ge Li, Shan Liu, Thomas H. Li
We show that our framework can spatially transform the inputs in an efficient manner.
1 code implementation • 28 Jul 2020 • Yuanqi Chen, Xiaoming Yu, Shan Liu, Ge Li
Recent studies have shown remarkable success in unsupervised image-to-image translation.
no code implementations • 7 Jul 2020 • Yanghao Li, Bichuan Guo, Jiangtao Wen, Zhen Xia, Shan Liu, Yuxing Han
Denoisers trained with synthetic data often fail to cope with the diversity of unknown noises, giving way to methods that can adapt to existing noise without knowing its ground truth.
no code implementations • 12 May 2020 • Zewang Zhang, Qiao Tian, Heng Lu, Ling-Hui Chen, Shan Liu
This paper investigates how to leverage a DurIAN-based average model to enable a new speaker to have both accurate pronunciation and fluent cross-lingual speaking with very limited monolingual data.
2 code implementations • 9 Feb 2020 • Min Zhang, Yifan Wang, Pranav Kadam, Shan Liu, C. -C. Jay Kuo
The PointHop method was recently proposed by Zhang et al. for 3D point cloud classification with unsupervised feature extraction.
1 code implementation • 8 Nov 2019 • Wei-Hong Lin, Jia-Xing Zhong, Shan Liu, Thomas Li, Ge Li
Generic object detection algorithms have proven their excellent performance in recent years.
no code implementations • 30 Oct 2019 • Munan Xu, Junming Chen, Haiqiang Wang, Shan Liu, Ge Li, Zhiqiang Bai
However, video quality exhibits different characteristics from static image quality due to the existence of temporal masking effects.
1 code implementation • NeurIPS 2019 • Xiaoming Yu, Yuanqi Chen, Thomas Li, Shan Liu, Ge Li
Recent advances of image-to-image translation focus on learning the one-to-many mapping from two aspects: multi-modal translation and multi-domain translation.
1 code implementation • ICCV 2019 • Yurui Ren, Xiaoming Yu, Ruonan Zhang, Thomas H. Li, Shan Liu, Ge Li
Image inpainting techniques have shown significant improvements by using deep neural networks recently.
3 code implementations • 30 Jul 2019 • Min Zhang, Haoxuan You, Pranav Kadam, Shan Liu, C. -C. Jay Kuo
In the attribute building stage, we address the problem of unordered point cloud data using a space partitioning procedure and developing a robust descriptor that characterizes the relationship between a point and its one-hop neighbor in a PointHop unit.
no code implementations • 29 Jul 2019 • Wei Jia, Li Li, Zhu Li, Xiang Zhang, Shan Liu
The block-based coding structure in the hybrid video coding framework inevitably introduces compression artifacts such as blocking, ringing, etc.
no code implementations • 18 Apr 2019 • Wei Yan, Yiting shao, Shan Liu, Thomas H. Li, Zhu Li, Ge Li
Point cloud is a fundamental 3D representation which is widely used in real world applications such as autonomous driving.
1 code implementation • CVPR 2019 • Jia-Xing Zhong, Nannan Li, Weijie Kong, Shan Liu, Thomas H. Li, Ge Li
Remarkably, we obtain the frame-level AUC score of 82. 12% on UCF-Crime.
Anomaly Detection In Surveillance Videos Multiple Instance Learning +3
no code implementations • 6 Dec 2018 • Qiao Tian, Bing Yang, Jing Chen, Benlai Tang, Shan Liu
Firstly, due to the noisy input signal of the model, there is still a gap between the quality of generated and natural waveforms.
no code implementations • 6 Nov 2018 • Weijie Kong, Nannan Li, Shan Liu, Thomas Li, Ge Li
Despite tremendous progress achieved in temporal action detection, state-of-the-art methods still suffer from the sharp performance deterioration when localizing the starting and ending temporal action boundaries.
no code implementations • 26 Jun 2018 • Xiaoming Yu, Zhenqiang Ying, Thomas Li, Shan Liu, Ge Li
Recent advances in image-to-image translation have seen a rise in approaches generating diverse images through a single network.