Neural Texture Extraction and Distribution for Controllable Person Image Synthesis

1 code implementation13 Apr 2022 Yurui Ren, Xiaoqing Fan, Ge Li, Shan Liu, Thomas H. Li

Our model is trained to predict human images in arbitrary poses, which encourages it to extract disentangled and expressive neural textures representing the appearance of different semantic entities.

Image Generation

PCRP: Unsupervised Point Cloud Object Retrieval and Pose Estimation

no code implementations16 Feb 2022 Pranav Kadam, Qingyang Zhou, Shan Liu, C. -C. Jay Kuo

An unsupervised point cloud object retrieval and pose estimation method, called PCRP, is proposed in this work.

Point Cloud Registration Pose Estimation

OctAttention: Octree-Based Large-Scale Contexts Model for Point Cloud Compression

1 code implementation12 Feb 2022 Chunyang Fu, Ge Li, Rui Song, Wei Gao, Shan Liu

In point cloud compression, sufficient contexts are significant for modeling the point cloud distribution.

GPCO: An Unsupervised Green Point Cloud Odometry Method

no code implementations8 Dec 2021 Pranav Kadam, Min Zhang, Shan Liu, C. -C. Jay Kuo

GPCO is an unsupervised learning method that predicts object motion by matching features of consecutive point cloud scans.

Visual Odometry

Online Meta Adaptation for Variable-Rate Learned Image Compression

no code implementations16 Nov 2021 Wei Jiang, Wei Wang, Songnan Li, Shan Liu

This work addresses two major issues of end-to-end learned image compression (LIC) based on deep neural networks: variable-rate learning where separate networks are required to generate compressed images with varying qualities, and the train-test mismatch between differentiable approximate quantization and true hard quantization.

Image Compression Meta-Learning +2

GSIP: Green Semantic Segmentation of Large-Scale Indoor Point Clouds

no code implementations24 Sep 2021 Min Zhang, Pranav Kadam, Shan Liu, C. -C. Jay Kuo

It is named GSIP (Green Segmentation of Indoor Point clouds) and its performance is evaluated on a representative large-scale benchmark -- the Stanford 3D Indoor Segmentation (S3DIS) dataset.

Semantic Segmentation

PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering

1 code implementation ICCV 2021 Yurui Ren, Ge Li, Yuanqi Chen, Thomas H. Li, Shan Liu

The proposed model can generate photo-realistic portrait images with accurate movements according to intuitive modifications.

Image Generation Neural Rendering

Combining Attention with Flow for Person Image Synthesis

no code implementations4 Aug 2021 Yurui Ren, Yubo Wu, Thomas H. Li, Shan Liu, Ge Li

Pose-guided person image synthesis aims to synthesize person images by transforming reference images into target poses.

Image Generation

Efficient Micro-Structured Weight Unification and Pruning for Neural Network Compression

no code implementations15 Jun 2021 Sheng Lin, Wei Jiang, Wei Wang, Kaidi Xu, Yanzhi Wang, Shan Liu, Songnan Li

Compressing Deep Neural Network (DNN) models to alleviate the storage and computation requirements is essential for practical applications, especially for resource limited devices.

Neural Network Compression

Recent Standard Development Activities on Video Coding for Machines

no code implementations26 May 2021 Wen Gao, Shan Liu, Xiaozhong Xu, Manouchehr Rafie, Yuan Zhang, Igor Curcio

Specifically, we will first provide an overview of the MPEG VCM group including use cases, requirements, processing pipelines, plan for potential VCM standards, followed by the evaluation framework including machine-vision tasks, dataset, evaluation metrics, and anchor generation.

Object Detection

Substitutional Neural Image Compression

no code implementations16 May 2021 Xiao Wang, Wei Jiang, Wei Wang, Shan Liu, Brian Kulis, Peter Chin

The key idea is to replace the image to be compressed with a substitutional one that outperforms the original one in a desired way.

Image Compression

Tencent Video Dataset (TVD): A Video Dataset for Learning-based Visual Data Compression and Analysis

no code implementations12 May 2021 Xiaozhong Xu, Shan Liu, Zeqiang Li

Learning-based visual data compression and analysis have attracted great interest from both academia and industry recently.

Data Compression Object Detection

R-PointHop: A Green, Accurate, and Unsupervised Point Cloud Registration Method

1 code implementation15 Mar 2021 Pranav Kadam, Min Zhang, Shan Liu, C. -C. Jay Kuo

Inspired by the recent PointHop classification method, an unsupervised 3D point cloud registration method, called R-PointHop, is proposed in this work.

Dimensionality Reduction Frame +2

An Optimized H.266/VVC Software Decoder On Mobile Platform

no code implementations5 Mar 2021 Yiming Li, Shan Liu, Yu Chen, Yushan Zheng, Sijia Chen, Bin Zhu, Jian Lou

As the successor of H. 265/HEVC, the new versatile video coding standard (H. 266/VVC) can provide up to 50% bitrate saving with the same subjective quality, at the cost of increased decoding complexity.

High Quality Disparity Remapping With Two-Stage Warping

no code implementations ICCV 2021 Bing Li, Chia-Wen Lin, Cheng Zheng, Shan Liu, Junsong Yuan, Bernard Ghanem, C.-C. Jay Kuo

In the second stage, we derive another warping model to refine warping results in less important regions by eliminating serious distortions in shape, disparity and 3D structure.

SSD-GAN: Measuring the Realness in the Spatial and Spectral Domains

1 code implementation10 Dec 2020 Yuanqi Chen, Ge Li, Cece Jin, Shan Liu, Thomas Li

This issue makes the generator lack the incentive from the discriminator to learn high-frequency content of data, resulting in a significant spectrum discrepancy between generated images and real images.

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis

no code implementations24 Nov 2020 Qiao Tian, Yi Chen, Zewang Zhang, Heng Lu, LingHui Chen, Lei Xie, Shan Liu

On one hand, we propose to discriminate ground-truth waveform from synthetic one in frequency domain for offering more consistency guarantees instead of only in time domain.

Speech Synthesis

Unsupervised Feedforward Feature (UFF) Learning for Point Cloud Classification and Segmentation

no code implementations2 Sep 2020 Min Zhang, Pranav Kadam, Shan Liu, C. -C. Jay Kuo

The UFF method exploits statistical correlations of points in a point cloud set to learn shape and point features in a one-pass feedforward manner through a cascaded encoder-decoder architecture.

Classification General Classification +1

Unsupervised Point Cloud Registration via Salient Points Analysis (SPA)

no code implementations2 Sep 2020 Pranav Kadam, Min Zhang, Shan Liu, C. -C. Jay Kuo

An unsupervised point cloud registration method, called salient points analysis (SPA), is proposed in this work.

Point Cloud Registration

Learning Model-Blind Temporal Denoisers without Ground Truths

no code implementations7 Jul 2020 Yanghao Li, Bichuan Guo, Jiangtao Wen, Zhen Xia, Shan Liu, Yuxing Han

Denoisers trained with synthetic data often fail to cope with the diversity of unknown noises, giving way to methods that can adapt to existing noise without knowing its ground truth.

Denoising Optical Flow Estimation +1

AdaDurIAN: Few-shot Adaptation for Neural Text-to-Speech with DurIAN

no code implementations12 May 2020 Zewang Zhang, Qiao Tian, Heng Lu, Ling-Hui Chen, Shan Liu

This paper investigates how to leverage a DurIAN-based average model to enable a new speaker to have both accurate pronunciation and fluent cross-lingual speaking with very limited monolingual data.

Few-Shot Learning

PointHop++: A Lightweight Learning Model on Point Sets for 3D Classification

2 code implementations9 Feb 2020 Min Zhang, Yifan Wang, Pranav Kadam, Shan Liu, C. -C. Jay Kuo

The PointHop method was recently proposed by Zhang et al. for 3D point cloud classification with unsupervised feature extraction.

3D Classification 3D Point Cloud Classification +2

C3DVQA: Full-Reference Video Quality Assessment with 3D Convolutional Neural Network

no code implementations30 Oct 2019 Munan Xu, Junming Chen, Haiqiang Wang, Shan Liu, Ge Li, Zhiqiang Bai

However, video quality exhibits different characteristics from static image quality due to the existence of temporal masking effects.

Frame Video Quality Assessment +2

Multi-mapping Image-to-Image Translation via Learning Disentanglement

1 code implementation NeurIPS 2019 Xiaoming Yu, Yuanqi Chen, Thomas Li, Shan Liu, Ge Li

Recent advances of image-to-image translation focus on learning the one-to-many mapping from two aspects: multi-modal translation and multi-domain translation.

Disentanglement Image-to-Image Translation +1

PointHop: An Explainable Machine Learning Method for Point Cloud Classification

3 code implementations30 Jul 2019 Min Zhang, Haoxuan You, Pranav Kadam, Shan Liu, C. -C. Jay Kuo

In the attribute building stage, we address the problem of unordered point cloud data using a space partitioning procedure and developing a robust descriptor that characterizes the relationship between a point and its one-hop neighbor in a PointHop unit.

Classification General Classification +1

Residual-Guided In-Loop Filter Using Convolution Neural Network

no code implementations29 Jul 2019 Wei Jia, Li Li, Zhu Li, Xiang Zhang, Shan Liu

The block-based coding structure in the hybrid video coding framework inevitably introduces compression artifacts such as blocking, ringing, etc.


Deep AutoEncoder-based Lossy Geometry Compression for Point Clouds

no code implementations18 Apr 2019 Wei Yan, Yiting shao, Shan Liu, Thomas H. Li, Zhu Li, Ge Li

Point cloud is a fundamental 3D representation which is widely used in real world applications such as autonomous driving.

Autonomous Driving Image Compression

Generative Adversarial Network based Speaker Adaptation for High Fidelity WaveNet Vocoder

no code implementations6 Dec 2018 Qiao Tian, Bing Yang, Jing Chen, Benlai Tang, Shan Liu

Firstly, due to the noisy input signal of the model, there is still a gap between the quality of generated and natural waveforms.

BLP -- Boundary Likelihood Pinpointing Networks for Accurate Temporal Action Localization

no code implementations6 Nov 2018 Weijie Kong, Nannan Li, Shan Liu, Thomas Li, Ge Li

Despite tremendous progress achieved in temporal action detection, state-of-the-art methods still suffer from the sharp performance deterioration when localizing the starting and ending temporal action boundaries.

Action Detection Temporal Action Localization

Multi-Mapping Image-to-Image Translation with Central Biasing Normalization

no code implementations26 Jun 2018 Xiaoming Yu, Zhenqiang Ying, Thomas Li, Shan Liu, Ge Li

Recent advances in image-to-image translation have seen a rise in approaches generating diverse images through a single network.

Image-to-Image Translation Translation

