Search Results for author: Pan Gao

Found 23 papers, 16 papers with code

ProxyFormer: Proxy Alignment Assisted Point Cloud Completion with Missing Part Sensitive Transformer

1 code implementation CVPR 2023 Shanshan Li, Pan Gao, Xiaoyang Tan, Mingqiang Wei

Specifically, we fuse information into point proxy via feature and position extractor, and generate features for missing point proxies from the features of existing point proxies.

Point Cloud Completion Position

SSformer: A Lightweight Transformer for Semantic Segmentation

1 code implementation3 Aug 2022 Wentao Shi, Jing Xu, Pan Gao

It is well believed that Transformer performs better in semantic segmentation compared to convolutional neural networks.

Image Classification Segmentation +1

Patch-Based Deep Autoencoder for Point Cloud Geometry Compression

1 code implementation18 Oct 2021 Kang You, Pan Gao

Unlike existing point cloud compression networks, which apply feature extraction and reconstruction on the entire point cloud, we divide the point cloud into patches and compress each patch independently.

Point cloud reconstruction

M$^3$Net: Multilevel, Mixed and Multistage Attention Network for Salient Object Detection

1 code implementation15 Sep 2023 Yao Yuan, Pan Gao, Xiaoyang Tan

To overcome these, we propose the M$^3$Net, i. e., the Multilevel, Mixed and Multistage attention network for Salient Object Detection (SOD).

object-detection RGB Salient Object Detection +1

Degradation-Aware Self-Attention Based Transformer for Blind Image Super-Resolution

1 code implementation6 Oct 2023 Qingguo Liu, Pan Gao, Kang Han, Ningzhong Liu, Wei Xiang

In particular, we integrate both CNN and Transformer components into the SR network, where we first use the CNN modulated by the degradation information to extract local features, and then employ the degradation-aware Transformer to extract global semantic features.

Blind Super-Resolution Contrastive Learning +2

UperFormer: A Multi-scale Transformer-based Decoder for Semantic Segmentation

1 code implementation25 Nov 2022 Jing Xu, Wentao Shi, Pan Gao, Zhengwei Wang, Qizhu Li

On the more challenging ADE20K dataset, our best model yields a single-scale mIoU of 50. 18, and a multi-scale mIoU of 51. 8, which is on-par with the current state-of-art model, while we drastically cut the number of FLOPs by 53. 5%.

Object Localization Segmentation +1

IPDAE: Improved Patch-Based Deep Autoencoder for Lossy Point Cloud Geometry Compression

1 code implementation4 Aug 2022 Kang You, Pan Gao, Qing Li

Point cloud is a crucial representation of 3D contents, which has been widely used in many areas such as virtual reality, mixed reality, autonomous driving, etc.

Autonomous Driving Mixed Reality

Dynamic Local Feature Aggregation for Learning on Point Clouds

1 code implementation7 Jan 2023 Zihao Li, Pan Gao, Hui Yuan, Ran Wei

Existing point cloud learning methods aggregate features from neighbouring points relying on constructing graph in the spatial domain, which results in feature update for each point based on spatially-fixed neighbours throughout layers.

Point Cloud Classification Position

Exploiting Inductive Bias in Transformer for Point Cloud Classification and Segmentation

1 code implementation27 Apr 2023 Zihao Li, Pan Gao, Hui Yuan, Ran Wei, Manoranjan Paul

Discovering inter-point connection for efficient high-dimensional feature extraction from point coordinate is a key challenge in processing point cloud.

3D Object Classification 3D Part Segmentation +3

Blind Image Quality Assessment via Transformer Predicted Error Map and Perceptual Quality Token

1 code implementation16 May 2023 Jinsong Shi, Pan Gao, Aljosa Smolic

Specifically, we firstly generate the predicted error map by pre-training one model consisting of a Transformer encoder and decoder, in which the objective difference between the distorted and the reference images is used as supervision.

Blind Image Quality Assessment No-Reference Image Quality Assessment +1

CAB: Empathetic Dialogue Generation with Cognition, Affection and Behavior

1 code implementation3 Feb 2023 Pan Gao, Donghong Han, Rui Zhou, Xuejiao Zhang, Zikun Wang

For behavior, we use appropriate dialogue acts to guide the dialogue generation to enhance the empathy expression.

Dialogue Generation

StylePrompter: All Styles Need Is Attention

1 code implementation30 Jul 2023 Chenyi Zhuang, Pan Gao, Aljosa Smolic

We then prove that StylePrompter lies in a more disentangled $\mathcal{W^+}$ and show the controllability of SMART.

Attribute Image Manipulation

Transformer-based No-Reference Image Quality Assessment via Supervised Contrastive Learning

1 code implementation12 Dec 2023 Jinsong Shi, Pan Gao, Jie Qin

We first train a model on a large-scale synthetic dataset by SCL (no image subjective score is required) to extract degradation features of images with various distortion types and levels.

Contrastive Learning Inductive Bias +2

Video Frame Interpolation Based on Deformable Kernel Region

1 code implementation25 Apr 2022 Haoyue Tian, Pan Gao, Xiaojiang Peng

In order to solve this problem, we revisit the deformable convolution for video interpolation, which can break the fixed grid restrictions on the kernel region, making the distribution of reference points more suitable for the shape of the object, and thus warp a more accurate interpolation frame.

Optical Flow Estimation Video Frame Interpolation

Countering Adversarial Examples: Combining Input Transformation and Noisy Training

no code implementations25 Jun 2021 Cheng Zhang, Pan Gao

Prior work has shown that JPEG compression can combat the drop in classification accuracy on adversarial examples to some extent.

Data Augmentation Quantization

Defending Adversaries Using Unsupervised Feature Clustering VAE

no code implementations ICML Workshop AML 2021 Cheng Zhang, Pan Gao

We propose a modified VAE (variational autoencoder) as a denoiser to remove adversarial perturbations for image classification.

Clustering Image Classification

Self-Ensemling for 3D Point Cloud Domain Adaption

no code implementations10 Dec 2021 Qing Li, Xiaojiang Peng, Chuan Yan, Pan Gao, Qi Hao

In SEN, a student network is kept in a collaborative manner with supervised learning and self-supervised learning, and a teacher network conducts temporal consistency to learn useful representations and ensure the quality of point clouds reconstruction.

Autonomous Driving Self-Supervised Learning +1

Video-based Smoky Vehicle Detection with A Coarse-to-Fine Framework

no code implementations8 Jul 2022 Xiaojiang Peng, Xiaomao Fan, Qingyang Wu, Jieyan Zhao, Pan Gao

Moreover, we present a new Coarse-to-fine Deep Smoky vehicle detection (CoDeS) framework for efficient smoky vehicle detection.

MRGAN360: Multi-stage Recurrent Generative Adversarial Network for 360 Degree Image Saliency Prediction

no code implementations15 Mar 2023 Pan Gao, Xinlang Chen, Rong Quan, Wei Xiang

We employ a recurrent neural network among adjacent prediction stages to model their correlations, and exploit a discriminator at the end of each stage to supervise the output saliency map.

Generative Adversarial Network Saliency Prediction

Video Frame Interpolation with Flow Transformer

no code implementations30 Jul 2023 Pan Gao, Haoyue Tian, Jie Qin

Specifically, we design a Flow Transformer Block that calculates the temporal self-attention in a matched local area with the guidance of flow, making our framework suitable for interpolating frames with large motion while maintaining reasonably low complexity.

Video Frame Interpolation

Medical Report Generation based on Segment-Enhanced Contrastive Representation Learning

no code implementations26 Dec 2023 Ruoqing Zhao, Xi Wang, Hongliang Dai, Pan Gao, Piji Li

Automated radiology report generation has the potential to improve radiology reporting and alleviate the workload of radiologists.

Contrastive Learning Image Segmentation +4

Cannot find the paper you are looking for? You can Submit a new open access paper.