Search Results for author: Weimin Tan

Found 24 papers, 8 papers with code

A Medical Data-Effective Learning Benchmark for Highly Efficient Pre-training of Foundation Models

no code implementations31 Jan 2024 Wenxuan Yang, Weimin Tan, Yuqi Sun, Bo Yan

This paper introduces data-effective learning, aiming to use data in the most impactful way to pre-train foundation models.

Low-latency Space-time Supersampling for Real-time Rendering

1 code implementation18 Dec 2023 Ruian He, Shili Zhou, Yuqi Sun, Ri Cheng, Weimin Tan, Bo Yan

With the rise of real-time rendering and the evolution of display devices, there is a growing demand for post-processing methods that offer high-resolution content in a high frame rate.

Context-Aware Iteration Policy Network for Efficient Optical Flow Estimation

no code implementations12 Dec 2023 Ri Cheng, Ruian He, Xuhao Jiang, Shili Zhou, Weimin Tan, Bo Yan

In this paper, we develop a Context-Aware Iteration Policy Network for efficient optical flow estimation, which determines the optimal number of iterations per sample.

Optical Flow Estimation

Unsupervised Disentangling of Facial Representations with 3D-aware Latent Diffusion Models

1 code implementation15 Sep 2023 Ruian He, Zhen Xing, Weimin Tan, Bo Yan

Second, we propose a novel representation diffusion model (RDM) to disentangle 3D latent into facial identity and expression.

Face Verification Facial Expression Recognition +1

Uncertainty-Guided Spatial Pruning Architecture for Efficient Frame Interpolation

no code implementations31 Jul 2023 Ri Cheng, Xuhao Jiang, Ruian He, Shili Zhou, Weimin Tan, Bo Yan

We can use dynamic spatial pruning method to skip redundant computation, but this method cannot properly identify easy regions in VFI tasks without supervision.

Video Frame Interpolation

DifFSS: Diffusion Model for Few-Shot Semantic Segmentation

1 code implementation3 Jul 2023 Weimin Tan, Siyuan Chen, Bo Yan

Although various few-shot semantic segmentation (FSS) models with different network structures have been proposed, performance improvement has reached a bottleneck.

Few-Shot Semantic Segmentation Image Generation +2

Instruct-NeuralTalker: Editing Audio-Driven Talking Radiance Fields with Instructions

no code implementations19 Jun 2023 Yuqi Sun, Ruian He, Weimin Tan, Bo Yan

Given a short speech video, we first build an efficient talking radiance field, and then apply the latest conditional diffusion model for image editing based on the given instructions and guiding implicit representation optimization towards the editing target.

Talking Face Generation

Learning Survival Distribution with Implicit Survival Function

1 code implementation24 May 2023 Yu Ling, Weimin Tan, Bo Yan

Survival analysis aims at modeling the relationship between covariates and event occurrence with some untracked (censored) samples.

Numerical Integration Survival Analysis

Multi-Modality Deep Network for JPEG Artifacts Reduction

no code implementations4 May 2023 Xuhao Jiang, Weimin Tan, Qing Lin, Chenxi Ma, Bo Yan, Liquan Shen

In recent years, many convolutional neural network-based models are designed for JPEG artifacts reduction, and have achieved notable progress.

Contrastive Learning Image Compression +1

Multi-Modality Deep Network for Extreme Learned Image Compression

no code implementations26 Apr 2023 Xuhao Jiang, Weimin Tan, Tian Tan, Bo Yan, Liquan Shen

Image-based single-modality compression learning approaches have demonstrated exceptionally powerful encoding and decoding capabilities in the past few years , but suffer from blur and severe semantics loss at extremely low bitrates.

Image Compression

Geometry-Aware Reference Synthesis for Multi-View Image Super-Resolution

no code implementations18 Jul 2022 Ri Cheng, Yuqi Sun, Bo Yan, Weimin Tan, Chenxi Ma

To address these problems, we propose the MVSRnet, which uses geometry information to extract sharp details from all LR multi-view to support the SR of the LR input view.

Image Super-Resolution Video Super-Resolution

Learning Parallax Transformer Network for Stereo Image JPEG Artifacts Removal

no code implementations15 Jul 2022 Xuhao Jiang, Weimin Tan, Ri Cheng, Shili Zhou, Bo Yan

Under stereo settings, the performance of image JPEG artifacts removal can be further improved by exploiting the additional information provided by a second view.

Rethinking Super-Resolution as Text-Guided Details Generation

no code implementations14 Jul 2022 Chenxi Ma, Bo Yan, Qing Lin, Weimin Tan, Siming Chen

To enhance the semantic accuracy and the visual quality of the reconstructed image, we explore the multi-modal fusion learning in SISR by proposing a Text-Guided Super-Resolution (TGSR) framework, which can effectively utilize the information from the text and image modalities.

Image Super-Resolution

Perception-Oriented Stereo Image Super-Resolution

no code implementations14 Jul 2022 Chenxi Ma, Bo Yan, Weimin Tan, Xuhao Jiang

Recent studies of deep learning based stereo image super-resolution (StereoSR) have promoted the development of StereoSR.

Disparity Estimation Stereo Image Super-Resolution

Learning Robust Image-Based Rendering on Sparse Scene Geometry via Depth Completion

no code implementations CVPR 2022 Yuqi Sun, Shili Zhou, Ri Cheng, Weimin Tan, Bo Yan, Lang Fu

Specifically, GR stage takes sparse depth map and RGB as input to predict dense depth map by exploiting the correlation between two modals.

Depth Completion

Feature Pyramid Network for Multi-task Affective Analysis

1 code implementation8 Jul 2021 Ruian He, Zhen Xing, Weimin Tan, Bo Yan

Affective Analysis is not a single task, and the valence-arousal value, expression class, and action unit can be predicted at the same time.

Frame and Feature-Context Video Super-Resolution

no code implementations28 Sep 2019 Bo Yan, Chuming Lin, Weimin Tan

For video super-resolution, current state-of-the-art approaches either process multiple low-resolution (LR) frames to produce each output high-resolution (HR) frame separately in a sliding window fashion or recurrently exploit the previously estimated HR frames to super-resolve the following frame.

Video Super-Resolution

RDGAN : Retinex Decomposition Based Adversarial Learning for Low-Light Enhancement

1 code implementation 2019 IEEE International Conference on Multimedia and Expo (ICME) 2019 Junyi Wang, Weimin Tan, Xuejing Niu and Bo Yan

We also present a new RDGAN (Retinex Decomposition based Generative Adversarial Network) loss, which is computed on the decomposed reflectance components of the enhanced an the reference images.

Generative Adversarial Network

Cycle-IR: Deep Cyclic Image Retargeting

1 code implementation9 May 2019 Weimin Tan, Bo Yan, Chumin Lin, Xuejing Niu

Our idea is built on the reverse mapping from the retargeted images to the given input images.

Image Retargeting

Feature Super-Resolution: Make Machine See More Clearly

no code implementations CVPR 2018 Weimin Tan, Bo Yan, Bahetiyaer Bare

In this paper, different from image super-resolution (ISR), we propose a novel super-resolution technique called feature super-resolution (FSR), which aims at enhancing the discriminatory power of small size image in order to provide high recognition precision for machine.

Generative Adversarial Network Image Super-Resolution +1

Understanding the Feedforward Artificial Neural Network Model From the Perspective of Network Flow

no code implementations26 Apr 2017 Dawei Dai, Weimin Tan, Hong Zhan

Experiments for two types of ANN model including multi-layer MLP and CNN verify that the network flow based on class-pathway is a reasonable explanation for ANN models.

Cannot find the paper you are looking for? You can Submit a new open access paper.