Search Results for author: Weimin Tan

Found 24 papers, 8 papers with code

A Medical Data-Effective Learning Benchmark for Highly Efficient Pre-training of Foundation Models

no code implementations • 31 Jan 2024 • Wenxuan Yang, Weimin Tan, Yuqi Sun, Bo Yan

This paper introduces data-effective learning, aiming to use data in the most impactful way to pre-train foundation models.

Paper
Add Code

Low-latency Space-time Supersampling for Real-time Rendering

1 code implementation • 18 Dec 2023 • Ruian He, Shili Zhou, Yuqi Sun, Ri Cheng, Weimin Tan, Bo Yan

With the rise of real-time rendering and the evolution of display devices, there is a growing demand for post-processing methods that offer high-resolution content in a high frame rate.

Paper
Code

Context-Aware Iteration Policy Network for Efficient Optical Flow Estimation

no code implementations • 12 Dec 2023 • Ri Cheng, Ruian He, Xuhao Jiang, Shili Zhou, Weimin Tan, Bo Yan

In this paper, we develop a Context-Aware Iteration Policy Network for efficient optical flow estimation, which determines the optimal number of iterations per sample.

Optical Flow Estimation

Paper
Add Code

Unsupervised Disentangling of Facial Representations with 3D-aware Latent Diffusion Models

1 code implementation • 15 Sep 2023 • Ruian He, Zhen Xing, Weimin Tan, Bo Yan

Second, we propose a novel representation diffusion model (RDM) to disentangle 3D latent into facial identity and expression.

Face Verification Facial Expression Recognition +1

Paper
Code

MVFlow: Deep Optical Flow Estimation of Compressed Videos with Motion Vector Prior

no code implementations • 3 Aug 2023 • Shili Zhou, Xuhao Jiang, Weimin Tan, Ruian He, Bo Yan

Motion vectors, one of the compression information, record the motion of the video frames.

Optical Flow Estimation

Paper
Add Code

SAMFlow: Eliminating Any Fragmentation in Optical Flow with Segment Anything Model

no code implementations • 31 Jul 2023 • Shili Zhou, Ruian He, Weimin Tan, Bo Yan

Optical Flow Estimation aims to find the 2D dense motion field between two frames.

Motion Estimation Optical Flow Estimation

Paper
Add Code

Uncertainty-Guided Spatial Pruning Architecture for Efficient Frame Interpolation

no code implementations • 31 Jul 2023 • Ri Cheng, Xuhao Jiang, Ruian He, Shili Zhou, Weimin Tan, Bo Yan

We can use dynamic spatial pruning method to skip redundant computation, but this method cannot properly identify easy regions in VFI tasks without supervision.

Video Frame Interpolation

Paper
Add Code

DifFSS: Diffusion Model for Few-Shot Semantic Segmentation

1 code implementation • 3 Jul 2023 • Weimin Tan, Siyuan Chen, Bo Yan

Although various few-shot semantic segmentation (FSS) models with different network structures have been proposed, performance improvement has reached a bottleneck.

Ranked #5 on Few-Shot Semantic Segmentation on PASCAL-5i (1-Shot)

Few-Shot Semantic Segmentation Image Generation +2

Paper
Code

Instruct-NeuralTalker: Editing Audio-Driven Talking Radiance Fields with Instructions

no code implementations • 19 Jun 2023 • Yuqi Sun, Ruian He, Weimin Tan, Bo Yan

Given a short speech video, we first build an efficient talking radiance field, and then apply the latest conditional diffusion model for image editing based on the given instructions and guiding implicit representation optimization towards the editing target.

Talking Face Generation

Paper
Add Code

Learning Survival Distribution with Implicit Survival Function

1 code implementation • 24 May 2023 • Yu Ling, Weimin Tan, Bo Yan

Survival analysis aims at modeling the relationship between covariates and event occurrence with some untracked (censored) samples.

Numerical Integration Survival Analysis

Paper
Code

Multi-Modality Deep Network for JPEG Artifacts Reduction

no code implementations • 4 May 2023 • Xuhao Jiang, Weimin Tan, Qing Lin, Chenxi Ma, Bo Yan, Liquan Shen

In recent years, many convolutional neural network-based models are designed for JPEG artifacts reduction, and have achieved notable progress.

Contrastive Learning Image Compression +1

Paper
Add Code

Multi-Modality Deep Network for Extreme Learned Image Compression

no code implementations • 26 Apr 2023 • Xuhao Jiang, Weimin Tan, Tian Tan, Bo Yan, Liquan Shen

Image-based single-modality compression learning approaches have demonstrated exceptionally powerful encoding and decoding capabilities in the past few years , but suffer from blur and severe semantics loss at extremely low bitrates.

Image Compression

Paper
Add Code

Geometry-Aware Reference Synthesis for Multi-View Image Super-Resolution

no code implementations • 18 Jul 2022 • Ri Cheng, Yuqi Sun, Bo Yan, Weimin Tan, Chenxi Ma

To address these problems, we propose the MVSRnet, which uses geometry information to extract sharp details from all LR multi-view to support the SR of the LR input view.

Image Super-Resolution Video Super-Resolution

Paper
Add Code

Learning Parallax Transformer Network for Stereo Image JPEG Artifacts Removal

no code implementations • 15 Jul 2022 • Xuhao Jiang, Weimin Tan, Ri Cheng, Shili Zhou, Bo Yan

Under stereo settings, the performance of image JPEG artifacts removal can be further improved by exploiting the additional information provided by a second view.

Paper
Add Code

Rethinking Super-Resolution as Text-Guided Details Generation

no code implementations • 14 Jul 2022 • Chenxi Ma, Bo Yan, Qing Lin, Weimin Tan, Siming Chen

To enhance the semantic accuracy and the visual quality of the reconstructed image, we explore the multi-modal fusion learning in SISR by proposing a Text-Guided Super-Resolution (TGSR) framework, which can effectively utilize the information from the text and image modalities.

Image Super-Resolution

Paper
Add Code

Perception-Oriented Stereo Image Super-Resolution

no code implementations • 14 Jul 2022 • Chenxi Ma, Bo Yan, Weimin Tan, Xuhao Jiang

Recent studies of deep learning based stereo image super-resolution (StereoSR) have promoted the development of StereoSR.

Disparity Estimation Stereo Image Super-Resolution

Paper
Add Code

Learning Robust Image-Based Rendering on Sparse Scene Geometry via Depth Completion

no code implementations • CVPR 2022 • Yuqi Sun, Shili Zhou, Ri Cheng, Weimin Tan, Bo Yan, Lang Fu

Specifically, GR stage takes sparse depth map and RGB as input to predict dense depth map by exploiting the correlation between two modals.

Depth Completion

Paper
Add Code

Feature Pyramid Network for Multi-task Affective Analysis

1 code implementation • 8 Jul 2021 • Ruian He, Zhen Xing, Weimin Tan, Bo Yan

Affective Analysis is not a single task, and the valence-arousal value, expression class, and action unit can be predicted at the same time.

Paper
Code

Perceptual Variousness Motion Deblurring With Light Global Context Refinement

1 code implementation • ICCV 2021 • Jichun Li, Weimin Tan, Bo Yan

Deep learning algorithms have made significant progress in dynamic scene deblurring.

Deblurring Image Deblurring +1

Paper
Code

Frame and Feature-Context Video Super-Resolution

no code implementations • 28 Sep 2019 • Bo Yan, Chuming Lin, Weimin Tan

For video super-resolution, current state-of-the-art approaches either process multiple low-resolution (LR) frames to produce each output high-resolution (HR) frame separately in a sliding window fashion or recurrently exploit the previously estimated HR frames to super-resolve the following frame.

Video Super-Resolution

Paper
Add Code

RDGAN : Retinex Decomposition Based Adversarial Learning for Low-Light Enhancement

1 code implementation • 2019 IEEE International Conference on Multimedia and Expo (ICME) 2019 • Junyi Wang, Weimin Tan, Xuejing Niu and Bo Yan

We also present a new RDGAN (Retinex Decomposition based Generative Adversarial Network) loss, which is computed on the decomposed reflectance components of the enhanced an the reference images.

Generative Adversarial Network

Paper
Code

Cycle-IR: Deep Cyclic Image Retargeting

1 code implementation • 9 May 2019 • Weimin Tan, Bo Yan, Chumin Lin, Xuejing Niu

Our idea is built on the reverse mapping from the retargeted images to the given input images.

Image Retargeting

Paper
Code

Feature Super-Resolution: Make Machine See More Clearly

no code implementations • CVPR 2018 • Weimin Tan, Bo Yan, Bahetiyaer Bare

In this paper, different from image super-resolution (ISR), we propose a novel super-resolution technique called feature super-resolution (FSR), which aims at enhancing the discriminatory power of small size image in order to provide high recognition precision for machine.

Generative Adversarial Network Image Super-Resolution +1

Paper
Add Code

Understanding the Feedforward Artificial Neural Network Model From the Perspective of Network Flow

no code implementations • 26 Apr 2017 • Dawei Dai, Weimin Tan, Hong Zhan

Experiments for two types of ANN model including multi-layer MLP and CNN verify that the network flow based on class-pathway is a reasonable explanation for ANN models.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.