Search Results for author: Fanyi Wang

Found 13 papers, 6 papers with code

PoseAnimate: Zero-shot high fidelity pose controllable character animation

no code implementations • 21 Apr 2024 • Bingwen Zhu, Fanyi Wang, Tianyi Lu, Peng Liu, Jingwen Su, Jinxiu Liu, Yanhao Zhang, Zuxuan Wu, Yu-Gang Jiang, Guo-Jun Qi

Image-to-video(I2V) generation aims to create a video sequence from a single image, which requires high temporal coherence and visual fidelity with the source image. However, existing approaches suffer from character appearance inconsistency and poor preservation of fine details.

Paper
Add Code

LoopAnimate: Loopable Salient Object Animation

no code implementations • 14 Apr 2024 • Fanyi Wang, Peng Liu, Haotian Hu, Dan Meng, Jingwen Su, Jinjin Xu, Yanhao Zhang, Xiaoming Ren, Zhiwang Zhang

The proposed LoopAnimate, which for the first time extends the single-pass generation length of UNet-based video generation models to 35 frames while maintaining high-quality video generation.

Object Video Generation

Paper
Add Code

ADMap: Anti-disturbance framework for reconstructing online vectorized HD map

1 code implementation • 24 Jan 2024 • Haotian Hu, Fanyi Wang, Yaonong Wang, Laifeng Hu, Jingwei Xu, Zhiwang Zhang

Therefore, this paper proposes the Anti-disturbance Map reconstruction framework (ADMap).

Autonomous Driving

Paper
Code

Lightweight high-resolution Subject Matting in the Real World

no code implementations • 12 Dec 2023 • Peng Liu, Fanyi Wang, Jingwen Su, Yanhao Zhang, GuoJun Qi

To alleviate these issues, we propose to construct a saliency object matting dataset HRSOM and a lightweight network PSUNet.

Image Matting object-detection +1

Paper
Add Code

BARET : Balanced Attention based Real image Editing driven by Target-text Inversion

no code implementations • 9 Dec 2023 • Yuming Qiao, Fanyi Wang, Jingwen Su, Yanhao Zhang, Yunjie Yu, Siyu Wu, Guo-Jun Qi

Image editing approaches with diffusion models have been rapidly developed, yet their applicability are subject to requirements such as specific editing types (e. g., foreground or background object editing, style transfer), multiple conditions (e. g., mask, sketch, caption), and time consuming fine-tuning of diffusion models.

Image Reconstruction Style Transfer

Paper
Add Code

u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model

1 code implementation • 9 Nov 2023 • Jinjin Xu, Liwu Xu, Yuzhe Yang, Xiang Li, Fanyi Wang, Yanchun Xie, Yi-Jie Huang, Yaqian Li

Recent advancements in multi-modal large language models (MLLMs) have led to substantial improvements in visual understanding, primarily driven by sophisticated modality alignment strategies.

Instruction Following Language Modelling +1

117

Paper
Code

A Machine Vision Method for Correction of Eccentric Error: Based on Adaptive Enhancement Algorithm

no code implementations • 1 Sep 2023 • Fanyi Wang, Pin Cao, Yihui Zhang, Haotian Hu, Yongying Yang

Focusing on the severe defocus blur of reference crosshair image caused by the imaging characteristic of the aspherical optical element, which may lead to the failure of correction, an Adaptive Enhancement Algorithm (AEA) is proposed to strengthen the crosshair image.

Paper
Add Code

All-pairs Consistency Learning for Weakly Supervised Semantic Segmentation

1 code implementation • 8 Aug 2023 • Weixuan Sun, Yanhao Zhang, Zhen Qin, Zheyuan Liu, Lin Cheng, Fanyi Wang, Yiran Zhong, Nick Barnes

Given a pair of augmented views, our approach regularizes the activation intensities between a pair of augmented views, while also ensuring that the affinity across regions within each view remains consistent.

Ranked #15 on Weakly-Supervised Semantic Segmentation on COCO 2014 val

Object Localization Weakly supervised Semantic Segmentation +1

Paper
Code

EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object Detection

1 code implementation • 31 Mar 2023 • Haotian Hu, Fanyi Wang, Jingwen Su, Yaonong Wang, Laifeng Hu, Weiye Fang, Jingwei Xu, Zhiwang Zhang

In recent years, great progress has been made in the Lift-Splat-Shot-based (LSS-based) 3D object detection method.

Ranked #1 on 3D Object Detection on nuScenes

3D Object Detection Depth Estimation +2

186

Paper
Code

GAM : Gradient Attention Module of Optimization for Point Clouds Analysis

1 code implementation • 19 Mar 2023 • Haotian Hu, Fanyi Wang, Jingwen Su, Hongtao Zhou, Yaonong Wang, Laifeng Hu, Yanhao Zhang, Zhiwang Zhang

In point cloud analysis tasks, the existing local feature aggregation descriptors (LFAD) are unable to fully utilize information in the neighborhood of central points.

Paper
Code

VA-GCN: A Vector Attention Graph Convolution Network for learning on Point Clouds

no code implementations • 1 Jun 2021 • Haotian Hu, Fanyi Wang, Huixiao Le

Owing to the development of research on local aggregation operators, dramatic breakthrough has been made in point cloud analysis models.

3D Classification

Paper
Add Code

A De-raining semantic segmentation network for real-time foreground segmentation

no code implementations • 16 Apr 2021 • Fanyi Wang, Yihui Zhang

According to the control experiments, the performances of MultiScaleSE Block and Asymmetric Skip compared with SEResNet18 and Symmetric Skip respectively are improved to a certain degree on the Foreground Accuracy index.

Foreground Segmentation Real-Time Semantic Segmentation +1

Paper
Add Code

BAM: A Balanced Attention Mechanism for Single Image Super Resolution

1 code implementation • 15 Apr 2021 • Fanyi Wang, Haotian Hu, Cheng Shen

The results demonstrate that BAM can efficiently improve the networks performance, and for those originally with attention mechanism, the substitution with BAM further reduces the amount of parameters and increases the inference speed.

Image Super-Resolution

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.