Search Results for author: Chenjie Cao

Found 22 papers, 14 papers with code

SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer

no code implementations4 Apr 2024 Zijie Wu, Chaohui Yu, Yanqin Jiang, Chenjie Cao, Fan Wang, Xiang Bai

Recent advances in 2D/3D generative models enable the generation of dynamic 3D objects from a single-view video.

motion prediction

Repositioning the Subject within Image

1 code implementation30 Jan 2024 Yikai Wang, Chenjie Cao, Ke Fan, Qiaole Dong, YiFan Li, xiangyang xue, Yanwei Fu

Our research reveals that the fundamental sub-tasks of subject repositioning, which include filling the void left by the repositioned subject, reconstructing obscured portions of the subject and blending the subject to be consistent with surrounding areas, can be effectively reformulated as a unified, prompt-guided inpainting task.

Image Generation Image Manipulation

MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo

1 code implementation22 Jan 2024 Chenjie Cao, Xinlin Ren, Yanwei Fu

Recent advancements in learning-based Multi-View Stereo (MVS) methods have prominently featured transformer-based models with attention mechanisms.

3D Reconstruction Depth Estimation +1

Towards Context-Stable and Visual-Consistent Image Inpainting

1 code implementation8 Dec 2023 Yikai Wang, Chenjie Cao, Ke Fan Xiangyang Xue Yanwei Fu

Recent progress in inpainting increasingly relies on generative models, leveraging their strong generation capabilities for addressing large irregular masks.

Image Inpainting

Local Consensus Enhanced Siamese Network with Reciprocal Loss for Two-view Correspondence Learning

no code implementations6 Aug 2023 Linbo Wang, Jing Wu, Xianyong Fang, Zhengyi Liu, Chenjie Cao, Yanwei Fu

First, we propose a Local Feature Consensus (LFC) plugin block to augment the features of existing models.

LeftRefill: Filling Right Canvas based on Left Reference through Generalized Text-to-Image Diffusion Model

3 code implementations19 May 2023 Chenjie Cao, Yunuo Cai, Qiaole Dong, Yikai Wang, Yanwei Fu

As an exemplar, we leverage LeftRefill to address two different challenges: reference-guided inpainting and novel view synthesis, based on the pre-trained StableDiffusion.

Image Inpainting Image Manipulation +2

Rethinking the Multi-view Stereo from the Perspective of Rendering-based Augmentation

no code implementations11 Mar 2023 Chenjie Cao, Xinlin Ren, xiangyang xue, Yanwei Fu

To address these problems, we first apply one of the state-of-the-art learning-based MVS methods, --MVSFormer, to overcome intractable scenarios such as textureless and reflections regions suffered by traditional PatchMatch methods, but it fails in a few large scenes' reconstructions.

Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints

1 code implementation ICCV 2023 Chenjie Cao, Yanwei Fu

Learning robust local image feature matching is a fundamental low-level vision task, which has been widely explored in the past few years.

Pose Estimation Visual Localization

ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural Priors

2 code implementations12 Oct 2022 Chenjie Cao, Qiaole Dong, Yanwei Fu

Specifically, given one corrupt image, we present the Transformer Structure Restorer (TSR) module to restore holistic structural priors at low image resolution, which are further upsampled by Simple Structure Upsampler (SSU) module to higher image resolution.

Image Inpainting

MVSFormer: Multi-View Stereo by Learning Robust Image Features and Temperature-based Depth

1 code implementation4 Aug 2022 Chenjie Cao, Xinlin Ren, Yanwei Fu

In this paper, we propose a pre-trained ViT enhanced MVS network called MVSFormer, which can learn more reliable feature representations benefited by informative priors from ViT.

3D Reconstruction Point Clouds +1

Learning Prior Feature and Attention Enhanced Image Inpainting

1 code implementation3 Aug 2022 Chenjie Cao, Qiaole Dong, Yanwei Fu

To this end, this paper incorporates the pre-training based Masked AutoEncoder (MAE) into the inpainting model, which enjoys richer informative priors to enhance the inpainting process.

Image Inpainting Image Restoration +2

Wavelet Prior Attention Learning in Axial Inpainting Network

no code implementations7 Jun 2022 Chenjie Cao, Chengrong Wang, Yuntao Zhang, Yanwei Fu

Image inpainting is the task of filling masked or unknown regions of an image with visually realistic contents, which has been remarkably improved by Deep Neural Networks (DNNs) recently.

Image Inpainting Semantic Segmentation

Pixel2Mesh++: 3D Mesh Generation and Refinement from Multi-View Images

no code implementations21 Apr 2022 Chao Wen, yinda zhang, Chenjie Cao, Zhuwen Li, xiangyang xue, Yanwei Fu

We study the problem of shape generation in 3D mesh representation from a small number of color images with or without camera poses.

Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding

2 code implementations CVPR 2022 Qiaole Dong, Chenjie Cao, Yanwei Fu

The proposed model restores holistic image structures with a powerful attention-based transformer model in a fixed low-resolution sketch space.

Image Inpainting

The Image Local Autoregressive Transformer

1 code implementation NeurIPS 2021 Chenjie Cao, Yuxin Hong, Xiang Li, Chengrong Wang, Chengming Xu, xiangyang xue, Yanwei Fu

To address these limitations, we propose a novel model -- image Local Autoregressive Transformer (iLAT), to better facilitate the locally guided image synthesis.

Image Generation

Learning a Sketch Tensor Space for Image Inpainting of Man-made Scenes

1 code implementation ICCV 2021 Chenjie Cao, Yanwei Fu

To this end, this paper proposes learning a Sketch Tensor (ST) space for inpainting man-made scenes.

Image Inpainting

SiBert: Enhanced Chinese Pre-trained Language Model with Sentence Insertion

1 code implementation LREC 2020 Jiahao Chen, Chenjie Cao, Xiuyan Jiang

However, some studies show that customized self-supervised tasks for a particular type of downstream task can effectively help the pre-trained model to capture more corresponding knowledge and semantic information.

Cloze Test Language Modelling +3

Learning to Augment Expressions for Few-shot Fine-grained Facial Expression Recognition

no code implementations17 Jan 2020 Wenxuan Wang, Yanwei Fu, Qiang Sun, Tao Chen, Chenjie Cao, Ziqi Zheng, Guoqiang Xu, Han Qiu, Yu-Gang Jiang, xiangyang xue

Considering the phenomenon of uneven data distribution and lack of samples is common in real-world scenarios, we further evaluate several tasks of few-shot expression learning by virtue of our F2ED, which are to recognize the facial expressions given only few training instances.

Facial Expression Recognition Facial Expression Recognition (FER) +1

Multimodal Emotion Recognition for One-Minute-Gradual Emotion Challenge

no code implementations3 May 2018 Ziqi Zheng, Chenjie Cao, Xingwei Chen, Guoqiang Xu

The continuous dimensional emotion modelled by arousal and valence can depict complex changes of emotions.

Multimodal Emotion Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.