Search Results for author: Chenjie Cao

Found 22 papers, 14 papers with code

SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer

no code implementations • 4 Apr 2024 • Zijie Wu, Chaohui Yu, Yanqin Jiang, Chenjie Cao, Fan Wang, Xiang Bai

Recent advances in 2D/3D generative models enable the generation of dynamic 3D objects from a single-view video.

Paper
Add Code

Repositioning the Subject within Image

1 code implementation • 30 Jan 2024 • Yikai Wang, Chenjie Cao, Ke Fan, Qiaole Dong, YiFan Li, xiangyang xue, Yanwei Fu

Our research reveals that the fundamental sub-tasks of subject repositioning, which include filling the void left by the repositioned subject, reconstructing obscured portions of the subject and blending the subject to be consistent with surrounding areas, can be effectively reformulated as a unified, prompt-guided inpainting task.

Image Generation Image Manipulation

Paper
Code

MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo

1 code implementation • 22 Jan 2024 • Chenjie Cao, Xinlin Ren, Yanwei Fu

Recent advancements in learning-based Multi-View Stereo (MVS) methods have prominently featured transformer-based models with attention mechanisms.

Ranked #1 on Point Clouds on Tanks and Temples

3D Reconstruction Depth Estimation +1

Paper
Code

Towards Context-Stable and Visual-Consistent Image Inpainting

1 code implementation • 8 Dec 2023 • Yikai Wang, Chenjie Cao, Ke Fan Xiangyang Xue Yanwei Fu

Recent progress in inpainting increasingly relies on generative models, leveraging their strong generation capabilities for addressing large irregular masks.

Image Inpainting

Paper
Code

Local Consensus Enhanced Siamese Network with Reciprocal Loss for Two-view Correspondence Learning

no code implementations • 6 Aug 2023 • Linbo Wang, Jing Wu, Xianyong Fang, Zhengyi Liu, Chenjie Cao, Yanwei Fu

First, we propose a Local Feature Consensus (LFC) plugin block to augment the features of existing models.

Paper
Add Code

LeftRefill: Filling Right Canvas based on Left Reference through Generalized Text-to-Image Diffusion Model

3 code implementations • 19 May 2023 • Chenjie Cao, Yunuo Cai, Qiaole Dong, Yikai Wang, Yanwei Fu

As an exemplar, we leverage LeftRefill to address two different challenges: reference-guided inpainting and novel view synthesis, based on the pre-trained StableDiffusion.

Image Inpainting Image Manipulation +2

Paper
Code

Rethinking Optical Flow from Geometric Matching Consistent Perspective

1 code implementation • CVPR 2023 • Qiaole Dong, Chenjie Cao, Yanwei Fu

In this paper, we propose a rethinking to previous optical flow estimation.

Geometric Matching Optical Flow Estimation

Paper
Code

Rethinking the Multi-view Stereo from the Perspective of Rendering-based Augmentation

no code implementations • 11 Mar 2023 • Chenjie Cao, Xinlin Ren, xiangyang xue, Yanwei Fu

To address these problems, we first apply one of the state-of-the-art learning-based MVS methods, --MVSFormer, to overcome intractable scenarios such as textureless and reflections regions suffered by traditional PatchMatch methods, but it fails in a few large scenes' reconstructions.

Paper
Add Code

Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints

1 code implementation • ICCV 2023 • Chenjie Cao, Yanwei Fu

Learning robust local image feature matching is a fundamental low-level vision task, which has been widely explored in the past few years.

Pose Estimation Visual Localization

Paper
Code

ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural Priors

2 code implementations • 12 Oct 2022 • Chenjie Cao, Qiaole Dong, Yanwei Fu

Specifically, given one corrupt image, we present the Transformer Structure Restorer (TSR) module to restore holistic structural priors at low image resolution, which are further upsampled by Simple Structure Upsampler (SSU) module to higher image resolution.

Image Inpainting

312

Paper
Code

MVSFormer: Multi-View Stereo by Learning Robust Image Features and Temperature-based Depth

1 code implementation • 4 Aug 2022 • Chenjie Cao, Xinlin Ren, Yanwei Fu

In this paper, we propose a pre-trained ViT enhanced MVS network called MVSFormer, which can learn more reliable feature representations benefited by informative priors from ViT.

Ranked #2 on 3D Reconstruction on DTU

3D Reconstruction Point Clouds +1

167

Paper
Code

Learning Prior Feature and Attention Enhanced Image Inpainting

1 code implementation • 3 Aug 2022 • Chenjie Cao, Qiaole Dong, Yanwei Fu

To this end, this paper incorporates the pre-training based Masked AutoEncoder (MAE) into the inpainting model, which enjoys richer informative priors to enhance the inpainting process.

Image Inpainting Image Restoration +2

Paper
Code

Wavelet Prior Attention Learning in Axial Inpainting Network

no code implementations • 7 Jun 2022 • Chenjie Cao, Chengrong Wang, Yuntao Zhang, Yanwei Fu

Image inpainting is the task of filling masked or unknown regions of an image with visually realistic contents, which has been remarkably improved by Deep Neural Networks (DNNs) recently.

Image Inpainting Semantic Segmentation

Paper
Add Code

Pixel2Mesh++: 3D Mesh Generation and Refinement from Multi-View Images

no code implementations • 21 Apr 2022 • Chao Wen, yinda zhang, Chenjie Cao, Zhuwen Li, xiangyang xue, Yanwei Fu

We study the problem of shape generation in 3D mesh representation from a small number of color images with or without camera poses.

Paper
Add Code

Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding

2 code implementations • CVPR 2022 • Qiaole Dong, Chenjie Cao, Yanwei Fu

The proposed model restores holistic image structures with a powerful attention-based transformer model in a fixed low-resolution sketch space.

Image Inpainting

312

Paper
Code

The Image Local Autoregressive Transformer

1 code implementation • NeurIPS 2021 • Chenjie Cao, Yuxin Hong, Xiang Li, Chengrong Wang, Chengming Xu, xiangyang xue, Yanwei Fu

To address these limitations, we propose a novel model -- image Local Autoregressive Transformer (iLAT), to better facilitate the locally guided image synthesis.

Image Generation

Paper
Code

Learning a Sketch Tensor Space for Image Inpainting of Man-made Scenes

1 code implementation • ICCV 2021 • Chenjie Cao, Yanwei Fu

To this end, this paper proposes learning a Sketch Tensor (ST) space for inpainting man-made scenes.

Image Inpainting

Paper
Code

SiBert: Enhanced Chinese Pre-trained Language Model with Sentence Insertion

1 code implementation • LREC 2020 • Jiahao Chen, Chenjie Cao, Xiuyan Jiang

However, some studies show that customized self-supervised tasks for a particular type of downstream task can effectively help the pre-trained model to capture more corresponding knowledge and semantic information.

Cloze Test Language Modelling +3

Paper
Code

CLUE: A Chinese Language Understanding Evaluation Benchmark

3 code implementations • COLING 2020 • Liang Xu, Hai Hu, Xuanwei Zhang, Lu Li, Chenjie Cao, Yudong Li, Yechen Xu, Kai Sun, Dian Yu, Cong Yu, Yin Tian, Qianqian Dong, Weitang Liu, Bo Shi, Yiming Cui, Junyi Li, Jun Zeng, Rongzhao Wang, Weijian Xie, Yanting Li, Yina Patterson, Zuoyu Tian, Yiwen Zhang, He Zhou, Shaoweihua Liu, Zhe Zhao, Qipeng Zhao, Cong Yue, Xinrui Zhang, Zhengliang Yang, Kyle Richardson, Zhenzhong Lan

The advent of natural language understanding (NLU) benchmarks for English, such as GLUE and SuperGLUE allows new NLU models to be evaluated across a diverse set of tasks.

General Classification Machine Reading Comprehension +4

3,815

Paper
Code

Learning to Augment Expressions for Few-shot Fine-grained Facial Expression Recognition

no code implementations • 17 Jan 2020 • Wenxuan Wang, Yanwei Fu, Qiang Sun, Tao Chen, Chenjie Cao, Ziqi Zheng, Guoqiang Xu, Han Qiu, Yu-Gang Jiang, xiangyang xue

Considering the phenomenon of uneven data distribution and lack of samples is common in real-world scenarios, we further evaluate several tasks of few-shot expression learning by virtue of our F2ED, which are to recognize the facial expressions given only few training instances.

Facial Expression Recognition Facial Expression Recognition (FER) +1

Paper
Add Code

A Fine-Grained Facial Expression Database for End-to-End Multi-Pose Facial Expression Recognition

no code implementations • 25 Jul 2019 • Wenxuan Wang, Qiang Sun, Tao Chen, Chenjie Cao, Ziqi Zheng, Guoqiang Xu, Han Qiu, Yanwei Fu

First, we create a new facial expression dataset of more than 200k images with 119 persons, 4 poses and 54 expressions.

Facial Expression Recognition Facial Expression Recognition (FER) +2

Paper
Add Code

Multimodal Emotion Recognition for One-Minute-Gradual Emotion Challenge

no code implementations • 3 May 2018 • Ziqi Zheng, Chenjie Cao, Xingwei Chen, Guoqiang Xu

The continuous dimensional emotion modelled by arousal and valence can depict complex changes of emotions.

Multimodal Emotion Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.