UMBRAE: Unified Multimodal Decoding of Brain Signals

no code implementations10 Apr 2024 Weihao Xia, Raoul de Charette, Cengiz Öztireli, Jing-Hao Xue

We address prevailing challenges of the brain-powered research, departing from the observation that the literature hardly recover accurate spatial information and require subject-specific models.

Language Modelling Large Language Model

DREAM: Visual Decoding from Reversing Human Visual System

no code implementations3 Oct 2023 Weihao Xia, Raoul de Charette, Cengiz Öztireli, Jing-Hao Xue

In this work we present DREAM, an fMRI-to-image method for reconstructing viewed images from brain activities, grounded on fundamental knowledge of the human visual system.

A Survey on Deep Generative 3D-aware Image Synthesis

1 code implementation25 Oct 2022 Weihao Xia, Jing-Hao Xue

Recent years have seen remarkable progress in deep learning powered visual content creation.

3D-Aware Image Synthesis

Modelling Latent Dynamics of StyleGAN using Neural ODEs

1 code implementation23 Aug 2022 Weihao Xia, Yujiu Yang, Jing-Hao Xue

The entire sequence is seen as discrete-time observations of a continuous trajectory of the initial latent code, by considering each latent code as a moving particle and the latent space as a high-dimensional dynamic system.

Video Editing

Learning Quality-aware Dynamic Memory for Video Object Segmentation

1 code implementation16 Jul 2022 Yong liu, Ran Yu, Fei Yin, Xinyuan Zhao, Wei Zhao, Weihao Xia, Yujiu Yang

However, they mainly focus on better matching between the current frame and the memory frames without explicitly paying attention to the quality of the memory.

Ranked #11 on Semi-Supervised Video Object Segmentation on DAVIS 2016 (using extra training data)

Segmentation Semantic Segmentation +2

High-fidelity GAN Inversion with Padding Space

1 code implementation21 Mar 2022 Qingyan Bai, Yinghao Xu, Jiapeng Zhu, Weihao Xia, Yujiu Yang, Yujun Shen

In this work, we propose to involve the padding space of the generator to complement the latent space with spatial information.

Generative Adversarial Network Image Manipulation +1

Identity-guided Face Generation with Multi-modal Contour Conditions

no code implementations10 Oct 2021 Qingyan Bai, Weihao Xia, Fei Yin, Yujiu Yang

Concretely, we propose a novel dual-encoder architecture, in which an identity encoder extracts the identity-related feature, accompanied by a main encoder to obtain the rough contour information and further fuse all the information together.

Face Generation Image Restoration

Real-time Human-Centric Segmentation for Complex Video Scenes

1 code implementation16 Aug 2021 Ran Yu, Chenyu Tian, Weihao Xia, Xinyuan Zhao, Haoqian Wang, Yujiu Yang

To alleviate this problem, we propose a mechanism named Inner Center Sampling to improve the accuracy of instance segmentation.

Instance Segmentation Segmentation +2

PoseDet: Fast Multi-Person Pose Estimation Using Pose Embedding

1 code implementation22 Jul 2021 Chenyu Tian, Ran Yu, Xinyuan Zhao, Weihao Xia, Haoqian Wang, Yujiu Yang

This simple framework achieves an unprecedented speed and a competitive accuracy on the COCO benchmark compared with state-of-the-art methods.

Multi-Person Pose Estimation

Towards Open-World Text-Guided Face Image Generation and Manipulation

2 code implementations18 Apr 2021 Weihao Xia, Yujiu Yang, Jing-Hao Xue, Baoyuan Wu

To be specific, we propose a brand new paradigm of text-guided image generation and manipulation based on the superior characteristics of a pretrained GAN model.

Language Modelling Semantic Segmentation +1

GAN Inversion: A Survey

1 code implementation14 Jan 2021 Weihao Xia, Yulun Zhang, Yujiu Yang, Jing-Hao Xue, Bolei Zhou, Ming-Hsuan Yang

GAN inversion aims to invert a given image back into the latent space of a pretrained GAN model, for the image to be faithfully reconstructed from the inverted code by the generator.

Image Manipulation Image Restoration

Controllable Continuous Gaze Redirection

1 code implementation9 Oct 2020 Weihao Xia, Yujiu Yang, Jing-Hao Xue, Wensen Feng

The encoder maps images into a well-disentangled and hierarchically-organized latent space.

Attribute gaze redirection

