Search Results for author: Jingyang Huo

Found 5 papers, 1 papers with code

NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual Pretraining and Multi-level Modulation

no code implementations • 27 Mar 2024 • Jingyang Huo, Yikai Wang, Xuelin Qian, Yun Wang, Chong Li, Jianfeng Feng, Yanwei Fu

Recent fMRI-to-image approaches mainly focused on associating fMRI signals with specific conditions of pre-trained diffusion models.

Image Reconstruction

Paper
Add Code

Intelligent Director: An Automatic Framework for Dynamic Visual Composition using ChatGPT

no code implementations • 24 Feb 2024 • Sixiao Zheng, Jingyang Huo, Yu Wang, Yanwei Fu

We propose an Intelligent Director framework, utilizing LENS to generate descriptions for images and video frames and combining ChatGPT to generate coherent captions while recommending appropriate music names.

Retrieval Style Transfer

Paper
Add Code

fMRI-PTE: A Large-scale fMRI Pretrained Transformer Encoder for Multi-Subject Brain Activity Decoding

no code implementations • 1 Nov 2023 • Xuelin Qian, Yun Wang, Jingyang Huo, Jianfeng Feng, Yanwei Fu

The exploration of brain activity and its decoding from fMRI data has been a longstanding pursuit, driven by its potential applications in brain-computer interfaces, medical diagnostics, and virtual reality.

Paper
Add Code

Pushing the Limits of 3D Shape Generation at Scale

no code implementations • 20 Jun 2023 • Yu Wang, Xuelin Qian, Jingyang Huo, Tiejun Huang, Bo Zhao, Yanwei Fu

Through the adaptation of the Auto-Regressive model and the utilization of large language models, we have developed a remarkable model with an astounding 3. 6 billion trainable parameters, establishing it as the largest 3D shape generation model to date, named Argus-3D.

3D Generation 3D Shape Generation +2

Paper
Add Code

GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language Navigation

1 code implementation • CVPR 2023 • Jingyang Huo, Qiang Sun, Boyan Jiang, Haitao Lin, Yanwei Fu

Technically, we introduce a two-stage module that combine local slot attention and CLIP model to produce geometry-enhanced representation from such input.

Vision and Language Navigation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.