Search Results for author: Youjia Zhang

Found 8 papers, 4 papers with code

AMD:Anatomical Motion Diffusion with Interpretable Motion Decomposition and Fusion

no code implementations20 Dec 2023 Beibei Jing, Youjia Zhang, Zikai Song, Junqing Yu, Wei Yang

Generating realistic human motion sequences from text descriptions is a challenging task that requires capturing the rich expressiveness of both natural language and human motion. Recent advances in diffusion models have enabled significant progress in human motion synthesis. However, existing methods struggle to handle text inputs that describe complex or long motions. In this paper, we propose the Adaptable Motion Diffusion (AMD) model, which leverages a Large Language Model (LLM) to parse the input text into a sequence of concise and interpretable anatomical scripts that correspond to the target motion. This process exploits the LLM's ability to provide anatomical guidance for complex motion synthesis. We then devise a two-branch fusion scheme that balances the influence of the input text and the anatomical scripts on the inverse diffusion process, which adaptively ensures the semantic fidelity and diversity of the synthesized motion. Our method can effectively handle texts with complex or long motion descriptions, where existing methods often fail.

Language Modelling Large Language Model

Optimized View and Geometry Distillation from Multi-view Diffuser

no code implementations11 Dec 2023 Youjia Zhang, Zikai Song, Junqing Yu, Yawei Luo, Wei Yang

We leverage the rendered views from the optimized radiance field as the basis and develop a two-step specialization process of a 2D diffusion model, which is adept at conducting object-specific denoising and generating high-quality multi-view images.

Denoising

Fine-grained Appearance Transfer with Diffusion Models

1 code implementation27 Nov 2023 Yuteng Ye, Guanwen Li, Hang Zhou, Cai Jiale, Junqing Yu, Yawei Luo, Zikai Song, Qilong Xing, Youjia Zhang, Wei Yang

A pivotal aspect of our approach is the strategic use of the predicted $x_0$ space by diffusion models within the latent space of diffusion processes.

Image-to-Image Translation

Progressive Text-to-Image Diffusion with Soft Latent Direction

1 code implementation18 Sep 2023 Yuteng Ye, Jiale Cai, Hang Zhou, Guanwen Li, Youjia Zhang, Zikai Song, Chenxing Gao, Junqing Yu, Wei Yang

In spite of the rapidly evolving landscape of text-to-image generation, the synthesis and manipulation of multiple entities while adhering to specific relational constraints pose enduring challenges.

Language Modelling Large Language Model +1

NeMF: Inverse Volume Rendering with Neural Microflake Field

no code implementations ICCV 2023 Youjia Zhang, Teng Xu, Junqing Yu, Yuteng Ye, Junle Wang, Yanqing Jing, Jingyi Yu, Wei Yang

Recovering the physical attributes of an object's appearance from its images captured under an unknown illumination is challenging yet essential for photo-realistic rendering.

Dynamic Feature Pruning and Consolidation for Occluded Person Re-Identification

1 code implementation27 Nov 2022 Yuteng Ye, Hang Zhou, Jiale Cai, Chenxing Gao, Youjia Zhang, Junle Wang, Qiang Hu, Junqing Yu, Wei Yang

The framework mainly consists of a sparse encoder, a multi-view feature mathcing module, and a feature consolidation decoder.

Person Re-Identification

Spatio-channel Attention Blocks for Cross-modal Crowd Counting

1 code implementation19 Oct 2022 Youjia Zhang, Soyun Choi, Sungeun Hong

Crowd counting research has made significant advancements in real-world applications, but it remains a formidable challenge in cross-modal settings.

Crowd Counting

Cannot find the paper you are looking for? You can Submit a new open access paper.