Magic Clothing: Controllable Garment-Driven Image Synthesis

shinechen1024/magicclothing 15 Apr 2024

We propose Magic Clothing, a latent diffusion model (LDM)-based network architecture for an unexplored garment-driven image synthesis task.

Image Generation

989
0.33 stars / hour

A Light CNN for Deep Face Representation with Noisy Labels

AlfredXiangWu/LightCNN 9 Nov 2015

This paper presents a Light CNN framework to learn a compact embedding on the large-scale face data with massive noisy labels.

Face Identification Face Recognition +2

998
0.33 stars / hour

Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations

Xiaoyao-Li/Ag2Manip 26 Apr 2024

Autonomous robotic systems capable of learning novel manipulation tasks are poised to transform industries from manufacturing to service automation.

Imitation Learning

19
0.33 stars / hour

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

scutzzj/aniportrait 26 Mar 2024

In this study, we propose AniPortrait, a novel framework for generating high-quality animation driven by audio and a reference portrait image.

Face Reenactment

3,726
0.32 stars / hour

OpenAgents: An Open Platform for Language Agents in the Wild

xlang-ai/xlang 16 Oct 2023

Language agents show potential in being capable of utilizing natural language for varied and intricate tasks in diverse environments, particularly when built upon large language models (LLMs).

2D Object Detection

3,512
0.31 stars / hour

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation

instantstyle/instantstyle 3 Apr 2024

Tuning-free diffusion-based models have demonstrated significant potential in the realm of image personalization and customization.

Text-to-Image Generation

1,188
0.31 stars / hour

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

fudan-generative-vision/champ 21 Mar 2024

In this study, we introduce a methodology for human image animation by leveraging a 3D human parametric model within a latent diffusion framework to enhance shape alignment and motion guidance in curernt human generative techniques.

Animated GIF Generation Image Animation +1

3,151
0.30 stars / hour

NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving

atonderski/neuro-ncap 11 Apr 2024

We present a versatile NeRF-based simulator for testing autonomous driving (AD) software systems, designed with a focus on sensor-realistic closed-loop evaluation and the creation of safety-critical scenarios.

Autonomous Driving

77
0.27 stars / hour

ViTamin: Designing Scalable Vision Models in the Vision-Language Era

beckschen/vitamin 2 Apr 2024

To this end, we introduce ViTamin, a new vision models tailored for VLMs.

102
0.27 stars / hour

SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation

ailab-cvc/seed-x 22 Apr 2024

We hope that our work will inspire future research into what can be achieved by versatile multimodal foundation models in real-world applications.

Image Generation

132
0.25 stars / hour