Trending Research

Magic Clothing: Controllable Garment-Driven Image Synthesis

shinechen1024/magicclothing • • 15 Apr 2024

We propose Magic Clothing, a latent diffusion model (LDM)-based network architecture for an unexplored garment-driven image synthesis task.

Image Generation

998

0.33 stars / hour

Paper
Code

A Light CNN for Deep Face Representation with Noisy Labels

AlfredXiangWu/LightCNN • • 9 Nov 2015

This paper presents a Light CNN framework to learn a compact embedding on the large-scale face data with massive noisy labels.

Ranked #2 on Age-Invariant Face Recognition on CAFR

Face Identification Face Recognition +2

998

0.33 stars / hour

Paper
Code

Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations

Xiaoyao-Li/Ag2Manip • • 26 Apr 2024

Autonomous robotic systems capable of learning novel manipulation tasks are poised to transform industries from manufacturing to service automation.

Imitation Learning

0.33 stars / hour

Paper
Code

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

scutzzj/aniportrait • • 26 Mar 2024

In this study, we propose AniPortrait, a novel framework for generating high-quality animation driven by audio and a reference portrait image.

Face Reenactment

3,738

0.32 stars / hour

Paper
Code

OpenAgents: An Open Platform for Language Agents in the Wild

xlang-ai/xlang • 16 Oct 2023

Language agents show potential in being capable of utilizing natural language for varied and intricate tasks in diverse environments, particularly when built upon large language models (LLMs).

2D Object Detection

3,546

0.31 stars / hour

Paper
Code

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation

instantstyle/instantstyle • • 3 Apr 2024

Tuning-free diffusion-based models have demonstrated significant potential in the realm of image personalization and customization.

Text-to-Image Generation

1,200

0.31 stars / hour

Paper
Code

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

fudan-generative-vision/champ • • 21 Mar 2024

In this study, we introduce a methodology for human image animation by leveraging a 3D human parametric model within a latent diffusion framework to enhance shape alignment and motion guidance in curernt human generative techniques.

Animated GIF Generation Image Animation +1

3,163

0.30 stars / hour

Paper
Code

NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving

atonderski/neuro-ncap • 11 Apr 2024

We present a versatile NeRF-based simulator for testing autonomous driving (AD) software systems, designed with a focus on sensor-realistic closed-loop evaluation and the creation of safety-critical scenarios.

Autonomous Driving