Trending Research

Arc2Face: A Foundation Model of Human Faces

Recognito-Vision/Face-SDK-Linux-Demos • 18 Mar 2024

This paper presents Arc2Face, an identity-conditioned face foundation model, which, given the ArcFace embedding of a person, can generate diverse photo-realistic images with an unparalleled degree of face similarity than existing models.

Ranked #1 on Diffusion Personalization Tuning Free on AgeDB

Diffusion Personalization Tuning Free Face Generation +1

139

1.25 stars / hour

Paper
Code

State Space Model for New-Generation Network Alternative to Transformers: A Survey

event-ahu/mamba_state_space_model_paper_list • • 15 Apr 2024

In this paper, we give the first comprehensive review of these works and also provide experimental comparisons and analysis to better demonstrate the features and advantages of SSM.

295

1.21 stars / hour

Paper
Code

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images

openbmb/omnilmm • • 18 Mar 2024

To address the challenges, we present LLaVA-UHD, a large multimodal model that can efficiently perceive images in any aspect ratio and high resolution.

1,254

1.18 stars / hour

Paper
Code

Unrecognizable Yet Identifiable: Image Distortion with Preserved Embeddings

Recognito-Vision/NIST-FRVT-Top-1-Face-Recognition • 26 Jan 2024

In the realm of security applications, biometric authentication systems play a crucial role, yet one often encounters challenges concerning privacy and security while developing one.

Face Recognition Security Studies

143

1.17 stars / hour

Paper
Code

RecAI: Leveraging Large Language Models for Next-Generation Recommender Systems

microsoft/recai • • 11 Mar 2024

This paper introduces RecAI, a practical toolkit designed to augment or even revolutionize recommender systems with the advanced capabilities of Large Language Models (LLMs).

Recommendation Systems

294

1.09 stars / hour

Paper
Code

ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback

liming-ai/ControlNet_Plus_Plus • • 11 Apr 2024

To this end, we propose ControlNet++, a novel approach that improves controllable generation by explicitly optimizing pixel-level cycle consistency between generated images and conditional controls.

SSIM

131

1.06 stars / hour

Paper
Code

Joint Physical-Digital Facial Attack Detection Via Simulating Spoofing Clues

FaceOnLive/Face-Liveness-Detection-SDK-Linux • 12 Apr 2024

SPSC and SDSC augment live samples into simulated attack samples by simulating spoofing clues of physical and digital attacks, respectively, which significantly improve the capability of the model to detect "unseen" attack types.

Data Augmentation Face Anti-Spoofing +1

205

0.94 stars / hour

Paper
Code

TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

Infini-AI-Lab/TriForce • • 18 Apr 2024

However, key-value (KV) cache, which is stored to avoid re-computation, has emerged as a critical bottleneck by growing linearly in size with the sequence length.

0.92 stars / hour

Paper
Code

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

fudan-generative-vision/champ • • 21 Mar 2024

In this study, we introduce a methodology for human image animation by leveraging a 3D human parametric model within a latent diffusion framework to enhance shape alignment and motion guidance in curernt human generative techniques.

Animated GIF Generation Image Animation +1

2,994

0.88 stars / hour

Paper
Code

AutoCodeRover: Autonomous Program Improvement

nus-apr/auto-code-rover • 8 Apr 2024

Recent progress in Large Language Models (LLMs) has significantly impacted the development process, where developers can use LLM-based programming assistants to achieve automated coding.

Bug fixing Code Search +1

2,018

0.75 stars / hour

Paper
Code