Trending Research

LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression

microsoft/LLMLingua • • 19 Mar 2024

The challenge is that information entropy may be a suboptimal compression metric: (i) it only leverages unidirectional context and may fail to capture all essential information needed for prompt compression; (ii) it is not aligned with the prompt compression objective.

GSM8K Language Modelling +3

3,818

0.67 stars / hour

Paper
Code

Graphic Design with Large Multimodal Model

graphic-design-ai/graphist • 22 Apr 2024

One existing practice is Graphic Layout Generation (GLG), which aims to layout sequential design elements.

0.66 stars / hour

Paper
Code

Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations

facebookresearch/generative-recommenders • • 27 Feb 2024

Large-scale recommendation systems are characterized by their reliance on high cardinality, heterogeneous features and the need to handle tens of billions of user actions on a daily basis.

Ranked #1 on Recommendation Systems on MovieLens 20M (HR@10 (full corpus) metric)

Recommendation Systems

200

0.65 stars / hour

Paper
Code

Unrecognizable Yet Identifiable: Image Distortion with Preserved Embeddings

Recognito-Vision/Face-SDK-Android-Demo • 26 Jan 2024

In the realm of security applications, biometric authentication systems play a crucial role, yet one often encounters challenges concerning privacy and security while developing one.

Face Recognition Security Studies

208

0.63 stars / hour

Paper
Code

How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study

macaronlin/llama3-quantization • • 22 Apr 2024

This exploration holds the potential to unveil new insights and challenges for low-bit quantization of LLaMA3 and other forthcoming LLMs, especially in addressing performance degradation problems that suffer in LLM compression.

Language Modelling Large Language Model +1

0.62 stars / hour

Paper
Code

GhostFaceNets: Lightweight Face Recognition Model From Cheap Operations

Recognito-Vision/Android-FaceRecognition-FaceLivenessDetection • IEEE Access 2023

The development of deep learning-based biometric models that can be deployed on devices with constrained memory and computational resources has proven to be a significant challenge.

Ranked #1 on Face Recognition on CFP-FF

Face Identification Face Verification +1

208

0.62 stars / hour

Paper
Code

Prompts As Programs: A Structure-Aware Approach to Efficient Compile-Time Prompt Optimization

microsoft/sammo • 2 Apr 2024

We show that SAMMO generalizes previous methods and improves the performance of complex prompts on (1) instruction tuning, (2) RAG pipeline tuning, and (3) prompt compression, across several different LLMs.

135

0.62 stars / hour

Paper
Code

SpaceByte: Towards Deleting Tokenization from Large Language Modeling

kjslag/spacebyte • • 22 Apr 2024

Tokenization is widely used in large language models because it significantly improves performance.

Language Modelling

0.61 stars / hour

Paper
Code

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

fudan-generative-vision/champ • • 21 Mar 2024

In this study, we introduce a methodology for human image animation by leveraging a 3D human parametric model within a latent diffusion framework to enhance shape alignment and motion guidance in curernt human generative techniques.

Animated GIF Generation Image Animation +1

3,091

0.61 stars / hour

Paper
Code

Arc2Face: A Foundation Model of Human Faces

Recognito-Vision/NIST-FRVT-Top-1-Face-Recognition • 18 Mar 2024

This paper presents Arc2Face, an identity-conditioned face foundation model, which, given the ArcFace embedding of a person, can generate diverse photo-realistic images with an unparalleled degree of face similarity than existing models.

Ranked #1 on Diffusion Personalization Tuning Free on AgeDB

Diffusion Personalization Tuning Free Face Generation +1

213

0.61 stars / hour

Paper
Code