OSDFace: One-Step Diffusion Model for Face Restoration

jkwang28/osdface 26 Nov 2024

Moreover, existing methods often struggle to generate face images that are harmonious, realistic, and consistent with the subject's identity.

Face Recognition Generative Adversarial Network

98
0.70 stars / hour

The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use

showlab/computer_use_ootb 15 Nov 2024

The recently released model, Claude 3. 5 Computer Use, stands out as the first frontier AI model to offer computer use in public beta as a graphical user interface (GUI) agent.

942
0.59 stars / hour

Comprehensive Competition Mechanism in Palmprint Recognition

Faceplugin-ltd/Palm-Recognition IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY 2023

The traditional competition mechanism focuses solely on selecting the winner of different channels without considering the spatial information of the features.

84
0.59 stars / hour

EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

antgroup/echomimic_v2 15 Nov 2024

Recent work on human animation usually involves audio, pose, or movement maps conditions, thereby achieves vivid animation quality.

Audio-Driven Body Animation Human Animation +1

1,667
0.57 stars / hour
55
0.56 stars / hour

The Well: a Large-Scale Collection of Diverse Physics Simulations for Machine Learning

polymathicai/the_well 30 Nov 2024

Machine learning based surrogate models offer researchers powerful tools for accelerating simulation-based workflows.

426
0.55 stars / hour

MARS: Unleashing the Power of Variance Reduction for Training Large Models

AGI-Arena/MARS 15 Nov 2024

Despite the development of numerous variance reduction algorithms in the past decade aimed at accelerating stochastic optimization in both convex and nonconvex settings, variance reduction has not found widespread success in training deep neural networks or large language models.

Stochastic Optimization

392
0.55 stars / hour

ReFT: Representation Finetuning for Language Models

lqtrung1998/mwp_reft 4 Apr 2024

We define a strong instance of the ReFT family, Low-rank Linear Subspace ReFT (LoReFT), and we identify an ablation of this method that trades some performance for increased efficiency.

Arithmetic Reasoning

239
0.54 stars / hour

Open-Sora Plan: Open-Source Large Video Generation Model

PKU-YuanGroup/ConsisID 28 Nov 2024

We introduce Open-Sora Plan, an open-source project that aims to contribute a large generation model for generating desired high-resolution videos with long durations based on various user inputs.

Video Generation

454
0.53 stars / hour

Scaling Transformers for Low-Bitrate High-Quality Speech Coding

Stability-AI/stable-codec 29 Nov 2024

The tokenization of speech with neural audio codec models is a vital part of modern AI pipelines for the generation or understanding of speech, alone or in a multimodal context.

Quantization

160
0.51 stars / hour