Trending Research

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

apple/corenet • • 22 Apr 2024

To this end, we release OpenELM, a state-of-the-art open language model.

Language Modelling

1,692

17.71 stars / hour

Paper
Code

CyberSecEval 2: A Wide-Ranging Cybersecurity Evaluation Suite for Large Language Models

facebookresearch/purplellama • 19 Apr 2024

We present BenchmarkName, a novel benchmark to quantify LLM security risks and capabilities.

1,817

3.46 stars / hour

Paper
Code

QLoRA: Efficient Finetuning of Quantized LLMs

internlm/xtuner • • NeurIPS 2023

Our best model family, which we name Guanaco, outperforms all previous openly released models on the Vicuna benchmark, reaching 99. 3% of the performance level of ChatGPT while only requiring 24 hours of finetuning on a single GPU.

Chatbot Instruction Following +2

2,092

2.52 stars / hour

Paper
Code

Improving Diffusion Models for Virtual Try-on

yisol/IDM-VTON • • 8 Mar 2024

Finally, we present a customization method using a pair of person-garment images, which significantly improves fidelity and authenticity.

Ranked #1 on Virtual Try-on on VITON-HD

Virtual Try-on

403

2.11 stars / hour

Paper
Code

ID-Animator: Zero-Shot Identity-Preserving Human Video Generation

id-animator/id-animator • 23 Apr 2024

Based on this pipeline, a random face reference training method is further devised to precisely capture the ID-relevant embeddings from reference images, thus improving the fidelity and generalization capacity of our model for ID-specific video generation.

Attribute Video Generation

1.67 stars / hour

Paper
Code

ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation

hiyouga/llama-efficient-tuning • • 4 Aug 2023

Applying Reinforcement Learning (RL) to sequence generation models enables the direct optimization of long-term rewards (\textit{e. g.,} BLEU and human feedback), but typically requires large-scale sampling over a space of action sequences.

Abstractive Text Summarization Language Modelling +5

19,592

1.65 stars / hour

Paper
Code

Dynamic Generation of Personalities with Large Language Models

hiyouga/llama-factory • • 10 Apr 2024

We propose a new metric to assess personality generation capability based on this evaluation method.

Personality Generation

19,575

1.64 stars / hour

Paper
Code

AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation

ez-hwh/autocrawler • 19 Apr 2024

We propose AutoCrawler, a two-stage framework that leverages the hierarchical structure of HTML for progressive understanding.

Action Generation

147

1.52 stars / hour

Paper
Code

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

tencentarc/instantmesh • • 10 Apr 2024

We present InstantMesh, a feed-forward framework for instant 3D mesh generation from a single image, featuring state-of-the-art generation quality and significant training scalability.

Image to 3D

1,389

1.32 stars / hour

Paper
Code

SnapKV: LLM Knows What You are Looking for Before Generation

fasterdecoding/snapkv • 22 Apr 2024

Specifically, SnapKV achieves a consistent decoding speed with a 3. 6x increase in generation speed and an 8. 2x enhancement in memory efficiency compared to baseline when processing inputs of 16K tokens.

16k

1.20 stars / hour

Paper
Code