WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

nlpxucan/wizardlm 18 Aug 2023

Through extensive experiments on two mathematical reasoning benchmarks, namely GSM8k and MATH, we reveal the extraordinary capabilities of our model.

Ranked #49 on Arithmetic Reasoning on GSM8K (using extra training data)

Arithmetic Reasoning GSM8K +2

8,813
0.39 stars / hour

SafeGen: Mitigating Unsafe Content Generation in Text-to-Image Models

letterligo/text-agnostic-governance 10 Apr 2024

The key idea is to eliminate unsafe visual representations from the model regardless of the text input.

85
0.39 stars / hour

Taming Stable Diffusion for Text to 360° Panorama Image Generation

chengzhag/panfusion 11 Apr 2024

Generative models, e. g., Stable Diffusion, have enabled the creation of photorealistic images from text prompts.

Denoising Image Generation

71
0.39 stars / hour

Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences

nianticlabs/mickey 9 Apr 2024

Usually, correspondences are 2D-to-2D and the pose we estimate is defined only up to scale.

277
0.38 stars / hour

Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed

zju3dv/efficientloftr 7 Mar 2024

Furthermore, we find spatial variance exists in LoFTR's fine correlation module, which is adverse to matching accuracy.

3D Reconstruction Image Retrieval

301
0.38 stars / hour

Pre-training Small Base LMs with Fewer Tokens

sanyalsunny111/llm-inheritune 12 Apr 2024

Here we show that smaller LMs trained utilizing some of the layers of GPT2-medium (355M) and GPT-2-large (770M) can effectively match the val loss of their bigger counterparts when trained from scratch for the same number of training steps on OpenWebText dataset with 9B tokens.

Language Modelling

41
0.38 stars / hour

TinyLlama: An Open-Source Small Language Model

Lightning-AI/lit-gpt 4 Jan 2024

We present TinyLlama, a compact 1. 1B language model pretrained on around 1 trillion tokens for approximately 3 epochs.

Computational Efficiency Language Modelling

6,518
0.37 stars / hour

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

showlab/show-1 27 Sep 2023

In this paper, we are the first to propose a hybrid model, dubbed as Show-1, which marries pixel-based and latent-based VDMs for text-to-video generation.

Text-to-Video Generation Video Alignment +1

830
0.37 stars / hour

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

scutzzj/aniportrait 26 Mar 2024

In this study, we propose AniPortrait, a novel framework for generating high-quality animation driven by audio and a reference portrait image.

Face Reenactment

3,513
0.35 stars / hour

WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?

servicenow/browsergym 12 Mar 2024

We study the use of large language model-based agents for interacting with software via web browsers.

Language Modelling Large Language Model

87
0.35 stars / hour