Trending Research

WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

nlpxucan/wizardlm • • 18 Aug 2023

Through extensive experiments on two mathematical reasoning benchmarks, namely GSM8k and MATH, we reveal the extraordinary capabilities of our model.

Ranked #49 on Arithmetic Reasoning on GSM8K (using extra training data)

Arithmetic Reasoning GSM8K +2

8,813

0.39 stars / hour

Paper
Code

SafeGen: Mitigating Unsafe Content Generation in Text-to-Image Models

letterligo/text-agnostic-governance • • 10 Apr 2024

The key idea is to eliminate unsafe visual representations from the model regardless of the text input.

0.39 stars / hour

Paper
Code

Taming Stable Diffusion for Text to 360° Panorama Image Generation

chengzhag/panfusion • • 11 Apr 2024

Generative models, e. g., Stable Diffusion, have enabled the creation of photorealistic images from text prompts.

Denoising Image Generation

0.39 stars / hour

Paper
Code

Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences

nianticlabs/mickey • • 9 Apr 2024

Usually, correspondences are 2D-to-2D and the pose we estimate is defined only up to scale.

277

0.38 stars / hour

Paper
Code

Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed

zju3dv/efficientloftr • • 7 Mar 2024

Furthermore, we find spatial variance exists in LoFTR's fine correlation module, which is adverse to matching accuracy.

3D Reconstruction Image Retrieval

301

0.38 stars / hour

Paper
Code

Pre-training Small Base LMs with Fewer Tokens

sanyalsunny111/llm-inheritune • • 12 Apr 2024

Here we show that smaller LMs trained utilizing some of the layers of GPT2-medium (355M) and GPT-2-large (770M) can effectively match the val loss of their bigger counterparts when trained from scratch for the same number of training steps on OpenWebText dataset with 9B tokens.

Language Modelling

0.38 stars / hour

Paper
Code

TinyLlama: An Open-Source Small Language Model

Lightning-AI/lit-gpt • • 4 Jan 2024

We present TinyLlama, a compact 1. 1B language model pretrained on around 1 trillion tokens for approximately 3 epochs.

Computational Efficiency Language Modelling

6,518

0.37 stars / hour

Paper
Code

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

showlab/show-1 • • 27 Sep 2023

In this paper, we are the first to propose a hybrid model, dubbed as Show-1, which marries pixel-based and latent-based VDMs for text-to-video generation.

Ranked #2 on Text-to-Video Generation on EvalCrafter Text-to-Video (ECTV) Dataset (using extra training data)