QLoRA: Efficient Finetuning of Quantized LLMs

artidoro/qlora 23 May 2023

Our best model family, which we name Guanaco, outperforms all previous openly released models on the Vicuna benchmark, reaching 99. 3% of the performance level of ChatGPT while only requiring 24 hours of finetuning on a single GPU.

Chatbot Instruction Following +2

4,629
6.91 stars / hour

Gorilla: Large Language Model Connected with Massive APIs

ShishirPatil/gorilla 24 May 2023

Large Language Models (LLMs) have seen an impressive wave of advances recently, with models now excelling in a variety of tasks, such as mathematical reasoning and program synthesis.

Language Modelling Mathematical Reasoning +2

2,398
5.87 stars / hour

Large Language Models as Tool Makers

ctlllll/llm-toolmaker 26 May 2023

Our approach consists of two key phases: 1) tool making: an LLM acts as the tool maker that crafts tools for given tasks, where a tool is implemented as a Python utility function.

286
4.03 stars / hour

Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold

opengvlab/internchat 18 May 2023

Synthesizing visual content that meets users' needs often requires flexible and precise controllability of the pose, shape, expression, and layout of the generated objects.

Image Manipulation

2,020
3.59 stars / hour

VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks

opengvlab/interngpt 18 May 2023

We hope this model can set a new baseline for generalist vision and language models.

Language Modelling

2,020
3.59 stars / hour

OCR-free Document Understanding Transformer

clovaai/donut 30 Nov 2021

Current Visual Document Understanding (VDU) methods outsource the task of reading text to off-the-shelf Optical Character Recognition (OCR) engines and focus on the understanding task with the OCR outputs.

Optical Character Recognition (OCR)

3,301
3.53 stars / hour

Tree of Thoughts: Deliberate Problem Solving with Large Language Models

kyegomez/tree-of-thoughts 17 May 2023

Language models are increasingly being deployed for general problem solving across a wide range of tasks, but are still confined to token-level, left-to-right decision-making processes during inference.

Decision Making Language Modelling

2,225
3.03 stars / hour

Generating Sequences With Recurrent Neural Networks

sjvasquez/handwriting-synthesis 4 Aug 2013

This paper shows how Long Short-term Memory recurrent neural networks can be used to generate complex sequences with long-range structure, simply by predicting one data point at a time.

Language Modelling Text Generation

3,238
2.79 stars / hour

ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation

thu-ml/prolificdreamer 25 May 2023

In this work, we propose to model the 3D parameter as a random variable instead of a constant as in SDS and present variational score distillation (VSD), a principled particle-based variational framework to explain and address the aforementioned issues in text-to-3D generation.

Text to 3D

577
2.65 stars / hour

Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training

Liuhong99/Sophia 23 May 2023

Given the massive cost of language model pre-training, a non-trivial improvement of the optimization algorithm would lead to a material reduction on the time and cost of training.

Language Modelling Stochastic Optimization

501
2.50 stars / hour