NEFTune: Noisy Embeddings Improve Instruction Finetuning

openaccess-ai-collective/axolotl 9 Oct 2023

We show that language model finetuning can be improved, sometimes dramatically, with a simple augmentation.

Language Modelling

6,342
0.25 stars / hour

DeepSeek-VL: Towards Real-World Vision-Language Understanding

deepseek-ai/deepseek-vl 8 Mar 2024

The DeepSeek-VL family (both 1. 3B and 7B models) showcases superior user experiences as a vision-language chatbot in real-world applications, achieving state-of-the-art or competitive performance across a wide range of visual-language benchmarks at the same model size while maintaining robust performance on language-centric benchmarks.

Chatbot Language Modelling +3

1,752
0.24 stars / hour

Imp: Highly Capable Large Multimodal Models for Mobile Devices

milvlg/imp 20 May 2024

By harnessing the capabilities of large language models (LLMs), recent large multimodal models (LMMs) have shown remarkable versatility in open-world multimodal understanding.

Quantization

141
0.24 stars / hour

Prometheus: Inducing Fine-grained Evaluation Capability in Language Models

stanford-oval/storm 12 Oct 2023

We first construct the Feedback Collection, a new dataset that consists of 1K fine-grained score rubrics, 20K instructions, and 100K responses and language feedback generated by GPT-4.

Language Modelling Large Language Model

4,482
0.24 stars / hour

AM-RADIO: Agglomerative Vision Foundation Model -- Reduce All Domains Into One

nvlabs/radio 10 Dec 2023

A handful of visual foundation models (VFMs) have recently emerged as the backbones for numerous downstream tasks.

Benchmarking object-detection +2

426
0.24 stars / hour

VILA: On Pre-training for Visual Language Models

nvlabs/vila 12 Dec 2023

Visual language models (VLMs) rapidly progressed with the recent success of large language models.

In-Context Learning Language Modelling +2

107
0.24 stars / hour

xFinder: Robust and Pinpoint Answer Extraction for Large Language Models

iaar-shanghai/xfinder 20 May 2024

The continuous advancement of large language models (LLMs) has brought increasing attention to the critical issue of developing fair and reliable methods for evaluating their performance.

40
0.23 stars / hour

Outlier-robust Kalman Filtering through Generalised Bayes

gerdm/weighted-likelihood-filter 9 May 2024

We derive a novel, provably robust, and closed-form Bayesian update rule for online filtering in state-space models in the presence of outliers and misspecified measurement models.

Bayesian Inference Computational Efficiency +1

48
0.23 stars / hour

MMBench: Is Your Multi-modal Model an All-around Player?

InternLM/opencompass 12 Jul 2023

In response to these challenges, we propose MMBench, a novel multi-modality benchmark.

Visual Question Answering

2,838
0.23 stars / hour

MM-Retinal: Knowledge-Enhanced Foundational Pretraining with Fundus Image-Text Expertise

lxirich/mm-retinal 20 May 2024

Current fundus image analysis models are predominantly built for specific tasks relying on individual datasets.

21
0.22 stars / hour