Trending Research

NEFTune: Noisy Embeddings Improve Instruction Finetuning

openaccess-ai-collective/axolotl • • 9 Oct 2023

We show that language model finetuning can be improved, sometimes dramatically, with a simple augmentation.

Language Modelling

6,342

0.25 stars / hour

Paper
Code

DeepSeek-VL: Towards Real-World Vision-Language Understanding

deepseek-ai/deepseek-vl • • 8 Mar 2024

The DeepSeek-VL family (both 1. 3B and 7B models) showcases superior user experiences as a vision-language chatbot in real-world applications, achieving state-of-the-art or competitive performance across a wide range of visual-language benchmarks at the same model size while maintaining robust performance on language-centric benchmarks.

Ranked #31 on Visual Question Answering on MM-Vet

Chatbot Language Modelling +3

1,752

0.24 stars / hour

Paper
Code

Imp: Highly Capable Large Multimodal Models for Mobile Devices

milvlg/imp • • 20 May 2024

By harnessing the capabilities of large language models (LLMs), recent large multimodal models (LMMs) have shown remarkable versatility in open-world multimodal understanding.

Quantization

141

0.24 stars / hour

Paper
Code

Prometheus: Inducing Fine-grained Evaluation Capability in Language Models

stanford-oval/storm • 12 Oct 2023

We first construct the Feedback Collection, a new dataset that consists of 1K fine-grained score rubrics, 20K instructions, and 100K responses and language feedback generated by GPT-4.

Language Modelling Large Language Model

4,482

0.24 stars / hour

Paper
Code

AM-RADIO: Agglomerative Vision Foundation Model -- Reduce All Domains Into One

nvlabs/radio • • 10 Dec 2023

A handful of visual foundation models (VFMs) have recently emerged as the backbones for numerous downstream tasks.

Benchmarking object-detection +2

426

0.24 stars / hour

Paper
Code

VILA: On Pre-training for Visual Language Models

nvlabs/vila • • 12 Dec 2023

Visual language models (VLMs) rapidly progressed with the recent success of large language models.

Ranked #24 on Visual Question Answering on MM-Vet

In-Context Learning Language Modelling +2

107

0.24 stars / hour

Paper
Code

xFinder: Robust and Pinpoint Answer Extraction for Large Language Models

iaar-shanghai/xfinder • • 20 May 2024

The continuous advancement of large language models (LLMs) has brought increasing attention to the critical issue of developing fair and reliable methods for evaluating their performance.

0.23 stars / hour

Paper
Code

Outlier-robust Kalman Filtering through Generalised Bayes

gerdm/weighted-likelihood-filter • • 9 May 2024

We derive a novel, provably robust, and closed-form Bayesian update rule for online filtering in state-space models in the presence of outliers and misspecified measurement models.

Bayesian Inference Computational Efficiency +1

0.23 stars / hour

Paper
Code

MMBench: Is Your Multi-modal Model an All-around Player?

InternLM/opencompass • • 12 Jul 2023

In response to these challenges, we propose MMBench, a novel multi-modality benchmark.

Visual Question Answering

2,838

0.23 stars / hour

Paper
Code

MM-Retinal: Knowledge-Enhanced Foundational Pretraining with Fundus Image-Text Expertise

lxirich/mm-retinal • • 20 May 2024

Current fundus image analysis models are predominantly built for specific tasks relying on individual datasets.

0.22 stars / hour

Paper
Code