Trending Research

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Leeroo-AI/mergoo • • 12 Mar 2024

We investigate efficient methods for training Large Language Models (LLMs) to possess capabilities in multiple specialized domains, such as coding, math reasoning and world knowledge.

Ranked #30 on Question Answering on TriviaQA

Arithmetic Reasoning Code Generation +6

0.65 stars / hour

Paper
Code

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

google-deepmind/recurrentgemma • • 29 Feb 2024

Recurrent neural networks (RNNs) have fast inference and scale efficiently on long sequences, but they are difficult to train and hard to scale.

Language Modelling

435

0.59 stars / hour

Paper
Code

Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models

haozheliu-st/t-gate • • 3 Apr 2024

This study explores the role of cross-attention during inference in text-conditional diffusion models.

189

0.57 stars / hour

Paper
Code

Retrieval-Augmented Generation for AI-Generated Content: A Survey

hymie122/rag-survey • 29 Feb 2024

We first classify RAG foundations according to how the retriever augments the generator, distilling the fundamental abstractions of the augmentation methodologies for various retrievers and generators.

Information Retrieval Large Language Model +2

557

0.55 stars / hour

Paper
Code

AIOS: LLM Agent Operating System

agiresearch/aios • 25 Mar 2024

Inspired by these challenges, this paper presents AIOS, an LLM agent operating system, which embeds large language model into operating systems (OS) as the brain of the OS, enabling an operating system "with soul" -- an important step towards AGI.

Language Modelling Large Language Model +1

2,340

0.52 stars / hour

Paper
Code

SafeGen: Mitigating Unsafe Content Generation in Text-to-Image Models

letterligo/text-agnostic-governance • • 10 Apr 2024

The key idea is to eliminate unsafe visual representations from the model regardless of the text input.

0.51 stars / hour

Paper
Code

Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences

nianticlabs/mickey • • 9 Apr 2024

Usually, correspondences are 2D-to-2D and the pose we estimate is defined only up to scale.

236

0.50 stars / hour

Paper
Code

PCToolkit: A Unified Plug-and-Play Prompt Compression Toolkit of Large Language Models

3DAgentWorld/Toolkit-for-Prompt-Compression • • 26 Mar 2024

Prompt compression is an innovative method for efficiently condensing input prompts while preserving essential information.

Code Completion Few-Shot Learning +2

104

0.49 stars / hour

Paper
Code

SchurVINS: Schur Complement-Based Lightweight Visual Inertial Navigation System

bytedance/schurvins • 4 Dec 2023

To this end, we propose a novel filter-based VINS framework named SchurVINS, which could guarantee both high accuracy by building a complete residual model and low computational complexity with Schur complement.

Computational Efficiency

190

0.49 stars / hour

Paper
Code

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

fudan-generative-vision/champ • • 21 Mar 2024

In this study, we introduce a methodology for human image animation by leveraging a 3D human parametric model within a latent diffusion framework to enhance shape alignment and motion guidance in curernt human generative techniques.

Animated GIF Generation Image Animation +1

2,545

0.49 stars / hour

Paper
Code