Trending Research

Language Model Crossover: Variation through Few-Shot Prompting

carperai/openelm • 23 Feb 2023

The promise of such language model crossover (which is simple to implement and can leverage many different open-source language models) is that it enables a simple mechanism to evolve semantically-rich text representations (with few domain-specific tweaks), and naturally benefits from current progress in language models.

In-Context Learning Language Modelling

540

0.46 stars / hour

Paper
Code

Moving Object Segmentation: All You Need Is SAM (and Flow)

Jyxarthur/flowsam • • 18 Apr 2024

The objective of this paper is motion segmentation -- discovering and segmenting the moving objects in a video.

Motion Segmentation Object +6

165

0.41 stars / hour

Paper
Code

Asynchronous Methods for Deep Reinforcement Learning

uvipen/Super-mario-bros-A3C-pytorch • • 4 Feb 2016

We propose a conceptually simple and lightweight framework for deep reinforcement learning that uses asynchronous gradient descent for optimization of deep neural network controllers.

Ranked #9 on Atari Games on Atari 2600 Star Gunner

Atari Games reinforcement-learning +1

1,006

0.39 stars / hour

Paper
Code

STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases

snap-stanford/stark • • 19 Apr 2024

Answering real-world user queries, such as product search, often requires accurate retrieval of information from semi-structured knowledge bases or databases that involve blend of unstructured (e. g., textual descriptions of products) and structured (e. g., entity relations of products) information.

Benchmarking Retrieval

0.39 stars / hour

Paper
Code

GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting

ku-cvlab/gaussiantalker • • 24 Apr 2024

A key insight is to encode the 3D Gaussian attributes into a shared implicit feature representation, where it is merged with audio features to manipulate each Gaussian attribute.

Attribute

0.38 stars / hour

Paper
Code

PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning

magic-research/PLLaVA • • arXiv 2024

PLLaVA achieves new state-of-the-art performance on modern benchmark datasets for both video question-answer and captioning tasks.

Ranked #1 on Zero-Shot Video Question Answer on TGIF-QA

Dense Captioning Video-based Generative Performance Benchmarking (Consistency) +6

0.38 stars / hour

Paper
Code

PromptBench: A Unified Library for Evaluation of Large Language Models

microsoft/promptbench • • 13 Dec 2023

The evaluation of large language models (LLMs) is crucial to assess their performance and mitigate potential security risks.

Prompt Engineering

2,047

0.37 stars / hour

Paper
Code

TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

Infini-AI-Lab/TriForce • • 18 Apr 2024

However, key-value (KV) cache, which is stored to avoid re-computation, has emerged as a critical bottleneck by growing linearly in size with the sequence length.

101

0.37 stars / hour

Paper
Code

UFO: A UI-Focused Agent for Windows OS Interaction

microsoft/UFO • 8 Feb 2024

We introduce UFO, an innovative UI-Focused agent to fulfill user requests tailored to applications on Windows OS, harnessing the capabilities of GPT-Vision.

Navigate

4,154

0.35 stars / hour

Paper
Code

Proximal Policy Optimization Algorithms

uvipen/super-mario-bros-ppo-pytorch • • 20 Jul 2017

We propose a new family of policy gradient methods for reinforcement learning, which alternate between sampling data through interaction with the environment, and optimizing a "surrogate" objective function using stochastic gradient ascent.

Ranked #2 on Neural Architecture Search on NATS-Bench Topology, CIFAR-100

Continuous Control Dota 2 +3

1,041

0.35 stars / hour

Paper
Code