Language Model Crossover: Variation through Few-Shot Prompting

carperai/openelm 23 Feb 2023

The promise of such language model crossover (which is simple to implement and can leverage many different open-source language models) is that it enables a simple mechanism to evolve semantically-rich text representations (with few domain-specific tweaks), and naturally benefits from current progress in language models.

In-Context Learning Language Modelling

540
0.46 stars / hour

Moving Object Segmentation: All You Need Is SAM (and Flow)

Jyxarthur/flowsam 18 Apr 2024

The objective of this paper is motion segmentation -- discovering and segmenting the moving objects in a video.

Motion Segmentation Object +6

165
0.41 stars / hour

Asynchronous Methods for Deep Reinforcement Learning

uvipen/Super-mario-bros-A3C-pytorch 4 Feb 2016

We propose a conceptually simple and lightweight framework for deep reinforcement learning that uses asynchronous gradient descent for optimization of deep neural network controllers.

Atari Games reinforcement-learning +1

1,006
0.39 stars / hour

STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases

snap-stanford/stark 19 Apr 2024

Answering real-world user queries, such as product search, often requires accurate retrieval of information from semi-structured knowledge bases or databases that involve blend of unstructured (e. g., textual descriptions of products) and structured (e. g., entity relations of products) information.

Benchmarking Retrieval

55
0.39 stars / hour

GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting

ku-cvlab/gaussiantalker 24 Apr 2024

A key insight is to encode the 3D Gaussian attributes into a shared implicit feature representation, where it is merged with audio features to manipulate each Gaussian attribute.

Attribute

45
0.38 stars / hour

PromptBench: A Unified Library for Evaluation of Large Language Models

microsoft/promptbench 13 Dec 2023

The evaluation of large language models (LLMs) is crucial to assess their performance and mitigate potential security risks.

Prompt Engineering

2,047
0.37 stars / hour

TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

Infini-AI-Lab/TriForce 18 Apr 2024

However, key-value (KV) cache, which is stored to avoid re-computation, has emerged as a critical bottleneck by growing linearly in size with the sequence length.

101
0.37 stars / hour

UFO: A UI-Focused Agent for Windows OS Interaction

microsoft/UFO 8 Feb 2024

We introduce UFO, an innovative UI-Focused agent to fulfill user requests tailored to applications on Windows OS, harnessing the capabilities of GPT-Vision.

Navigate

4,154
0.35 stars / hour

Proximal Policy Optimization Algorithms

uvipen/super-mario-bros-ppo-pytorch 20 Jul 2017

We propose a new family of policy gradient methods for reinforcement learning, which alternate between sampling data through interaction with the environment, and optimizing a "surrogate" objective function using stochastic gradient ascent.

Continuous Control Dota 2 +3

1,041
0.35 stars / hour