Trending Research

The Platonic Representation Hypothesis

minyoungg/platonic-rep • • 13 May 2024

We argue that representations in AI models, particularly deep networks, are converging.

135

1.07 stars / hour

Paper
Code

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

hvision-nku/storydiffusion • • 2 May 2024

This module converts the generated sequence of images into videos with smooth transitions and consistent subjects that are significantly more stable than the modules based on latent spaces only, especially in the context of long video generation.

motion prediction Story Generation +1

4,632

0.78 stars / hour

Paper
Code

RLHF Workflow: From Reward Modeling to Online RLHF

rlhflow/online-rlhf • • 13 May 2024

We present the workflow of Online Iterative Reinforcement Learning from Human Feedback (RLHF) in this technical report, which is widely reported to outperform its offline counterpart by a large margin in the recent large language model (LLM) literature.

Chatbot Language Modelling +1

0.75 stars / hour

Paper
Code

AgentScope: A Flexible yet Robust Multi-Agent Platform

modelscope/agentscope • 21 Feb 2024

With the rapid advancement of Large Language Models (LLMs), significant progress has been made in multi-agent applications.

Multi-agent Integration

2,295

0.71 stars / hour

Paper
Code

Kolmogorov-Arnold Networks are Radial Basis Function Networks

ZiyaoLi/fast-kan • • 10 May 2024

This short paper is a fast proof-of-concept that the 3-order B-splines used in Kolmogorov-Arnold Networks (KANs) can be well approximated by Gaussian radial basis functions.

166

0.67 stars / hour

Paper
Code

UFO: A UI-Focused Agent for Windows OS Interaction

microsoft/UFO • 8 Feb 2024

We introduce UFO, an innovative UI-Focused agent to fulfill user requests tailored to applications on Windows OS, harnessing the capabilities of GPT-Vision.

Navigate

4,859

0.66 stars / hour

Paper
Code

MarkLLM: An Open-Source Toolkit for LLM Watermarking

thu-bpm/markllm • • 16 May 2024

However, the abundance of LLM watermarking algorithms, their intricate mechanisms, and the complex evaluation procedures and perspectives pose challenges for researchers and the community to easily experiment with, understand, and assess the latest advancements.

0.63 stars / hour

Paper
Code

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

deepseek-ai/deepseek-v2 • • 7 May 2024

MLA guarantees efficient inference through significantly compressing the Key-Value (KV) cache into a latent vector, while DeepSeekMoE enables training strong models at an economical cost through sparse computation.

Language Modelling Reinforcement Learning (RL)

2,097

0.62 stars / hour

Paper
Code

GIVT: Generative Infinite-Vocabulary Transformers

google-research/big_vision • • 4 Dec 2023

We introduce generative infinite-vocabulary transformers (GIVT) which generate vector sequences with real-valued entries, instead of discrete tokens from a finite vocabulary.

Ranked #13 on Image Generation on ImageNet 256x256

Conditional Image Generation Decoder +2

1,728

0.59 stars / hour

Paper
Code

Fundus: A Simple-to-Use News Scraper Optimized for High Quality Extractions

flairnlp/fundus • 22 Mar 2024

This paper introduces Fundus, a user-friendly news scraper that enables users to obtain millions of high-quality news articles with just a few lines of code.

0.55 stars / hour

Paper
Code