Smaller But Better: Unifying Layout Generation with Smaller Large Language Models

niceringnode/lggpt 19 Feb 2025

We propose LGGPT, an LLM-based model tailored for unified layout generation.

Layout Generation

137
0.53 stars / hour

R-KV: Redundancy-aware KV Cache Compression for Training-Free Reasoning Models Acceleration

zefan-cai/r-kv 30 May 2025

To address this, we propose Redundancy-aware KV Cache Compression for Reasoning models (R-KV), a novel method specifically targeting redundant tokens in reasoning models.

Mathematical Reasoning

938
0.52 stars / hour

ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks

IBM/itbench-sample-scenarios 7 Feb 2025

Our results show that agents powered by state-of-the-art models resolve only 13. 8% of SRE scenarios, 25. 2% of CISO scenarios, and 0% of FinOps scenarios.

Benchmarking

100
0.50 stars / hour

ManimML: Communicating Machine Learning Architectures with Animation

helblazer811/manimml 29 Jun 2023

A user can take a preexisting neural network architecture and easily write a specification for an animation in ManimML, which will then automatically compose animations for different components of the system into a final animation of the entire neural network.

2,842
0.50 stars / hour

TradingAgents: Multi-Agents LLM Financial Trading Framework

tauricresearch/tradingagents 28 Dec 2024

Significant progress has been made in automated problem-solving using societies of agents powered by large language models (LLMs).

Management

13,713
0.50 stars / hour

Mirage: A Multi-Level Superoptimizer for Tensor Programs

mirage-project/mirage 9 May 2024

We introduce Mirage, the first multi-level superoptimizer for tensor programs.

Navigate

1,537
0.44 stars / hour

Continuous Thought Machines

SakanaAI/continuous-thought-machines 8 May 2025

The CTM has two core innovations: (1) neuron-level temporal processing, where each neuron uses unique weight parameters to process a history of incoming signals; and (2) neural synchronization employed as a latent representation.

Computational Efficiency Question Answering

1,163
0.44 stars / hour

Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting

bytedance/dolphin 20 May 2025

Document image parsing is challenging due to its complexly intertwined elements such as text paragraphs, figures, formulas, and tables.

3,801
0.43 stars / hour

AlphaEvolve: A Learning Framework to Discover Novel Alphas in Quantitative Investment

codelion/openevolve 30 Mar 2021

In this paper, we introduce a new class of alphas to model scalar, vector, and matrix features which possess the strengths of these two existing classes.

AutoML Stock Prediction

3,136
0.41 stars / hour

Skill Expansion and Composition in Parameter Space

ltlhuuu/PSEC 9 Feb 2025

In this paper, we propose Parametric Skill Expansion and Composition (PSEC), a new framework designed to iteratively evolve the agents' capabilities and efficiently address new challenges by maintaining a manageable skill library.

D4RL

58
0.39 stars / hour