olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models

allenai/olmocr 25 Feb 2025

PDF documents have the potential to provide trillions of novel, high-quality tokens for training language models.

Diversity Language Modeling +1

9,749
0.93 stars / hour

Lanpaint: Training-Free Diffusion Inpainting with Exact and Fast Conditional Inference

scraed/LanPaint 5 Feb 2025

Diffusion models generate high-quality images but often lack efficient and universally applicable inpainting capabilities, particularly in community-trained models.

153
0.88 stars / hour

Executable Code Actions Elicit Better LLM Agents

xingyaoww/code-act 1 Feb 2024

LLM agents are typically prompted to produce actions by generating JSON or text in a pre-defined format, which is usually limited by constrained action space (e. g., the scope of pre-defined tools) and restricted flexibility (e. g., inability to compose multiple tools).

Language Modelling Large Language Model

798
0.85 stars / hour

A Survey of Graph Retrieval-Augmented Generation for Customized Large Language Models

deep-polyu/awesome-graphrag 21 Jan 2025

Large language models (LLMs) have demonstrated remarkable capabilities in a wide range of tasks, yet their application to specialized domains remains challenging due to the need for deep expertise.

RAG Text Retrieval

787
0.76 stars / hour

Chain of Draft: Thinking Faster by Writing Less

sileix/chain-of-draft 25 Feb 2025

Large Language Models (LLMs) have demonstrated remarkable performance in solving complex reasoning tasks through mechanisms like Chain-of-Thought (CoT) prompting, which emphasizes verbose, step-by-step reasoning.

148
0.76 stars / hour

HybridFlow: A Flexible and Efficient RLHF Framework

hiyouga/easyr1 28 Sep 2024

Traditional RL can be modeled as a dataflow, where each node represents computation of a neural network (NN) and each edge denotes data dependencies between the NNs.

Large Language Model

1,402
0.70 stars / hour

Residual Kolmogorov-Arnold Network for Enhanced Deep Learning

withray/residualkan 7 Oct 2024

Despite the strong performance in many computer vision tasks, Convolutional Neural Networks (CNNs) can sometimes struggle to efficiently capture long-range, complex non-linear dependencies in deeper layers of the network.

Computational Efficiency Deep Learning

185
0.67 stars / hour

Learning Efficient Online 3D Bin Packing on Packing Configuration Trees

alexfrom0815/Online-3D-BPP-PCT ICLR 2022

PCT is a full-fledged description of the state and action space of bin packing which can support packing policy learning based on deep reinforcement learning (DRL).

3D Bin Packing Deep Reinforcement Learning

449
0.67 stars / hour

Self-rewarding correction for mathematical reasoning

volcengine/verl 26 Feb 2025

We study self-rewarding reasoning large language models (LLMs), which can simultaneously generate step-by-step reasoning and evaluate the correctness of their outputs during the inference time-without external feedback.

Mathematical Reasoning

4,734
0.59 stars / hour