Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting

bytedance/dolphin 20 May 2025

Document image parsing is challenging due to its complexly intertwined elements such as text paragraphs, figures, formulas, and tables.

131
3.80 stars / hour

AlphaEvolve: A Learning Framework to Discover Novel Alphas in Quantitative Investment

codelion/openevolve 30 Mar 2021

In this paper, we introduce a new class of alphas to model scalar, vector, and matrix features which possess the strengths of these two existing classes.

AutoML Stock Prediction

404
3.79 stars / hour

Aligning Anime Video Generation with Human Feedback

bilibili/index-anisora 14 Apr 2025

Existing reward models, designed primarily for real-world videos, fail to capture the unique appearance and consistency requirements of anime.

Video Generation

714
3.67 stars / hour

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

jiuhaichen/blip3o 14 May 2025

Building on our innovative model design, training recipe, and datasets, we develop BLIP3-o, a suite of state-of-the-art unified multimodal models.

Image Generation

750
2.25 stars / hour

Parallel Scaling Law for Language Models

qwenlm/parscale 15 May 2025

We apply $P$ diverse and learnable transformations to the input, execute forward passes of the model in parallel, and dynamically aggregate the $P$ outputs.

271
1.67 stars / hour

Fully Open Source Moxin-7B Technical Report

moxin-org/moxin-llm 8 Dec 2024

Recently, Large Language Models (LLMs) have undergone a significant transformation, marked by a rapid rise in both their popularity and capabilities.

416
1.40 stars / hour

Thinkless: LLM Learns When to Think

vainf/thinkless 19 May 2025

Reasoning Language Models, capable of extended chain-of-thought reasoning, have demonstrated remarkable performance on tasks requiring complex logical inference.

GSM8K Math

58
1.25 stars / hour

Visual Planning: Let's Think Only with Images

yix8/visualplanning 16 May 2025

Recent advancements in Large Language Models (LLMs) and their multimodal extensions (MLLMs) have substantially enhanced machine reasoning across diverse tasks.

reinforcement-learning Reinforcement Learning +1

84
1.24 stars / hour

Tiny QA Benchmark++: Ultra-Lightweight, Synthetic Multilingual Dataset Generation & Smoke-Tests for Continuous LLM Evaluation

vincentkoc/tiny_qa_benchmark_pp 17 May 2025

Tiny QA Benchmark++ (TQB++) presents an ultra-lightweight, multilingual smoke-test suite designed to give large-language-model (LLM) pipelines a unit-test style safety net dataset that runs in seconds with minimal cost.

Dataset Generation Large Language Model +3

45
1.12 stars / hour

From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery

hkust-knowcomp/awesome-llm-scientific-discovery 19 May 2025

Large Language Models (LLMs) are catalyzing a paradigm shift in scientific discovery, evolving from task-specific automation tools into increasingly autonomous agents and fundamentally redefining research processes and human-AI collaboration.

Navigate scientific discovery +1

44
1.10 stars / hour