Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement

hiyouga/easyr1 9 Mar 2025

Traditional methods for reasoning segmentation rely on supervised fine-tuning with categorical labels and simple descriptions, limiting its out-of-domain generalization and lacking explicit reasoning processes.

Domain Generalization Open Vocabulary Object Detection +6

1,546
0.67 stars / hour

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

petergriffinjin/search-r1 12 Mar 2025

Efficiently acquiring external knowledge and up-to-date information is essential for effective reasoning and text generation in large language models (LLMs).

Question Answering Reinforcement Learning (RL) +2

1,160
0.65 stars / hour

Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language Models

emcie-co/parlant 5 Mar 2025

We present Attentive Reasoning Queries (ARQs), a novel structured reasoning approach that significantly improves instruction-following in Large Language Models through domain-specialized reasoning blueprints.

Hallucination Instruction Following +1

1,859
0.49 stars / hour

Self-rewarding correction for mathematical reasoning

volcengine/verl 26 Feb 2025

We study self-rewarding reasoning large language models (LLMs), which can simultaneously generate step-by-step reasoning and evaluate the correctness of their outputs during the inference time-without external feedback.

Mathematical Reasoning

5,119
0.62 stars / hour

GENERator: A Long-Context Generative Genomic Foundation Model

generteam/generator 11 Feb 2025

Recent developments in genomic language models have underscored the potential of LLMs in deciphering DNA sequences.

model

375
0.57 stars / hour

Retrieval-Augmented Generation with Hierarchical Knowledge

hhy-huang/HiRAG 13 Mar 2025

Graph-based Retrieval-Augmented Generation (RAG) methods have significantly enhanced the performance of large language models (LLMs) in domain-specific tasks.

Multi-hop Question Answering Question Answering +2

66
0.54 stars / hour

OmniSQL: Synthesizing High-quality Text-to-SQL Data at Scale

RUCKBReasoning/OmniSQL 4 Mar 2025

Text-to-SQL, the task of translating natural language questions into SQL queries, plays a crucial role in enabling non-experts to interact with databases.

Text-To-SQL

104
0.51 stars / hour

Evaluating Self-Supervised Learning for Molecular Graph Embeddings

hansen7/molgrapheval NeurIPS 2023

Graph Self-Supervised Learning (GSSL) provides a robust pathway for acquiring embeddings without expert labelling, a capability that carries profound implications for molecular graphs due to the staggering number of potential molecules and the high cost of obtaining labels.

Self-Supervised Learning

111
0.50 stars / hour

From System 1 to System 2: A Survey of Reasoning Large Language Models

zzli2022/awesome-slow-reason-system 24 Feb 2025

Achieving human-level intelligence requires refining the transition from the fast, intuitive System 1 to the slower, more deliberate System 2 reasoning.

Logical Reasoning

792
0.49 stars / hour

Aggregated Contextual Transformations for High-Resolution Image Inpainting

zyddnys/manga-image-translator 3 Apr 2021

For improving texture synthesis, we enhance the discriminator of AOT-GAN by training it with a tailored mask-prediction task.

Image Inpainting Texture Synthesis +1

6,801
0.44 stars / hour