Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG

asinghcsu/agenticrag-survey 15 Jan 2025

Large Language Models (LLMs) have revolutionized artificial intelligence (AI) by enabling human like text generation and natural language understanding.

Natural Language Understanding RAG +3

144
0.45 stars / hour

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

hjyao00/mulberry 24 Dec 2024

Using CoMCTS, we construct Mulberry-260k, a multimodal dataset with a tree of rich, explicit and well-defined reasoning nodes for each question.

250
0.43 stars / hour

Predicting Human Brain States with Transformer

syf0122/brain_state_pred 11 Dec 2024

The human brain is a complex and highly dynamic system, and our current knowledge of its functional mechanism is still very limited.

Language Modelling Music Generation

64
0.42 stars / hour

Click-Calib: A Robust Extrinsic Calibration Method for Surround-View Systems

lwangvaleo/click_calib 2 Jan 2025

Surround-View System (SVS) is an essential component in Advanced Driver Assistance System (ADAS) and requires precise calibrations.

55
0.38 stars / hour

Lifelong Learning of Large Language Model based Agents: A Roadmap

qianlima-lab/awesome-lifelong-llm-agent 13 Jan 2025

This survey is the first to systematically summarize the potential techniques for incorporating lifelong learning into LLM-based agents.

Incremental Learning Language Modeling +3

102
0.36 stars / hour

Don't Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks

hhhuang/cag 20 Dec 2024

With the advent of large language models (LLMs) featuring significantly extended context windows, this paper proposes an alternative paradigm, cache-augmented generation (CAG) that bypasses real-time retrieval.

RAG Retrieval

849
0.36 stars / hour

Gated Delta Networks: Improving Mamba2 with Delta Rule

sustcsonglin/flash-linear-attention 9 Dec 2024

Linear Transformers have gained attention as efficient alternatives to standard Transformers, but their performance in retrieval and long-context tasks has been limited.

Common Sense Reasoning Language Modeling +3

1,776
0.35 stars / hour

StateFlow: Enhancing LLM Task-Solving through State-Driven Workflows

ag2ai/ag2 17 Mar 2024

In StateFlow, we distinguish between "process grounding" (via state and state transitions) and "sub-task solving" (through actions within a state), enhancing control and interpretability of the task-solving procedure.

Management

1,548
0.34 stars / hour

The GAN is dead; long live the GAN! A Modern GAN Baseline

brownvc/r3gan 9 Jan 2025

There is a widely-spread claim that GANs are difficult to train, and GAN architectures in the literature are littered with empirical tricks.

Image Generation

598
0.34 stars / hour

3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering

limuloo/3DIS 9 Jan 2025

In this paper, we present 3DIS-FLUX, an extension of the 3DIS framework that integrates the FLUX model for enhanced rendering capabilities.

Text-to-Image Generation

177
0.32 stars / hour