A Survey on Large Language Model based Human-Agent Systems

HenryPengZou/Awesome-LLM-Based-Human-Agent-System-Papers 1 May 2025

Recent advances in large language models (LLMs) have sparked growing interest in building fully autonomous agents.

Language Modeling Language Modelling +1

49
0.21 stars / hour

Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

NVIDIA/NeMo 2 Jun 2022

After re-examining the design choices for both the macro and micro-architecture of Conformer, we propose Squeezeformer which consistently outperforms the state-of-the-art ASR models under the same training schemes.

Automatic Speech Recognition Automatic Speech Recognition (ASR)

14,317
0.21 stars / hour

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

sihyeong/awesome-llm-inference-engine 3 May 2025

This paper provides a comprehensive evaluation of 25 open-source and commercial inference engines.

32
0.20 stars / hour

Learning Dynamics of LLM Finetuning

joshua-ren/learning_dynamics_llm 15 Jul 2024

Learning dynamics, which describes how the learning of specific training examples influences the model's predictions on other examples, gives us a powerful tool for understanding the behavior of deep learning systems.

Hallucination

108
0.20 stars / hour

The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search

sakanaai/ai-scientist-v2 10 Apr 2025

AI is increasingly playing a pivotal role in transforming how scientific discoveries are made.

scientific discovery

1,098
0.20 stars / hour

Adaptive In-conversation Team Building for Language Model Agents

ag2ai/ag2 29 May 2024

Leveraging multiple large language model (LLM) agents has shown to be a promising approach for tackling complex tasks, while the effective design of multiple agents for a particular application remains an art.

Diversity Language Modeling +3

2,491
0.20 stars / hour

Sequential Models in the Synthetic Data Vault

sdv-dev/SDV 28 Jul 2022

After building the Sequential SDV, we used it to generate synthetic data and compared its quality against an existing, non-sequential generative adversarial network based model called CTGAN.

Generative Adversarial Network

2,805
0.20 stars / hour

Do Large Language Models Need a Content Delivery Network?

lmcache/lmcache 16 Sep 2024

As the use of large language models (LLMs) expands rapidly, so does the range of knowledge needed to supplement various LLM queries.

In-Context Learning

1,009
0.20 stars / hour
2,372
0.20 stars / hour

Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning

SkyworkAI/Skywork-R1V 23 Apr 2025

We present Skywork R1V2, a next-generation multimodal reasoning model and a major leap forward from its predecessor, Skywork R1V.

Multimodal Reasoning reinforcement-learning +1

2,478
0.19 stars / hour