Large Language Model-Based Agents for Software Engineering: A Survey

fudanselab/agent4se-paper-list 4 Sep 2024

The recent advance in Large Language Models (LLMs) has shaped a new paradigm of AI agents, i. e., LLM-based agents.

Language Modelling Large Language Model

200
0.44 stars / hour

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

THUDM/LongCite 4 Sep 2024

Though current long-context large language models (LLMs) have demonstrated impressive capacities in answering user questions based on extensive text, the lack of citations in their responses makes user verification difficult, leading to concerns about their trustworthiness due to their potential hallucinations.

Question Answering Sentence

222
0.43 stars / hour

OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer

om-ai-lab/OmAgent 24 Jun 2024

Recent advancements in Large Language Models (LLMs) have expanded their capabilities to multimodal contexts, including comprehensive video understanding.

AI Agent Video Understanding

492
0.42 stars / hour

Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy Optimization

tiancigao/diffppo 2 Sep 2024

Recent advancements in reinforcement learning (RL) have been fueled by large-scale data and deep neural networks, particularly for high-dimensional and complex tasks.

Diversity Offline RL +1

43
0.40 stars / hour

FaceScore: Benchmarking and Enhancing Face Quality in Human Generation

oppo-mente-lab/facescore 24 Jun 2024

Targeting addressing such an issue, we first assess the face quality of generations from popular pre-trained DMs with the aid of human annotators and then evaluate the alignment between existing metrics with human judgments.

Benchmarking Denoising +2

35
0.37 stars / hour

GeoCalib: Learning Single-image Calibration with Geometric Optimization

cvg/geocalib 10 Sep 2024

This single-image calibration can benefit various downstream applications like image editing and 3D mapping.

3D geometry Visual Localization

120
0.37 stars / hour

Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers

NoviScl/AI-Researcher 6 Sep 2024

Recent advancements in large language models (LLMs) have sparked optimism about their potential to accelerate scientific discovery, with a growing number of works proposing research agents that autonomously generate and validate new ideas.

Experimental Design scientific discovery

163
0.36 stars / hour

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

jishengpeng/wavtokenizer 29 Aug 2024

Despite the reduced number of tokens, WavTokenizer achieves state-of-the-art reconstruction quality with outstanding UTMOS scores and inherently contains richer semantic information.

Language Modelling

652
0.35 stars / hour

Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis

caiyuanhao1998/sax-nerf 7 Mar 2024

X-ray is widely applied for transmission imaging due to its stronger penetration than natural light.

CT Reconstruction Novel View Synthesis

417
0.34 stars / hour

One-Shot Diffusion Mimicker for Handwritten Text Generation

dailenson/one-dm 6 Sep 2024

Extensive experiments demonstrate that our method can successfully generate handwriting scripts with just one sample reference in multiple languages, even outperforming previous methods using over ten samples.

Handwriting generation Text Generation

128
0.34 stars / hour