Search Results

AFlow: Automating Agentic Workflow Generation

1 code implementation14 Oct 2024

Large language models (LLMs) have demonstrated remarkable potential in solving complex tasks across diverse domains, typically by employing agentic workflows that follow detailed instructions and operational sequences.

Code Generation

Data Interpreter: An LLM Agent For Data Science

1 code implementation28 Feb 2024

On InfiAgent-DABench, it achieves a 25% performance boost, raising accuracy from 75. 9% to 94. 9%.

Code Generation Language Modelling +2

MetaGPT: Meta Programming for A Multi-Agent Collaborative Framework

1 code implementation1 Aug 2023

Remarkable progress has been made on automated problem solving through societies of agents based on large language models (LLMs).

Code Generation

SELA: Tree-Search Enhanced LLM Agents for Automated Machine Learning

1 code implementation22 Oct 2024

Automated Machine Learning (AutoML) approaches encompass traditional methods that optimize fixed pipelines for model selection and ensembling, as well as newer LLM-based frameworks that autonomously build pipelines.

AutoML Diversity +1

ChatDev: Communicative Agents for Software Development

1 code implementation16 Jul 2023

Numerous studies used deep learning to improve specific phases in a waterfall model, such as design, coding, and testing.

Decision Making

OpenHands: An Open Platform for AI Software Developers as Generalist Agents

2 code implementations23 Jul 2024

OpenDevin), a platform for the development of powerful and flexible AI agents that interact with the world in similar ways to those of a human developer: by writing code, interacting with a command line, and browsing the web.

Experiential Co-Learning of Software-Developing Agents

1 code implementation28 Dec 2023

Recent advancements in large language models (LLMs) have brought significant changes to various domains, especially through LLM-driven autonomous agents.

Autonomous Agents for Collaborative Task under Information Asymmetry

2 code implementations21 Jun 2024

Together with InfoNav, iAgents organizes human information in a mixed memory to provide agents with accurate and comprehensive information for exchange.

Language Modelling Large Language Model +1

Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web

1 code implementation30 Nov 2023

We show that while existing prompted LMAs (gpt-3. 5-turbo or gpt-4) achieve 94. 0% average success rate on base tasks, their performance degrades to 24. 9% success rate on compositional tasks.

Decision Making Language Modeling +1

Perceive, Reflect, and Plan: Designing LLM Agent for Goal-Directed City Navigation without Instructions

1 code implementation8 Aug 2024

In the absence of navigation instructions, such abilities are vital for the agent to make high-quality decisions in long-range city navigation.

AI Agent Navigate