TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools

mims-harvard/TxAgent 14 Mar 2025

It selects tools based on task objectives and executes structured function calls to solve therapeutic tasks that require clinical reasoning and cross-source validation.

AI Agent Decision Making

320
0.40 stars / hour

TextGrad: Automatic "Differentiation" via Text

zou-group/textgrad 11 Jun 2024

Without modifying the framework, TextGrad improves the zero-shot accuracy of GPT-4o in Google-Proof Question Answering from $51\%$ to $55\%$, yields $20\%$ relative performance gain in optimizing LeetCode-Hard coding problem solutions, improves prompts for reasoning, designs new druglike small molecules with desirable in silico binding, and designs radiation oncology treatment plans with high specificity.

Ranked #4 on on GPQA

Question Answering Specificity

2,287
0.39 stars / hour

HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation

dcdmllm/healthgpt 14 Feb 2025

To effectively learn the HealthGPT, we devise a comprehensive medical domain-specific comprehension and generation dataset called VL-Health.

Language Modeling Language Modelling +1

656
0.39 stars / hour

Docling Technical Report

DS4SD/docling 19 Aug 2024

This technical report introduces Docling, an easy to use, self-contained, MIT-licensed open-source package for PDF document conversion.

25,096
0.38 stars / hour

Why Do Multi-Agent LLM Systems Fail?

multi-agent-systems-failure-taxonomy/MASFT 17 Mar 2025

In this paper, we present the first comprehensive study of MAS challenges.

44
0.37 stars / hour

DEIM: DETR with Improved Matching for Fast Convergence

shihuahuang95/deim 5 Dec 2024

We introduce DEIM, an innovative and efficient training framework designed to accelerate convergence in real-time object detection with Transformer-based architectures (DETR).

 Ranked #1 on Real-Time Object Detection on MS COCO (using extra training data)

Data Augmentation object-detection +1

540
0.37 stars / hour

Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering

xiaomi-research/r1-aqa 14 Mar 2025

Recently, reinforcement learning (RL) has been shown to greatly enhance the reasoning capabilities of large language models (LLMs), and RL-based approaches have been progressively applied to visual multimodal tasks.

Audio Question Answering Question Answering +1

189
0.36 stars / hour

Sample-Efficient Alignment for LLMs

sail-sg/oat 3 Nov 2024

The results demonstrate that SEA achieves highly sample-efficient alignment with oracle's preferences, outperforming recent active exploration methods for LLMs.

Thompson Sampling

267
0.35 stars / hour

XAttention: Block Sparse Attention with Antidiagonal Scoring

mit-han-lab/x-attention 20 Mar 2025

In this paper, we introduce XAttention, a plug-and-play framework that dramatically accelerates long-context inference in Transformers models using sparse attention.

Video Generation Video Understanding

95
0.35 stars / hour

Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging

LZY-the-boys/Twin-Merging 17 Jun 2024

In view of this, we propose Twin-Merging, a method that encompasses two principal stages: (1) modularizing knowledge into shared and exclusive components, with compression to reduce redundancy and enhance efficiency; (2) dynamically merging shared and task-specific knowledge based on the input.

133
0.34 stars / hour