Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

zhentingqi/rstar 12 Aug 2024

This paper introduces rStar, a self-play mutual reasoning approach that significantly improves reasoning capabilities of small language models (SLMs) without fine-tuning or superior models.

GSM8K Math +1

156
1.27 stars / hour

Large Language Model-Based Agents for Software Engineering: A Survey

fudanselab/agent4se-paper-list 4 Sep 2024

The recent advance in Large Language Models (LLMs) has shaped a new paradigm of AI agents, i. e., LLM-based agents.

Language Modelling Large Language Model

174
1.18 stars / hour

Towards a Unified View of Preference Learning for Large Language Models: A Survey

kbsdjames/awesome-llm-preference-learning 4 Sep 2024

Finally, based on our unified perspective, we explore the challenges and future research directions for aligning large language models with human preferences.

109
1.13 stars / hour

optillm

codelion/optillm 26 Jul 2024

Optimizing inference proxy for LLMs

Inference Optimization

256
1.13 stars / hour

Docling Technical Report

DS4SD/docling 19 Aug 2024

This technical report introduces Docling, an easy to use, self-contained, MIT-licensed open-source package for PDF document conversion.

422
1.02 stars / hour

RealisDance: Equip controllable character animation with realistic hands

damo-cv/realisdance 10 Sep 2024

2) The hands generated using the DWPose sequence are blurry and unrealistic.

55
1.01 stars / hour

RobustSAM: Segment Anything Robustly on Degraded Images

robustsam/RobustSAM CVPR 2024

Segment Anything Model (SAM) has emerged as a transformative approach in image segmentation, acclaimed for its robust zero-shot segmentation capabilities and flexible prompting system.

Deblurring Image Dehazing +6

264
0.95 stars / hour

KGRefiner: Knowledge Graph Refinement for Improving Accuracy of Translational Link Prediction Methods

saeedizade/LinkPrediction 27 Jun 2021

The Link Prediction is the task of predicting missing relations between entities of the knowledge graph.

Ranked #2 on Link Prediction on FB15k-237 (training time (s) metric)

Knowledge Graph Embedding Link Prediction

114
0.92 stars / hour

FLUX that Plays Music

feizc/fluxmusic 1 Sep 2024

This paper explores a simple extension of diffusion-based rectified flow Transformers for text-to-music generation, termed as FluxMusic.

Music Generation Text-to-Music Generation

1,388
0.83 stars / hour

One-Shot Diffusion Mimicker for Handwritten Text Generation

dailenson/one-dm 6 Sep 2024

Extensive experiments demonstrate that our method can successfully generate handwriting scripts with just one sample reference in multiple languages, even outperforming previous methods using over ten samples.

Handwriting generation Text Generation

103
0.82 stars / hour