LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

dvlab-research/longlora 21 Sep 2023

LongLoRA adopts LLaMA2 7B from 4k context to 100k, or LLaMA2 70B to 32k on a single 8x A100 machine.

948
3.99 stars / hour

The Rise and Potential of Large Language Model Based Agents: A Survey

woooodyy/llm-agent-paper-list 14 Sep 2023

Many efforts have been made to develop intelligent agents, but they mainly focus on advancement in algorithms or training strategies to enhance specific capabilities or performance on particular tasks.

Language Modelling Large Language Model

2,619
3.55 stars / hour

Communicative Agents for Software Development

openbmb/chatdev 16 Jul 2023

At the core of this paradigm lies ChatDev, a virtual chat-powered software development company that mirrors the established waterfall model, meticulously dividing the development process into four distinct chronological stages: designing, coding, testing, and documenting.

Decision Making

9,202
2.86 stars / hour

ProPainter: Improving Propagation and Transformer for Video Inpainting

sczhou/propainter 7 Sep 2023

We also propose a mask-guided sparse video Transformer, which achieves high efficiency by discarding unnecessary and redundant tokens.

Optical Flow Estimation Video Inpainting

1,577
2.70 stars / hour

Agents: An Open-source Framework for Autonomous Language Agents

aiwaves-cn/agents 14 Sep 2023

Recent advances on large language models (LLMs) enable researchers and developers to build autonomous language agents that can automatically solve various tasks and interact with environments, humans, and other agents using natural language interfaces.

3,181
2.18 stars / hour

NExT-GPT: Any-to-Any Multimodal LLM

NExT-GPT/NExT-GPT 11 Sep 2023

While recently Multimodal Large Language Models (MM-LLMs) have made exciting strides, they mostly fall prey to the limitation of only input-side multimodal understanding, without the ability to produce content in multiple modalities.

1,655
2.07 stars / hour

FreeU: Free Lunch in Diffusion U-Net

ChenyangSi/FreeU 20 Sep 2023

In this paper, we uncover the untapped potential of diffusion U-Net, which serves as a "free lunch" that substantially improves the generation quality on the fly.

Denoising Video Generation

637
1.41 stars / hour

MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation

jiahao000/mosaicfusion 22 Sep 2023

We present MosaicFusion, a simple yet effective diffusion-based data augmentation approach for large vocabulary instance segmentation.

Data Augmentation Instance Segmentation +1

57
1.39 stars / hour

The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"

lukasberglund/reversal_curse 21 Sep 2023

This shows a failure of logical deduction that we hypothesize is caused by the Reversal Curse.

Data Augmentation

147
1.27 stars / hour

DreamLLM: Synergistic Multimodal Comprehension and Creation

RunpeiDong/DreamLLM 20 Sep 2023

This paper presents DreamLLM, a learning framework that first achieves versatile Multimodal Large Language Models (MLLMs) empowered with frequently overlooked synergy between multimodal comprehension and creation.

167
1.11 stars / hour