Transparent Image Layer Diffusion using Latent Transparency

layerdiffusion/layerdiffusion 27 Feb 2024

We show that latent transparency can be applied to different open source image generators, or be adapted to various conditional control systems to achieve applications like foreground/background-conditioned layer generation, joint layer generation, structural control of layer contents, etc.

1,252
6.30 stars / hour

Intent-based Prompt Calibration: Enhancing prompt optimization with synthetic boundary cases

eladlev/autoprompt 5 Feb 2024

Recent studies have demonstrated the capabilities of LLMs to automatically conduct prompt engineering by employing a meta-prompt that incorporates the outcomes of the last trials and proposes an improved prompt.

Prompt Engineering

1,251
4.00 stars / hour

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

lichao-sun/sorareview 27 Feb 2024

Sora is a text-to-video generative AI model, released by OpenAI in February 2024.

Marketing Video Generation

310
2.18 stars / hour

YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

wongkinyiu/yolov9 21 Feb 2024

It can be used to obtain complete information, so that train-from-scratch models can achieve better results than state-of-the-art models pre-trained using large datasets, the comparison results are shown in Figure 1.

object-detection Real-Time Object Detection

6,167
2.15 stars / hour

Datasets for Large Language Models: A Comprehensive Survey

lmmlzn/awesome-llms-datasets 28 Feb 2024

Additionally, a comprehensive review of the existing available dataset resources is also provided, including statistics from 444 datasets, covering 8 language categories and spanning 32 domains.

Language Modelling Large Language Model

243
1.81 stars / hour

Learning to Generate Instruction Tuning Datasets for Zero-Shot Task Adaptation

batsresearch/bonito 28 Feb 2024

Overall, we show that learning with synthetic instruction tuning datasets is an effective way to adapt language models to new domains.

Attribute Extractive Question-Answering +2

192
1.77 stars / hour

MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT

mbzuai-oryx/mobillama 26 Feb 2024

"Bigger the better" has been the predominant trend in recent Large Language Models (LLMs) development.

363
1.53 stars / hour

Training-Free Long-Context Scaling of Large Language Models

hkunlp/chunkllama 27 Feb 2024

The ability of Large Language Models (LLMs) to process and generate coherent text is markedly weakened when the number of input tokens exceeds their pretraining length.

116
1.10 stars / hour

BitNet: Scaling 1-bit Transformers for Large Language Models

Beomi/BitNet-Transformers 17 Oct 2023

The increasing size of large language models has posed challenges for deployment and raised concerns about environmental impact due to high energy consumption.

Language Modelling Quantization

159
1.06 stars / hour

The First Place Solution of WSDM Cup 2024: Leveraging Large Language Models for Conversational Multi-Doc QA

zhangzhao219/wsdm-cup-2024 28 Feb 2024

Conversational multi-doc question answering aims to answer specific questions based on the retrieved documents as well as the contextual conversations.

Natural Language Understanding Question Answering

86
0.95 stars / hour