Chronos: Learning the Language of Time Series

amazon-science/chronos-forecasting 12 Mar 2024

We introduce Chronos, a simple yet effective framework for pretrained probabilistic time series models.

Gaussian Processes Language Modelling +2

1,256
0.54 stars / hour

AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System

salesforceairesearch/agentlite 23 Feb 2024

Thus, we open-source a new AI agent library, AgentLite, which simplifies this process by offering a lightweight, user-friendly platform for innovating LLM agent reasoning, architectures, and applications with ease.

214
0.54 stars / hour

UFO: A UI-Focused Agent for Windows OS Interaction

microsoft/UFO 8 Feb 2024

We introduce UFO, an innovative UI-Focused agent to fulfill user requests tailored to applications on Windows OS, harnessing the capabilities of GPT-Vision.

Navigate

3,338
0.52 stars / hour

Arcee's MergeKit: A Toolkit for Merging Large Language Models

cg123/mergekit 20 Mar 2024

The rapid expansion of the open-source language model landscape presents an opportunity to merge the competencies of these model checkpoints by combining their parameters.

Language Modelling Multi-Task Learning

2,920
0.52 stars / hour

Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework

ictmcg/make-your-anchor 25 Mar 2024

We adopt a two-stage training strategy for the diffusion model, effectively binding movements with specific appearances.

Denoising

116
0.52 stars / hour

ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing

ianarawjo/ChainForge 17 Sep 2023

Evaluating outputs of large language models (LLMs) is challenging, requiring making -- and making sense of -- many responses.

Model Selection Prompt Engineering +1

1,823
0.52 stars / hour

OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models

kongzhecn/omg 16 Mar 2024

We also observe that the initiation denoising timestep for noise blending is the key to identity preservation and layout.

Denoising Text-to-Image Generation

443
0.47 stars / hour

Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small

openai/transformer-debugger 1 Nov 2022

Research in mechanistic interpretability seeks to explain behaviors of machine learning models in terms of their internal components.

Language Modelling

3,522
0.45 stars / hour

SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series

badripatro/simba 22 Mar 2024

Transformers have widely adopted attention networks for sequence mixing and MLPs for channel mixing, playing a pivotal role in achieving breakthroughs across domains.

Inductive Bias Time Series +1

71
0.44 stars / hour

When Do We Not Need Larger Vision Models?

bfshi/scaling_on_scales 19 Mar 2024

Our results show that a multi-scale smaller model has comparable learning capacity to a larger model, and pre-training smaller models with S$^2$ can match or even exceed the advantage of larger models.

Depth Estimation

140
0.44 stars / hour