Mamba: Linear-Time Sequence Modeling with Selective State Spaces

state-spaces/mamba 1 Dec 2023

Foundation models, now powering most of the exciting applications in deep learning, are almost universally based on the Transformer architecture and its core attention module.

Language Modelling

2,377
9.80 stars / hour

TaskWeaver: A Code-First Agent Framework

microsoft/taskweaver 29 Nov 2023

TaskWeaver provides support for rich data structures, flexible plugin usage, and dynamic plugin selection, and leverages LLM coding capabilities for complex logic.

Natural Language Understanding

2,052
6.86 stars / hour

Magicoder: Source Code Is All You Need

ise-uiuc/magicoder 4 Dec 2023

Magicoder models are trained on 75K synthetic instruction data using OSS-Instruct, a novel approach to enlightening LLMs with open-source code snippets to generate high-quality instruction data for code.

Code Generation Text-to-Code Generation

685
5.13 stars / hour

Self-conditioned Image Generation via Generating Representations

LTH14/rcg 6 Dec 2023

During generation, RCG samples from such representation distribution using a representation diffusion model (RDM), and employs a pixel generator to craft image pixels conditioned on the sampled representation.

Conditional Image Generation Unconditional Image Generation

148
4.49 stars / hour

Sequential Modeling Enables Scalable Learning for Large Vision Models

ytongbai/LVM 1 Dec 2023

We introduce a novel sequential modeling approach which enables learning a Large Vision Model (LVM) without making use of any linguistic data.

1,199
3.90 stars / hour

HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis

sh-lee-prml/hierspeechpp 21 Nov 2023

Furthermore, we significantly improve the naturalness and speaker similarity of synthetic speech even in zero-shot speech synthesis scenarios.

Speech Synthesis Super-Resolution +2

728
2.77 stars / hour

DeepCache: Accelerating Diffusion Models for Free

horseee/deepcache 1 Dec 2023

Diffusion models have recently gained unprecedented attention in the field of image synthesis due to their remarkable generative capabilities.

Denoising Image Generation

189
2.02 stars / hour

Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians

yuelangx/gaussian-head-avatar 5 Dec 2023

Creating high-fidelity 3D head avatars has always been a research hotspot, but there remains a great challenge under lightweight sparse view setups.

82
1.85 stars / hour

Aligning and Prompting Everything All at Once for Universal Visual Perception

shenyunhang/ape 4 Dec 2023

However, predominant paradigms, driven by casting instance-level tasks as an object-word alignment, bring heavy cross-modality interaction, which is not effective in prompting object detection and visual grounding.

object-detection Object Detection +4

147
1.77 stars / hour

DemoFusion: Democratising High-Resolution Image Generation With No $$$

PRIS-CV/DemoFusion 24 Nov 2023

High-resolution image generation with Generative Artificial Intelligence (GenAI) has immense potential but, due to the enormous capital investment required for training, it is increasingly centralised to a few large corporations, and hidden behind paywalls.

Image Generation

322
1.76 stars / hour