Search Results

DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI Scale

3 code implementations14 Jan 2022

As the training of giant dense models hits the boundary on the availability and capability of the hardware resources today, Mixture-of-Experts (MoE) models become one of the most promising model architectures due to their significant training cost reduction compared to a quality-equivalent dense model.

Decoder Mixture-of-Experts +1

USP: A Unified Sequence Parallelism Approach for Long Context Generative AI

4 code implementations13 May 2024

Sequence parallelism (SP), which divides the sequence dimension of input tensors across multiple computational devices, is becoming key to unlocking the long-context capabilities of generative AI models.

MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models

1 code implementation18 Oct 2023

For developers and amateurs, it is very difficult to grasp all of these task to satisfy their requirements in music processing, especially considering the huge differences in the representations of music data and the model applicability across platforms among various tasks.

AI Agent Music Classification

EasyPhoto: Your Smart AI Photo Generator

2 code implementations7 Oct 2023

By training a digital doppelganger of a specific user ID using 5 to 20 relevant images, the finetuned model (according to the trained LoRA model) allows for the generation of AI photos using arbitrary templates.

Elucidating the Design Space of Diffusion-Based Generative Models

20 code implementations1 Jun 2022

We argue that the theory and practice of diffusion-based generative models are currently unnecessarily convoluted and seek to remedy the situation by presenting a design space that clearly separates the concrete design choices.

Image Generation

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

9 code implementations4 Jul 2023

We present SDXL, a latent diffusion model for text-to-image synthesis.

Image Generation

SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation

1 code implementation20 Mar 2025

We present Stable Video 4D 2. 0 (SV4D 2. 0), a multi-view video diffusion model for dynamic 3D asset generation.

PyRIT: A Framework for Security Risk Identification and Red Teaming in Generative AI System

1 code implementation1 Oct 2024

To meet this need, we introduce the Python Risk Identification Toolkit (PyRIT), an open-source framework designed to enhance red teaming efforts in GenAI systems.

Red Teaming

Retrieval-Augmented Generation for AI-Generated Content: A Survey

3 code implementations29 Feb 2024

We first classify RAG foundations according to how the retriever augments the generator, distilling the fundamental abstractions of the augmentation methodologies for various retrievers and generators.

Information Retrieval multimodal generation +4

SocratiQ: A Generative AI-Powered Learning Companion for Personalized Education and Broader Accessibility

1 code implementation1 Feb 2025

Traditional educational approaches often struggle to provide personalized and interactive learning experiences on a scale.

Computers and Society