Search Results

DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI Scale

3 code implementations14 Jan 2022

As the training of giant dense models hits the boundary on the availability and capability of the hardware resources today, Mixture-of-Experts (MoE) models become one of the most promising model architectures due to their significant training cost reduction compared to a quality-equivalent dense model.

Decoder Mixture-of-Experts +1

USP: A Unified Sequence Parallelism Approach for Long Context Generative AI

4 code implementations13 May 2024

Sequence parallelism (SP), which divides the sequence dimension of input tensors across multiple computational devices, is becoming key to unlocking the long-context capabilities of generative AI models.

MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models

1 code implementation18 Oct 2023

For developers and amateurs, it is very difficult to grasp all of these task to satisfy their requirements in music processing, especially considering the huge differences in the representations of music data and the model applicability across platforms among various tasks.

AI Agent Music Classification

EasyPhoto: Your Smart AI Photo Generator

2 code implementations7 Oct 2023

By training a digital doppelganger of a specific user ID using 5 to 20 relevant images, the finetuned model (according to the trained LoRA model) allows for the generation of AI photos using arbitrary templates.

PyRIT: A Framework for Security Risk Identification and Red Teaming in Generative AI System

1 code implementation1 Oct 2024

To meet this need, we introduce the Python Risk Identification Toolkit (PyRIT), an open-source framework designed to enhance red teaming efforts in GenAI systems.

Red Teaming

Retrieval-Augmented Generation for AI-Generated Content: A Survey

3 code implementations29 Feb 2024

We first classify RAG foundations according to how the retriever augments the generator, distilling the fundamental abstractions of the augmentation methodologies for various retrievers and generators.

Information Retrieval multimodal generation +3

SocratiQ: A Generative AI-Powered Learning Companion for Personalized Education and Broader Accessibility

1 code implementation1 Feb 2025

Traditional educational approaches often struggle to provide personalized and interactive learning experiences on a scale.

Computers and Society

Leveraging Speculative Sampling and KV-Cache Optimizations Together for Generative AI using OpenVINO

1 code implementation8 Nov 2023

Inference optimizations are critical for improving user experience and reducing infrastructure costs and power consumption.

Quantization Text Generation

Adversarial Diffusion Distillation

6 code implementations28 Nov 2023

We introduce Adversarial Diffusion Distillation (ADD), a novel training approach that efficiently samples large-scale foundational image diffusion models in just 1-4 steps while maintaining high image quality.

Image Generation

Programming Is Hard -- Or at Least It Used to Be: Educational Opportunities And Challenges of AI Code Generation

1 code implementation2 Dec 2022

The introductory programming sequence has been the focus of much research in computing education.

Code Generation Position