Search Results

HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face

1 code implementation NeurIPS 2023

Solving complicated AI tasks with different domains and modalities is a key step toward artificial general intelligence.

Automatic Machine Learning Model Selection Model Selection +2

torchdistill Meets Hugging Face Libraries for Reproducible, Coding-Free Deep Learning Studies: A Case Study on NLP

1 code implementation26 Oct 2023

Reproducibility in scientific work has been becoming increasingly important in research communities such as machine learning, natural language processing, and computer vision communities due to the rapid development of the research domains supported by recent advances in deep learning.

Image Classification Knowledge Distillation +4

REALM: Retrieval-Augmented Language Model Pre-Training

6 code implementations10 Feb 2020

Language model pre-training has been shown to capture a surprising amount of world knowledge, crucial for NLP tasks such as question answering.

Language Modeling Language Modelling +5

Spacerini: Plug-and-play Search Engines with Pyserini and Hugging Face

1 code implementation28 Feb 2023

We present Spacerini, a tool that integrates the Pyserini toolkit for reproducible information retrieval research with Hugging Face to enable the seamless construction and deployment of interactive search engines.

Information Retrieval Retrieval

Denoising Diffusion Implicit Models

30 code implementations ICLR 2021

Denoising diffusion probabilistic models (DDPMs) have achieved high quality image generation without adversarial training, yet they require simulating a Markov chain for many steps to produce a sample.

Denoising Image Generation

GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration

1 code implementation2 Jun 2023

We discuss how Pyserini - a widely used toolkit for reproducible IR research can be integrated with the Hugging Face ecosystem of open-source AI libraries and artifacts.

Information Retrieval Retrieval

Learning Transferable Visual Models From Natural Language Supervision

82 code implementations26 Feb 2021

State-of-the-art computer vision systems are trained to predict a fixed set of predetermined object categories.

 Ranked #1 on Zero-Shot Learning on COCO-MLT (using extra training data)

Benchmarking Few-Shot Image Classification +20

Linear-time Minimum Bayes Risk Decoding with Reference Aggregation

2 code implementations6 Feb 2024

Minimum Bayes Risk (MBR) decoding is a text generation technique that has been shown to improve the quality of machine translations, but is expensive, even if a sampling-based approximation is used.

Text Generation

Towards Semantic Versioning of Open Pre-trained Language Model Releases on Hugging Face

1 code implementation16 Sep 2024

Given the knowledge gap on current PTLM release practices, our empirical study uses a mixed-methods approach to analyze the releases of 52, 227 PTLMs on the most well-known model registry, HF.

Software Engineering