MEDITRON-70B: Scaling Medical Pretraining for Large Language Models

epfllm/meditron 27 Nov 2023

Large language models (LLMs) can potentially democratize access to medical knowledge.

 Ranked #1 on Multiple Choice Question Answering (MCQA) on MedMCQA (Dev Set (Acc-%) metric)

Conditional Text Generation Multiple Choice Question Answering (MCQA)

929
4.79 stars / hour

TaskWeaver: A Code-First Agent Framework

microsoft/taskweaver 29 Nov 2023

TaskWeaver provides support for rich data structures, flexible plugin usage, and dynamic plugin selection, and leverages LLM coding capabilities for complex logic.

Natural Language Understanding

651
3.45 stars / hour

Improving Sample Quality of Diffusion Models Using Self-Attention Guidance

lllyasviel/fooocus ICCV 2023

Denoising diffusion models (DDMs) have attracted attention for their exceptional generation quality and diversity.

Denoising Image Generation

22,440
2.55 stars / hour

GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation

baaivision/GeoDream 29 Nov 2023

We justify that the refined 3D geometric priors aid in the 3D-aware capability of 2D diffusion priors, which in turn provides superior guidance for the refinement of 3D geometric priors.

Text to 3D

244
2.46 stars / hour

On Bringing Robots Home

notmahi/dobb-e 27 Nov 2023

We use the Stick to collect 13 hours of data in 22 homes of New York City, and train Home Pretrained Representations (HPR).

314
2.18 stars / hour

SeamlessM4T: Massively Multilingual & Multimodal Machine Translation

facebookresearch/seamless_communication 22 Aug 2023

What does it take to create the Babel Fish, a tool that can help individuals translate speech between any two languages?

Automatic Speech Recognition Speech-to-Speech Translation +3

7,420
1.92 stars / hour

LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS

VITA-Group/LightGaussian 28 Nov 2023

Recent advancements in real-time neural rendering using point-based techniques have paved the way for the widespread adoption of 3D representations.

Network Pruning Neural Rendering +2

161
1.42 stars / hour

Qwen Technical Report

QwenLM/Qwen-7B 28 Sep 2023

Large language models (LLMs) have revolutionized the field of artificial intelligence, enabling natural language processing tasks that were previously thought to be exclusive to humans.

Language Modelling Large Language Model +1

6,945
1.13 stars / hour

Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models

qwenlm/qwen-audio 14 Nov 2023

Recently, instruction-following audio-language models have received broad attention for audio interaction with humans.

Instruction Following

543
1.02 stars / hour

YUAN 2.0: A Large Language Model with Localized Filtering-based Attention

ieit-yuan/yuan-2.0 27 Nov 2023

In this work, we develop and release Yuan 2. 0, a series of large language models with parameters ranging from 2. 1 billion to 102. 6 billion.

Code Generation Language Modelling +2

433
0.94 stars / hour