Trending Research

Rethinking Interpretability in the Era of Large Language Models

csinva/imodelsX • • 30 Jan 2024

We highlight two emerging research priorities for LLM interpretation: using LLMs to directly analyze new datasets and to generate interactive explanations.

Interpretable Machine Learning

119

0.47 stars / hour

Paper
Code

MemGPT: Towards LLMs as Operating Systems

cpacker/memgpt • 12 Oct 2023

Large language models (LLMs) have revolutionized AI, but are constrained by limited context windows, hindering their utility in tasks like extended conversations and document analysis.

Management

9,747

0.45 stars / hour

Paper
Code

PLAYER*: Enhancing LLM-based Multi-Agent Communication and Interaction in Murder Mystery Games

alickzhu/player • 26 Apr 2024

Recent advancements in Large Language Models (LLMs) have enhanced the efficacy of agent communication and social interactions.

Multiple-choice

0.45 stars / hour

Paper
Code

OpenVoice: Versatile Instant Voice Cloning

myshell-ai/openvoice • • 3 Dec 2023

The voice styles are not directly copied from and constrained by the style of the reference speaker.

Voice Cloning

24,654

0.44 stars / hour

Paper
Code

ID-Animator: Zero-Shot Identity-Preserving Human Video Generation

id-animator/id-animator • • 23 Apr 2024

Based on this pipeline, a random face reference training method is further devised to precisely capture the ID-relevant embeddings from reference images, thus improving the fidelity and generalization capacity of our model for ID-specific video generation.

Attribute Video Generation

160

0.44 stars / hour

Paper
Code

TensorIR: An Abstraction for Automatic Tensorized Program Optimization

mlc-ai/web-llm • • 9 Jul 2022

Finally, we build an end-to-end framework on top of our abstraction to automatically optimize deep learning models for given tensor computation primitives.

BIG-bench Machine Learning

9,650

0.41 stars / hour

Paper
Code

Vision-based 3D occupancy prediction in autonomous driving: a review and outlook

zya3d/awesome-3d-occupancy-prediction • 4 May 2024

In recent years, autonomous driving has garnered escalating attention for its potential to relieve drivers' burdens and improve driving safety.

Autonomous Driving

0.40 stars / hour

Paper
Code

Foundation Models for Video Understanding: A Survey

neelumadan/vifm_survey • • 6 May 2024

Additionally, we offer an in-depth performance analysis of these models for the 6 most common video tasks.

Video Understanding

0.39 stars / hour

Paper
Code

emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

alibaba-damo-academy/FunASR • • 23 Dec 2023

To the best of our knowledge, emotion2vec is the first universal representation model in various emotion-related tasks, filling a gap in the field.

Self-Supervised Learning Sentiment Analysis +1

3,588

0.39 stars / hour

Paper
Code

ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving

JackAILab/ConsistentID • • 25 Apr 2024

ConsistentID comprises two key components: a multimodal facial prompt generator that combines facial features, corresponding facial descriptions and the overall facial context to enhance precision in facial details, and an ID-preservation network optimized through the facial attention localization strategy, aimed at preserving ID consistency in facial regions.

463

0.39 stars / hour

Paper
Code