Rethinking Interpretability in the Era of Large Language Models

csinva/imodelsX 30 Jan 2024

We highlight two emerging research priorities for LLM interpretation: using LLMs to directly analyze new datasets and to generate interactive explanations.

Interpretable Machine Learning

119
0.47 stars / hour

MemGPT: Towards LLMs as Operating Systems

cpacker/memgpt 12 Oct 2023

Large language models (LLMs) have revolutionized AI, but are constrained by limited context windows, hindering their utility in tasks like extended conversations and document analysis.

Management

9,747
0.45 stars / hour

PLAYER*: Enhancing LLM-based Multi-Agent Communication and Interaction in Murder Mystery Games

alickzhu/player 26 Apr 2024

Recent advancements in Large Language Models (LLMs) have enhanced the efficacy of agent communication and social interactions.

Multiple-choice

42
0.45 stars / hour

OpenVoice: Versatile Instant Voice Cloning

myshell-ai/openvoice 3 Dec 2023

The voice styles are not directly copied from and constrained by the style of the reference speaker.

Voice Cloning

24,654
0.44 stars / hour

ID-Animator: Zero-Shot Identity-Preserving Human Video Generation

id-animator/id-animator 23 Apr 2024

Based on this pipeline, a random face reference training method is further devised to precisely capture the ID-relevant embeddings from reference images, thus improving the fidelity and generalization capacity of our model for ID-specific video generation.

Attribute Video Generation

160
0.44 stars / hour

TensorIR: An Abstraction for Automatic Tensorized Program Optimization

mlc-ai/web-llm 9 Jul 2022

Finally, we build an end-to-end framework on top of our abstraction to automatically optimize deep learning models for given tensor computation primitives.

BIG-bench Machine Learning

9,650
0.41 stars / hour

Vision-based 3D occupancy prediction in autonomous driving: a review and outlook

zya3d/awesome-3d-occupancy-prediction 4 May 2024

In recent years, autonomous driving has garnered escalating attention for its potential to relieve drivers' burdens and improve driving safety.

Autonomous Driving

43
0.40 stars / hour

Foundation Models for Video Understanding: A Survey

neelumadan/vifm_survey 6 May 2024

Additionally, we offer an in-depth performance analysis of these models for the 6 most common video tasks.

Video Understanding

19
0.39 stars / hour

emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

alibaba-damo-academy/FunASR 23 Dec 2023

To the best of our knowledge, emotion2vec is the first universal representation model in various emotion-related tasks, filling a gap in the field.

Self-Supervised Learning Sentiment Analysis +1

3,588
0.39 stars / hour

ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving

JackAILab/ConsistentID 25 Apr 2024

ConsistentID comprises two key components: a multimodal facial prompt generator that combines facial features, corresponding facial descriptions and the overall facial context to enhance precision in facial details, and an ID-preservation network optimized through the facial attention localization strategy, aimed at preserving ID consistency in facial regions.

463
0.39 stars / hour