Trending Research

MolTC: Towards Molecular Relational Modeling In Language Models

MangoKiller/MolTC • • 6 Feb 2024

Molecular Relational Learning (MRL), aiming to understand interactions between molecular pairs, plays a pivotal role in advancing biochemical research.

Relational Reasoning

190

0.47 stars / hour

Paper
Code

X-LoRA: Mixture of Low-Rank Adapter Experts, a Flexible Framework for Large Language Models with Applications in Protein Mechanics and Molecular Design

ericlbuehler/mistral.rs • 11 Feb 2024

Starting with a set of pre-trained LoRA adapters, our gating strategy uses the hidden states to dynamically mix adapted layers, allowing the resulting X-LoRA model to draw upon different capabilities and create never-before-used deep layer-wise combinations to solve tasks.

graph construction Knowledge Graphs +3

1,417

0.46 stars / hour

Paper
Code

UFO: A UI-Focused Agent for Windows OS Interaction

microsoft/UFO • 8 Feb 2024

We introduce UFO, an innovative UI-Focused agent to fulfill user requests tailored to applications on Windows OS, harnessing the capabilities of GPT-Vision.

Navigate

4,482

0.43 stars / hour

Paper
Code

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

mcgill-nlp/llm2vec • • 9 Apr 2024

We outperform encoder-only models by a large margin on word-level tasks and reach a new unsupervised state-of-the-art performance on the Massive Text Embeddings Benchmark (MTEB).

Contrastive Learning Decoder

483

0.42 stars / hour

Paper
Code

InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions

nttmdlab-nlp/instructdoc • 24 Jan 2024

We study the problem of completing various visual document understanding (VDU) tasks, e. g., question answering and information extraction, on real-world documents through human-written instructions.

document understanding Question Answering +1

119

0.41 stars / hour

Paper
Code

CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

apple/corenet • • 24 Apr 2024

Contrastive learning has emerged as a transformative method for learning effective visual representations through the alignment of image and text embeddings.

Contrastive Learning

6,328

0.41 stars / hour

Paper
Code

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

tencentarc/instantmesh • • 10 Apr 2024

We present InstantMesh, a feed-forward framework for instant 3D mesh generation from a single image, featuring state-of-the-art generation quality and significant training scalability.

Image to 3D

1,844

0.40 stars / hour

Paper
Code

NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment

nvidia/nemo-aligner • • 2 May 2024

However, building efficient tools to perform alignment can be challenging, especially for the largest and most competent LLMs which often contain tens or hundreds of billions of parameters.

241

0.39 stars / hour

Paper
Code

Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding

infini-ai-lab/sequoia • • 19 Feb 2024

This paper introduces Sequoia, a scalable, robust, and hardware-aware algorithm for speculative decoding.

253

0.39 stars / hour

Paper
Code

OpenVoice: Versatile Instant Voice Cloning

myshell-ai/openvoice • • 3 Dec 2023

The voice styles are not directly copied from and constrained by the style of the reference speaker.

Voice Cloning

24,206

0.39 stars / hour

Paper
Code