Trending Research

Continual Learning of Large Language Models: A Comprehensive Survey

wang-ml-lab/llm-continual-learning-survey • • 25 Apr 2024

In this survey, we provide a comprehensive overview of the current research progress on LLMs within the context of CL.

0.63 stars / hour

Paper
Code

Llama 2: Open Foundation and Fine-Tuned Chat Models

flagalpha/llama2-chinese • • 18 Jul 2023

In this work, we develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion parameters.

Ranked #2 on Question Answering on PubChemQA

Arithmetic Reasoning +5

10,988

0.61 stars / hour

Paper
Code

Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models

FoundationVision/Groma • • 19 Apr 2024

We introduce Groma, a Multimodal Large Language Model (MLLM) with grounded and fine-grained visual perception ability.

Language Modelling Large Language Model +2

179

0.58 stars / hour

Paper
Code

Efficient Multimodal Learning from Data-centric Perspective

baai-dcai/bunny • • 18 Feb 2024

Multimodal Large Language Models (MLLMs) have demonstrated notable capabilities in general visual understanding and reasoning tasks.

575

0.56 stars / hour

Paper
Code

MultiBooth: Towards Generating All Your Concepts in an Image from Text

chenyangzhu1/multibooth • 22 Apr 2024

MultiBooth addresses these issues by dividing the multi-concept generation process into two phases: a single-concept learning phase and a multi-concept integration phase.

Computational Efficiency Image Generation

0.56 stars / hour

Paper
Code

Retrieval Head Mechanistically Explains Long-Context Factuality

nightdessert/retrieval_head • • 24 Apr 2024

Despite the recent progress in long-context language models, it remains elusive how transformer-based models exhibit the capability to retrieve relevant information from arbitrary locations within the long context.

Continual Pretraining Hallucination +2

0.49 stars / hour

Paper
Code

Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models

stanford-oval/storm • 22 Feb 2024

We study how to apply large language models to write grounded and organized long-form articles from scratch, with comparable breadth and depth to Wikipedia pages.

Retrieval

3,964

0.51 stars / hour

Paper
Code

MolTC: Towards Molecular Relational Modeling In Language Models

MangoKiller/MolTC • • 6 Feb 2024

Molecular Relational Learning (MRL), aiming to understand interactions between molecular pairs, plays a pivotal role in advancing biochemical research.

Relational Reasoning

0.50 stars / hour

Paper
Code

AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation

ez-hwh/autocrawler • 19 Apr 2024

We propose AutoCrawler, a two-stage framework that leverages the hierarchical structure of HTML for progressive understanding.

Action Generation

176

0.50 stars / hour

Paper
Code

Metric3D v2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation

yvanyin/metric3d • • Under review for Transaction 2024

Our method benefits various applications including in-the-wild metrology monocular-SLAM, and 3D reconstruction, which highlight the versatility of Metric3D v2 models as geometric foundation models.

Ranked #1 on Monocular Depth Estimation on NYU-Depth V2 (using extra training data)

3D Reconstruction Monocular Depth Estimation +3

663

0.49 stars / hour

Paper
Code