Date Understanding

8 papers with code • 0 benchmarks • 0 datasets

This task has no description! Would you like to contribute one?

Most implemented papers

Large Language Models are Zero-Shot Reasoners

kojima-takeshi188/zero_shot_cot 24 May 2022

Pretrained large language models (LLMs) are widely used in many sub-fields of natural language processing (NLP) and generally known as excellent few-shot learners with task-specific exemplars.

Scaling Language Models: Methods, Analysis & Insights from Training Gopher

allenai/dolma NA 2021

Language modelling provides a step towards intelligent communication systems by harnessing large repositories of written human knowledge to better predict and understand the world.

Training Compute-Optimal Large Language Models

karpathy/llama2.c 29 Mar 2022

We investigate the optimal model size and number of tokens for training a transformer language model under a given compute budget.

Dataset and Baseline System for Multi-lingual Extraction and Normalization of Temporal and Numerical Expressions

Microsoft/Recognizers-Text 31 Mar 2023

Temporal and numerical expression understanding is of great importance in many downstream Natural Language Processing (NLP) and Information Retrieval (IR) tasks.

EchoPrompt: Instructing the Model to Rephrase Queries for Improved In-context Learning

rajasekharmekala/query-rephrasing-subtask-cot 16 Sep 2023

On average, EchoPrompt improves the Zero-shot-CoT performance of code-davinci-002 by 5% in numerical tasks and 13% in reading comprehension tasks.

ReGAL: Refactoring Programs to Discover Generalizable Abstractions

esteng/regal_program_learning 29 Jan 2024

While large language models (LLMs) are increasingly being used for program synthesis, they lack the global view needed to develop useful abstractions; they generally predict programs one at a time, often repeating the same functionality.

Understanding the Weakness of Large Language Model Agents within a Complex Android Environment

androidarenaagent/androidarena 9 Feb 2024

These challenges motivate AndroidArena, an environment and benchmark designed to evaluate LLM agents on a modern operating system.