Reasoning

Date Understanding

8 papers with code • 0 benchmarks • 0 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Date Understanding

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Most implemented papers

Most implemented Social Latest No code

Large Language Models are Zero-Shot Reasoners

kojima-takeshi188/zero_shot_cot • • 24 May 2022

Pretrained large language models (LLMs) are widely used in many sub-fields of natural language processing (NLP) and generally known as excellent few-shot learners with task-specific exemplars.

Paper
Code

Scaling Language Models: Methods, Analysis & Insights from Training Gopher

allenai/dolma • NA 2021

Language modelling provides a step towards intelligent communication systems by harnessing large repositories of written human knowledge to better predict and understand the world.

Paper
Code

Training Compute-Optimal Large Language Models

karpathy/llama2.c • • 29 Mar 2022

We investigate the optimal model size and number of tokens for training a transformer language model under a given compute budget.

Paper
Code

Dataset and Baseline System for Multi-lingual Extraction and Normalization of Temporal and Numerical Expressions

Microsoft/Recognizers-Text • 31 Mar 2023

Temporal and numerical expression understanding is of great importance in many downstream Natural Language Processing (NLP) and Information Retrieval (IR) tasks.

Paper
Code

EchoPrompt: Instructing the Model to Rephrase Queries for Improved In-context Learning

rajasekharmekala/query-rephrasing-subtask-cot • 16 Sep 2023

On average, EchoPrompt improves the Zero-shot-CoT performance of code-davinci-002 by 5% in numerical tasks and 13% in reading comprehension tasks.

Paper
Code

ReGAL: Refactoring Programs to Discover Generalizable Abstractions

esteng/regal_program_learning • 29 Jan 2024

While large language models (LLMs) are increasingly being used for program synthesis, they lack the global view needed to develop useful abstractions; they generally predict programs one at a time, often repeating the same functionality.

Paper
Code

Understanding the Weakness of Large Language Model Agents within a Complex Android Environment

androidarenaagent/androidarena • 9 Feb 2024

These challenges motivate AndroidArena, an environment and benchmark designed to evaluate LLM agents on a modern operating system.

Paper
Code

Date Understanding

Benchmarks Add a Result

Most implemented papers

Large Language Models are Zero-Shot Reasoners

Scaling Language Models: Methods, Analysis & Insights from Training Gopher

Training Compute-Optimal Large Language Models

Dataset and Baseline System for Multi-lingual Extraction and Normalization of Temporal and Numerical Expressions

EchoPrompt: Instructing the Model to Rephrase Queries for Improved In-context Learning

ReGAL: Refactoring Programs to Discover Generalizable Abstractions

Understanding the Weakness of Large Language Model Agents within a Complex Android Environment

Content

Benchmarks

Add a Result