Math

318 papers with code • 0 benchmarks • 0 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Math

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Libraries

Use these libraries to find Math models and implementations

lastmile-ai/aiconfig

3 papers

857

huggingface/transformers

2 papers

126,027

hiyouga/llama-factory

2 papers

21,909

lupantech/chameleon-llm

2 papers

1,020

See all 6 libraries.

Most implemented papers

Most implemented Social Latest No code

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

guidance-ai/guidance • 28 Jan 2022

We explore how generating a chain of thought -- a series of intermediate reasoning steps -- significantly improves the ability of large language models to perform complex reasoning.

Paper
Code

GPT-4 Technical Report

openai/evals • Preprint 2023

We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs.

Paper
Code

PaLM: Scaling Language Modeling with Pathways

lucidrains/CoCa-pytorch • • Google Research 2022

To further our understanding of the impact of scale on few-shot learning, we trained a 540-billion parameter, densely activated, Transformer language model, which we call Pathways Language Model PaLM.

Paper
Code

The Matrix Calculus You Need For Deep Learning

parrt/bookish • 5 Feb 2018

This paper is an attempt to explain all the matrix calculus you need in order to understand the training of deep neural networks.

Paper
Code

Full Page Handwriting Recognition via Image to Sequence Extraction

kingyiusuen/image-to-latex • • 11 Mar 2021

We present a Neural Network based Handwritten Text Recognition (HTR) model architecture that can be trained to recognize full pages of handwritten or printed text without image segmentation.

Paper
Code

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

vllm-project/vllm • • 1 Jun 2023

We then propose to search for the optimal per-channel scaling that protects the salient weights by observing the activation, not weights.

Paper
Code

Mistral 7B

mistralai/mistral-src • • 10 Oct 2023

We introduce Mistral 7B v0. 1, a 7-billion-parameter language model engineered for superior performance and efficiency.

Paper
Code

Measuring Mathematical Problem Solving With the MATH Dataset

hendrycks/math • • 5 Mar 2021

To facilitate future research and increase accuracy on MATH, we also contribute a large auxiliary pretraining dataset which helps teach models the fundamentals of mathematics.

Paper
Code