Math

525 papers with code • 0 benchmarks • 5 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Math models and implementations
3 papers
760
2 papers
38,657
See all 7 libraries.

Most implemented papers

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

microsoft/guidance 28 Jan 2022

We explore how generating a chain of thought -- a series of intermediate reasoning steps -- significantly improves the ability of large language models to perform complex reasoning.

GPT-4 Technical Report

openai/evals Preprint 2023

We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs.

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

vllm-project/vllm 1 Jun 2023

We propose Activation-aware Weight Quantization (AWQ), a hardware-friendly approach for LLM low-bit weight-only quantization.

Translating Math Formula Images to LaTeX Sequences Using Deep Neural Networks with Sequence-level Training

tatp22/multidim-positional-encoding 29 Aug 2019

The encoder is a convolutional neural network (CNN) that transforms images into a group of feature maps.

PaLM: Scaling Language Modeling with Pathways

lucidrains/CoCa-pytorch Google Research 2022

To further our understanding of the impact of scale on few-shot learning, we trained a 540-billion parameter, densely activated, Transformer language model, which we call Pathways Language Model PaLM.

Mistral 7B

mistralai/mistral-src 10 Oct 2023

We introduce Mistral 7B v0. 1, a 7-billion-parameter language model engineered for superior performance and efficiency.

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

thudm/chatglm-6b 18 Jun 2024

We introduce ChatGLM, an evolving family of large language models that we have been developing over time.

The Matrix Calculus You Need For Deep Learning

parrt/bookish 5 Feb 2018

This paper is an attempt to explain all the matrix calculus you need in order to understand the training of deep neural networks.

Full Page Handwriting Recognition via Image to Sequence Extraction

kingyiusuen/image-to-latex 11 Mar 2021

We present a Neural Network based Handwritten Text Recognition (HTR) model architecture that can be trained to recognize full pages of handwritten or printed text without image segmentation.

Training Verifiers to Solve Math Word Problems

openai/grade-school-math 27 Oct 2021

State-of-the-art language models can match human performance on many tasks, but they still struggle to robustly perform multi-step mathematical reasoning.