Search Results for author: Ernest Davis

Found 19 papers, 0 papers with code

A Gentle Introduction to Deep Nets and Opportunities for the Future

no code implementations • ACL 2022 • Kenneth Church, Valia Kordoni, Gary Marcus, Ernest Davis, Yanjun Ma, Zeyu Chen

The first half of this tutorial will make deep nets more accessible to a broader audience, following “Deep Nets for Poets” and “A Gentle Introduction to Fine-Tuning.” We will also introduce GFT (general fine tuning), a little language for fine tuning deep nets with short (one line) programs that are as easy to code as regression in statistics packages such as R using glm (general linear models).

Paper
Add Code

Testing GPT-4 with Wolfram Alpha and Code Interpreter plug-ins on math and science problems

no code implementations • 10 Aug 2023 • Ernest Davis, Scott Aaronson

This report describes a test of the large language model GPT-4 with the Wolfram Alpha and the Code Interpreter plug-ins on 105 original problems in science and math, at the high school and college levels, carried out in June-August 2023.

Language Modelling Large Language Model +1

Paper
Add Code

Benchmarks for Automated Commonsense Reasoning: A Survey

no code implementations • 9 Feb 2023 • Ernest Davis

We discuss the gaps in the existing benchmarks and aspects of commonsense reasoning that are not addressed in any existing benchmark.

Common Sense Reasoning

Paper
Add Code

Mathematics, word problems, common sense, and artificial intelligence

no code implementations • 23 Jan 2023 • Ernest Davis

The paper discusses the capacities and limitations of current artificial intelligence (AI) technology to solve word problems that combine elementary knowledge with commonsense reasoning.

Common Sense Reasoning

Paper
Add Code

Limits of an AI program for solving college math problems

no code implementations • 14 Aug 2022 • Ernest Davis

Drori et al. (2022) report that "A neural network solves, explains, and generates university math problems by program synthesis and few-shot learning at human level ... [It] automatically answers 81\% of university-level mathematics problems."

Few-Shot Learning Math +1

Paper
Add Code

A very preliminary analysis of DALL-E 2

no code implementations • 25 Apr 2022 • Gary Marcus, Ernest Davis, Scott Aaronson

The DALL-E 2 system generates original synthetic images corresponding to an input text as caption.

Common Sense Reasoning

Paper
Add Code

Pragmatic constraints and pronoun reference disambiguation: the possible and the impossible

no code implementations • 3 Apr 2022 • Ernest Davis

Pronoun disambiguation in understanding text and discourse often requires the application of both general pragmatic knowledge and context-specific information.

Paper
Add Code

Physical Reasoning in an Open World

no code implementations • 22 Jan 2022 • Zhuoran Zeng, Ernest Davis

Most work on physical reasoning, both in artificial intelligence and in cognitive science, has focused on closed-world reasoning, in which it is assumed that the problem specification specifies all relevant objects and substance, all their relations in an initial situation, and all exogenous events.

valid

Paper
Add Code

The Defeat of the Winograd Schema Challenge

no code implementations • 7 Jan 2022 • Vid Kocijan, Ernest Davis, Thomas Lukasiewicz, Gary Marcus, Leora Morgenstern

The Winograd Schema Challenge - a set of twin sentences involving pronoun reference disambiguation that seem to require the use of commonsense knowledge - was proposed by Hector Levesque in 2011.

Paper
Add Code

Deep Learning and Mathematical Intuition: A Review of (Davies et al. 2021)

no code implementations • 8 Dec 2021 • Ernest Davis

A recent paper by Davies et al (2021) describes how deep learning (DL) technology was used to find plausible hypotheses that have led to two original mathematical results: one in knot theory, one in representation theory.

Paper
Add Code

A Flawed Dataset for Symbolic Equation Verification

no code implementations • 24 May 2021 • Ernest Davis

Arabshahi, Singh, and Anandkumar (2018) propose a method for creating a dataset of symbolic mathematical equations for the tasks of symbolic equation verification and equation completion.

Paper
Add Code

Unanswerable Questions about Images and Texts

no code implementations • 25 Jan 2021 • Ernest Davis

Questions about a text or an image that cannot be answered raise distinctive issues for an AI.

Question Answering Visual Question Answering

Paper
Add Code

The test set for the TransCoder system

no code implementations • 1 Aug 2020 • Ernest Davis

The TransCoder system translates source code between Java, C++, and Python 3.

Paper
Add Code

A Review of Winograd Schema Challenge Datasets and Approaches

no code implementations • 23 Apr 2020 • Vid Kocijan, Thomas Lukasiewicz, Ernest Davis, Gary Marcus, Leora Morgenstern

The Winograd Schema Challenge is both a commonsense reasoning and natural language understanding challenge, introduced as an alternative to the Turing test.

Natural Language Understanding

Paper
Add Code

The Use of Deep Learning for Symbolic Integration: A Review of (Lample and Charton, 2019)

no code implementations • 12 Dec 2019 • Ernest Davis

Lample and Charton (2019) describe a system that uses deep learning technology to compute symbolic, indefinite integrals, and to find symbolic solutions to first- and second-order ordinary differential equations, when the solutions are elementary functions.

Paper
Add Code

Winograd Schemas and Machine Translation

no code implementations • 5 Aug 2016 • Ernest Davis

A Winograd schema is a pair of sentences that differ in a single word and that contain an ambiguous pronoun whose referent is different in the two sentences and requires the use of commonsense knowledge or world knowledge to disambiguate.

Machine Translation Sentence +2

Paper
Add Code

The Scope and Limits of Simulation in Cognitive Models

no code implementations • 16 Jun 2015 • Ernest Davis, Gary Marcus

It has been proposed that human physical reasoning consists largely of running "physics engines in the head" in which the future trajectory of the physical system under consideration is computed precisely using accurate scientific theories.

Paper
Add Code

The Limitations of Standardized Science Tests as Benchmarks for Artificial Intelligence Research: Position Paper

no code implementations • 6 Nov 2014 • Ernest Davis

In particular, standardized tests do not test knowledge that is obvious for people; none of this knowledge can be assumed in AI systems.

Position

Paper
Add Code

The Relevance of Proofs of the Rationality of Probability Theory to Automated Reasoning and Cognitive Models

no code implementations • 4 Oct 2013 • Ernest Davis

A number of well-known theorems, such as Cox's theorem and de Finetti's theorem.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.