1 code implementation • 19 Feb 2025 • Zenan Li, Zhaoyu Li, Wen Tang, Xian Zhang, Yuan YAO, Xujie Si, Fan Yang, Kaiyu Yang, Xiaoxing Ma
Large language models (LLMs) can prove mathematical theorems formally by generating proof steps (\textit{a. k. a.}
no code implementations • 12 Feb 2025 • Andrew Cohen, Andrey Gromov, Kaiyu Yang, Yuandong Tian
In this setting, the representations and the dynamics learned by the model are interpretable.
1 code implementation • 11 Feb 2025 • Yong Lin, Shange Tang, Bohan Lyu, Jiayun Wu, Hongzhou Lin, Kaiyu Yang, Jia Li, Mengzhou Xia, Danqi Chen, Sanjeev Arora, Chi Jin
On the miniF2F benchmark, it achieves a 57. 6% success rate (Pass@32), exceeding the previous best open-source model by 7. 6%.
no code implementations • 20 Dec 2024 • Kaiyu Yang, Gabriel Poesia, Jingxuan He, Wenda Li, Kristin Lauter, Swarat Chaudhuri, Dawn Song
AI for Mathematics (AI4Math) is not only intriguing intellectually but also crucial for AI-driven discovery in science, engineering, and beyond.
1 code implementation • 27 May 2024 • Logan Murphy, Kaiyu Yang, Jialiang Sun, Zhaoyu Li, Anima Anandkumar, Xujie Si
One challenge in Euclidean geometry is that informal proofs rely on diagrams, leaving gaps in texts that are hard to formalize.
2 code implementations • 18 Apr 2024 • Peiyang Song, Kaiyu Yang, Anima Anandkumar
Neural theorem proving combines large language models (LLMs) with proof assistants such as Lean, where the correctness of formal proofs can be rigorously verified, leaving no room for hallucination.
1 code implementation • 15 Apr 2024 • Zhaoyu Li, Jialiang Sun, Logan Murphy, Qidong Su, Zenan Li, Xian Zhang, Kaiyu Yang, Xujie Si
Theorem proving is a fundamental aspect of mathematics, spanning from informal reasoning in natural language to rigorous derivations in formal systems.
1 code implementation • 15 Jan 2024 • Dan Zhang, Ziniu Hu, Sining Zhoubian, Zhengxiao Du, Kaiyu Yang, Zihan Wang, Yisong Yue, Yuxiao Dong, Jie Tang
To bridge these gaps, we introduce SciInstruct, a suite of scientific instructions for training scientific language models capable of college-level scientific reasoning.
3 code implementations • NeurIPS 2023 • Kaiyu Yang, Aidan M. Swope, Alex Gu, Rahul Chalamala, Peiyang Song, Shixing Yu, Saad Godil, Ryan Prenger, Anima Anandkumar
Using this data, we develop ReProver (Retrieval-Augmented Prover): an LLM-based prover augmented with retrieval for selecting premises from a vast math library.
1 code implementation • CVPR 2023 • Alexander Raistrick, Lahav Lipson, Zeyu Ma, Lingjie Mei, Mingzhe Wang, Yiming Zuo, Karhan Kayan, Hongyu Wen, Beining Han, Yihan Wang, Alejandro Newell, Hei Law, Ankit Goyal, Kaiyu Yang, Jia Deng
We introduce Infinigen, a procedural generator of photorealistic 3D scenes of the natural world.
1 code implementation • 25 May 2022 • Kaiyu Yang, Jia Deng, Danqi Chen
In this paper, we present a novel stepwise method, NLProofS (Natural Language Proof Search), which learns to generate relevant steps conditioning on the hypothesis.
no code implementations • 25 Feb 2022 • Yuan Gao, Kaiyu Yang, Yuanlong Chen, Min Liu, Noureddine El Karoui
We establish a general optimization framework for the design of automated bidding agent in dynamic online marketplaces.
2 code implementations • 23 Nov 2021 • Kaiyu Yang, Jia Deng
In this work, we ask how we can build a rule-based system that can reason with natural language input but without the manual construction of rules.
1 code implementation • 10 Mar 2021 • Kaiyu Yang, Jacqueline Yau, Li Fei-Fei, Jia Deng, Olga Russakovsky
In this paper, we explore the effects of face obfuscation on the popular ImageNet challenge visual recognition benchmark.
2 code implementations • NeurIPS 2020 • Ankit Goyal, Kaiyu Yang, Dawei Yang, Jia Deng
The 3D scenes in our dataset come in minimally contrastive pairs: two scenes in a pair are almost identical, but a spatial relation holds in one and fails in the other.
Ranked #1 on
Spatial Relation Recognition
on Rel3D
3 code implementations • NeurIPS 2020 • Kaiyu Yang, Jia Deng
Based on our transition system, we develop a strongly incremental parser.
Ranked #1 on
Constituency Parsing
on CTB5
no code implementations • 16 Dec 2019 • Kaiyu Yang, Klint Qinami, Li Fei-Fei, Jia Deng, Olga Russakovsky
Computer vision technology is being used by many but remains representative of only a few.
1 code implementation • ICCV 2019 • Kaiyu Yang, Olga Russakovsky, Jia Deng
Understanding the spatial relations between objects in images is a surprisingly challenging task.
1 code implementation • 21 May 2019 • Kaiyu Yang, Jia Deng
Proof assistants offer a formalism that resembles human mathematical reasoning, representing theorems in higher-order logic and proofs as high-level tactics.
Ranked #1 on
Automated Theorem Proving
on CoqGym
46 code implementations • 22 Mar 2016 • Alejandro Newell, Kaiyu Yang, Jia Deng
This work introduces a novel convolutional network architecture for the task of human pose estimation.
Ranked #1 on
Pose Estimation
on FLIC Wrists