Search Results for author: Jing Sha

Found 5 papers, 4 papers with code

Continual Pre-training of Language Models for Math Problem Understanding with Syntax-Aware Memory Network

1 code implementation • ACL 2022 • Zheng Gong, Kun Zhou, Xin Zhao, Jing Sha, Shijin Wang, Ji-Rong Wen

In this paper, we study how to continually pre-train language models for improving the understanding of math problems.

Paper
Code

Bit-mask Robust Contrastive Knowledge Distillation for Unsupervised Semantic Hashing

1 code implementation • 10 Mar 2024 • Liyang He, Zhenya Huang, Jiayu Liu, Enhong Chen, Fei Wang, Jing Sha, Shijin Wang

In this paper, we propose an innovative Bit-mask Robust Contrastive knowledge Distillation (BRCD) method, specifically devised for the distillation of semantic hashing models.

Image Retrieval Knowledge Distillation +1

Paper
Code

JiuZhang 2.0: A Unified Chinese Pre-trained Language Model for Multi-task Mathematical Problem Solving

no code implementations • 19 Jun 2023 • Wayne Xin Zhao, Kun Zhou, Beichen Zhang, Zheng Gong, Zhipeng Chen, Yuanhang Zhou, Ji-Rong Wen, Jing Sha, Shijin Wang, Cong Liu, Guoping Hu

Specially, we construct a Mixture-of-Experts~(MoE) architecture for modeling mathematical text, so as to capture the common mathematical knowledge across tasks.

In-Context Learning Language Modelling +1

Paper
Add Code

Evaluating and Improving Tool-Augmented Computation-Intensive Math Reasoning

1 code implementation • NeurIPS 2023 • Beichen Zhang, Kun Zhou, Xilin Wei, Wayne Xin Zhao, Jing Sha, Shijin Wang, Ji-Rong Wen

Based on this finding, we propose a new approach that can deliberate the reasoning steps with tool interfaces, namely \textbf{DELI}.

Math

Paper
Code

JiuZhang: A Chinese Pre-trained Language Model for Mathematical Problem Understanding

1 code implementation • 13 Jun 2022 • Wayne Xin Zhao, Kun Zhou, Zheng Gong, Beichen Zhang, Yuanhang Zhou, Jing Sha, Zhigang Chen, Shijin Wang, Cong Liu, Ji-Rong Wen

Considering the complex nature of mathematical texts, we design a novel curriculum pre-training approach for improving the learning of mathematical PLMs, consisting of both basic and advanced courses.

Language Modelling Math

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.