Search Results for author: Zui Chen

Found 9 papers, 5 papers with code

Brain-Inspired Two-Stage Approach: Enhancing Mathematical Reasoning by Imitating Human Thought Processes

no code implementations • 23 Feb 2024 • Yezeng Chen, Zui Chen, Yi Zhou

Although large language models demonstrate emergent abilities in solving math word problems, there is a challenging task in complex multi-step mathematical reasoning tasks.

Math Mathematical Reasoning

Paper
Add Code

An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning

1 code implementation • 23 Feb 2024 • Zui Chen, Yezeng Chen, Jiaqi Han, Zhijie Huang, Ji Qi, Yi Zhou

Large language models (LLMs) are displaying emergent abilities for math reasoning tasks, and there is a growing attention on enhancing the ability of open-source LLMs through supervised fine-tuning (SFT). In this paper, we aim to explore a general data strategy for supervised data to help optimize and expand math reasoning ability. Firstly, we determine the ability boundary of reasoning paths augmentation by identifying these paths' minimal optimal set. Secondly, we validate that different abilities of the model can be cumulatively enhanced by Mix of Minimal Optimal Sets of corresponding types of data, while our models MMOS achieve SOTA performance on series base models under much lower construction costs. Besides, we point out GSM-HARD is not really hard and today's LLMs no longer lack numerical robustness. Also, we provide an Auto Problem Generator for robustness testing and educational applications. Our code and data are publicly available at https://github. com/cyzhh/MMOS.

Ranked #2 on Math Word Problem Solving on ASDiv-A (using extra training data)

Arithmetic Reasoning Math Word Problem Solving

Paper
Code

SEED: Domain-Specific Data Curation With Large Language Models

no code implementations • 1 Oct 2023 • Zui Chen, Lei Cao, Sam Madden, Tim Kraska, Zeyuan Shang, Ju Fan, Nan Tang, Zihui Gu, Chunwei Liu, Michael Cafarella

SEED uses these generated modules to process most of the data records and dynamically decides when the LLM should step in to directly process some individual records, possibly using the data-access modules to retrieve relevant information from the data sources to assist the LLM in solving the task.

Code Generation Imputation +1

Paper
Add Code

RoTaR: Efficient Row-Based Table Representation Learning via Teacher-Student Training

no code implementations • 20 Jun 2023 • Zui Chen, Lei Cao, Sam Madden

In addition to the row-based architecture, we introduce several techniques: cell-aware position embedding, teacher-student training paradigm, and selective backward to improve the performance of RoTaR model.

Position Representation Learning

Paper
Add Code

Lingua Manga: A Generic Large Language Model Centric System for Data Curation

no code implementations • 20 Jun 2023 • Zui Chen, Lei Cao, Sam Madden

Data curation is a wide-ranging area which contains many critical but time-consuming data processing tasks.

Language Modelling Large Language Model

Paper
Add Code

Interleaving Pre-Trained Language Models and Large Language Models for Zero-Shot NL2SQL Generation

1 code implementation • 15 Jun 2023 • Zihui Gu, Ju Fan, Nan Tang, Songyue Zhang, Yuxin Zhang, Zui Chen, Lei Cao, Guoliang Li, Sam Madden, Xiaoyong Du

PLMs can perform well in schema alignment but struggle to achieve complex reasoning, while LLMs is superior in complex reasoning tasks but cannot achieve precise schema alignment.

Paper
Code

DeepStruct: Pretraining of Language Models for Structure Prediction

1 code implementation • Findings (ACL) 2022 • Chenguang Wang, Xiao Liu, Zui Chen, Haoyun Hong, Jie Tang, Dawn Song

We introduce a method for improving the structural understanding abilities of language models.

Ranked #1 on Open Information Extraction on Penn Treebank

coreference-resolution Dialogue State Tracking +11

Paper
Code

Sound2Synth: Interpreting Sound via FM Synthesizer Parameters Estimation

1 code implementation • 6 May 2022 • Zui Chen, Yansen Jing, Shengcheng Yuan, Yifei Xu, Jian Wu, Hang Zhao

Synthesizer is a type of electronic musical instrument that is now widely used in modern music production and sound design.

Audio Classification Audio Signal Processing

Paper
Code

Zero-Shot Information Extraction as a Unified Text-to-Triple Translation

1 code implementation • EMNLP 2021 • Chenguang Wang, Xiao Liu, Zui Chen, Haoyun Hong, Jie Tang, Dawn Song

We cast a suite of information extraction tasks into a text-to-triple translation framework.

Ranked #1 on Open Information Extraction on OIE2016 (using extra training data)

Factual probe Language Modelling +3

105

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.