Search Results for author: Jiaqi Han

Found 15 papers, 7 papers with code

An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning

1 code implementation23 Feb 2024 Zui Chen, Yezeng Chen, Jiaqi Han, Zhijie Huang, Ji Qi, Yi Zhou

Large language models (LLMs) are displaying emergent abilities for math reasoning tasks, and there is a growing attention on enhancing the ability of open-source LLMs through supervised fine-tuning (SFT). In this paper, we aim to explore a general data strategy for supervised data to help optimize and expand math reasoning ability. Firstly, we determine the ability boundary of reasoning paths augmentation by identifying these paths' minimal optimal set. Secondly, we validate that different abilities of the model can be cumulatively enhanced by Mix of Minimal Optimal Sets of corresponding types of data, while our models MMOS achieve SOTA performance on series base models under much lower construction costs. Besides, we point out GSM-HARD is not really hard and today's LLMs no longer lack numerical robustness. Also, we provide an Auto Problem Generator for robustness testing and educational applications. Our code and data are publicly available at https://github. com/cyzhh/MMOS.

Ranked #2 on Math Word Problem Solving on ASDiv-A (using extra training data)

Arithmetic Reasoning Math Word Problem Solving

Equivariant Graph Neural Operator for Modeling 3D Dynamics

no code implementations19 Jan 2024 Minkai Xu, Jiaqi Han, Aaron Lou, Jean Kossaifi, Arvind Ramanathan, Kamyar Azizzadenesheli, Jure Leskovec, Stefano Ermon, Anima Anandkumar

Modeling the complex three-dimensional (3D) dynamics of relational systems is an important problem in the natural sciences, with applications ranging from molecular simulations to particle mechanics.

Operator learning

Crystal Structure Prediction by Joint Equivariant Diffusion

1 code implementation NeurIPS 2023 Rui Jiao, Wenbing Huang, Peijia Lin, Jiaqi Han, Pin Chen, Yutong Lu, Yang Liu

To be specific, DiffCSP jointly generates the lattice and atom coordinates for each crystal by employing a periodic-E(3)-equivariant denoising model, to better model the crystal geometry.

Denoising

Structure-Aware DropEdge Towards Deep Graph Convolutional Networks

no code implementations21 Jun 2023 Jiaqi Han, Wenbing Huang, Yu Rong, Tingyang Xu, Fuchun Sun, Junzhou Huang

Regarding the layer-dependent sampler, we interestingly find that increasingly sampling edges from the bottom layer yields superior performance than the decreasing counterpart as well as DropEdge.

Node Classification

Subequivariant Graph Reinforcement Learning in 3D Environments

1 code implementation30 May 2023 Runfa Chen, Jiaqi Han, Fuchun Sun, Wenbing Huang

Learning a shared policy that guides the locomotion of different agents is of core interest in Reinforcement Learning (RL), which leads to the study of morphology-agnostic RL.

reinforcement-learning Reinforcement Learning (RL) +1

DialogVCS: Robust Natural Language Understanding in Dialogue System Upgrade

no code implementations24 May 2023 Zefan Cai, Xin Zheng, Tianyu Liu, Xu Wang, Haoran Meng, Jiaqi Han, Gang Yuan, Binghuai Lin, Baobao Chang, Yunbo Cao

In the constant updates of the product dialogue systems, we need to retrain the natural language understanding (NLU) model as new data from the real users would be merged into the existent data accumulated in the last updates.

Intent Detection Multi-Label Classification +1

Energy-Motivated Equivariant Pretraining for 3D Molecular Graphs

2 code implementations18 Jul 2022 Rui Jiao, Jiaqi Han, Wenbing Huang, Yu Rong, Yang Liu

Pretraining molecular representation models without labels is fundamental to various applications.

molecular representation

Smoothing Matters: Momentum Transformer for Domain Adaptive Semantic Segmentation

1 code implementation15 Mar 2022 Runfa Chen, Yu Rong, Shangmin Guo, Jiaqi Han, Fuchun Sun, Tingyang Xu, Wenbing Huang

After the great success of Vision Transformer variants (ViTs) in computer vision, it has also demonstrated great potential in domain adaptive semantic segmentation.

Pseudo Label Segmentation +2

Equivariant Graph Mechanics Networks with Constraints

1 code implementation12 Mar 2022 Wenbing Huang, Jiaqi Han, Yu Rong, Tingyang Xu, Fuchun Sun, Junzhou Huang

The core of GMN is that it represents, by generalized coordinates, the forward kinematics information (positions and velocities) of a structural object.

Equivariant Graph Hierarchy-Based Neural Networks

1 code implementation22 Feb 2022 Jiaqi Han, Wenbing Huang, Tingyang Xu, Yu Rong

Equivariant Graph neural Networks (EGNs) are powerful in characterizing the dynamics of multi-body physical systems.

Geometrically Equivariant Graph Neural Networks: A Survey

no code implementations15 Feb 2022 Jiaqi Han, Yu Rong, Tingyang Xu, Wenbing Huang

Many scientific problems require to process data in the form of geometric graphs.

Inductive Bias

Constrained Graph Mechanics Networks

no code implementations ICLR 2022 Wenbing Huang, Jiaqi Han, Yu Rong, Tingyang Xu, Fuchun Sun, Junzhou Huang

In this manner, the geometrical constraints are implicitly and naturally encoded in the forward kinematics.

On Locality in Graph Learning via Graph Neural Network

no code implementations29 Sep 2021 Junwei Su, Jiaqi Han, Chuan Wu

In this paper, we study how the training set in the input graph effects the performance of GNN.

Active Learning Graph Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.