Search Results for author: Jiaxi Li

Found 11 papers, 5 papers with code

HELENE: Hessian Layer-wise Clipping and Gradient Annealing for Accelerating Fine-tuning LLM with Zeroth-order Optimization

no code implementations16 Nov 2024 Huaqin Zhao, Jiaxi Li, Yi Pan, Shizhe Liang, Xiaofeng Yang, Wei Liu, Xiang Li, Fei Dou, Tianming Liu, Jin Lu

Experimental results on RoBERTa-large and OPT-1. 3B across multiple tasks show that HELENE achieves up to a 20x speedup compared to MeZO, with average accuracy improvements of 1. 5%.

parameter-efficient fine-tuning

OWLed: Outlier-weighed Layerwise Pruning for Efficient Autonomous Driving Framework

1 code implementation12 Nov 2024 Jiaxi Li, Lu Yin, Xilu Wang

The integration of Large Language Models (LLMs) into autonomous driving systems offers promising enhancements in environmental understanding and decision-making.

Autonomous Driving Decision Making +1

A Systematic Assessment of OpenAI o1-Preview for Higher Order Thinking in Education

no code implementations11 Oct 2024 Ehsan Latif, Yifan Zhou, Shuchen Guo, Yizhu Gao, Lehong Shi, Matthew Nayaaba, Gyeonggeon Lee, Liang Zhang, Arne Bewersdorff, Luyang Fang, Xiantong Yang, Huaqin Zhao, Hanqi Jiang, Haoran Lu, Jiaxi Li, Jichao Yu, Weihang You, Zhengliang Liu, Vincent Shung Liu, Hui Wang, Zihao Wu, Jin Lu, Fei Dou, Ping Ma, Ninghao Liu, Tianming Liu, Xiaoming Zhai

This study evaluates OpenAI o1-preview's ability to perform higher-order cognitive tasks across 14 dimensions, including critical thinking, systems thinking, computational thinking, design thinking, metacognition, data literacy, creative thinking, abstract reasoning, quantitative reasoning, logical reasoning, analogical reasoning, and scientific reasoning.

Logical Reasoning

Revisiting semi-supervised training objectives for differentiable particle filters

no code implementations2 May 2024 Jiaxi Li, John-Joseph Brady, Xiongjie Chen, Yunpeng Li

Differentiable particle filters combine the flexibility of neural networks with the probabilistic nature of sequential Monte Carlo methods.

Learning Differentiable Particle Filter on the Fly

no code implementations10 Dec 2023 Jiaxi Li, Xiongjie Chen, Yunpeng Li

Differentiable particle filters are an emerging class of sequential Bayesian inference techniques that use neural networks to construct components in state space models.

Bayesian Inference Object Tracking +2

Unraveling Feature Extraction Mechanisms in Neural Networks

1 code implementation25 Oct 2023 Xiaobing Sun, Jiaxi Li, Wei Lu

The underlying mechanism of neural networks in capturing precise knowledge has been the subject of consistent research efforts.

Language Modeling Language Modelling

Decomposed Prompt Tuning via Low-Rank Reparameterization

1 code implementation16 Oct 2023 Yao Xiao, Lu Xu, Jiaxi Li, Wei Lu, XiaoLi Li

While prompt tuning approaches have achieved competitive performance with high efficiency, we observe that they invariably employ the same initialization process, wherein the soft prompt is either randomly initialized or derived from an existing embedding vocabulary.

HRGCN: Heterogeneous Graph-level Anomaly Detection with Hierarchical Relation-augmented Graph Neural Networks

1 code implementation28 Aug 2023 Jiaxi Li, Guansong Pang, Ling Chen, Mohammad-Reza Namazi-Rad

To address the problem, we propose HRGCN, an unsupervised deep heterogeneous graph neural network, to model complex heterogeneous relations between different entities in the system for effectively identifying these anomalous behaviour graphs.

Anomaly Detection Graph Neural Network +1

Contextual Distortion Reveals Constituency: Masked Language Models are Implicit Parsers

1 code implementation1 Jun 2023 Jiaxi Li, Wei Lu

To leverage this knowledge, we propose a novel chart-based method for extracting parse trees from masked language models (LMs) without the need to train separate parsers.

Cannot find the paper you are looking for? You can Submit a new open access paper.