Search Results for author: Mingxuan Li

Found 8 papers, 2 papers with code

AttentionLego: An Open-Source Building Block For Spatially-Scalable Large Language Model Accelerator With Processing-In-Memory Technology

no code implementations21 Jan 2024 Rongqing Cong, Wenyang He, Mingxuan Li, Bangning Luo, Zebin Yang, Yuchao Yang, Ru Huang, Bonan Yan

Large language models (LLMs) with Transformer architectures have become phenomenal in natural language processing, multimodal generative artificial intelligence, and agent-oriented artificial intelligence.

Language Modelling Large Language Model

Interpretability is a Kind of Safety: An Interpreter-based Ensemble for Adversary Defense

no code implementations14 Apr 2023 Jingyuan Wang, Yufan Wu, Mingxuan Li, Xin Lin, Junjie Wu, Chao Li

While having achieved great success in rich real-life applications, deep neural network (DNN) models have long been criticized for their vulnerability to adversarial attacks.

Intelligent detect for substation insulator defects based on CenterMask

no code implementations31 Aug 2022 Bo Ye, Feng Li, Mingxuan Li, Peipei Yan, Huiting Yang, Lihua Wang

Based on the end-to-end learning paradigm, this paper proposes an intelligent detection method for substation insulator defects based on CenterMask.

Defect Detection

Learning Generalizable Behavior via Visual Rewrite Rules

no code implementations9 Dec 2021 Yiheng Xie, Mingxuan Li, Shangqun Yu, Michael Littman

Though deep reinforcement learning agents have achieved unprecedented success in recent years, their learned policies can be brittle, failing to generalize to even slight modifications of their environments or unfamiliar situations.

Descriptive

Deep Compression of Neural Networks for Fault Detection on Tennessee Eastman Chemical Processes

no code implementations18 Jan 2021 Mingxuan Li, Yuanxun Shao

Artificial neural network has achieved the state-of-art performance in fault detection on the Tennessee Eastman process, but it often requires enormous memory to fund its massive parameters.

Clustering Fault Detection +2

Towards Sample Efficient Agents through Algorithmic Alignment

1 code implementation7 Aug 2020 Mingxuan Li, Michael L. Littman

We demonstrate the potential of graph neural network in supporting sample efficient learning by showing that Deep Graph Value Network can outperform unstructured baselines by a large margin in solving the Markov Decision Process (MDP).

Reinforcement Learning (RL)

Are L2 adversarial examples intrinsically different?

no code implementations28 Feb 2020 Mingxuan Li, Jingyuan Wang, Yufan Wu

That is, adversarial examples generated by $L_2$ attacks usually have larger input sensitivity which can be used to identify them efficiently.

Cannot find the paper you are looking for? You can Submit a new open access paper.