Search Results for author: Kai Mei

Found 26 papers, 20 papers with code

Semantic Scheduling for LLM Inference

1 code implementation13 Jun 2025 Wenyue Hua, Dujian Ding, Yile Gu, Yujie Ren, Kai Mei, Minghua Ma, William Yang Wang

Conventional operating system scheduling algorithms are largely content-ignorant, making decisions based on factors such as latency or fairness without considering the actual intents or semantics of processes.

Fairness Management +1

LiteCUA: Computer as MCP Server for Computer-Use Agent on AIOS

3 code implementations24 May 2025 Kai Mei, Xi Zhu, Hang Gao, Shuhang Lin, Yongfeng Zhang

We present AIOS 1. 0, a novel platform designed to advance computer-use agent (CUA) capabilities through environmental contextualization.

Cerebrum (AIOS SDK): A Platform for Agent Development, Deployment, Distribution, and Discovery

3 code implementations14 Mar 2025 Balaji Rama, Kai Mei, Yongfeng Zhang

Autonomous LLM-based agents have emerged as a powerful paradigm for complex task execution, yet the field lacks standardized tools for development, deployment, distribution and discovery of agents.

Management

OmniRouter: Budget and Performance Controllable Multi-LLM Routing

3 code implementations27 Feb 2025 Kai Mei, Wujiang Xu, Shuhang Lin, Yongfeng Zhang

LLM routing is a crucial paradigm that dynamically selects the most suitable large language models from a pool of candidates to process diverse inputs, ensuring optimal resource utilization while maintaining response quality.

AI Agent Mathematical Reasoning +1

iAgent: LLM Agent as a Shield between User and Recommender Systems

1 code implementation20 Feb 2025 Wujiang Xu, Yunxiao Shi, Zujie Liang, Xuying Ning, Kai Mei, Kun Wang, Xi Zhu, Min Xu, Yongfeng Zhang

Traditional recommender systems usually take the user-platform paradigm, where users are directly exposed under the control of the platform's recommendation algorithms.

Recommendation Systems

A-MEM: Agentic Memory for LLM Agents

4 code implementations17 Feb 2025 Wujiang Xu, Kai Mei, Hang Gao, Juntao Tan, Zujie Liang, Yongfeng Zhang

To address this limitation, this paper proposes a novel agentic memory system for LLM agents that can dynamically organize memories in an agentic way.

Large Language Model

Massive Values in Self-Attention Modules are the Key to Contextual Knowledge Understanding

1 code implementation3 Feb 2025 Mingyu Jin, Kai Mei, Wujiang Xu, MingJie Sun, Ruixiang Tang, Mengnan Du, Zirui Liu, Yongfeng Zhang

In this paper, we show that these concentrated massive values consistently emerge in specific regions of attention queries (Q) and keys (K) while not having such patterns in values (V) in various modern transformer-based LLMs (Q, K, and V mean the representations output by the query, key, and value layers respectively).

Quantization

Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents

1 code implementation3 Oct 2024 Hanrong Zhang, Jingyuan Huang, Kai Mei, Yifei Yao, Zhenting Wang, Chenlu Zhan, Hongwei Wang, Yongfeng Zhang

To address this, we introduce Agent Security Bench (ASB), a comprehensive framework designed to formalize, benchmark, and evaluate the attacks and defenses of LLM-based agents, including 10 scenarios (e. g., e-commerce, autonomous driving, finance), 10 agents targeting the scenarios, over 400 tools, 27 different types of attack/defense methods, and 7 evaluation metrics.

Autonomous Driving Backdoor Attack +1

From Commands to Prompts: LLM-based Semantic File System for AIOS

1 code implementation23 Sep 2024 Zeru Shi, Kai Mei, Mingyu Jin, Yongye Su, Chaoji Zuo, Wenyue Hua, Wujiang Xu, Yujie Ren, Zirui Liu, Mengnan Du, Dong Deng, Yongfeng Zhang

Large language models (LLMs) have demonstrated significant potential in the development of intelligent applications and systems such as LLM-based agents and agent operating systems (AIOS).

Management Navigate

AutoFlow: Automated Workflow Generation for Large Language Model Agents

1 code implementation1 Jul 2024 Zelong Li, Shuyuan Xu, Kai Mei, Wenyue Hua, Balaji Rama, Om Raheja, Hao Wang, He Zhu, Yongfeng Zhang

We believe that the automatic generation and interpretation of workflows in natural language represent a promising paradigm for solving complex tasks, particularly with the rapid development of LLMs.

AI Agent Language Modeling +1

AIOS Compiler: LLM as Interpreter for Natural Language Programming and Flow Programming of AI Agents

3 code implementations11 May 2024 Shuyuan Xu, Zelong Li, Kai Mei, Yongfeng Zhang

However, the inherent vagueness, ambiguity, and verbosity of natural language pose significant challenges in developing an interpreter that can accurately understand the programming logic and execute instructions written in natural language.

Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?

1 code implementation10 Apr 2024 Mingyu Jin, Qinkai Yu, Jingyuan Huang, Qingcheng Zeng, Zhenting Wang, Wenyue Hua, Haiyan Zhao, Kai Mei, Yanda Meng, Kaize Ding, Fan Yang, Mengnan Du, Yongfeng Zhang

In this paper, we explore the hypothesis that LLMs process concepts of varying complexities in different layers, introducing the idea of ``Concept Depth'' to suggest that more complex concepts are typically acquired in deeper layers.

AIOS: LLM Agent Operating System

2 code implementations25 Mar 2024 Kai Mei, Xi Zhu, Wujiang Xu, Wenyue Hua, Mingyu Jin, Zelong Li, Shuyuan Xu, Ruosong Ye, Yingqiang Ge, Yongfeng Zhang

This AIOS kernel provides fundamental services (e. g., scheduling, context management, memory management, storage management, access control) and efficient management of resources (e. g., LLM and external tools) for runtime agents.

AI Agent Language Modelling +2

What if LLMs Have Different World Views: Simulating Alien Civilizations with LLM-based Agents

1 code implementation20 Feb 2024 Zhaoqian Xue, Mingyu Jin, Beichen Wang, Suiyuan Zhu, Kai Mei, Hua Tang, Wenyue Hua, Mengnan Du, Yongfeng Zhang

This study introduces "CosmoAgent," an innovative artificial intelligence system that utilizes Large Language Models (LLMs) to simulate complex interactions between human and extraterrestrial civilizations.

Decision Making

LightLM: A Lightweight Deep and Narrow Language Model for Generative Recommendation

1 code implementation26 Oct 2023 Kai Mei, Yongfeng Zhang

LightLM tackles the issue by introducing a light-weight deep and narrow Transformer architecture, which is specifically tailored for direct generation of recommendation items.

Hallucination Language Modeling +1

NOTABLE: Transferable Backdoor Attacks Against Prompt-based NLP Models

1 code implementation28 May 2023 Kai Mei, Zheng Li, Zhenting Wang, Yang Zhang, Shiqing Ma

Such attacks can be easily affected by retraining on downstream tasks and with different prompting strategies, limiting the transferability of backdoor attacks.

OpenAGI: When LLM Meets Domain Experts

2 code implementations NeurIPS 2023 Yingqiang Ge, Wenyue Hua, Kai Mei, Jianchao Ji, Juntao Tan, Shuyuan Xu, Zelong Li, Yongfeng Zhang

This capability is vital for Artificial Intelligence (AI) and should be embedded in comprehensive AI Agents, enabling them to harness expert models for complex task-solving towards Artificial General Intelligence (AGI).

Benchmarking Natural Language Queries

UNICORN: A Unified Backdoor Trigger Inversion Framework

1 code implementation5 Apr 2023 Zhenting Wang, Kai Mei, Juan Zhai, Shiqing Ma

Then, it proposes a unified framework to invert backdoor triggers based on the formalization of triggers and the identified inner behaviors of backdoor models from our analysis.

Backdoor Attack

Theoretical Analysis of Deep Neural Networks in Physical Layer Communication

no code implementations21 Feb 2022 Jun Liu, Haitao Zhao, Dongtang Ma, Kai Mei, Jibo Wei

In this paper, we aim to quantitatively analyze why DNNs can achieve comparable performance in the physical layer comparing with traditional techniques, and also drive their cost in terms of computational complexity.

Intelligent Communication

A Low Complexity Learning-based Channel Estimation for OFDM Systems with Online Training

no code implementations14 Jul 2021 Kai Mei, Jun Liu, Xiaoying Zhang, Kuo Cao, Nandana Rajatheva, Jibo Wei

Besides, a training data construction approach utilizing least square (LS) estimation results is proposed so that the training data can be collected during the data transmission.

BIG-bench Machine Learning

Opening the Black Box of Deep Neural Networks in Physical Layer Communication

no code implementations2 Jun 2021 Jun Liu, Haitao Zhao, Dongtang Ma, Kai Mei, Jibo Wei

Deep Neural Network (DNN)-based physical layer techniques are attracting considerable interest due to their potential to enhance communication systems.

Fine Timing and Frequency Synchronization for MIMO-OFDM: An Extreme Learning Approach

no code implementations17 Jul 2020 Jun Liu, Kai Mei, Xiaochen Zhang, Des McLernon, Dongtang Ma, Jibo Wei, Syed Ali Raza Zaidi

Multiple-input multiple-output orthogonal frequency-division multiplexing (MIMO-OFDM) is a key technology component in the evolution towards cognitive radio (CR) in next-generation communication in which the accuracy of timing and frequency synchronization significantly impacts the overall system performance.

BIG-bench Machine Learning

Performance Analysis on Machine Learning-Based Channel Estimation

no code implementations10 Nov 2019 Kai Mei, Jun Liu, Xiaochen Zhang, Nandana Rajatheva, Jibo Wei

In this situation, our analysis results can be applied to assess the performance and support the design of machine learning-based channel estimation.

BIG-bench Machine Learning

Deep Neural Network Aided Scenario Identification in Wireless Multi-path Fading Channels

no code implementations23 Nov 2018 Jun Liu, Kai Mei, Dongtang Ma, Jibo Wei

This letter illustrates our preliminary works in deep nerual network (DNN) for wireless communication scenario identification in wireless multi-path fading channels.

Cannot find the paper you are looking for? You can Submit a new open access paper.