Search Results for author: Yubin Xia

Found 6 papers, 4 papers with code

PowerInfer-2: Fast Large Language Model Inference on a Smartphone

1 code implementation10 Jun 2024 Zhenliang Xue, Yixin Song, Zeyu Mi, Le Chen, Yubin Xia, Haibo Chen

This paper introduces PowerInfer-2, a framework designed for high-speed inference of Large Language Models (LLMs) on smartphones, particularly effective for models whose sizes exceed the device's memory capacity.

Language Modelling Large Language Model

PatentGPT: A Large Language Model for Intellectual Property

no code implementations28 Apr 2024 Zilong Bai, ruiji zhang, Linqing Chen, Qijun Cai, Yuan Zhong, Cong Wang, Yan Fang, Jie Fang, Jing Sun, Weikuan Wang, Lizhi Zhou, Haoran Hua, Tian Qiu, Chaochao Wang, Cheng Sun, Jianping Lu, Yixin Wang, Yubin Xia, Meng Hu, Haowen Liu, Peng Xu, Licong Xu, Fu Bian, Xiaolong Gu, Lisha Zhang, Weilei Wang, Changyang Tu

In recent years, large language models(LLMs) have attracted significant attention due to their exceptional performance across a multitude of natural language process tasks, and have been widely applied in various fields.

Language Modelling Large Language Model

Scalable Memory Protection in the PENGLAI Enclave

1 code implementation OSDI 2021 Erhu Feng, Xu Lu, Dong Du, Bicheng Yang, Xueqiang Jiang, Yubin Xia, Binyu Zang, Haibo Chen

Upon these two primitives, our system can scale to thousands of concurrent enclaves with high resource utilization and eliminate the high-cost initialization of secure memory using fork-style enclave creation without weakening the security guarantees.

Characterizing serverless platforms with serverlessbench

1 code implementation SOCC 2020 Tianyi Yu, Qingyuan Liu, Dong Du, Yubin Xia, Bingyu Zang, Haibo Chen

This, however, also presents new challenges including how to efficiently design high-performance serverless platforms and how to efficiently program on the platforms.

Occlum: Secure and Efficient Multitasking Inside a Single Enclave of Intel SGX

7 code implementations21 Jan 2020 Youren Shen, Hongliang Tian, Yu Chen, Kang Chen, Runji Wang, Yi Xu, Yubin Xia

SFI is a software instrumentation technique for sandboxing untrusted modules (called domains).

Operating Systems Hardware Architecture Cryptography and Security

Cannot find the paper you are looking for? You can Submit a new open access paper.