Search Results for author: Zijian Hu

Found 13 papers, 4 papers with code

Fox-1 Technical Report

no code implementations8 Nov 2024 Zijian Hu, Jipeng Zhang, Rui Pan, Zhaozhuo Xu, Shanshan Han, Han Jin, Alay Dilipbhai Shah, Dimitris Stripelis, Yuhang Yao, Salman Avestimehr, Chaoyang He, Tong Zhang

Aiming to improve the pre-training efficiency, Fox-1-1. 6B model introduces a novel 3-stage data curriculum across all the training data with 2K-8K sequence length.

2k 8k +1

Alopex: A Computational Framework for Enabling On-Device Function Calls with LLMs

no code implementations7 Nov 2024 Yide Ran, Zhaozhuo Xu, Yuhang Yao, Zijian Hu, Shanshan Han, Han Jin, Alay Dilipbhai Shah, Jipeng Zhang, Dimitris Stripelis, Tong Zhang, Salman Avestimehr, Chaoyang He

The rapid advancement of Large Language Models (LLMs) has led to their increased integration into mobile devices for personalized assistance, which enables LLMs to call external API functions to enhance their performance.

TensorOpera Router: A Multi-Model Router for Efficient LLM Inference

no code implementations22 Aug 2024 Dimitris Stripelis, Zijian Hu, Jipeng Zhang, Zhaozhuo Xu, Alay Dilipbhai Shah, Han Jin, Yuhang Yao, Salman Avestimehr, Chaoyang He

With the rapid growth of Large Language Models (LLMs) across various domains, numerous new LLMs have emerged, each possessing domain-specific expertise.

ScaleLLM: A Resource-Frugal LLM Serving Framework by Optimizing End-to-End Efficiency

no code implementations23 Jul 2024 Yuhang Yao, Han Jin, Alay Dilipbhai Shah, Shanshan Han, Zijian Hu, Yide Ran, Dimitris Stripelis, Zhaozhuo Xu, Salman Avestimehr, Chaoyang He

Large language models (LLMs) have surged in popularity and are extensively used in commercial applications, where the efficiency of model serving is crucial for the user experience.

TorchOpera: A Compound AI System for LLM Safety

no code implementations16 Jun 2024 Shanshan Han, Zijian Hu, Alay Dilipbhai Shah, Han Jin, Yuhang Yao, Dimitris Stripelis, Zhaozhuo Xu, Chaoyang He

We introduce TorchOpera, a compound AI system for enhancing the safety and quality of prompts and responses for Large Language Models.

FedSecurity: Benchmarking Attacks and Defenses in Federated Learning and Federated LLMs

1 code implementation8 Jun 2023 Shanshan Han, Baturalp Buyukates, Zijian Hu, Han Jin, Weizhao Jin, Lichao Sun, Xiaoyang Wang, Wenxuan Wu, Chulin Xie, Yuhang Yao, Kai Zhang, Qifan Zhang, Yuhui Zhang, Carlee Joe-Wong, Salman Avestimehr, Chaoyang He

This paper introduces FedSecurity, an end-to-end benchmark that serves as a supplementary component of the FedML library for simulating adversarial attacks and corresponding defense mechanisms in Federated Learning (FL).

Benchmarking Federated Learning

Demonstration-guided Deep Reinforcement Learning for Coordinated Ramp Metering and Perimeter Control in Large Scale Networks

no code implementations4 Mar 2023 Zijian Hu, Wei Ma

This study considers two representative control approaches: ramp metering for freeways and perimeter control for homogeneous urban roads, and we aim to develop a deep reinforcement learning (DRL)-based coordinated control framework for large-scale networks.

Deep Reinforcement Learning

Heterogeneous Line Graph Transformer for Math Word Problems

no code implementations11 Aug 2022 Zijian Hu, Meng Jiang

We originally planned to employ existing models but realized that they processed a math word problem as a sequence or a homogeneous graph of tokens.

Math Representation Learning +1

SimMER: Simple Maximization of Entropy and Rank for Self-supervised Representation Learning

no code implementations29 Sep 2021 Zhengyu Yang, Zijian Hu, Xuefeng Hu, Ram Nevatia

With both entropy and rank maximization, our method surpasses the state-of-the-art on CIFAR-10 and Mini-ImageNet under the standard linear evaluation protocol.

Contrastive Learning Linear evaluation +2

SimPLE: Similar Pseudo Label Exploitation for Semi-Supervised Classification

1 code implementation CVPR 2021 Zijian Hu, Zhengyu Yang, Xuefeng Hu, Ram Nevatia

Combining the Pair Loss with the techniques developed by the MixMatch family, our proposed SimPLE algorithm shows significant performance gains over previous algorithms on CIFAR-100 and Mini-ImageNet, and is on par with the state-of-the-art methods on CIFAR-10 and SVHN.

Classification General Classification +3

Cannot find the paper you are looking for? You can Submit a new open access paper.