Search Results for author: Zebin Yang

Found 9 papers, 4 papers with code

ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding

no code implementations21 Feb 2024 Shuzhang Zhong, Zebin Yang, Meng Li, Ruihao Gong, Runsheng Wang, Ru Huang

Additionally, it introduces a dynamic token tree generation algorithm to balance the computation and parallelism of the verification phase in real-time and maximize the overall efficiency across different batch sizes, sequence lengths, and tasks, etc.

AttentionLego: An Open-Source Building Block For Spatially-Scalable Large Language Model Accelerator With Processing-In-Memory Technology

no code implementations21 Jan 2024 Rongqing Cong, Wenyang He, Mingxuan Li, Bangning Luo, Zebin Yang, Yuchao Yang, Ru Huang, Bonan Yan

Large language models (LLMs) with Transformer architectures have become phenomenal in natural language processing, multimodal generative artificial intelligence, and agent-oriented artificial intelligence.

Language Modelling Large Language Model

PiML Toolbox for Interpretable Machine Learning Model Development and Diagnostics

1 code implementation7 May 2023 Agus Sudjianto, Aijun Zhang, Zebin Yang, Yu Su, Ningzhou Zeng

PiML (read $\pi$-ML, /`pai`em`el/) is an integrated and open-access Python toolbox for interpretable machine learning model development and model diagnostics.

Fairness Interpretable Machine Learning

Explainable Recommendation Systems by Generalized Additive Models with Manifest and Latent Interactions

no code implementations15 Dec 2020 Yifeng Guo, Yu Su, Zebin Yang, Aijun Zhang

In this paper, we propose the explainable recommendation systems based on a generalized additive model with manifest and latent interactions (GAMMLI).

Additive models Collaborative Filtering +2

Unwrapping The Black Box of Deep ReLU Networks: Interpretability, Diagnostics, and Simplification

1 code implementation8 Nov 2020 Agus Sudjianto, William Knauth, Rahul Singh, Zebin Yang, Aijun Zhang

We propose the local linear profile plot and other visualization methods for interpretation and diagnostics, and an effective merging strategy for network simplification.

Hyperparameter Optimization via Sequential Uniform Designs

2 code implementations8 Sep 2020 Zebin Yang, Aijun Zhang

Hyperparameter optimization (HPO) plays a central role in the automated machine learning (AutoML).

Hyperparameter Optimization

GAMI-Net: An Explainable Neural Network based on Generalized Additive Models with Structured Interactions

2 code implementations16 Mar 2020 Zebin Yang, Aijun Zhang, Agus Sudjianto

The lack of interpretability is an inevitable problem when using neural network models in real applications.

Additive models

Enhancing Explainability of Neural Networks through Architecture Constraints

no code implementations12 Jan 2019 Zebin Yang, Aijun Zhang, Agus Sudjianto

It leads to an explainable neural network (xNN) with the superior balance between prediction performance and model interpretability.

Cannot find the paper you are looking for? You can Submit a new open access paper.