Search Results for author: Shan Huang

Found 21 papers, 6 papers with code

Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective

no code implementations6 Oct 2024 Jinhao Li, Jiaming Xu, Shan Huang, Yonghua Chen, Wen Li, Jun Liu, Yaoxiu Lian, Jiayi Pan, Li Ding, Hao Zhou, Yu Wang, Guohao Dai

We compare the performance of the same optimization methods across different hardware platforms, the performance across different hardware platforms, and the performance of different methods on the same hardware platform.

Language Modeling Language Modelling +3

MARCA: Mamba Accelerator with ReConfigurable Architecture

no code implementations16 Sep 2024 Jinhao Li, Shan Huang, Jiaming Xu, Jun Liu, Li Ding, Ningyi Xu, Guohao Dai

We propose intra-operation buffer management strategy to maximize input data sharing for linear operations within operations, and inter-operation strategy for element-wise operations between operations.

Mamba Management

"The Strength of Weak Ties" Varies Across Viral Channels

no code implementations7 Aug 2024 Shan Huang, Yuan Yuan, Yi Ji

Furthermore, the "strength of weak ties" is only evident in BC, not in DM where weaker ties do not transmit significantly more novel information.

Stephanie: Step-by-Step Dialogues for Mimicking Human Interactions in Social Conversations

no code implementations4 Jul 2024 Hao Yang, Hongyuan Lu, Xinhua Zeng, Yang Liu, Xiang Zhang, Haoran Yang, Yumeng Zhang, Shan Huang, Yiran Wei, Wai Lam

In the rapidly evolving field of natural language processing, dialogue systems primarily employ a single-step dialogue paradigm.

Chatbot

Fast and Efficient 2-bit LLM Inference on GPU: 2/4/16-bit in a Weight Matrix with Asynchronous Dequantization

no code implementations28 Nov 2023 Jinhao Li, Jiaming Xu, Shiyao Li, Shan Huang, Jun Liu, Yaoxiu Lian, Guohao Dai

To tackle these challenges and enable fast and efficient LLM inference on GPUs, we propose the following techniques in this paper.

Quantization

Estimating Effects of Long-Term Treatments

no code implementations16 Aug 2023 Shan Huang, Chen Wang, Yuan Yuan, Jinglong Zhao, Brocco, Zhang

Estimating the effects of long-term treatments through A/B testing is challenging.

TransMRSR: Transformer-based Self-Distilled Generative Prior for Brain MRI Super-Resolution

1 code implementation11 Jun 2023 Shan Huang, Xiaohong Liu, Tao Tan, Menghan Hu, Xiaoer Wei, TingLi Chen, Bin Sheng

We perform a generative task to encapsulate diverse priors into a generative network (GAN), which is the decoder sub-module of the deep non-local feature capture part, in the first stage.

Decoder Image Reconstruction +1

A Novel Interpretable and Generalizable Re-synchronization Model for Cued Speech based on a Multi-Cuer Corpus

1 code implementation5 Jun 2023 Lufei Gao, Shan Huang, Li Liu

Cued Speech (CS) is a multi-modal visual coding system combining lip reading with several hand cues at the phonetic level to make the spoken language visible to the hearing impaired.

Lip Reading

Federated Learning for Metaverse: A Survey

no code implementations23 Mar 2023 Yao Chen, Shan Huang, Wensheng Gan, Gengsen Huang, Yongdong Wu

In this paper, we review some of the early advances of FL4M, which will be a research direction with unlimited development potential.

Edge-computing Federated Learning +2

TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities

3 code implementations13 Dec 2022 Zhe Zhao, Yudong Li, Cheng Hou, Jing Zhao, Rong Tian, Weijie Liu, Yiren Chen, Ningyuan Sun, Haoyan Liu, Weiquan Mao, Han Guo, Weigang Guo, Taiqiang Wu, Tao Zhu, Wenhang Shi, Chen Chen, Shan Huang, Sihong Chen, Liqun Liu, Feifei Li, Xiaoshuai Chen, Xingwu Sun, Zhanhui Kang, Xiaoyong Du, Linlin Shen, Kimmo Yan

The proposed pre-training models of different modalities are showing a rising trend of homogeneity in their model structures, which brings the opportunity to implement different pre-training models within a uniform framework.

Decoder

Contrastive Credibility Propagation for Reliable Semi-Supervised Learning

1 code implementation17 Nov 2022 Brody Kutt, Pralay Ramteke, Xavier Mignot, Pamela Toman, Nandini Ramanan, Sujit Rokka Chhetri, Shan Huang, Min Du, William Hewlett

CCP unifies semi-supervised learning and noisy label learning for the goal of reliably outperforming a supervised baseline in any data scenario.

Pseudo Label

Dynamic optical contrast imaging for real-time delineation of tumor resection margins using head and neck cancer as a model

no code implementations15 Feb 2022 Yong Hu, Shan Huang, Albert Y. Han, Seong Moon, Jeffrey F. Krane, Oscar Stafsudd, Warren Grundfest, Maie A. St. John

Complete surgical resection of the tumor for Head and neck squamous cell carcinoma (HNSCC) remains challenging, given the devastating side effects of aggressive surgery and the anatomic proximity to vital structures.

Specificity

WebUAV-3M: A Benchmark for Unveiling the Power of Million-Scale Deep UAV Tracking

1 code implementation19 Jan 2022 Chunhui Zhang, Guanjie Huang, Li Liu, Shan Huang, Yinan Yang, Xiang Wan, Shiming Ge, DaCheng Tao

In this work, we propose WebUAV-3M, the largest public UAV tracking benchmark to date, to facilitate both the development and evaluation of deep UAV trackers.

Factor Representation and Decision Making in Stock Markets Using Deep Reinforcement Learning

no code implementations3 Aug 2021 Zhaolu Dong, Shan Huang, Simiao Ma, Yining Qian

Deep Reinforcement learning is a branch of unsupervised learning in which an agent learns to act based on environment state in order to maximize its total reward.

Decision Making Deep Reinforcement Learning +3

Two Stochastic Control Problems In Capital Structure and Portfolio Choice

no code implementations5 Jul 2021 Shan Huang

In the first problem, we derive a stochastic control model to optimize banks' dividend and recapitalization policies and calibrate that to a sample of U. S. banks in the situation where we model banks' true accounting asset values as partially observed variables due to the opaqueness in banks' assets.

Vocal Bursts Valence Prediction

Emotions in Online Content Diffusion

no code implementations17 Nov 2020 Yifan Yu, Shan Huang, Yuchen Liu, Yong Tan

We apply a partial-linear instrumental variable approach with a double machine learning framework to causally identify the impact of the negative discrete emotions on online content diffusion.

Marketing

Predicting nucleation near the spinodal in the Ising model using machine learning

no code implementations20 Apr 2020 Shan Huang, William Klein, Harvey Gould

We use a Convolutional Neural Network (CNN) and two logistic regression models to predict the probability of nucleation in the two-dimensional Ising model.

BIG-bench Machine Learning regression

A Study of Pyramid Structure for Code Correction

no code implementations28 Jan 2020 Shan Huang, Xiao Zhou, Sang Chin

We demonstrate the implementations of pyramid encoders in both multi-layer GRU and Transformer for seq2seq tasks.

Software Engineering

Cannot find the paper you are looking for? You can Submit a new open access paper.