Search Results for author: Lei Su

Found 12 papers, 5 papers with code

Comprehensive Performance Evaluation of YOLOv11, YOLOv10, YOLOv9, YOLOv8 and YOLOv5 on Object Detection of Power Equipment

no code implementations28 Nov 2024 Zijian He, Kang Wang, Tian Fang, Lei Su, Rui Chen, Xihong Fei

With the rapid development of global industrial production, the demand for reliability in power equipment has been continuously increasing.

object-detection Object Detection

Topology-aware Preemptive Scheduling for Co-located LLM Workloads

1 code implementation18 Nov 2024 Ping Zhang, Lei Su, Jinjie Yang, Xin Chen

Hosting diverse large language model workloads in a unified resource pool through co-location is cost-effective.

Language Modelling Large Language Model +1

INT-FlashAttention: Enabling Flash Attention for INT8 Quantization

1 code implementation25 Sep 2024 Shimao Chen, Zirui Liu, Zhiying Wu, Ce Zheng, Peizhuang Cong, Zihan Jiang, Yuhan Wu, Lei Su, Tong Yang

As the foundation of large language models (LLMs), self-attention module faces the challenge of quadratic time and memory complexity with respect to sequence length.

Quantization

ISO: Overlap of Computation and Communication within Seqenence For LLM Inference

no code implementations4 Sep 2024 Bin Xiao, Lei Su

In the realm of Large Language Model (LLM) inference, the inherent structure of transformer models coupled with the multi-GPU tensor parallelism strategy leads to a sequential execution of computation and communication.

Language Modelling Large Language Model

Boosting Lossless Speculative Decoding via Feature Sampling and Partial Alignment Distillation

no code implementations28 Aug 2024 Lujun Gui, Bin Xiao, Lei Su, WeiPeng Chen

In this paper, we reassess these approaches and propose FSPAD (Feature Sampling and Partial Alignment Distillation for Lossless Speculative Decoding), which introduces two straightforward and effective components within the existing framework to boost lossless speculative decoding.

Knowledge Distillation Language Modelling +3

Neural Graph Matching for Video Retrieval in Large-Scale Video-driven E-commerce

no code implementations1 Aug 2024 Houye Ji, Ye Tang, Zhaoxin Chen, Lixi Deng, Jun Hu, Lei Su

In this paper, we first leverage the dual graph to model the co-existing of user-video and user-item interactions in video-driven e-commerce and innovatively reduce user preference understanding to a graph matching problem.

Graph Matching Retrieval +1

M^3:Manipulation Mask Manufacturer for Arbitrary-Scale Super-Resolution Mask

no code implementations4 Jul 2024 Xinyu Yang, Xiaochen Ma, Xuekang Zhu, Bo Du, Lei Su, Bingkui Tong, Zeyu Lei, Jizhe Zhou

Additionally, we created the Manipulation Mask Manufacturer Dataset (MMMD), a dataset that covers a wide range of manipulation techniques.

Change Detection Image Forensics +3

Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge

1 code implementation1 May 2024 Bin Xiao, Chunan Shi, Xiaonan Nie, Fan Yang, Xiangwei Deng, Lei Su, WeiPeng Chen, Bin Cui

Consequently, the GPU spends most of its time on memory transfer instead of computation.

Light Propagation Prediction through Multimode Optical Fibers with a Deep Neural Network

no code implementations6 Dec 2018 Pengfei Fan, Liang Deng, Lei Su

This work demonstrates a computational method for predicting the light propagation through a single multimode fiber using a deep neural network.

SSIM

Cannot find the paper you are looking for? You can Submit a new open access paper.