Search Results for author: Bo Bai

Found 20 papers, 6 papers with code

Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference

no code implementations22 Jan 2025 Weizhi Fei, Xueyan Niu, Guoqing Xie, Yingqing Liu, Bo Bai, Wei Han

We identify specific attention heads in transformer-based LLMs, which we designate as evaluator heads, that are capable of selecting tokens in long inputs that are most significant for inference.

Towards Faster Graph Partitioning via Pre-training and Inductive Inference

1 code implementation1 Sep 2024 Meng Qin, Chaorui Zhang, Yu Gao, Yibin Ding, Weipeng Jiang, Weixi Zhang, Wei Han, Bo Bai

Graph partitioning (GP) is a classic problem that divides the node set of a graph into densely-connected blocks.

Graph Learning graph partitioning

A Mean Field Ansatz for Zero-Shot Weight Transfer

no code implementations16 Aug 2024 Xingyuan Chen, Wenwei Kuang, Lei Deng, Wei Han, Bo Bai, Goncalo dos Reis

Specifically, we propose the row-column (RC) ansatz under the mean field point of view, which describes the measure structure of the weights in the neural network (NN) and admits a close measure dynamic.

Retrieval Meets Reasoning: Dynamic In-Context Editing for Long-Text Understanding

no code implementations18 Jun 2024 Weizhi Fei, Xueyan Niu, Guoqing Xie, Yanhua Zhang, Bo Bai, Lei Deng, Wei Han

Current Large Language Models (LLMs) face inherent limitations due to their pre-defined context lengths, which impede their capacity for multi-hop reasoning within extensive textual contexts.

Information Retrieval knowledge editing +2

Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory

no code implementations14 May 2024 Xueyan Niu, Bo Bai, Lei Deng, Wei Han

In particular, the energy function in modern continuous Hopfield networks serves as an explanation for the attention mechanism, which we approximate with a distance-based energy function.

Memorization

Extreme Video Compression with Pre-trained Diffusion Models

1 code implementation14 Feb 2024 Bohan Li, Yiming Liu, Xueyan Niu, Bo Bai, Lei Deng, Deniz Gündüz

The results showcase the potential of exploiting the temporal relations in video data using generative models.

Decoder Image Compression +1

Extending Context Window of Large Language Models via Semantic Compression

1 code implementation15 Dec 2023 Weizhi Fei, Xueyan Niu, Pingyi Zhou, Lu Hou, Bo Bai, Lei Deng, Wei Han

Transformer-based Large Language Models (LLMs) often impose limitations on the length of the text input to ensure the generation of fluent and relevant responses.

Few-Shot Learning Information Retrieval +4

High Perceptual Quality Wireless Image Delivery with Denoising Diffusion Models

1 code implementation27 Sep 2023 Selim F. Yilmaz, Xueyan Niu, Bo Bai, Wei Han, Lei Deng, Deniz Gunduz

We consider the image transmission problem over a noisy wireless channel via deep learning-based joint source-channel coding (DeepJSCC) along with a denoising diffusion probabilistic model (DDPM) at the receiver.

Denoising

A Constrained BA Algorithm for Rate-Distortion and Distortion-Rate Functions

no code implementations4 May 2023 Lingyi Chen, Shitong Wu, Wenhao Ye, Huihui Wu, Wenyi Zhang, Hao Wu, Bo Bai

The Blahut-Arimoto (BA) algorithm has played a fundamental role in the numerical computation of rate-distortion (RD) functions.

Trading off Quality for Efficiency of Community Detection: An Inductive Method across Graphs

no code implementations29 Sep 2022 Meng Qin, Chaorui Zhang, Bo Bai, Gong Zhang, Dit-yan Yeung

The trained model is then directly generalized to new unseen graphs for online CD without additional optimization, where a better trade-off between quality and efficiency can be achieved.

Combinatorial Optimization Community Detection

GPPF: A General Perception Pre-training Framework via Sparsely Activated Multi-Task Learning

no code implementations3 Aug 2022 Benyuan Sun, Jin Dai, Zihao Liang, Congying Liu, Yi Yang, Bo Bai

SIMT lays the foundation of pre-training with large-scale multi-task multi-domain datasets and is proved essential for stable training in our GPPF experiments.

Multi-Task Learning

A Tensor-BTD-based Modulation for Massive Unsourced Random Access

no code implementations5 Dec 2021 Zhenting Luan, Yuchi Wu, Shansuo Liang, Liping Zhang, Wei Han, Bo Bai

In this letter, we propose a novel tensor-based modulation scheme for massive unsourced random access.

Tensor Decomposition

PartialFed: Cross-Domain Personalized Federated Learning via Partial Initialization

no code implementations NeurIPS 2021 Benyuan Sun, Hongxing Huo, Yi Yang, Bo Bai

The superiority of our algorithm is proved by demonstrating the new state-of-the-art results on cross-domain federated classification and detection.

medical image detection Personalized Federated Learning +1

Branch and Bound in Mixed Integer Linear Programming Problems: A Survey of Techniques and Trends

no code implementations5 Nov 2021 Lingying Huang, Xiaomeng Chen, Wei Huo, Jiazheng Wang, Fan Zhang, Bo Bai, Ling Shi

In order to improve the speed of B&B algorithms, learning techniques have been introduced in this algorithm recently.

Variable Selection

Trading Quality for Efficiency of Graph Partitioning: An Inductive Method across Graphs

no code implementations29 Sep 2021 Meng Qin, Chaorui Zhang, Bo Bai, Gong Zhang, Dit-yan Yeung

IGP is also a generic framework that can capture the permutation invariant partitioning ground-truth of historical snapshots in the offline training and tackle the online GP on graphs with non-fixed number of nodes and clusters.

Combinatorial Optimization Graph Neural Network +1

A Convergent Semi-Proximal Alternating Direction Method of Multipliers for Recovering Internet Traffics from Link Measurements

no code implementations5 Feb 2021 Zhenyu Ming, Liping Zhang, Hao Wu, Yanwei Xu, Mayank Bakshi, Bo Bai, Gong Zhang

Our model can be divided into a series of subproblems, which only relate to the traffics in a certain individual time interval.

Optimization and Control

Lower Bound on the Optimal Access Bandwidth of ($K+2,K,2$)-MDS Array Code with Degraded Read Friendly

no code implementations4 Feb 2021 Ting-Yi Wu, Yunghsiang S. Han, Zhengrui Li, Bo Bai, Gong Zhang, Liang Chen, Xiang Wu

Accessing the data in the failed disk (degraded read) with low latency is crucial for an erasure-coded storage system.

Information Theory Information Theory

Asymmetric Gained Deep Image Compression With Continuous Rate Adaptation

1 code implementation CVPR 2021 Ze Cui, Jing Wang, Shangyin Gao, Bo Bai, Tiansheng Guo, Yihui Feng

With the development of deep learning techniques, the combination of deep learning with image compression has drawn lots of attention.

Image Compression MS-SSIM +2

GCN-GAN: A Non-linear Temporal Link Prediction Model for Weighted Dynamic Networks

1 code implementation26 Jan 2019 Kai Lei, Meng Qin, Bo Bai, Gong Zhang, Min Yang

Different from conventional techniques of temporal link prediction that ignore the potential non-linear characteristics and the informative link weights in the dynamic network, we introduce a novel non-linear model GCN-GAN to tackle the challenging temporal link prediction task of weighted dynamic networks.

Generative Adversarial Network Link Prediction +1

Cannot find the paper you are looking for? You can Submit a new open access paper.