no code implementations • 22 Jan 2025 • Weizhi Fei, Xueyan Niu, Guoqing Xie, Yingqing Liu, Bo Bai, Wei Han
We identify specific attention heads in transformer-based LLMs, which we designate as evaluator heads, that are capable of selecting tokens in long inputs that are most significant for inference.
1 code implementation • 1 Sep 2024 • Meng Qin, Chaorui Zhang, Yu Gao, Yibin Ding, Weipeng Jiang, Weixi Zhang, Wei Han, Bo Bai
Graph partitioning (GP) is a classic problem that divides the node set of a graph into densely-connected blocks.
no code implementations • 16 Aug 2024 • Xingyuan Chen, Wenwei Kuang, Lei Deng, Wei Han, Bo Bai, Goncalo dos Reis
Specifically, we propose the row-column (RC) ansatz under the mean field point of view, which describes the measure structure of the weights in the neural network (NN) and admits a close measure dynamic.
no code implementations • 18 Jun 2024 • Weizhi Fei, Xueyan Niu, Guoqing Xie, Yanhua Zhang, Bo Bai, Lei Deng, Wei Han
Current Large Language Models (LLMs) face inherent limitations due to their pre-defined context lengths, which impede their capacity for multi-hop reasoning within extensive textual contexts.
no code implementations • 14 May 2024 • Xueyan Niu, Bo Bai, Lei Deng, Wei Han
In particular, the energy function in modern continuous Hopfield networks serves as an explanation for the attention mechanism, which we approximate with a distance-based energy function.
1 code implementation • 14 Feb 2024 • Bohan Li, Yiming Liu, Xueyan Niu, Bo Bai, Lei Deng, Deniz Gündüz
The results showcase the potential of exploiting the temporal relations in video data using generative models.
1 code implementation • 15 Dec 2023 • Weizhi Fei, Xueyan Niu, Pingyi Zhou, Lu Hou, Bo Bai, Lei Deng, Wei Han
Transformer-based Large Language Models (LLMs) often impose limitations on the length of the text input to ensure the generation of fluent and relevant responses.
1 code implementation • 27 Sep 2023 • Selim F. Yilmaz, Xueyan Niu, Bo Bai, Wei Han, Lei Deng, Deniz Gunduz
We consider the image transmission problem over a noisy wireless channel via deep learning-based joint source-channel coding (DeepJSCC) along with a denoising diffusion probabilistic model (DDPM) at the receiver.
no code implementations • 4 May 2023 • Lingyi Chen, Shitong Wu, Wenhao Ye, Huihui Wu, Wenyi Zhang, Hao Wu, Bo Bai
The Blahut-Arimoto (BA) algorithm has played a fundamental role in the numerical computation of rate-distortion (RD) functions.
no code implementations • 29 Sep 2022 • Meng Qin, Chaorui Zhang, Bo Bai, Gong Zhang, Dit-yan Yeung
The trained model is then directly generalized to new unseen graphs for online CD without additional optimization, where a better trade-off between quality and efficiency can be achieved.
no code implementations • 3 Aug 2022 • Benyuan Sun, Jin Dai, Zihao Liang, Congying Liu, Yi Yang, Bo Bai
SIMT lays the foundation of pre-training with large-scale multi-task multi-domain datasets and is proved essential for stable training in our GPPF experiments.
no code implementations • 5 Dec 2021 • Zhenting Luan, Yuchi Wu, Shansuo Liang, Liping Zhang, Wei Han, Bo Bai
In this letter, we propose a novel tensor-based modulation scheme for massive unsourced random access.
no code implementations • NeurIPS 2021 • Benyuan Sun, Hongxing Huo, Yi Yang, Bo Bai
The superiority of our algorithm is proved by demonstrating the new state-of-the-art results on cross-domain federated classification and detection.
no code implementations • 29 Nov 2021 • Zhenting Luan, Zhenyu Ming, Yuchi Wu, Wei Han, Xiang Chen, Bo Bai, Liping Zhang
We also develop a novel subcarrier recovery method for the proposed model.
no code implementations • 5 Nov 2021 • Lingying Huang, Xiaomeng Chen, Wei Huo, Jiazheng Wang, Fan Zhang, Bo Bai, Ling Shi
In order to improve the speed of B&B algorithms, learning techniques have been introduced in this algorithm recently.
no code implementations • 29 Sep 2021 • Meng Qin, Chaorui Zhang, Bo Bai, Gong Zhang, Dit-yan Yeung
IGP is also a generic framework that can capture the permutation invariant partitioning ground-truth of historical snapshots in the offline training and tackle the online GP on graphs with non-fixed number of nodes and clusters.
no code implementations • 5 Feb 2021 • Zhenyu Ming, Liping Zhang, Hao Wu, Yanwei Xu, Mayank Bakshi, Bo Bai, Gong Zhang
Our model can be divided into a series of subproblems, which only relate to the traffics in a certain individual time interval.
Optimization and Control
no code implementations • 4 Feb 2021 • Ting-Yi Wu, Yunghsiang S. Han, Zhengrui Li, Bo Bai, Gong Zhang, Liang Chen, Xiang Wu
Accessing the data in the failed disk (degraded read) with low latency is crucial for an erasure-coded storage system.
Information Theory Information Theory
1 code implementation • CVPR 2021 • Ze Cui, Jing Wang, Shangyin Gao, Bo Bai, Tiansheng Guo, Yihui Feng
With the development of deep learning techniques, the combination of deep learning with image compression has drawn lots of attention.
1 code implementation • 26 Jan 2019 • Kai Lei, Meng Qin, Bo Bai, Gong Zhang, Min Yang
Different from conventional techniques of temporal link prediction that ignore the potential non-linear characteristics and the informative link weights in the dynamic network, we introduce a novel non-linear model GCN-GAN to tackle the challenging temporal link prediction task of weighted dynamic networks.