Search Results for author: Huiwen Bao

Found 2 papers, 2 papers with code

PartialFormer: Modeling Part Instead of Whole

1 code implementation • 23 Oct 2023 • Tong Zheng, Bei Li, Huiwen Bao, Weiqiao Shan, Tong Xiao, Jingbo Zhu

The design choices in Transformer feed-forward neural networks have resulted in significant computational and parameter overhead.

Ranked #23 on Machine Translation on WMT2014 English-German

Abstractive Text Summarization Machine Translation +1

Paper
Code

EIT: Enhanced Interactive Transformer

1 code implementation • 20 Dec 2022 • Tong Zheng, Bei Li, Huiwen Bao, Tong Xiao, Jingbo Zhu

In this paper, we propose a novel architecture, the Enhanced Interactive Transformer (EIT), to address the issue of head degradation in self-attention mechanisms.

Abstractive Text Summarization Language Modelling +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.