Search Results for author: Shengyi Gao

Found 2 papers, 2 papers with code

AVSegFormer: Audio-Visual Segmentation with Transformer

1 code implementation3 Jul 2023 Shengyi Gao, Zhe Chen, Guo Chen, Wenhai Wang, Tong Lu

In this paper, we propose AVSegFormer, a novel framework for AVS tasks that leverages the transformer architecture.

Decoder Scene Understanding +1

Champion Solution for the WSDM2023 Toloka VQA Challenge

1 code implementation22 Jan 2023 Shengyi Gao, Zhe Chen, Guo Chen, Wenhai Wang, Tong Lu

In this report, we present our champion solution to the WSDM2023 Toloka Visual Question Answering (VQA) Challenge.

Question Answering Visual Grounding +1

Cannot find the paper you are looking for? You can Submit a new open access paper.