Search Results for author: Shengyi Gao

AVSegFormer: Audio-Visual Segmentation with Transformer

In this paper, we propose AVSegFormer, a novel framework for AVS tasks that leverages the transformer architecture.

Paper
Code

In this report, we present our champion solution to the WSDM2023 Toloka Visual Question Answering (VQA) Challenge.

1,120

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.