Search Results for author: Michael Kidd

Found 2 papers, 2 papers with code

AISFormer: Amodal Instance Segmentation with Transformer

1 code implementation12 Oct 2022 Minh Tran, Khoa Vo, Kashu Yamazaki, Arthur Fernandes, Michael Kidd, Ngan Le

AISFormer explicitly models the complex coherence between occluder, visible, amodal, and invisible masks within an object's regions of interest by treating them as learnable queries.

Amodal Instance Segmentation Decoder +2

VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning

1 code implementation26 Jun 2022 Kashu Yamazaki, Sang Truong, Khoa Vo, Michael Kidd, Chase Rainwater, Khoa Luu, Ngan Le

In this paper, we leverage the human perceiving process, that involves vision and language interaction, to generate a coherent paragraph description of untrimmed videos.

Contrastive Learning Video Captioning

Cannot find the paper you are looking for? You can Submit a new open access paper.