1 code implementation • 6 Oct 2022 • Fuwen Tan, Fatemeh Saleh, Brais Martinez
This hints at view sampling being one of the performance bottlenecks for SSL on low-capacity networks.
no code implementations • 16 Jun 2022 • Fatemeh Saleh, Fuwen Tan, Adrian Bulat, Georgios Tzimiropoulos, Brais Martinez
Video self-supervised learning (SSL) suffers from added challenges: video datasets are typically not as large as image datasets, compute is an order of magnitude larger, and the amount of spurious patterns the optimizer has to sieve through is multiplied several fold.
1 code implementation • 6 May 2022 • Junting Pan, Adrian Bulat, Fuwen Tan, Xiatian Zhu, Lukasz Dudziak, Hongsheng Li, Georgios Tzimiropoulos, Brais Martinez
In this work, pushing further along this under-studied direction we introduce EdgeViTs, a new family of light-weight ViTs that, for the first time, enable attention-based vision models to compete with the best light-weight CNNs in the tradeoff between accuracy and on-device efficiency.
1 code implementation • ICCV 2021 • Fuwen Tan, Jiangbo Yuan, Vicente Ordonez
Instance-level image retrieval is the task of searching in a large database for images that match an object in a query image.
Ranked #3 on Image Retrieval on RParis (Medium)
1 code implementation • 16 Jan 2020 • Paola Cascante-Bonilla, Fuwen Tan, Yanjun Qi, Vicente Ordonez
Pseudo-labeling works by applying pseudo-labels to samples in the unlabeled set by using a model trained on the combination of the labeled samples and any previously pseudo-labeled samples, and iteratively repeating this process in a self-training cycle.
1 code implementation • NeurIPS 2019 • Fuwen Tan, Paola Cascante-Bonilla, Xiaoxiao Guo, Hui Wu, Song Feng, Vicente Ordonez
We show that using multiple rounds of natural language queries as input can be surprisingly effective to find arbitrarily specific images of complex scenes.
3 code implementations • CVPR 2019 • Fuwen Tan, Song Feng, Vicente Ordonez
In this paper, we propose Text2Scene, a model that generates various forms of compositional scene representations from natural language descriptions.
no code implementations • 4 Jun 2017 • Fuwen Tan, Crispin Bernier, Benjamin Cohen, Vicente Ordonez, Connelly Barnes
Image compositing is a method used to generate realistic yet fake imagery by inserting contents from one image to another.