Search Results for author: Fuwen Tan

Found 8 papers, 6 papers with code

iBoot: Image-bootstrapped Self-Supervised Video Representation Learning

no code implementations16 Jun 2022 Fatemeh Saleh, Fuwen Tan, Adrian Bulat, Georgios Tzimiropoulos, Brais Martinez

Video self-supervised learning (SSL) suffers from added challenges: video datasets are typically not as large as image datasets, compute is an order of magnitude larger, and the amount of spurious patterns the optimizer has to sieve through is multiplied several fold.

Data Augmentation Representation Learning +1

EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers

1 code implementation6 May 2022 Junting Pan, Adrian Bulat, Fuwen Tan, Xiatian Zhu, Lukasz Dudziak, Hongsheng Li, Georgios Tzimiropoulos, Brais Martinez

In this work, pushing further along this under-studied direction we introduce EdgeViTs, a new family of light-weight ViTs that, for the first time, enable attention-based vision models to compete with the best light-weight CNNs in the tradeoff between accuracy and on-device efficiency.

Instance-level Image Retrieval using Reranking Transformers

1 code implementation ICCV 2021 Fuwen Tan, Jiangbo Yuan, Vicente Ordonez

Instance-level image retrieval is the task of searching in a large database for images that match an object in a query image.

Image Retrieval Retrieval

Curriculum Labeling: Revisiting Pseudo-Labeling for Semi-Supervised Learning

1 code implementation16 Jan 2020 Paola Cascante-Bonilla, Fuwen Tan, Yanjun Qi, Vicente Ordonez

Pseudo-labeling works by applying pseudo-labels to samples in the unlabeled set by using a model trained on the combination of the labeled samples and any previously pseudo-labeled samples, and iteratively repeating this process in a self-training cycle.

Image Classification

Drill-down: Interactive Retrieval of Complex Scenes using Natural Language Queries

1 code implementation NeurIPS 2019 Fuwen Tan, Paola Cascante-Bonilla, Xiaoxiao Guo, Hui Wu, Song Feng, Vicente Ordonez

We show that using multiple rounds of natural language queries as input can be surprisingly effective to find arbitrarily specific images of complex scenes.

Image Retrieval Natural Language Queries +1

Text2Scene: Generating Compositional Scenes from Textual Descriptions

3 code implementations CVPR 2019 Fuwen Tan, Song Feng, Vicente Ordonez

In this paper, we propose Text2Scene, a model that generates various forms of compositional scene representations from natural language descriptions.

Where and Who? Automatic Semantic-Aware Person Composition

no code implementations4 Jun 2017 Fuwen Tan, Crispin Bernier, Benjamin Cohen, Vicente Ordonez, Connelly Barnes

Image compositing is a method used to generate realistic yet fake imagery by inserting contents from one image to another.

Cannot find the paper you are looking for? You can Submit a new open access paper.