The "Roaring 20s" of visual recognition began with the introduction of Vision Transformers (ViTs), which quickly superseded ConvNets as the state-of-the-art image classification model.
Ranked #3 on Domain Generalization on ImageNet-Sketch (using extra training data)
In this work we introduce a new optimisation method called SAGA in the spirit of SAG, SDCA, MISO and SVRG, a set of recently proposed incremental gradient algorithms with fast linear convergence rates.
We describe efforts to adapt the Tesseract open source OCR engine for multiple scripts and languages.
In this paper, we consider the problem of detecting and recommending such missing behaviors, a task that we call code sophistication.
Deepfake defense not only requires the research of detection but also requires the efforts of generation methods.
Ranked #1 on Face Swapping on FaceForensics++