The "Roaring 20s" of visual recognition began with the introduction of Vision Transformers (ViTs), which quickly superseded ConvNets as the state-of-the-art image classification model.
Ranked #3 on
Domain Generalization
on ImageNet-Sketch
(using extra training data)
In this work we introduce a new optimisation method called SAGA in the spirit of SAG, SDCA, MISO and SVRG, a set of recently proposed incremental gradient algorithms with fast linear convergence rates.
Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems.
We describe efforts to adapt the Tesseract open source OCR engine for multiple scripts and languages.
In this paper, we consider the problem of detecting and recommending such missing behaviors, a task that we call code sophistication.
Deepfake defense not only requires the research of detection but also requires the efforts of generation methods.
Ranked #1 on
Face Swapping
on FaceForensics++