no code implementations • 6 Aug 2024 • Pavel Suma, Giorgos Kordopatis-Zilos, Ahmet Iscen, Giorgos Tolias
The proposed model uses a transformer-based architecture designed to estimate image-to-image similarity by capturing interactions within and across images based on their local descriptors.
2 code implementations • CVPR 2024 • Mathilde Caron, Ahmet Iscen, Alireza Fathi, Cordelia Schmid
In this paper, we address web-scale visual entity recognition, specifically the task of mapping a given query image to one of the 6 million existing entities in Wikipedia.
no code implementations • 2 Mar 2024 • Ziniu Hu, Ahmet Iscen, Aashi Jain, Thomas Kipf, Yisong Yue, David A. Ross, Cordelia Schmid, Alireza Fathi
SceneCraft first models a scene graph as a blueprint, detailing the spatial relationships among assets in the scene.
no code implementations • 12 Jun 2023 • Ahmet Iscen, Mathilde Caron, Alireza Fathi, Cordelia Schmid
Contrastive image-text models such as CLIP form the building blocks of many state-of-the-art systems.
Ranked #3 on Fine-Grained Image Recognition on OVEN
no code implementations • CVPR 2023 • Ahmet Iscen, Alireza Fathi, Cordelia Schmid
Retrieval augmented models are becoming increasingly popular for computer vision tasks after their recent success in NLP problems.
Ranked #1 on Image Classification on WebVision-1000 (using extra training data)
1 code implementation • CVPR 2023 • Ziniu Hu, Ahmet Iscen, Chen Sun, ZiRui Wang, Kai-Wei Chang, Yizhou Sun, Cordelia Schmid, David A. Ross, Alireza Fathi
REVEAL consists of four key components: the memory, the encoder, the retriever and the generator.
Ranked #9 on Visual Question Answering (VQA) on OK-VQA
no code implementations • 10 Oct 2022 • Ahmet Iscen, Thomas Bird, Mathilde Caron, Alireza Fathi, Cordelia Schmid
We study class-incremental learning, a training setup in which new classes of data are observed over time for the model to learn from.
1 code implementation • CVPR 2022 • Ahmet Iscen, Jack Valmadre, Anurag Arnab, Cordelia Schmid
Recent advances in deep learning have relied on large, labelled datasets to train high-capacity models.
3 code implementations • 12 Apr 2021 • Ahmet Iscen, André Araujo, Boqing Gong, Cordelia Schmid
An effective and simple approach to long-tailed visual recognition is to learn feature representations and a classifier separately, with instance and class-balanced sampling, respectively.
Ranked #13 on Long-tail Learning on iNaturalist 2018
no code implementations • ECCV 2020 • Ahmet Iscen, Jeffrey Zhang, Svetlana Lazebnik, Cordelia Schmid
We assume that the model is updated incrementally for new classes as new data becomes available sequentially. This requires adapting the previously stored feature vectors to the updated feature space without having access to the corresponding original training images.
1 code implementation • ECCV 2020 • Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, Ondrej Chum, Cordelia Schmid
In this work we consider the problem of learning a classifier from noisy labels when a few clean labeled examples are given.
1 code implementation • CVPR 2019 • Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, Ondrej Chum
In this work, we employ a transductive label propagation method that is based on the manifold assumption to make predictions on the entire dataset and use these predictions to generate pseudo-labels for the unlabeled data and train a deep neural network.
no code implementations • ECCV 2018 • Ahmet Iscen, Ondrej Chum
This work addresses approximate nearest neighbor search applied in the domain of large-scale image retrieval.
no code implementations • 23 Jul 2018 • Ahmet Iscen, Yannis Avrithis, Giorgos Tolias, Teddy Furon, Ondrej Chum
State of the art image retrieval performance is achieved with CNN features and manifold ranking using a k-NN similarity graph that is pre-computed off-line.
2 code implementations • CVPR 2018 • Filip Radenović, Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, Ondřej Chum
In particular, annotation errors, the size of the dataset, and the level of challenge are addressed: new annotation for both datasets is created with an extra attention to the reliability of the ground truth.
1 code implementation • CVPR 2018 • Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, Ondrej Chum
Positive examples are distant points on a single manifold, while negative examples are nearby points on different manifolds.
no code implementations • 14 Sep 2017 • Oriane Siméoni, Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, Ondrej Chum
Eliminating the impact of the clutter on the image descriptor increases the chance of retrieving relevant images and prevents topic drift due to actually retrieving the clutter in the case of query expansion.
no code implementations • 21 Apr 2017 • Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, Teddy Furon, Ondrej Chum
Location recognition is commonly treated as visual instance retrieval on "street view" imagery.
1 code implementation • CVPR 2018 • Ahmet Iscen, Yannis Avrithis, Giorgos Tolias, Teddy Furon, Ondrej Chum
This makes the Euclidean nearest neighbor search biased for this task.
3 code implementations • CVPR 2017 • Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, Teddy Furon, Ondrej Chum
The diffusion is carried out on descriptors of overlapping image regions rather than on a global image descriptor like in previous approaches.
no code implementations • CVPR 2016 • Ahmet Iscen, Michael Rabbat, Teddy Furon
Experiments with standard image search benchmarks, including the Yahoo100M dataset comprising 100 million images, show that our method gives comparable (and sometimes superior) accuracy compared to exhaustive search while requiring only 10% of the vector operations and memory.
no code implementations • 10 Dec 2014 • Ahmet Iscen, Teddy Furon, Vincent Gripon, Michael Rabbat, Hervé Jégou
We study an indexing architecture to store and search in a database of high-dimensional vectors from the perspective of statistical signal processing and decision theory.
no code implementations • 29 Oct 2014 • Ahmet Iscen, Giorgos Tolias, Philippe-Henri Gosselin, Hervé Jégou
Our results show that the regular dense detector is outperformed by other methods in most situations, leading us to improve the state of the art in comparable setups on standard retrieval and fined-grain benchmarks.
no code implementations • 3 Jan 2014 • Ahmet Iscen, Eren Golge, Ilker Sarac, Pinar Duygulu
We introduce ConceptVision, a method that aims for high accuracy in categorizing large number of scenes, while keeping the model relatively simpler and efficient for scalability.
no code implementations • 3 Jan 2014 • Ahmet Iscen, Anil Armagan, Pinar Duygulu
Unusual events are important as being possible indicators of undesired consequences.