Search Results for author: Thomas Serre

Found 46 papers, 18 papers with code

Neural representation of action sequences: how far can a simple snippet-matching model take us?

no code implementations NeurIPS 2013 Cheston Tan, Jedediah M. Singer, Thomas Serre, David Sheinberg, Tomaso Poggio

The macaque Superior Temporal Sulcus (STS) is a brain area that receives and integrates inputs from both the ventral and dorsal visual processing streams (thought to specialize in form and motion processing respectively).

STS

Neuronal Synchrony in Complex-Valued Deep Networks

no code implementations20 Dec 2013 David P. Reichert, Thomas Serre

Here, we show how aspects of spike timing, long hypothesized to play a crucial role in cortical information processing, could be incorporated into deep networks to build richer, versatile representations.

Cooking in the kitchen: Recognizing and Segmenting Human Activities in Videos

no code implementations25 Aug 2015 Hilde Kuehne, Juergen Gall, Thomas Serre

Through extensive system evaluations, we demonstrate that combining compact video representations based on Fisher Vectors with HMM-based modeling yields very significant gains in accuracy and when properly trained with sufficient training samples, structured temporal models outperform unstructured bag-of-word types of models by a large margin on the tested performance metric.

Action Recognition Temporal Action Localization

How Deep is the Feature Analysis underlying Rapid Visual Categorization?

no code implementations NeurIPS 2016 Sven Eberhardt, Jonah Cader, Thomas Serre

We found that recognition accuracy increases with higher stages of visual processing (higher level stages indeed outperforming human participants on the same task) but that human decisions agree best with predictions from intermediate stages.

Object Recognition

What are the visual features underlying human versus machine vision?

no code implementations10 Jan 2017 Drew Linsley, Sven Eberhardt, Tarun Sharma, Pankaj Gupta, Thomas Serre

Our study demonstrates that the narrowing gap between the object recognition accuracy of human observers and DCNs obscures distinct visual strategies used by each to achieve this performance.

Object Object Recognition

Same-different problems strain convolutional neural networks

no code implementations9 Feb 2018 Matthew Ricci, Junkyung Kim, Thomas Serre

The robust and efficient recognition of visual relations in images is a hallmark of biological vision.

Memorization Visual Reasoning

Learning long-range spatial dependencies with horizontal gated-recurrent units

1 code implementation NeurIPS 2018 Drew Linsley, Junkyung Kim, Vijay Veerabadran, Thomas Serre

As a prime example, convolutional neural networks, a type of feedforward neural networks, are now approaching -- and sometimes even surpassing -- human accuracy on a variety of visual recognition tasks.

Contour Detection

Learning what and where to attend

1 code implementation22 May 2018 Drew Linsley, Dan Shiebler, Sven Eberhardt, Thomas Serre

Most recent gains in visual recognition have originated from the inclusion of attention mechanisms in deep convolutional networks (DCNs).

Image Categorization Object Recognition

Robust pose tracking with a joint model of appearance and shape

no code implementations28 Jun 2018 Yuliang Guo, Lakshmi Narasimhan Govindarajan, Benjamin Kimia, Thomas Serre

We present a novel approach for estimating the 2D pose of an articulated object with an application to automated video analysis of small laboratory animals.

Pose Tracking

Learning what and where to attend with humans in the loop

no code implementations ICLR 2019 Drew Linsley, Dan Shiebler, Sven Eberhardt, Thomas Serre

Most recent gains in visual recognition have originated from the inclusion of attention mechanisms in deep convolutional networks (DCNs).

Image Categorization Object Recognition

Disentangling neural mechanisms for perceptual grouping

no code implementations ICLR 2020 Junkyung Kim, Drew Linsley, Kalpit Thakkar, Thomas Serre

Forming perceptual groups and individuating objects in visual scenes is an essential step towards visual intelligence.

Object

Stable and expressive recurrent vision models

1 code implementation NeurIPS 2020 Drew Linsley, Alekh Karkada Ashok, Lakshmi Narasimhan Govindarajan, Rex Liu, Thomas Serre

We posit that the effectiveness of recurrent vision models is bottlenecked by the standard algorithm used for training them, "back-propagation through time" (BPTT), which has O(N) memory-complexity for training an N step model.

Panoptic Segmentation

Go with the Flow: Adaptive Control for Neural ODEs

no code implementations ICLR 2021 Mathieu Chalvidal, Matthew Ricci, Rufin VanRullen, Thomas Serre

Despite their elegant formulation and lightweight memory cost, neural ordinary differential equations (NODEs) suffer from known representational limitations.

Image Reconstruction Representation Learning

How Good is your Explanation? Algorithmic Stability Measures to Assess the Quality of Explanations for Deep Neural Networks

no code implementations7 Sep 2020 Thomas Fel, David Vigouroux, Rémi Cadène, Thomas Serre

A plethora of methods have been proposed to explain how deep neural networks reach their decisions but comparatively, little effort has been made to ensure that the explanations produced by these methods are objectively relevant.

Image Classification

Recurrent neural circuits for contour detection

no code implementations ICLR 2020 Drew Linsley, Junkyung Kim, Alekh Ashok, Thomas Serre

We introduce a deep recurrent neural network architecture that approximates visual cortical circuits.

Contour Detection

KuraNet: Systems of Coupled Oscillators that Learn to Synchronize

1 code implementation6 May 2021 Matthew Ricci, Minju Jung, Yuwei Zhang, Mathieu Chalvidal, Aneri Soni, Thomas Serre

Here, we present a single approach to both of these problems in the form of "KuraNet", a deep-learning-based system of coupled oscillators that can learn to synchronize across a distribution of disordered network conditions.

Tracking Without Re-recognition in Humans and Machines

no code implementations NeurIPS 2021 Drew Linsley, Girik Malik, Junkyung Kim, Lakshmi N Govindarajan, Ennio Mingolla, Thomas Serre

For this, we introduce PathTracker, a synthetic visual challenge that asks human observers and machines to track a target object in the midst of identical-looking "distractor" objects.

Decision Making Object +2

Understanding the computational demands underlying visual reasoning

no code implementations8 Aug 2021 Mohit Vaishnav, Remi Cadene, Andrea Alamia, Drew Linsley, Rufin VanRullen, Thomas Serre

Our analysis reveals a novel taxonomy of visual reasoning tasks, which can be primarily explained by both the type of relations (same-different vs. spatial-relation judgments) and the number of relations used to compose the underlying rules.

Visual Reasoning

The Challenge of Appearance-Free Object Tracking with Feedforward Neural Networks

no code implementations30 Sep 2021 Girik Malik, Drew Linsley, Thomas Serre, Ennio Mingolla

Here, we introduce $\textit{PathTracker}$, a visual challenge inspired by cognitive psychology, which tests the ability of observers to learn to track objects solely by their motion.

Object Object Recognition +1

What I Cannot Predict, I Do Not Understand: A Human-Centered Evaluation Framework for Explainability Methods

1 code implementation6 Dec 2021 Julien Colin, Thomas Fel, Remi Cadene, Thomas Serre

A multitude of explainability methods and associated fidelity performance metrics have been proposed to help better understand how modern AI systems make decisions.

Meta-Reinforcement Learning with Self-Modifying Networks

no code implementations4 Feb 2022 Mathieu Chalvidal, Thomas Serre, Rufin VanRullen

Deep Reinforcement Learning has demonstrated the potential of neural networks tuned with gradient descent for solving complex tasks in well-delimited environments.

Meta Reinforcement Learning One-Shot Learning +2

Diversity vs. Recognizability: Human-like generalization in one-shot generative models

2 code implementations20 May 2022 Victor Boutin, Lakshya Singhal, Xavier Thomas, Thomas Serre

Robust generalization to new concepts has long remained a distinctive feature of human intelligence.

Disentanglement

GAMR: A Guided Attention Model for (visual) Reasoning

1 code implementation10 Jun 2022 Mohit Vaishnav, Thomas Serre

Humans continue to outperform modern AI systems in their ability to flexibly parse and understand complex visual scenes.

Visual Reasoning Zero-shot Generalization

A Benchmark for Compositional Visual Reasoning

1 code implementation11 Jun 2022 Aimen Zerroug, Mohit Vaishnav, Julien Colin, Sebastian Musslick, Thomas Serre

Overall, we hope that our challenge will spur interest in the development of neural architectures that can learn to harness compositionality toward more efficient learning.

Visual Reasoning

Harmonizing the object recognition strategies of deep neural networks with humans

3 code implementations8 Nov 2022 Thomas Fel, Ivan Felipe, Drew Linsley, Thomas Serre

Across 84 different DNNs trained on ImageNet and three independent datasets measuring the where and the how of human visual strategies for object recognition on those images, we find a systematic trade-off between DNN categorization accuracy and alignment with human visual strategies for object recognition.

Object Object Recognition

CRAFT: Concept Recursive Activation FacTorization for Explainability

1 code implementation CVPR 2023 Thomas Fel, Agustin Picard, Louis Bethune, Thibaut Boissin, David Vigouroux, Julien Colin, Rémi Cadène, Thomas Serre

However, recent research has exposed the limited practical value of these methods, attributed in part to their narrow focus on the most prominent regions of an image -- revealing "where" the model looks, but failing to elucidate "what" the model sees in those areas.

Break It Down: Evidence for Structural Compositionality in Neural Networks

1 code implementation NeurIPS 2023 Michael A. Lepori, Thomas Serre, Ellie Pavlick

Though modern neural networks have achieved impressive performance in both vision and language tasks, we know little about the functions that they implement.

Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines?

1 code implementation27 Jan 2023 Victor Boutin, Thomas Fel, Lakshya Singhal, Rishav Mukherji, Akash Nagaraj, Julien Colin, Thomas Serre

An important milestone for AI is the development of algorithms that can produce drawings that are indistinguishable from those of humans.

Adversarial alignment: Breaking the trade-off between the strength of an attack and its relevance to human perception

no code implementations5 Jun 2023 Drew Linsley, Pinyuan Feng, Thibaut Boissin, Alekh Karkada Ashok, Thomas Fel, Stephanie Olaiya, Thomas Serre

Harmonized DNNs achieve the best of both worlds and experience attacks that are detectable and affect features that humans find diagnostic for recognition, meaning that attacks on these models are more likely to be rendered ineffective by inducing similar effects on human perception.

Adversarial Attack Adversarial Robustness +2

Unlocking Feature Visualization for Deeper Networks with MAgnitude Constrained Optimization

1 code implementation11 Jun 2023 Thomas Fel, Thibaut Boissin, Victor Boutin, Agustin Picard, Paul Novello, Julien Colin, Drew Linsley, Tom Rousseau, Rémi Cadène, Laurent Gardes, Thomas Serre

However, its widespread adoption has been limited due to a reliance on tricks to generate interpretable images, and corresponding challenges in scaling it to deeper neural networks.

Saliency strikes back: How filtering out high frequencies improves white-box explanations

no code implementations18 Jul 2023 Sabine Muzellec, Thomas Fel, Victor Boutin, Léo Andéol, Rufin VanRullen, Thomas Serre

Attribution methods correspond to a class of explainability methods (XAI) that aim to assess how individual inputs contribute to a model's decision-making process.

Computational Efficiency Decision Making

NeuroSurgeon: A Toolkit for Subnetwork Analysis

1 code implementation1 Sep 2023 Michael A. Lepori, Ellie Pavlick, Thomas Serre

Despite recent advances in the field of explainability, much remains unknown about the algorithms that neural networks learn to represent.

Fixing the problems of deep neural networks will require better training data and learning algorithms

no code implementations26 Sep 2023 Drew Linsley, Thomas Serre

Bowers and colleagues argue that DNNs are poor models of biological vision because they often learn to rival human accuracy by relying on strategies that differ markedly from those of humans.

Uncovering Intermediate Variables in Transformers using Circuit Probing

1 code implementation7 Nov 2023 Michael A. Lepori, Thomas Serre, Ellie Pavlick

We apply this method to models trained on simple arithmetic tasks, demonstrating its effectiveness at (1) deciphering the algorithms that models have learned, (2) revealing modular structure within a model, and (3) tracking the development of circuits over training.

Language Modelling Sentence

Categorizing the Visual Environment and Analyzing the Visual Attention of Dogs

no code implementations20 Nov 2023 Shreyas Sundara Raman, Madeline H. Pelgrim, Daphna Buchsbaum, Thomas Serre

The MaskRCNN, with eye tracking apparatus, serves as an end to end model for automatically classifying the visual fixations of dogs.

Cannot find the paper you are looking for? You can Submit a new open access paper.