Search Results for author: Yann Lecun

Found 135 papers, 76 papers with code

EgoPet: Egomotion and Interaction Data from an Animal's Perspective

no code implementations • 15 Apr 2024 • Amir Bar, Arya Bakhtiar, Danny Tran, Antonio Loquercio, Jathushan Rajasegaran, Yann Lecun, Amir Globerson, Trevor Darrell

Animals perceive the world to plan their actions and interact with other agents to accomplish complex tasks, demonstrating capabilities that are still unmatched by AI systems.

Paper
Add Code

Learning and Leveraging World Models in Visual Representation Learning

no code implementations • 1 Mar 2024 • Quentin Garrido, Mahmoud Assran, Nicolas Ballas, Adrien Bardes, Laurent Najman, Yann Lecun

Joint-Embedding Predictive Architecture (JEPA) has emerged as a promising self-supervised approach that learns by leveraging a world model.

Representation Learning

Paper
Add Code

Learning by Reconstruction Produces Uninformative Features For Perception

no code implementations • 17 Feb 2024 • Randall Balestriero, Yann Lecun

Despite interpretability of the reconstruction and generation, we identify a misalignment between learning by reconstruction, and learning for perception.

Denoising Representation Learning

Paper
Add Code

Revisiting Feature Prediction for Learning Visual Representations from Video

1 code implementation • arXiv preprint 2024 • Adrien Bardes, Quentin Garrido, Jean Ponce, Xinlei Chen, Michael Rabbat, Yann Lecun, Mahmoud Assran, Nicolas Ballas

This paper explores feature prediction as a stand-alone objective for unsupervised learning from video and introduces V-JEPA, a collection of vision models trained solely using a feature prediction objective, without the use of pretrained image encoders, text, negative examples, reconstruction, or other sources of supervision.

2,345

Paper
Code

G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering

1 code implementation • 12 Feb 2024 • Xiaoxin He, Yijun Tian, Yifei Sun, Nitesh V. Chawla, Thomas Laurent, Yann Lecun, Xavier Bresson, Bryan Hooi

Given a graph with textual attributes, we enable users to `chat with their graph': that is, to ask questions about the graph using a conversational interface.

Common Sense Reasoning Graph Classification +4

142

Paper
Code

Fast and Exact Enumeration of Deep Networks Partitions Regions

no code implementations • 20 Jan 2024 • Randall Balestriero, Yann Lecun

One fruitful formulation of Deep Networks (DNs) enabling their theoretical study and providing practical guidelines to practitioners relies on Piecewise Affine Splines.

Paper
Add Code

Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs

1 code implementation • 11 Jan 2024 • Shengbang Tong, Zhuang Liu, Yuexiang Zhai, Yi Ma, Yann Lecun, Saining Xie

To understand the roots of these errors, we explore the gap between the visual embedding space of CLIP and vision-only self-supervised learning.

Representation Learning Self-Supervised Learning +1

198

Paper
Code

Gradient-based Planning with World Models

no code implementations • 28 Dec 2023 • Jyothir S V, Siddhartha Jalagam, Yann Lecun, Vlad Sobal

The enduring challenge in the field of artificial intelligence has been the control of systems to achieve desired behaviours.

Model Predictive Control

Paper
Add Code

GAIA: a benchmark for General AI Assistants

1 code implementation • 21 Nov 2023 • Grégoire Mialon, Clémentine Fourrier, Craig Swift, Thomas Wolf, Yann Lecun, Thomas Scialom

GAIA's philosophy departs from the current trend in AI benchmarks suggesting to target tasks that are ever more difficult for humans.

Philosophy

Paper
Code

URLOST: Unsupervised Representation Learning without Stationarity or Topology

no code implementations • 6 Oct 2023 • Zeyu Yun, Juexiao Zhang, Bruno Olshausen, Yann Lecun, Yubei Chen

Unsupervised representation learning has seen tremendous progress but is constrained by its reliance on data modality-specific stationarity and topology, a limitation not found in biological intelligence systems.

Representation Learning

Paper
Add Code

Stochastic positional embeddings improve masked image modeling

no code implementations • 31 Jul 2023 • Amir Bar, Florian Bordes, Assaf Shocher, Mahmoud Assran, Pascal Vincent, Nicolas Ballas, Trevor Darrell, Amir Globerson, Yann Lecun

Masked Image Modeling (MIM) is a promising self-supervised learning approach that enables learning from unlabeled images.

Language Modelling Masked Language Modeling +3

Paper
Add Code

MC-JEPA: A Joint-Embedding Predictive Architecture for Self-Supervised Learning of Motion and Content Features

no code implementations • 24 Jul 2023 • Adrien Bardes, Jean Ponce, Yann Lecun

Self-supervised learning of visual representations has been focusing on learning content features, which do not capture object motion or location, and focus on identifying and differentiating objects in images and videos.

Optical Flow Estimation Self-Supervised Learning +1

Paper
Add Code

Self-Supervised Learning with Lie Symmetries for Partial Differential Equations

1 code implementation • NeurIPS 2023 • Grégoire Mialon, Quentin Garrido, Hannah Lawrence, Danyal Rehman, Yann Lecun, Bobak T. Kiani

Machine learning for differential equations paves the way for computationally efficient alternatives to numerical solvers, with potentially broad impacts in science and engineering.

Representation Learning Self-Supervised Learning

Paper
Code

Variance-Covariance Regularization Improves Representation Learning

no code implementations • 23 Jun 2023 • Jiachen Zhu, Katrina Evtimova, Yubei Chen, Ravid Shwartz-Ziv, Yann Lecun

In summary, VCReg offers a universally applicable regularization framework that significantly advances transfer learning and highlights the connection between gradient starvation, neural collapse, and feature transferability.

Long-tail Learning Representation Learning +2

Paper
Add Code

Introduction to Latent Variable Energy-Based Models: A Path Towards Autonomous Machine Intelligence

no code implementations • 5 Jun 2023 • Anna Dawid, Yann Lecun

Current automated systems have crucial limitations that need to be addressed before artificial intelligence can reach human-like levels and bring new technological revolutions.

Self-Driving Cars

Paper
Add Code

Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation Learning

3 code implementations • 31 May 2023 • Xiaoxin He, Xavier Bresson, Thomas Laurent, Adam Perold, Yann Lecun, Bryan Hooi

With the advent of powerful large language models (LLMs) such as GPT or Llama2, which demonstrate an ability to reason and to utilize general knowledge, there is a growing need for techniques which combine the textual modelling abilities of LLMs with the structural learning capabilities of GNNs.

Ranked #2 on Node Property Prediction on ogbn-arxiv (using extra training data)

Decision Making General Knowledge +4

171

Paper
Code

Reverse Engineering Self-Supervised Learning

1 code implementation • NeurIPS 2023 • Ido Ben-Shaul, Ravid Shwartz-Ziv, Tomer Galanti, Shai Dekel, Yann Lecun

Self-supervised learning (SSL) is a powerful tool in machine learning, but understanding the learned representations and their underlying mechanisms remains a challenge.

Clustering Representation Learning +1

2,743

Paper
Code

A Cookbook of Self-Supervised Learning

no code implementations • 24 Apr 2023 • Randall Balestriero, Mark Ibrahim, Vlad Sobal, Ari Morcos, Shashank Shekhar, Tom Goldstein, Florian Bordes, Adrien Bardes, Gregoire Mialon, Yuandong Tian, Avi Schwarzschild, Andrew Gordon Wilson, Jonas Geiping, Quentin Garrido, Pierre Fernandez, Amir Bar, Hamed Pirsiavash, Yann Lecun, Micah Goldblum

Self-supervised learning, dubbed the dark matter of intelligence, is a promising path to advance machine learning.

Navigate Self-Supervised Learning

Paper
Add Code

To Compress or Not to Compress- Self-Supervised Learning and Information Theory: A Review

no code implementations • 19 Apr 2023 • Ravid Shwartz-Ziv, Yann Lecun

Information theory, and notably the information bottleneck principle, has been pivotal in shaping deep neural networks.

Self-Supervised Learning

Paper
Add Code

EMP-SSL: Towards Self-Supervised Learning in One Training Epoch

2 code implementations • 8 Apr 2023 • Shengbang Tong, Yubei Chen, Yi Ma, Yann Lecun

Recently, self-supervised learning (SSL) has achieved tremendous success in learning image representation.

Quantization Self-Supervised Learning

214

Paper
Code

Active Self-Supervised Learning: A Few Low-Cost Relationships Are All You Need

1 code implementation • ICCV 2023 • Vivien Cabannes, Leon Bottou, Yann Lecun, Randall Balestriero

Third, it provides a proper active learning framework yielding low-cost solutions to annotate datasets, arguably bringing the gap between theory and practice of active learning that is based on simple-to-answer-by-non-experts queries of semantic relationships between inputs.

Active Learning Self-Supervised Learning

Paper
Code

An Information-Theoretic Perspective on Variance-Invariance-Covariance Regularization

no code implementations • 1 Mar 2023 • Ravid Shwartz-Ziv, Randall Balestriero, Kenji Kawaguchi, Tim G. J. Rudner, Yann Lecun

In this paper, we provide an information-theoretic perspective on Variance-Invariance-Covariance Regularization (VICReg) for self-supervised learning.

Self-Supervised Learning Transfer Learning

Paper
Add Code

Augmented Language Models: a Survey

1 code implementation • 15 Feb 2023 • Grégoire Mialon, Roberto Dessì, Maria Lomeli, Christoforos Nalmpantis, Ram Pasunuru, Roberta Raileanu, Baptiste Rozière, Timo Schick, Jane Dwivedi-Yu, Asli Celikyilmaz, Edouard Grave, Yann Lecun, Thomas Scialom

This survey reviews works in which language models (LMs) are augmented with reasoning skills and the ability to use tools.

Language Modelling

273

Paper
Code

Self-supervised learning of Split Invariant Equivariant representations

1 code implementation • 14 Feb 2023 • Quentin Garrido, Laurent Najman, Yann Lecun

We hope that both our introduced dataset and approach will enable learning richer representations without supervision in more complex scenarios.

Self-Supervised Learning

Paper
Code

The SSL Interplay: Augmentations, Inductive Bias, and Generalization

no code implementations • 6 Feb 2023 • Vivien Cabannes, Bobak T. Kiani, Randall Balestriero, Yann Lecun, Alberto Bietti

Self-supervised learning (SSL) has emerged as a powerful framework to learn representations from raw data without supervision.

Data Augmentation Inductive Bias +1

Paper
Add Code

Blockwise Self-Supervised Learning at Scale

1 code implementation • 3 Feb 2023 • Shoaib Ahmed Siddiqui, David Krueger, Yann Lecun, Stéphane Deny

Current state-of-the-art deep networks are all powered by backpropagation.

Self-Supervised Learning

Paper
Code

Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture

3 code implementations • CVPR 2023 • Mahmoud Assran, Quentin Duval, Ishan Misra, Piotr Bojanowski, Pascal Vincent, Michael Rabbat, Yann Lecun, Nicolas Ballas

This paper demonstrates an approach for learning highly semantic image representations without relying on hand-crafted data-augmentations.

Depth Estimation Depth Prediction +2

2,743

Paper
Code

A Generalization of ViT/MLP-Mixer to Graphs

3 code implementations • 27 Dec 2022 • Xiaoxin He, Bryan Hooi, Thomas Laurent, Adam Perold, Yann Lecun, Xavier Bresson

First, they capture long-range dependency and mitigate the issue of over-squashing as demonstrated on Long Range Graph Benchmark and TreeNeighbourMatch datasets.

Ranked #1 on Graph Regression on Peptides-struct

Graph Classification Graph Regression +1

136

Paper
Code

Joint Embedding Predictive Architectures Focus on Slow Features

1 code implementation • 20 Nov 2022 • Vlad Sobal, Jyothir S V, Siddhartha Jalagam, Nicolas Carion, Kyunghyun Cho, Yann Lecun

Many common methods for learning a world model for pixel-based environments use generative architectures trained with pixel-level reconstruction objectives.

Paper
Code

POLICE: Provably Optimal Linear Constraint Enforcement for Deep Neural Networks

1 code implementation • 2 Nov 2022 • Randall Balestriero, Yann Lecun

In this paper we propose the first provable affine constraint enforcement method for DNNs that only requires minimal changes into a given DNN's forward-pass, that is computationally friendly, and that leaves the optimization of the DNN's parameter to be unconstrained, i. e. standard gradient-based method can be employed.

Paper
Code

Unsupervised Learning of Structured Representations via Closed-Loop Transcription

1 code implementation • 30 Oct 2022 • Shengbang Tong, Xili Dai, Yubei Chen, Mingyang Li, Zengyi Li, Brent Yi, Yann Lecun, Yi Ma

This paper proposes an unsupervised method for learning a unified representation that serves both discriminative and generative purposes.

Paper
Code

Toward Next-Generation Artificial Intelligence: Catalyzing the NeuroAI Revolution

no code implementations • 15 Oct 2022 • Anthony Zador, Sean Escola, Blake Richards, Bence Ölveczky, Yoshua Bengio, Kwabena Boahen, Matthew Botvinick, Dmitri Chklovskii, Anne Churchland, Claudia Clopath, James DiCarlo, Surya Ganguli, Jeff Hawkins, Konrad Koerding, Alexei Koulakov, Yann Lecun, Timothy Lillicrap, Adam Marblestone, Bruno Olshausen, Alexandre Pouget, Cristina Savin, Terrence Sejnowski, Eero Simoncelli, Sara Solla, David Sussillo, Andreas S. Tolias, Doris Tsao

Neuroscience has long been an essential driver of progress in artificial intelligence (AI).

Paper
Add Code

VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature Alignment

1 code implementation • 9 Oct 2022 • Shraman Pramanick, Li Jing, Sayan Nag, Jiachen Zhu, Hardik Shah, Yann Lecun, Rama Chellappa

Extensive experiments on a wide range of vision- and vision-language downstream tasks demonstrate the effectiveness of VoLTA on fine-grained applications without compromising the coarse-grained downstream performance, often outperforming methods using significantly more caption and box annotations.

object-detection Object Detection +2

Paper
Code

RankMe: Assessing the downstream performance of pretrained self-supervised representations by their rank

no code implementations • 5 Oct 2022 • Quentin Garrido, Randall Balestriero, Laurent Najman, Yann Lecun

Joint-Embedding Self Supervised Learning (JE-SSL) has seen a rapid development, with the emergence of many method variations but only few principled guidelines that would help practitioners to successfully deploy them.

Self-Supervised Learning

Paper
Add Code

VICRegL: Self-Supervised Learning of Local Visual Features

3 code implementations • 4 Oct 2022 • Adrien Bardes, Jean Ponce, Yann Lecun

Most recent self-supervised methods for learning image representations focus on either producing a global feature with invariance properties, or producing a set of local features.

Segmentation Self-Supervised Learning

2,743

Paper
Code

Minimalistic Unsupervised Learning with the Sparse Manifold Transform

no code implementations • 30 Sep 2022 • Yubei Chen, Zeyu Yun, Yi Ma, Bruno Olshausen, Yann Lecun

Though there remains a small performance gap between our simple constructive model and SOTA methods, the evidence points to this as a promising direction for achieving a principled and white-box approach to unsupervised learning.

Ranked #1 on Unsupervised MNIST on MNIST

Self-Supervised Learning Sparse Representation-based Classification +3

Paper
Add Code

Joint Embedding Self-Supervised Learning in the Kernel Regime

no code implementations • 29 Sep 2022 • Bobak T. Kiani, Randall Balestriero, Yubei Chen, Seth Lloyd, Yann Lecun

The fundamental goal of self-supervised learning (SSL) is to produce useful representations of data without access to any labels for classifying the data.

Self-Supervised Learning

Paper
Add Code

Variance Covariance Regularization Enforces Pairwise Independence in Self-Supervised Representations

no code implementations • 29 Sep 2022 • Grégoire Mialon, Randall Balestriero, Yann Lecun

Self-Supervised Learning (SSL) methods such as VICReg, Barlow Twins or W-MSE avoid collapse of their joint embedding architectures by constraining or regularizing the covariance matrix of their projector's output.

Domain Generalization Self-Supervised Learning

Paper
Add Code

Light-weight probing of unsupervised representations for Reinforcement Learning

1 code implementation • 25 Aug 2022 • Wancong Zhang, Anthony GX-Chen, Vlad Sobal, Yann Lecun, Nicolas Carion

Unsupervised visual representation learning offers the opportunity to leverage large corpora of unlabeled trajectories to form useful visual representations, which can benefit the training of reinforcement learning (RL) algorithms.

reinforcement-learning Reinforcement Learning (RL) +2

Paper
Code

What Do We Maximize in Self-Supervised Learning?

no code implementations • 20 Jul 2022 • Ravid Shwartz-Ziv, Randall Balestriero, Yann Lecun

In this paper, we examine self-supervised learning methods, particularly VICReg, to provide an information-theoretical understanding of their construction.

Self-Supervised Learning Transfer Learning

Paper
Add Code

TiCo: Transformation Invariance and Covariance Contrast for Self-Supervised Visual Representation Learning

2 code implementations • 21 Jun 2022 • Jiachen Zhu, Rafael M. Moraes, Serkan Karakulak, Vlad Sobol, Alfredo Canziani, Yann Lecun

Similar to other recent self-supervised learning methods, our method is based on maximizing the agreement among embeddings of different distorted versions of the same image, which pushes the encoder to produce transformation invariant representations.

Representation Learning Self-Supervised Learning

2,743

Paper
Code

Bag of Image Patch Embedding Behind the Success of Self-Supervised Learning

no code implementations • 17 Jun 2022 • Yubei Chen, Adrien Bardes, Zengyi Li, Yann Lecun

Even with 32x32 patch representation, BagSSL achieves 62% top-1 linear probing accuracy on ImageNet.

Representation Learning Self-Supervised Learning

Paper
Add Code

Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone

1 code implementation • NeurIPS 2022 • Zi-Yi Dou, Aishwarya Kamath, Zhe Gan, Pengchuan Zhang, JianFeng Wang, Linjie Li, Zicheng Liu, Ce Liu, Yann Lecun, Nanyun Peng, Jianfeng Gao, Lijuan Wang

Vision-language (VL) pre-training has recently received considerable attention.

Ranked #1 on Phrase Grounding on Flickr30k Entities Dev

Described Object Detection Image Captioning +5

123

Paper
Code

Masked Siamese ConvNets

no code implementations • 15 Jun 2022 • Li Jing, Jiachen Zhu, Yann Lecun

Self-supervised learning has shown superior performances over supervised methods on various vision benchmarks.

Image Classification Inductive Bias +4

Paper
Add Code

On the duality between contrastive and non-contrastive self-supervised learning

no code implementations • 3 Jun 2022 • Quentin Garrido, Yubei Chen, Adrien Bardes, Laurent Najman, Yann Lecun

Recent approaches in self-supervised learning of image representations can be categorized into different families of methods and, in particular, can be divided into contrastive and non-contrastive approaches.

Self-Supervised Learning

Paper
Add Code

Contrastive and Non-Contrastive Self-Supervised Learning Recover Global and Local Spectral Embedding Methods

no code implementations • 23 May 2022 • Randall Balestriero, Yann Lecun

Self-Supervised Learning (SSL) surmises that inputs and pairwise positive relationships are enough to learn meaningful representations.

Self-Supervised Learning

Paper
Add Code

Pre-Train Your Loss: Easy Bayesian Transfer Learning with Informative Priors

1 code implementation • 20 May 2022 • Ravid Shwartz-Ziv, Micah Goldblum, Hossein Souri, Sanyam Kapoor, Chen Zhu, Yann Lecun, Andrew Gordon Wilson

Deep learning is increasingly moving towards a transfer learning paradigm whereby large foundation models are fine-tuned on downstream tasks, starting from an initialization learned on the source task.

Transfer Learning

107

Paper
Code

The Effects of Regularization and Data Augmentation are Class Dependent

no code implementations • 7 Apr 2022 • Randall Balestriero, Leon Bottou, Yann Lecun

The optimal amount of DA or weight decay found from cross-validation leads to disastrous model performances on some classes e. g. on Imagenet with a resnet50, the "barn spider" classification test accuracy falls from $68\%$ to $46\%$ only by introducing random crop DA during training.

Data Augmentation

Paper
Add Code

projUNN: efficient method for training deep networks with unitary matrices

1 code implementation • 10 Mar 2022 • Bobak Kiani, Randall Balestriero, Yann Lecun, Seth Lloyd

In learning with recurrent or very deep feed-forward networks, employing unitary matrices in each layer can be very effective at maintaining long-range stability.

Paper
Code

A Data-Augmentation Is Worth A Thousand Samples: Exact Quantification From Analytical Augmented Sample Moments

no code implementations • 16 Feb 2022 • Randall Balestriero, Ishan Misra, Yann Lecun

We show that for a training loss to be stable under DA sampling, the model's saliency map (gradient of the loss with respect to the model's input) must align with the smallest eigenvector of the sample variance under the considered DA augmentation, hinting at a possible explanation on why models tend to shift their focus from edges to textures.

Data Augmentation

Paper
Add Code

Neural Manifold Clustering and Embedding

1 code implementation • 24 Jan 2022 • Zengyi Li, Yubei Chen, Yann Lecun, Friedrich T. Sommer

We argue that achieving manifold clustering with neural networks requires two essential ingredients: a domain-specific constraint that ensures the identification of the manifolds, and a learning algorithm for embedding each manifold to a linear subspace in the feature space.

Clustering Data Augmentation +2

Paper
Code

Sparse Coding with Multi-Layer Decoders using Variance Regularization

1 code implementation • 16 Dec 2021 • Katrina Evtimova, Yann Lecun

Sparse coding with an $l_1$ penalty and a learned linear dictionary requires regularization of the dictionary to prevent a collapse in the $l_1$ norms of the codes.

Denoising

Paper
Code

Learning in High Dimension Always Amounts to Extrapolation

no code implementations • 18 Oct 2021 • Randall Balestriero, Jerome Pesenti, Yann Lecun

The notion of interpolation and extrapolation is fundamental in various fields from deep learning to function approximation.

Vocal Bursts Intensity Prediction

Paper
Add Code

Understanding Dimensional Collapse in Contrastive Self-supervised Learning

1 code implementation • ICLR 2022 • Li Jing, Pascal Vincent, Yann Lecun, Yuandong Tian

It has been shown that non-contrastive methods suffer from a lesser collapse problem of a different nature: dimensional collapse, whereby the embedding vectors end up spanning a lower-dimensional subspace instead of the entire available embedding space.

Contrastive Learning Learning Theory +2

Paper
Code

Decoupled Contrastive Learning

4 code implementations • 13 Oct 2021 • Chun-Hsiao Yeh, Cheng-Yao Hong, Yen-Chi Hsu, Tyng-Luh Liu, Yubei Chen, Yann Lecun

Further, DCL can be combined with the SOTA contrastive learning method, NNCLR, to achieve 72. 3% ImageNet-1K top-1 accuracy with 512 batch size in 400 epochs, which represents a new SOTA in contrastive learning.

Contrastive Learning Self-Supervised Learning

2,743

Paper
Code

Compact and Optimal Deep Learning with Recurrent Parameter Generators

1 code implementation • 15 Jul 2021 • Jiayun Wang, Yubei Chen, Stella X. Yu, Brian Cheung, Yann Lecun

We propose a drastically different approach to compact and optimal deep learning: We decouple the Degrees of freedom (DoF) and the actual number of parameters of a model, optimize a small DoF with predefined random linear constraints for a large model of arbitrary architecture, in one-stage end-to-end learning.

Ranked #97 on Image Classification on ObjectNet (using extra training data)

Image Classification Model Compression

Paper
Code

VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised Learning

6 code implementations • NeurIPS 2021 • Adrien Bardes, Jean Ponce, Yann Lecun

Recent self-supervised methods for image representation learning are based on maximizing the agreement between embedding vectors from different views of the same image.

Ranked #40 on Semi-Supervised Image Classification on ImageNet - 1% labeled data

Representation Learning Self-Supervised Image Classification +2

2,743

Paper
Code

MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding

3 code implementations • 26 Apr 2021 • Aishwarya Kamath, Mannat Singh, Yann Lecun, Gabriel Synnaeve, Ishan Misra, Nicolas Carion

We also investigate the utility of our model as an object detector on a given label set when fine-tuned in a few-shot setting.

Ranked #1 on Visual Question Answering (VQA) on CLEVR-Humans

Generalized Referring Expression Comprehension Phrase Grounding +9

1,290

Paper
Code

Transformer visualization via dictionary learning: contextualized embedding as a linear superposition of transformer factors

1 code implementation • NAACL (DeeLIO) 2021 • Zeyu Yun, Yubei Chen, Bruno A Olshausen, Yann Lecun

Transformer networks have revolutionized NLP representation learning since they were introduced.

Dictionary Learning Representation Learning +1

Paper
Code

Barlow Twins: Self-Supervised Learning via Redundancy Reduction

24 code implementations • 4 Mar 2021 • Jure Zbontar, Li Jing, Ishan Misra, Yann Lecun, Stéphane Deny

This causes the embedding vectors of distorted versions of a sample to be similar, while minimizing the redundancy between the components of these vectors.

Ranked #11 on Image Classification on Places205

General Classification Object Detection +3

3,229

Paper
Code

MDETR - Modulated Detection for End-to-End Multi-Modal Understanding

1 code implementation • ICCV 2021 • Aishwarya Kamath, Mannat Singh, Yann Lecun, Gabriel Synnaeve, Ishan Misra, Nicolas Carion

We also investigate the utility of our model as an object detector on a given label set when fine-tuned in a few-shot setting.

Ranked #2 on Referring Expression Comprehension on Talk2Car (using extra training data)

Phrase Grounding Question Answering +3

938

Paper
Code

Neural Potts Model

no code implementations • 1 Jan 2021 • Tom Sercu, Robert Verkuil, Joshua Meier, Brandon Amos, Zeming Lin, Caroline Chen, Jason Liu, Yann Lecun, Alexander Rives

We propose the Neural Potts Model objective as an amortized optimization problem.

Paper
Add Code

Implicit Rank-Minimizing Autoencoder

3 code implementations • NeurIPS 2020 • Li Jing, Jure Zbontar, Yann Lecun

An important component of autoencoders is the method by which the information capacity of the latent representation is minimized or limited.

Image Generation Representation Learning +1

Paper
Code

Inspirational Adversarial Image Generation

1 code implementation • 17 Jun 2019 • Baptiste Rozière, Morgane Riviere, Olivier Teytaud, Jérémy Rapin, Yann Lecun, Camille Couprie

We design a simple optimization method to find the optimal latent parameters corresponding to the closest generation to any input inspirational image.

Image Generation

1,597

Paper
Code

The role of over-parametrization in generalization of neural networks

1 code implementation • ICLR 2019 • Behnam Neyshabur, Zhiyuan Li, Srinadh Bhojanapalli, Yann Lecun, Nathan Srebro

Despite existing work on ensuring generalization of neural networks in terms of scale sensitive complexity measures, such as norms, margin and sharpness, these complexity measures do not offer an explanation of why neural networks generalize better with over-parametrization.

Paper
Code

Unsupervised Image Matching and Object Discovery as Optimization

1 code implementation • CVPR 2019 • Huy V. Vo, Francis Bach, Minsu Cho, Kai Han, Yann Lecun, Patrick Perez, Jean Ponce

Learning with complete or partial supervision is powerful but relies on ever-growing human annotation efforts.

Ranked #2 on Single-object colocalization on Object Discovery

Object Object Discovery +2

Paper
Code

Learning about an exponential amount of conditional distributions

1 code implementation • NeurIPS 2019 • Mohamed Ishmael Belghazi, Maxime Oquab, Yann Lecun, David Lopez-Paz

We introduce the Neural Conditioner (NC), a self-supervised machine able to learn about all the conditional distributions of a random vector $X$.

General Classification

Paper
Code

Model-Predictive Policy Learning with Uncertainty Regularization for Driving in Dense Traffic

1 code implementation • ICLR 2019 • Mikael Henaff, Alfredo Canziani, Yann Lecun

Learning a policy using only observational data is challenging because the distribution of states it induces at execution time may differ from the distribution observed during training.

Rolling Shutter Correction

196

Paper
Code

A Spectral Regularizer for Unsupervised Disentanglement

no code implementations • 4 Dec 2018 • Aditya Ramesh, Youngduck Choi, Yann Lecun

A generative model with a disentangled representation allows for independent control over different aspects of the output.

Disentanglement

Paper
Add Code

GLoMo: Unsupervised Learning of Transferable Relational Graphs

no code implementations • NeurIPS 2018 • Zhilin Yang, Jake Zhao, Bhuwan Dhingra, Kaiming He, William W. Cohen, Ruslan R. Salakhutdinov, Yann Lecun

We also show that the learned graphs are generic enough to be transferred to different embeddings on which the graphs have not been trained (including GloVe embeddings, ELMo embeddings, and task-specific RNN hidden units), or embedding-free units such as image pixels.

Image Classification Natural Language Inference +4

Paper
Add Code

Adversarially-Trained Normalized Noisy-Feature Auto-Encoder for Text Generation

no code implementations • 10 Nov 2018 • Xiang Zhang, Yann Lecun

An ATNNFAE consists of an auto-encoder where the internal code is normalized on the unit sphere and corrupted by additive noise.

Text Generation

Paper
Add Code

Learning with Reflective Likelihoods

no code implementations • 27 Sep 2018 • Adji B. Dieng, Kyunghyun Cho, David M. Blei, Yann Lecun

Furthermore, the reflective likelihood objective prevents posterior collapse when used to train stochastic auto-encoders with amortized inference.

Attribute

Paper
Add Code

Comparing Dynamics: Deep Neural Networks versus Glassy Systems

no code implementations • ICML 2018 • Marco Baity-Jesi, Levent Sagun, Mario Geiger, Stefano Spigler, Gerard Ben Arous, Chiara Cammarota, Yann Lecun, Matthieu Wyart, Giulio Biroli

We analyze numerically the training dynamics of deep neural networks (DNN) by using methods developed in statistical physics of glassy systems.

Paper
Add Code

GLoMo: Unsupervisedly Learned Relational Graphs as Transferable Representations

1 code implementation • 14 Jun 2018 • Zhilin Yang, Jake Zhao, Bhuwan Dhingra, Kaiming He, William W. Cohen, Ruslan Salakhutdinov, Yann Lecun

Image Classification Natural Language Inference +4

Paper
Code

Backpropagation for Implicit Spectral Densities

1 code implementation • 1 Jun 2018 • Aditya Ramesh, Yann Lecun

We introduce a tool that allows us to do this even when the likelihood is not explicitly set, by instead using the implicit likelihood of the model.

Paper
Code

Towards Understanding the Role of Over-Parametrization in Generalization of Neural Networks

2 code implementations • 30 May 2018 • Behnam Neyshabur, Zhiyuan Li, Srinadh Bhojanapalli, Yann Lecun, Nathan Srebro

Paper
Code

DeSIGN: Design Inspiration from Generative Networks

1 code implementation • 3 Apr 2018 • Othman Sbai, Mohamed Elhoseiny, Antoine Bordes, Yann Lecun, Camille Couprie

Can an algorithm create original and compelling fashion designs to serve as an inspirational assistant?

Image Generation Retrieval

Paper
Code

Predicting Future Instance Segmentation by Forecasting Convolutional Features

1 code implementation • ECCV 2018 • Pauline Luc, Camille Couprie, Yann Lecun, Jakob Verbeek

We apply the "detection head'" of Mask R-CNN on the predicted features to produce the instance segmentation of future frames.

Instance Segmentation Optical Flow Estimation +3

Paper
Code

Byte-Level Recursive Convolutional Auto-Encoder for Text

1 code implementation • ICLR 2018 • Xiang Zhang, Yann Lecun

The proposed model is a multi-stage deep convolutional encoder-decoder framework using residual connections, containing up to 160 parameterized layers.

Text Generation

Paper
Code

Prediction Under Uncertainty with Error Encoding Networks

no code implementations • ICLR 2018 • Mikael Henaff, Junbo Zhao, Yann Lecun

In this work we introduce a new framework for performing temporal predictions in the presence of uncertainty.

Video Prediction

Paper
Add Code

A Closer Look at Spatiotemporal Convolutions for Action Recognition

20 code implementations • CVPR 2018 • Du Tran, Heng Wang, Lorenzo Torresani, Jamie Ray, Yann Lecun, Manohar Paluri

In this paper we discuss several forms of spatiotemporal convolutions for video analysis and study their effects on action recognition.

Ranked #3 on Action Recognition on Sports-1M

Action Classification Action Recognition +1

9,286

Paper
Code

Prediction Under Uncertainty with Error-Encoding Networks

2 code implementations • 14 Nov 2017 • Mikael Henaff, Junbo Zhao, Yann Lecun

In this work we introduce a new framework for performing temporal predictions in the presence of uncertainty.

Video Prediction

Paper
Code

A hierarchical loss and its problems when classifying non-hierarchically

no code implementations • 1 Sep 2017 • Cinna Wu, Mark Tygert, Yann Lecun

We define a metric that, inter alia, can penalize failure to distinguish between a sheepdog and a skyscraper more than failure to distinguish between a sheepdog and a poodle.

General Classification

Paper
Add Code

Which Encoding is the Best for Text Classification in Chinese, English, Japanese and Korean?

3 code implementations • 8 Aug 2017 • Xiang Zhang, Yann Lecun

This article offers an empirical study on the different ways of encoding Chinese, Japanese, Korean (CJK) and English languages for text classification.

General Classification Text Classification

2,909

Paper
Code

Adversarially Regularized Autoencoders

6 code implementations • 13 Jun 2017 • Jake Zhao, Yoon Kim, Kelly Zhang, Alexander M. Rush, Yann Lecun

This adversarially regularized autoencoder (ARAE) allows us to generate natural textual outputs as well as perform manipulations in the latent space to induce change in the output space.

Representation Learning Style Transfer

401

Paper
Code

Model-Based Planning with Discrete and Continuous Actions

1 code implementation • 19 May 2017 • Mikael Henaff, William F. Whitney, Yann Lecun

Action planning using learned and differentiable forward models of the world is a general approach which has a number of desirable properties, including improved sample complexity over model-free RL methods, reuse of learned models across different tasks, and the ability to perform efficient gradient-based optimization in continuous action spaces.

Paper
Code

Predicting Deeper into the Future of Semantic Segmentation

2 code implementations • ICCV 2017 • Pauline Luc, Natalia Neverova, Camille Couprie, Jakob Verbeek, Yann Lecun

The ability to predict and therefore to anticipate the future is an important attribute of intelligence.

Attribute Autonomous Driving +5

Paper
Code

Tunable Efficient Unitary Neural Networks (EUNN) and their application to RNNs

4 code implementations • ICML 2017 • Li Jing, Yichen Shen, Tena Dubček, John Peurifoy, Scott Skirlo, Yann Lecun, Max Tegmark, Marin Soljačić

Using unitary (instead of general) matrices in artificial neural networks (ANNs) is a promising way to solve the gradient explosion/vanishing problem, as well as to enable ANNs to learn long-term correlations in the data.

Permuted-MNIST

Paper
Code

Tracking the World State with Recurrent Entity Networks

5 code implementations • 12 Dec 2016 • Mikael Henaff, Jason Weston, Arthur Szlam, Antoine Bordes, Yann Lecun

The EntNet sets a new state-of-the-art on the bAbI tasks, and is the first method to solve all the tasks in the 10k training examples setting.

Ranked #5 on Procedural Text Understanding on ProPara

Procedural Text Understanding Question Answering

1,755

Paper
Code

Disentangling factors of variation in deep representation using adversarial training

no code implementations • NeurIPS 2016 • Michael F. Mathieu, Junbo Jake Zhao, Junbo Zhao, Aditya Ramesh, Pablo Sprechmann, Yann Lecun

The only available source of supervision during the training process comes from our ability to distinguish among different observations belonging to the same category.

Paper
Add Code

Geometric deep learning: going beyond Euclidean data

no code implementations • 24 Nov 2016 • Michael M. Bronstein, Joan Bruna, Yann Lecun, Arthur Szlam, Pierre Vandergheynst

In many applications, such geometric data are large and complex (in the case of social networks, on the scale of billions), and are natural targets for machine learning techniques.

Paper
Add Code

Eigenvalues of the Hessian in Deep Learning: Singularity and Beyond

no code implementations • 22 Nov 2016 • Levent Sagun, Leon Bottou, Yann Lecun

We look at the eigenvalues of the Hessian of a loss function before and after training.

Paper
Add Code

Disentangling factors of variation in deep representations using adversarial training

3 code implementations • 10 Nov 2016 • Michael Mathieu, Junbo Zhao, Pablo Sprechmann, Aditya Ramesh, Yann Lecun

During training, the only available source of supervision comes from our ability to distinguish among different observations belonging to the same class.

Disentanglement

Paper
Code

Entropy-SGD: Biasing Gradient Descent Into Wide Valleys

2 code implementations • 6 Nov 2016 • Pratik Chaudhari, Anna Choromanska, Stefano Soatto, Yann Lecun, Carlo Baldassi, Christian Borgs, Jennifer Chayes, Levent Sagun, Riccardo Zecchina

This paper proposes a new optimization algorithm called Entropy-SGD for training deep neural networks that is motivated by the local geometry of the energy landscape.

Paper
Code

Energy-based Generative Adversarial Network

3 code implementations • 11 Sep 2016 • Junbo Zhao, Michael Mathieu, Yann Lecun

We introduce the "Energy-based Generative Adversarial Network" model (EBGAN) which views the discriminator as an energy function that attributes low energies to the regions near the data manifold and higher energies to other regions.

Generative Adversarial Network

15,701

Paper
Code

Very Deep Convolutional Networks for Text Classification

24 code implementations • EACL 2017 • Alexis Conneau, Holger Schwenk, Loïc Barrault, Yann Lecun

The dominant approach for many NLP tasks are recurrent neural networks, in particular LSTMs, and convolutional neural networks.

Ranked #17 on Text Classification on AG News

General Classification Text Classification

504

Paper
Code

What is the Best Feature Learning Procedure in Hierarchical Recognition Architectures?

no code implementations • 5 Jun 2016 • Kevin Jarrett, Koray Kvukcuoglu, Karol Gregor, Yann Lecun

We also introduce a new single phase supervised learning procedure that places an L1 penalty on the output state of each layer of the network.

Object Recognition Unsupervised Pre-training

Paper
Add Code

Recurrent Orthogonal Networks and Long-Memory Tasks

1 code implementation • 22 Feb 2016 • Mikael Henaff, Arthur Szlam, Yann Lecun

Although RNNs have been shown to be powerful tools for processing sequential data, finding architectures or optimization strategies that allow them to model very long term dependencies is still an active area of research.

Paper
Code

Universal halting times in optimization and machine learning

no code implementations • 19 Nov 2015 • Levent Sagun, Thomas Trogdon, Yann Lecun

Given an algorithm, which we take to be both the optimization routine and the form of the random landscape, the fluctuations of the halting time follow a distribution that, after centering and scaling, remains unchanged even when the distribution on the landscape is changed.

BIG-bench Machine Learning

Paper
Add Code

Super-Resolution with Deep Convolutional Sufficient Statistics

1 code implementation • 18 Nov 2015 • Joan Bruna, Pablo Sprechmann, Yann Lecun

Inverse problems in image and audio, and super-resolution in particular, can be seen as high-dimensional structured prediction problems, where the goal is to characterize the conditional distribution of a high-resolution output given its low-resolution corrupted observation.

Bandwidth Extension Image Super-Resolution +1

Paper
Code

Deep multi-scale video prediction beyond mean square error

5 code implementations • 17 Nov 2015 • Michael Mathieu, Camille Couprie, Yann Lecun

Learning to predict future images from a video sequence involves the construction of an internal representation that models the image evolution accurately, and therefore, to some degree, its content and dynamics.

Optical Flow Estimation Video Prediction

729

Paper
Code

Binary embeddings with structured hashed projections

no code implementations • 16 Nov 2015 • Anna Choromanska, Krzysztof Choromanski, Mariusz Bojarski, Tony Jebara, Sanjiv Kumar, Yann Lecun

We prove several theoretical results showing that projections via various structured matrices followed by nonlinear mappings accurately preserve the angular distance between input high-dimensional vectors.

LEMMA

Paper
Add Code

Universum Prescription: Regularization using Unlabeled Data

no code implementations • 11 Nov 2015 • Xiang Zhang, Yann Lecun

This paper shows that simply prescribing "none of the above" labels to unlabeled data has a beneficial regularization effect to supervised learning.

Ranked #158 on Image Classification on CIFAR-10

Image Classification

Paper
Add Code

Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches

2 code implementations • 20 Oct 2015 • Jure Žbontar, Yann Lecun

We approach the problem by learning a similarity measure on small image patches using a convolutional neural network.

Binary Classification Stereo Matching +1

694

Paper
Code

Very Deep Multilingual Convolutional Neural Networks for LVCSR

no code implementations • 29 Sep 2015 • Tom Sercu, Christian Puhrsch, Brian Kingsbury, Yann Lecun

However, CNNs in LVCSR have not kept pace with recent advances in other domains where deeper neural networks provide superior performance.

Ranked #17 on Speech Recognition on Switchboard + Hub500

speech-recognition Speech Recognition

Paper
Add Code

Character-level Convolutional Networks for Text Classification

30 code implementations • NeurIPS 2015 • Xiang Zhang, Junbo Zhao, Yann Lecun

This article offers an empirical exploration on the use of character-level convolutional networks (ConvNets) for text classification.

Ranked #16 on Sentiment Analysis on Yelp Fine-grained classification

General Classification Sentiment Analysis +1

4,299

Paper
Code

Deep Convolutional Networks on Graph-Structured Data

3 code implementations • 16 Jun 2015 • Mikael Henaff, Joan Bruna, Yann Lecun

Deep Learning's recent successes have mostly relied on Convolutional Networks, which exploit fundamental statistical properties of images, sounds and video data: the local stationarity and multi-scale compositional structure, that allows expressing long range interactions in terms of shorter, localized interactions.

General Classification

1,319

Paper
Code

Learning to Linearize Under Uncertainty

no code implementations • NeurIPS 2015 • Ross Goroshin, Michael Mathieu, Yann Lecun

Training deep feature hierarchies to solve supervised learning tasks has achieved state of the art performance on many problems in computer vision.

Paper
Add Code

Stacked What-Where Auto-encoders

2 code implementations • 8 Jun 2015 • Junbo Zhao, Michael Mathieu, Ross Goroshin, Yann Lecun

The objective function includes reconstruction terms that induce the hidden states in the Deconvnet to be similar to those of the Convnet.

Ranked #10 on Semi-Supervised Image Classification on STL-10, 1000 Labels

Semi-Supervised Image Classification

Paper
Code

Unsupervised Feature Learning from Temporal Data

no code implementations • 9 Apr 2015 • Ross Goroshin, Joan Bruna, Jonathan Tompson, David Eigen, Yann Lecun

Current state-of-the-art classification and detection algorithms rely on supervised training.

General Classification Metric Learning

Paper
Add Code

A mathematical motivation for complex-valued convolutional networks

no code implementations • 11 Mar 2015 • Joan Bruna, Soumith Chintala, Yann Lecun, Serkan Piantino, Arthur Szlam, Mark Tygert

Courtesy of the exact correspondence, the remarkably rich and rigorous body of mathematical analysis for wavelets applies directly to (complex-valued) convnets.

Paper
Add Code

Text Understanding from Scratch

3 code implementations • 5 Feb 2015 • Xiang Zhang, Yann Lecun

This article demontrates that we can apply deep learning to text understanding from character-level inputs all the way up to abstract text concepts, using temporal convolutional networks (ConvNets).

General Classification Sentiment Analysis

847

Paper
Code

Fast Convolutional Nets With fbfft: A GPU Performance Evaluation

2 code implementations • 24 Dec 2014 • Nicolas Vasilache, Jeff Johnson, Michael Mathieu, Soumith Chintala, Serkan Piantino, Yann Lecun

We examine the performance profile of Convolutional Neural Network training on the current generation of NVIDIA Graphics Processing Units.

1,065

Paper
Code

Audio Source Separation with Discriminative Scattering Networks

no code implementations • 22 Dec 2014 • Pablo Sprechmann, Joan Bruna, Yann Lecun

In this report we describe an ongoing line of research for solving single-channel source separation problems.

Audio Source Separation

Paper
Add Code

Deep learning with Elastic Averaging SGD

10 code implementations • NeurIPS 2015 • Sixin Zhang, Anna Choromanska, Yann Lecun

We empirically demonstrate that in the deep learning setting, due to the existence of many local optima, allowing more exploration can lead to the improved performance.

Image Classification Stochastic Optimization

645

Paper
Code

Explorations on high dimensional landscapes

no code implementations • 20 Dec 2014 • Levent Sagun, V. Ugur Guney, Gerard Ben Arous, Yann Lecun

Finding minima of a real valued non-convex function over a high dimensional space is a major challenge in science.

Vocal Bursts Intensity Prediction

Paper
Add Code

Unsupervised Learning of Spatiotemporally Coherent Metrics

no code implementations • ICCV 2015 • Ross Goroshin, Joan Bruna, Jonathan Tompson, David Eigen, Yann Lecun

Current state-of-the-art classification and detection algorithms rely on supervised training.

General Classification Metric Learning

Paper
Add Code

The Loss Surfaces of Multilayer Networks

1 code implementation • 30 Nov 2014 • Anna Choromanska, Mikael Henaff, Michael Mathieu, Gérard Ben Arous, Yann Lecun

We show that for large-size decoupled networks the lowest critical values of the random loss function form a layered structure and they are located in a well-defined band lower-bounded by the global minimum.

Paper
Code

Efficient Object Localization Using Convolutional Networks

2 code implementations • CVPR 2015 • Jonathan Tompson, Ross Goroshin, Arjun Jain, Yann Lecun, Christopher Bregler

Recent state-of-the-art performance on human-body pose estimation has been achieved with Deep Convolutional Networks (ConvNets).

Ranked #42 on Pose Estimation on MPII Human Pose

Object Object Localization +2

Paper
Code

Differentially- and non-differentially-private random decision trees

no code implementations • 26 Oct 2014 • Mariusz Bojarski, Anna Choromanska, Krzysztof Choromanski, Yann Lecun

We consider supervised learning with random decision trees, where the tree construction is completely random.

Paper
Add Code

MoDeep: A Deep Learning Framework Using Motion Features for Human Pose Estimation

no code implementations • 28 Sep 2014 • Arjun Jain, Jonathan Tompson, Yann Lecun, Christoph Bregler

In this work, we propose a novel and efficient method for articulated human pose estimation in videos using a convolutional network architecture, which incorporates both color and motion features.

2D Human Pose Estimation Pose Estimation

Paper
Add Code

Computing the Stereo Matching Cost with a Convolutional Neural Network

1 code implementation • CVPR 2015 • Jure Žbontar, Yann Lecun

We present a method for extracting depth information from a rectified image pair.

Stereo Matching Stereo Matching Hand

Paper
Code

Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation

1 code implementation • NeurIPS 2014 • Jonathan Tompson, Arjun Jain, Yann Lecun, Christoph Bregler

This paper proposes a new hybrid architecture that consists of a deep Convolutional Network and a Markov Random Field.

Pose Estimation

Paper
Code

Fast Approximation of Rotations and Hessians matrices

no code implementations • 29 Apr 2014 • Michael Mathieu, Yann Lecun

A new method to represent and approximate rotation matrices is introduced.

Paper
Add Code

Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation

no code implementations • NeurIPS 2014 • Emily Denton, Wojciech Zaremba, Joan Bruna, Yann Lecun, Rob Fergus

We present techniques for speeding up the test-time evaluation of large convolutional networks, designed for object recognition tasks.

Object Recognition

Paper
Add Code

OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks

4 code implementations • 21 Dec 2013 • Pierre Sermanet, David Eigen, Xiang Zhang, Michael Mathieu, Rob Fergus, Yann Lecun

This integrated framework is the winner of the localization task of the ImageNet Large Scale Visual Recognition Challenge 2013 (ILSVRC2013) and obtained very competitive results for the detection and classifications tasks.

General Classification Image Classification +2

2,673

Paper
Code

Spectral Networks and Locally Connected Networks on Graphs

4 code implementations • 21 Dec 2013 • Joan Bruna, Wojciech Zaremba, Arthur Szlam, Yann Lecun

Convolutional Neural Networks are extremely efficient architectures in image and audio recognition tasks, thanks to their ability to exploit the local translational invariance of signal classes over their domain.

Clustering Translation

1,319

Paper
Code

Fast Training of Convolutional Networks through FFTs

no code implementations • 20 Dec 2013 • Michael Mathieu, Mikael Henaff, Yann Lecun

Convolutional networks are one of the most widely employed architectures in computer vision and machine learning.

Paper
Add Code

Understanding Deep Architectures using a Recursive Convolutional Network

no code implementations • 6 Dec 2013 • David Eigen, Jason Rolfe, Rob Fergus, Yann Lecun

A key challenge in designing convolutional network models is sizing them appropriately.

Paper
Add Code

Signal Recovery from Pooling Representations

no code implementations • 16 Nov 2013 • Joan Bruna, Arthur Szlam, Yann Lecun

In this work we compute lower Lipschitz bounds of $\ell_p$ pooling operators for $p=1, 2, \infty$ as well as $\ell_p$ pooling operators preceded by half-rectification layers.

regression