no code implementations • 28 Oct 2024 • Jan Witowski, Ken Zeng, Joseph Cappadona, Jailan Elayoubi, Elena Diana Chiru, Nancy Chan, Young-Joon Kang, Frederick Howard, Irina Ostrovnaya, Carlos Fernandez-Granda, Freya Schnabel, Ugur Ozerdem, Kangning Liu, Zoe Steinsnyder, Nitya Thakore, Mohammad Sadic, Frank Yeung, Elisa Liu, Theodore Hill, Benjamin Swett, Danielle Rigau, Andrew Clayburn, Valerie Speirs, Marcus Vetter, Lina Sojak, Simone Muenst Soysal, Daniel Baumhoer, Khalil Choucair, Yu Zong, Lina Daoud, Anas Saad, Waleed Abdulsattar, Rafic Beydoun, Jia-Wern Pan, Haslina Makmur, Soo-Hwang Teo, Linda Ma Pak, Victor Angel, Dovile Zilenaite-Petrulaitiene, Arvydas Laurinavicius, Natalie Klar, Brian D. Piening, Carlo Bifulco, Sun-Young Jun, Jae Pak Yi, Su Hyun Lim, Adam Brufsky, Francisco J. Esteva, Lajos Pusztai, Yann Lecun, Krzysztof J. Geras
Treatment selection in breast cancer is guided by molecular subtypes and clinical characteristics.
no code implementations • 20 Aug 2024 • Alex N. Wang, Christopher Hoang, Yuwen Xiong, Yann Lecun, Mengye Ren
Self-supervised learning has driven significant progress in learning from single-subject, iconic images.
no code implementations • 25 Jul 2024 • Vlad Sobal, Mark Ibrahim, Randall Balestriero, Vivien Cabannes, Diane Bouchacourt, Pietro Astolfi, Kyunghyun Cho, Yann Lecun
Based on this observation, we revise the standard contrastive loss to explicitly encode how a sample relates to others.
1 code implementation • 27 Jun 2024 • Colin White, Samuel Dooley, Manley Roberts, Arka Pal, Ben Feuer, Siddhartha Jain, Ravid Shwartz-Ziv, Neel Jain, Khalid Saifullah, Siddartha Naidu, Chinmay Hegde, Yann Lecun, Tom Goldstein, Willie Neiswanger, Micah Goldblum
In this work, we introduce a new benchmark for LLMs designed to be immune to both test set contamination and the pitfalls of LLM judging and human crowdsourcing.
1 code implementation • 24 Jun 2024 • Shengbang Tong, Ellis Brown, Penghao Wu, Sanghyun Woo, Manoj Middepogu, Sai Charitha Akula, Jihan Yang, Shusheng Yang, Adithya Iyer, Xichen Pan, Austin Wang, Rob Fergus, Yann Lecun, Saining Xie
We introduce Cambrian-1, a family of multimodal LLMs (MLLMs) designed with a vision-centric approach.
no code implementations • 17 Jun 2024 • Ravid Shwartz-Ziv, Micah Goldblum, Arpit Bansal, C. Bayan Bruss, Yann Lecun, Andrew Gordon Wilson
Our findings indicate that: (1) standard optimizers find minima where the model can only fit training sets with significantly fewer samples than it has parameters; (2) convolutional networks are more parameter-efficient than MLPs and ViTs, even on randomly labeled data; (3) while stochastic training is thought to have a regularizing effect, SGD actually finds minima that fit more training data than full-batch gradient descent; (4) the difference in capacity to fit correctly labeled and incorrectly labeled samples can be predictive of generalization; (5) ReLU activation functions result in finding minima that fit more data despite being designed to avoid vanishing and exploding gradients in deep architectures.
no code implementations • 13 Jun 2024 • Rylan Schaeffer, Victor Lecomte, Dhruv Bhandarkar Pai, Andres Carranza, Berivan Isik, Alyssa Unell, Mikail Khona, Thomas Yerxa, Yann Lecun, SueYeon Chung, Andrey Gromov, Ravid Shwartz-Ziv, Sanmi Koyejo
We then leverage tools from information theory to show that such embeddings maximize a well-known lower bound on mutual information between views, thereby connecting the geometric perspective of MMCR to the information-theoretic perspective commonly discussed in MVSSL.
no code implementations • 28 May 2024 • Nicklas Hansen, Jyothir S V, Vlad Sobal, Yann Lecun, Xiaolong Wang, Hao Su
Whole-body control for humanoids is challenging due to the high-dimensional nature of the problem, coupled with the inherent instability of a bipedal morphology.
no code implementations • 17 May 2024 • Adrien Basdevant, Camille François, Victor Storchan, Kevin Bankston, Ayah Bdeir, Brian Behlendorf, Merouane Debbah, Sayash Kapoor, Yann Lecun, Mark Surman, Helen King-Turvey, Nathan Lambert, Stefano Maffulli, Nik Marda, Govind Shivkumar, Justine Tunney
Over the past year, there has been a robust debate about the benefits and risks of open sourcing foundation models.
no code implementations • 16 May 2024 • Yuexiang Zhai, Hao Bai, Zipeng Lin, Jiayi Pan, Shengbang Tong, Yifei Zhou, Alane Suhr, Saining Xie, Yann Lecun, Yi Ma, Sergey Levine
Finally, our framework uses these task rewards to fine-tune the entire VLM with RL.
1 code implementation • 8 May 2024 • Ori Press, Ravid Shwartz-Ziv, Yann Lecun, Matthias Bethge
After many steps of optimization, EM makes the model embed test images far away from the embeddings of training images, which results in a degradation of accuracy.
no code implementations • 2 May 2024 • Théo Moutakanni, Piotr Bojanowski, Guillaume Chassagnon, Céline Hudelot, Armand Joulin, Yann Lecun, Matthew Muckley, Maxime Oquab, Marie-Pierre Revel, Maria Vakalopoulou
AI Foundation models are gaining traction in various applications, including medical fields like radiology.
no code implementations • 15 Apr 2024 • Amir Bar, Arya Bakhtiar, Danny Tran, Antonio Loquercio, Jathushan Rajasegaran, Yann Lecun, Amir Globerson, Trevor Darrell
Animals perceive the world to plan their actions and interact with other agents to accomplish complex tasks, demonstrating capabilities that are still unmatched by AI systems.
no code implementations • 1 Mar 2024 • Quentin Garrido, Mahmoud Assran, Nicolas Ballas, Adrien Bardes, Laurent Najman, Yann Lecun
Joint-Embedding Predictive Architecture (JEPA) has emerged as a promising self-supervised approach that learns by leveraging a world model.
no code implementations • 17 Feb 2024 • Randall Balestriero, Yann Lecun
Despite interpretability of the reconstruction and generation, we identify a misalignment between learning by reconstruction, and learning for perception.
1 code implementation • arXiv preprint 2024 • Adrien Bardes, Quentin Garrido, Jean Ponce, Xinlei Chen, Michael Rabbat, Yann Lecun, Mahmoud Assran, Nicolas Ballas
This paper explores feature prediction as a stand-alone objective for unsupervised learning from video and introduces V-JEPA, a collection of vision models trained solely using a feature prediction objective, without the use of pretrained image encoders, text, negative examples, reconstruction, or other sources of supervision.
1 code implementation • 12 Feb 2024 • Xiaoxin He, Yijun Tian, Yifei Sun, Nitesh V. Chawla, Thomas Laurent, Yann Lecun, Xavier Bresson, Bryan Hooi
Given a graph with textual attributes, we enable users to `chat with their graph': that is, to ask questions about the graph using a conversational interface.
no code implementations • 20 Jan 2024 • Randall Balestriero, Yann Lecun
One fruitful formulation of Deep Networks (DNs) enabling their theoretical study and providing practical guidelines to practitioners relies on Piecewise Affine Splines.
1 code implementation • CVPR 2024 • Shengbang Tong, Zhuang Liu, Yuexiang Zhai, Yi Ma, Yann Lecun, Saining Xie
To understand the roots of these errors, we explore the gap between the visual embedding space of CLIP and vision-only self-supervised learning.
no code implementations • 28 Dec 2023 • Jyothir S V, Siddhartha Jalagam, Yann Lecun, Vlad Sobal
The enduring challenge in the field of artificial intelligence has been the control of systems to achieve desired behaviours.
1 code implementation • 21 Nov 2023 • Grégoire Mialon, Clémentine Fourrier, Craig Swift, Thomas Wolf, Yann Lecun, Thomas Scialom
GAIA's philosophy departs from the current trend in AI benchmarks suggesting to target tasks that are ever more difficult for humans.
no code implementations • 6 Oct 2023 • Zeyu Yun, Juexiao Zhang, Bruno Olshausen, Yann Lecun, Yubei Chen
Unsupervised representation learning has seen tremendous progress but is constrained by its reliance on data modality-specific stationarity and topology, a limitation not found in biological intelligence systems.
1 code implementation • 31 Jul 2023 • Amir Bar, Florian Bordes, Assaf Shocher, Mahmoud Assran, Pascal Vincent, Nicolas Ballas, Trevor Darrell, Amir Globerson, Yann Lecun
Masked Image Modeling (MIM) is a promising self-supervised learning approach that enables learning from unlabeled images.
no code implementations • 24 Jul 2023 • Adrien Bardes, Jean Ponce, Yann Lecun
Self-supervised learning of visual representations has been focusing on learning content features, which do not capture object motion or location, and focus on identifying and differentiating objects in images and videos.
1 code implementation • NeurIPS 2023 • Grégoire Mialon, Quentin Garrido, Hannah Lawrence, Danyal Rehman, Yann Lecun, Bobak T. Kiani
Machine learning for differential equations paves the way for computationally efficient alternatives to numerical solvers, with potentially broad impacts in science and engineering.
no code implementations • 23 Jun 2023 • Jiachen Zhu, Katrina Evtimova, Yubei Chen, Ravid Shwartz-Ziv, Yann Lecun
In summary, VCReg offers a universally applicable regularization framework that significantly advances transfer learning and highlights the connection between gradient starvation, neural collapse, and feature transferability.
no code implementations • 5 Jun 2023 • Anna Dawid, Yann Lecun
Current automated systems have crucial limitations that need to be addressed before artificial intelligence can reach human-like levels and bring new technological revolutions.
3 code implementations • 31 May 2023 • Xiaoxin He, Xavier Bresson, Thomas Laurent, Adam Perold, Yann Lecun, Bryan Hooi
With the advent of powerful large language models (LLMs) such as GPT or Llama2, which demonstrate an ability to reason and to utilize general knowledge, there is a growing need for techniques which combine the textual modelling abilities of LLMs with the structural learning capabilities of GNNs.
Ranked #2 on Node Property Prediction on ogbn-arxiv (using extra training data)
1 code implementation • NeurIPS 2023 • Ido Ben-Shaul, Ravid Shwartz-Ziv, Tomer Galanti, Shai Dekel, Yann Lecun
Self-supervised learning (SSL) is a powerful tool in machine learning, but understanding the learned representations and their underlying mechanisms remains a challenge.
no code implementations • 24 Apr 2023 • Randall Balestriero, Mark Ibrahim, Vlad Sobal, Ari Morcos, Shashank Shekhar, Tom Goldstein, Florian Bordes, Adrien Bardes, Gregoire Mialon, Yuandong Tian, Avi Schwarzschild, Andrew Gordon Wilson, Jonas Geiping, Quentin Garrido, Pierre Fernandez, Amir Bar, Hamed Pirsiavash, Yann Lecun, Micah Goldblum
Self-supervised learning, dubbed the dark matter of intelligence, is a promising path to advance machine learning.
no code implementations • 19 Apr 2023 • Ravid Shwartz-Ziv, Yann Lecun
Information theory, and notably the information bottleneck principle, has been pivotal in shaping deep neural networks.
3 code implementations • 8 Apr 2023 • Shengbang Tong, Yubei Chen, Yi Ma, Yann Lecun
Recently, self-supervised learning (SSL) has achieved tremendous success in learning image representation.
no code implementations • ICCV 2023 • Vivien Cabannes, Leon Bottou, Yann Lecun, Randall Balestriero
Third, it provides a proper active learning framework yielding low-cost solutions to annotate datasets, arguably bringing the gap between theory and practice of active learning that is based on simple-to-answer-by-non-experts queries of semantic relationships between inputs.
no code implementations • 1 Mar 2023 • Ravid Shwartz-Ziv, Randall Balestriero, Kenji Kawaguchi, Tim G. J. Rudner, Yann Lecun
Variance-Invariance-Covariance Regularization (VICReg) is a self-supervised learning (SSL) method that has shown promising results on a variety of tasks.
1 code implementation • 15 Feb 2023 • Grégoire Mialon, Roberto Dessì, Maria Lomeli, Christoforos Nalmpantis, Ram Pasunuru, Roberta Raileanu, Baptiste Rozière, Timo Schick, Jane Dwivedi-Yu, Asli Celikyilmaz, Edouard Grave, Yann Lecun, Thomas Scialom
This survey reviews works in which language models (LMs) are augmented with reasoning skills and the ability to use tools.
1 code implementation • 14 Feb 2023 • Quentin Garrido, Laurent Najman, Yann Lecun
We hope that both our introduced dataset and approach will enable learning richer representations without supervision in more complex scenarios.
no code implementations • 6 Feb 2023 • Vivien Cabannes, Bobak T. Kiani, Randall Balestriero, Yann Lecun, Alberto Bietti
Self-supervised learning (SSL) has emerged as a powerful framework to learn representations from raw data without supervision.
1 code implementation • 3 Feb 2023 • Shoaib Ahmed Siddiqui, David Krueger, Yann Lecun, Stéphane Deny
Current state-of-the-art deep networks are all powered by backpropagation.
3 code implementations • CVPR 2023 • Mahmoud Assran, Quentin Duval, Ishan Misra, Piotr Bojanowski, Pascal Vincent, Michael Rabbat, Yann Lecun, Nicolas Ballas
This paper demonstrates an approach for learning highly semantic image representations without relying on hand-crafted data-augmentations.
3 code implementations • 27 Dec 2022 • Xiaoxin He, Bryan Hooi, Thomas Laurent, Adam Perold, Yann Lecun, Xavier Bresson
First, they capture long-range dependency and mitigate the issue of over-squashing as demonstrated on Long Range Graph Benchmark and TreeNeighbourMatch datasets.
Ranked #2 on Graph Regression on Peptides-struct
1 code implementation • 20 Nov 2022 • Vlad Sobal, Jyothir S V, Siddhartha Jalagam, Nicolas Carion, Kyunghyun Cho, Yann Lecun
Many common methods for learning a world model for pixel-based environments use generative architectures trained with pixel-level reconstruction objectives.
1 code implementation • 2 Nov 2022 • Randall Balestriero, Yann Lecun
In this paper we propose the first provable affine constraint enforcement method for DNNs that only requires minimal changes into a given DNN's forward-pass, that is computationally friendly, and that leaves the optimization of the DNN's parameter to be unconstrained, i. e. standard gradient-based method can be employed.
1 code implementation • 30 Oct 2022 • Shengbang Tong, Xili Dai, Yubei Chen, Mingyang Li, Zengyi Li, Brent Yi, Yann Lecun, Yi Ma
This paper proposes an unsupervised method for learning a unified representation that serves both discriminative and generative purposes.
no code implementations • 15 Oct 2022 • Anthony Zador, Sean Escola, Blake Richards, Bence Ölveczky, Yoshua Bengio, Kwabena Boahen, Matthew Botvinick, Dmitri Chklovskii, Anne Churchland, Claudia Clopath, James DiCarlo, Surya Ganguli, Jeff Hawkins, Konrad Koerding, Alexei Koulakov, Yann Lecun, Timothy Lillicrap, Adam Marblestone, Bruno Olshausen, Alexandre Pouget, Cristina Savin, Terrence Sejnowski, Eero Simoncelli, Sara Solla, David Sussillo, Andreas S. Tolias, Doris Tsao
Neuroscience has long been an essential driver of progress in artificial intelligence (AI).
1 code implementation • 9 Oct 2022 • Shraman Pramanick, Li Jing, Sayan Nag, Jiachen Zhu, Hardik Shah, Yann Lecun, Rama Chellappa
Extensive experiments on a wide range of vision- and vision-language downstream tasks demonstrate the effectiveness of VoLTA on fine-grained applications without compromising the coarse-grained downstream performance, often outperforming methods using significantly more caption and box annotations.
no code implementations • 5 Oct 2022 • Quentin Garrido, Randall Balestriero, Laurent Najman, Yann Lecun
Joint-Embedding Self Supervised Learning (JE-SSL) has seen a rapid development, with the emergence of many method variations but only few principled guidelines that would help practitioners to successfully deploy them.
3 code implementations • 4 Oct 2022 • Adrien Bardes, Jean Ponce, Yann Lecun
Most recent self-supervised methods for learning image representations focus on either producing a global feature with invariance properties, or producing a set of local features.
no code implementations • 30 Sep 2022 • Yubei Chen, Zeyu Yun, Yi Ma, Bruno Olshausen, Yann Lecun
Though there remains a small performance gap between our simple constructive model and SOTA methods, the evidence points to this as a promising direction for achieving a principled and white-box approach to unsupervised learning.
Ranked #1 on Unsupervised MNIST on MNIST
Self-Supervised Learning Sparse Representation-based Classification +3
no code implementations • 29 Sep 2022 • Grégoire Mialon, Randall Balestriero, Yann Lecun
Self-Supervised Learning (SSL) methods such as VICReg, Barlow Twins or W-MSE avoid collapse of their joint embedding architectures by constraining or regularizing the covariance matrix of their projector's output.
no code implementations • 29 Sep 2022 • Bobak T. Kiani, Randall Balestriero, Yubei Chen, Seth Lloyd, Yann Lecun
The fundamental goal of self-supervised learning (SSL) is to produce useful representations of data without access to any labels for classifying the data.
1 code implementation • 25 Aug 2022 • Wancong Zhang, Anthony GX-Chen, Vlad Sobal, Yann Lecun, Nicolas Carion
Unsupervised visual representation learning offers the opportunity to leverage large corpora of unlabeled trajectories to form useful visual representations, which can benefit the training of reinforcement learning (RL) algorithms.
no code implementations • 20 Jul 2022 • Ravid Shwartz-Ziv, Randall Balestriero, Yann Lecun
In this paper, we examine self-supervised learning methods, particularly VICReg, to provide an information-theoretical understanding of their construction.
2 code implementations • 21 Jun 2022 • Jiachen Zhu, Rafael M. Moraes, Serkan Karakulak, Vlad Sobol, Alfredo Canziani, Yann Lecun
Similar to other recent self-supervised learning methods, our method is based on maximizing the agreement among embeddings of different distorted versions of the same image, which pushes the encoder to produce transformation invariant representations.
no code implementations • 17 Jun 2022 • Yubei Chen, Adrien Bardes, Zengyi Li, Yann Lecun
Even with 32x32 patch representation, BagSSL achieves 62% top-1 linear probing accuracy on ImageNet.
1 code implementation • NeurIPS 2022 • Zi-Yi Dou, Aishwarya Kamath, Zhe Gan, Pengchuan Zhang, JianFeng Wang, Linjie Li, Zicheng Liu, Ce Liu, Yann Lecun, Nanyun Peng, Jianfeng Gao, Lijuan Wang
Vision-language (VL) pre-training has recently received considerable attention.
Ranked #1 on Phrase Grounding on Flickr30k Entities Dev
no code implementations • 15 Jun 2022 • Li Jing, Jiachen Zhu, Yann Lecun
Self-supervised learning has shown superior performances over supervised methods on various vision benchmarks.
no code implementations • 3 Jun 2022 • Quentin Garrido, Yubei Chen, Adrien Bardes, Laurent Najman, Yann Lecun
Recent approaches in self-supervised learning of image representations can be categorized into different families of methods and, in particular, can be divided into contrastive and non-contrastive approaches.
no code implementations • 23 May 2022 • Randall Balestriero, Yann Lecun
Self-Supervised Learning (SSL) surmises that inputs and pairwise positive relationships are enough to learn meaningful representations.
1 code implementation • 20 May 2022 • Ravid Shwartz-Ziv, Micah Goldblum, Hossein Souri, Sanyam Kapoor, Chen Zhu, Yann Lecun, Andrew Gordon Wilson
Deep learning is increasingly moving towards a transfer learning paradigm whereby large foundation models are fine-tuned on downstream tasks, starting from an initialization learned on the source task.
no code implementations • 7 Apr 2022 • Randall Balestriero, Leon Bottou, Yann Lecun
The optimal amount of DA or weight decay found from cross-validation leads to disastrous model performances on some classes e. g. on Imagenet with a resnet50, the "barn spider" classification test accuracy falls from $68\%$ to $46\%$ only by introducing random crop DA during training.
1 code implementation • 10 Mar 2022 • Bobak Kiani, Randall Balestriero, Yann Lecun, Seth Lloyd
In learning with recurrent or very deep feed-forward networks, employing unitary matrices in each layer can be very effective at maintaining long-range stability.
no code implementations • 16 Feb 2022 • Randall Balestriero, Ishan Misra, Yann Lecun
We show that for a training loss to be stable under DA sampling, the model's saliency map (gradient of the loss with respect to the model's input) must align with the smallest eigenvector of the sample variance under the considered DA augmentation, hinting at a possible explanation on why models tend to shift their focus from edges to textures.
1 code implementation • 24 Jan 2022 • Zengyi Li, Yubei Chen, Yann Lecun, Friedrich T. Sommer
We argue that achieving manifold clustering with neural networks requires two essential ingredients: a domain-specific constraint that ensures the identification of the manifolds, and a learning algorithm for embedding each manifold to a linear subspace in the feature space.
1 code implementation • 16 Dec 2021 • Katrina Evtimova, Yann Lecun
Sparse coding with an $l_1$ penalty and a learned linear dictionary requires regularization of the dictionary to prevent a collapse in the $l_1$ norms of the codes.
no code implementations • 18 Oct 2021 • Randall Balestriero, Jerome Pesenti, Yann Lecun
The notion of interpolation and extrapolation is fundamental in various fields from deep learning to function approximation.
1 code implementation • ICLR 2022 • Li Jing, Pascal Vincent, Yann Lecun, Yuandong Tian
It has been shown that non-contrastive methods suffer from a lesser collapse problem of a different nature: dimensional collapse, whereby the embedding vectors end up spanning a lower-dimensional subspace instead of the entire available embedding space.
4 code implementations • 13 Oct 2021 • Chun-Hsiao Yeh, Cheng-Yao Hong, Yen-Chi Hsu, Tyng-Luh Liu, Yubei Chen, Yann Lecun
Further, DCL can be combined with the SOTA contrastive learning method, NNCLR, to achieve 72. 3% ImageNet-1K top-1 accuracy with 512 batch size in 400 epochs, which represents a new SOTA in contrastive learning.
1 code implementation • 15 Jul 2021 • Jiayun Wang, Yubei Chen, Stella X. Yu, Brian Cheung, Yann Lecun
We propose a drastically different approach to compact and optimal deep learning: We decouple the Degrees of freedom (DoF) and the actual number of parameters of a model, optimize a small DoF with predefined random linear constraints for a large model of arbitrary architecture, in one-stage end-to-end learning.
Ranked #97 on Image Classification on ObjectNet (using extra training data)
6 code implementations • NeurIPS 2021 • Adrien Bardes, Jean Ponce, Yann Lecun
Recent self-supervised methods for image representation learning are based on maximizing the agreement between embedding vectors from different views of the same image.
Representation Learning Self-Supervised Image Classification +2
5 code implementations • 26 Apr 2021 • Aishwarya Kamath, Mannat Singh, Yann Lecun, Gabriel Synnaeve, Ishan Misra, Nicolas Carion
We also investigate the utility of our model as an object detector on a given label set when fine-tuned in a few-shot setting.
Ranked #1 on Visual Question Answering (VQA) on CLEVR-Humans
Generalized Referring Expression Comprehension Phrase Grounding +9
1 code implementation • NAACL (DeeLIO) 2021 • Zeyu Yun, Yubei Chen, Bruno A Olshausen, Yann Lecun
Transformer networks have revolutionized NLP representation learning since they were introduced.
24 code implementations • 4 Mar 2021 • Jure Zbontar, Li Jing, Ishan Misra, Yann Lecun, Stéphane Deny
This causes the embedding vectors of distorted versions of a sample to be similar, while minimizing the redundancy between the components of these vectors.
Ranked #11 on Image Classification on Places205
1 code implementation • ICCV 2021 • Aishwarya Kamath, Mannat Singh, Yann Lecun, Gabriel Synnaeve, Ishan Misra, Nicolas Carion
We also investigate the utility of our model as an object detector on a given label set when fine-tuned in a few-shot setting.
Ranked #2 on Referring Expression Comprehension on Talk2Car (using extra training data)
no code implementations • 1 Jan 2021 • Tom Sercu, Robert Verkuil, Joshua Meier, Brandon Amos, Zeming Lin, Caroline Chen, Jason Liu, Yann Lecun, Alexander Rives
We propose the Neural Potts Model objective as an amortized optimization problem.
3 code implementations • NeurIPS 2020 • Li Jing, Jure Zbontar, Yann Lecun
An important component of autoencoders is the method by which the information capacity of the latent representation is minimized or limited.
1 code implementation • 17 Jun 2019 • Baptiste Rozière, Morgane Riviere, Olivier Teytaud, Jérémy Rapin, Yann Lecun, Camille Couprie
We design a simple optimization method to find the optimal latent parameters corresponding to the closest generation to any input inspirational image.
1 code implementation • ICLR 2019 • Behnam Neyshabur, Zhiyuan Li, Srinadh Bhojanapalli, Yann Lecun, Nathan Srebro
Despite existing work on ensuring generalization of neural networks in terms of scale sensitive complexity measures, such as norms, margin and sharpness, these complexity measures do not offer an explanation of why neural networks generalize better with over-parametrization.
1 code implementation • CVPR 2019 • Huy V. Vo, Francis Bach, Minsu Cho, Kai Han, Yann Lecun, Patrick Perez, Jean Ponce
Learning with complete or partial supervision is powerful but relies on ever-growing human annotation efforts.
Ranked #2 on Single-object colocalization on Object Discovery
1 code implementation • NeurIPS 2019 • Mohamed Ishmael Belghazi, Maxime Oquab, Yann Lecun, David Lopez-Paz
We introduce the Neural Conditioner (NC), a self-supervised machine able to learn about all the conditional distributions of a random vector $X$.
1 code implementation • ICLR 2019 • Mikael Henaff, Alfredo Canziani, Yann Lecun
Learning a policy using only observational data is challenging because the distribution of states it induces at execution time may differ from the distribution observed during training.
no code implementations • 4 Dec 2018 • Aditya Ramesh, Youngduck Choi, Yann Lecun
A generative model with a disentangled representation allows for independent control over different aspects of the output.
no code implementations • NeurIPS 2018 • Zhilin Yang, Jake Zhao, Bhuwan Dhingra, Kaiming He, William W. Cohen, Ruslan R. Salakhutdinov, Yann Lecun
We also show that the learned graphs are generic enough to be transferred to different embeddings on which the graphs have not been trained (including GloVe embeddings, ELMo embeddings, and task-specific RNN hidden units), or embedding-free units such as image pixels.
no code implementations • 10 Nov 2018 • Xiang Zhang, Yann Lecun
An ATNNFAE consists of an auto-encoder where the internal code is normalized on the unit sphere and corrupted by additive noise.
no code implementations • 27 Sep 2018 • Adji B. Dieng, Kyunghyun Cho, David M. Blei, Yann Lecun
Furthermore, the reflective likelihood objective prevents posterior collapse when used to train stochastic auto-encoders with amortized inference.
no code implementations • ICML 2018 • Marco Baity-Jesi, Levent Sagun, Mario Geiger, Stefano Spigler, Gerard Ben Arous, Chiara Cammarota, Yann Lecun, Matthieu Wyart, Giulio Biroli
We analyze numerically the training dynamics of deep neural networks (DNN) by using methods developed in statistical physics of glassy systems.
1 code implementation • 14 Jun 2018 • Zhilin Yang, Jake Zhao, Bhuwan Dhingra, Kaiming He, William W. Cohen, Ruslan Salakhutdinov, Yann Lecun
We also show that the learned graphs are generic enough to be transferred to different embeddings on which the graphs have not been trained (including GloVe embeddings, ELMo embeddings, and task-specific RNN hidden unit), or embedding-free units such as image pixels.
1 code implementation • 1 Jun 2018 • Aditya Ramesh, Yann Lecun
We introduce a tool that allows us to do this even when the likelihood is not explicitly set, by instead using the implicit likelihood of the model.
2 code implementations • 30 May 2018 • Behnam Neyshabur, Zhiyuan Li, Srinadh Bhojanapalli, Yann Lecun, Nathan Srebro
Despite existing work on ensuring generalization of neural networks in terms of scale sensitive complexity measures, such as norms, margin and sharpness, these complexity measures do not offer an explanation of why neural networks generalize better with over-parametrization.
1 code implementation • 3 Apr 2018 • Othman Sbai, Mohamed Elhoseiny, Antoine Bordes, Yann Lecun, Camille Couprie
Can an algorithm create original and compelling fashion designs to serve as an inspirational assistant?
1 code implementation • ECCV 2018 • Pauline Luc, Camille Couprie, Yann Lecun, Jakob Verbeek
We apply the "detection head'" of Mask R-CNN on the predicted features to produce the instance segmentation of future frames.
1 code implementation • ICLR 2018 • Xiang Zhang, Yann Lecun
The proposed model is a multi-stage deep convolutional encoder-decoder framework using residual connections, containing up to 160 parameterized layers.
no code implementations • ICLR 2018 • Mikael Henaff, Junbo Zhao, Yann Lecun
In this work we introduce a new framework for performing temporal predictions in the presence of uncertainty.
21 code implementations • CVPR 2018 • Du Tran, Heng Wang, Lorenzo Torresani, Jamie Ray, Yann Lecun, Manohar Paluri
In this paper we discuss several forms of spatiotemporal convolutions for video analysis and study their effects on action recognition.
Ranked #3 on Action Recognition on Sports-1M
2 code implementations • 14 Nov 2017 • Mikael Henaff, Junbo Zhao, Yann Lecun
In this work we introduce a new framework for performing temporal predictions in the presence of uncertainty.
no code implementations • 1 Sep 2017 • Cinna Wu, Mark Tygert, Yann Lecun
We define a metric that, inter alia, can penalize failure to distinguish between a sheepdog and a skyscraper more than failure to distinguish between a sheepdog and a poodle.
3 code implementations • 8 Aug 2017 • Xiang Zhang, Yann Lecun
This article offers an empirical study on the different ways of encoding Chinese, Japanese, Korean (CJK) and English languages for text classification.
6 code implementations • 13 Jun 2017 • Jake Zhao, Yoon Kim, Kelly Zhang, Alexander M. Rush, Yann Lecun
This adversarially regularized autoencoder (ARAE) allows us to generate natural textual outputs as well as perform manipulations in the latent space to induce change in the output space.
1 code implementation • 19 May 2017 • Mikael Henaff, William F. Whitney, Yann Lecun
Action planning using learned and differentiable forward models of the world is a general approach which has a number of desirable properties, including improved sample complexity over model-free RL methods, reuse of learned models across different tasks, and the ability to perform efficient gradient-based optimization in continuous action spaces.
2 code implementations • ICCV 2017 • Pauline Luc, Natalia Neverova, Camille Couprie, Jakob Verbeek, Yann Lecun
The ability to predict and therefore to anticipate the future is an important attribute of intelligence.
4 code implementations • ICML 2017 • Li Jing, Yichen Shen, Tena Dubček, John Peurifoy, Scott Skirlo, Yann Lecun, Max Tegmark, Marin Soljačić
Using unitary (instead of general) matrices in artificial neural networks (ANNs) is a promising way to solve the gradient explosion/vanishing problem, as well as to enable ANNs to learn long-term correlations in the data.
5 code implementations • 12 Dec 2016 • Mikael Henaff, Jason Weston, Arthur Szlam, Antoine Bordes, Yann Lecun
The EntNet sets a new state-of-the-art on the bAbI tasks, and is the first method to solve all the tasks in the 10k training examples setting.
Ranked #5 on Procedural Text Understanding on ProPara
no code implementations • NeurIPS 2016 • Michael F. Mathieu, Junbo Jake Zhao, Junbo Zhao, Aditya Ramesh, Pablo Sprechmann, Yann Lecun
The only available source of supervision during the training process comes from our ability to distinguish among different observations belonging to the same category.
no code implementations • 24 Nov 2016 • Michael M. Bronstein, Joan Bruna, Yann Lecun, Arthur Szlam, Pierre Vandergheynst
In many applications, such geometric data are large and complex (in the case of social networks, on the scale of billions), and are natural targets for machine learning techniques.
no code implementations • 22 Nov 2016 • Levent Sagun, Leon Bottou, Yann Lecun
We look at the eigenvalues of the Hessian of a loss function before and after training.
3 code implementations • 10 Nov 2016 • Michael Mathieu, Junbo Zhao, Pablo Sprechmann, Aditya Ramesh, Yann Lecun
During training, the only available source of supervision comes from our ability to distinguish among different observations belonging to the same class.
2 code implementations • 6 Nov 2016 • Pratik Chaudhari, Anna Choromanska, Stefano Soatto, Yann Lecun, Carlo Baldassi, Christian Borgs, Jennifer Chayes, Levent Sagun, Riccardo Zecchina
This paper proposes a new optimization algorithm called Entropy-SGD for training deep neural networks that is motivated by the local geometry of the energy landscape.
3 code implementations • 11 Sep 2016 • Junbo Zhao, Michael Mathieu, Yann Lecun
We introduce the "Energy-based Generative Adversarial Network" model (EBGAN) which views the discriminator as an energy function that attributes low energies to the regions near the data manifold and higher energies to other regions.
23 code implementations • EACL 2017 • Alexis Conneau, Holger Schwenk, Loïc Barrault, Yann Lecun
The dominant approach for many NLP tasks are recurrent neural networks, in particular LSTMs, and convolutional neural networks.
Ranked #17 on Text Classification on AG News
no code implementations • 5 Jun 2016 • Kevin Jarrett, Koray Kvukcuoglu, Karol Gregor, Yann Lecun
We also introduce a new single phase supervised learning procedure that places an L1 penalty on the output state of each layer of the network.
1 code implementation • 22 Feb 2016 • Mikael Henaff, Arthur Szlam, Yann Lecun
Although RNNs have been shown to be powerful tools for processing sequential data, finding architectures or optimization strategies that allow them to model very long term dependencies is still an active area of research.
no code implementations • 19 Nov 2015 • Levent Sagun, Thomas Trogdon, Yann Lecun
Given an algorithm, which we take to be both the optimization routine and the form of the random landscape, the fluctuations of the halting time follow a distribution that, after centering and scaling, remains unchanged even when the distribution on the landscape is changed.
1 code implementation • 18 Nov 2015 • Joan Bruna, Pablo Sprechmann, Yann Lecun
Inverse problems in image and audio, and super-resolution in particular, can be seen as high-dimensional structured prediction problems, where the goal is to characterize the conditional distribution of a high-resolution output given its low-resolution corrupted observation.
5 code implementations • 17 Nov 2015 • Michael Mathieu, Camille Couprie, Yann Lecun
Learning to predict future images from a video sequence involves the construction of an internal representation that models the image evolution accurately, and therefore, to some degree, its content and dynamics.
no code implementations • 16 Nov 2015 • Anna Choromanska, Krzysztof Choromanski, Mariusz Bojarski, Tony Jebara, Sanjiv Kumar, Yann Lecun
We prove several theoretical results showing that projections via various structured matrices followed by nonlinear mappings accurately preserve the angular distance between input high-dimensional vectors.
no code implementations • 11 Nov 2015 • Xiang Zhang, Yann Lecun
This paper shows that simply prescribing "none of the above" labels to unlabeled data has a beneficial regularization effect to supervised learning.
Ranked #164 on Image Classification on CIFAR-10
2 code implementations • 20 Oct 2015 • Jure Žbontar, Yann Lecun
We approach the problem by learning a similarity measure on small image patches using a convolutional neural network.
no code implementations • 29 Sep 2015 • Tom Sercu, Christian Puhrsch, Brian Kingsbury, Yann Lecun
However, CNNs in LVCSR have not kept pace with recent advances in other domains where deeper neural networks provide superior performance.
Ranked #17 on Speech Recognition on Switchboard + Hub500
30 code implementations • NeurIPS 2015 • Xiang Zhang, Junbo Zhao, Yann Lecun
This article offers an empirical exploration on the use of character-level convolutional networks (ConvNets) for text classification.
Ranked #16 on Sentiment Analysis on Yelp Fine-grained classification
3 code implementations • 16 Jun 2015 • Mikael Henaff, Joan Bruna, Yann Lecun
Deep Learning's recent successes have mostly relied on Convolutional Networks, which exploit fundamental statistical properties of images, sounds and video data: the local stationarity and multi-scale compositional structure, that allows expressing long range interactions in terms of shorter, localized interactions.
no code implementations • NeurIPS 2015 • Ross Goroshin, Michael Mathieu, Yann Lecun
Training deep feature hierarchies to solve supervised learning tasks has achieved state of the art performance on many problems in computer vision.
2 code implementations • 8 Jun 2015 • Junbo Zhao, Michael Mathieu, Ross Goroshin, Yann Lecun
The objective function includes reconstruction terms that induce the hidden states in the Deconvnet to be similar to those of the Convnet.
no code implementations • 9 Apr 2015 • Ross Goroshin, Joan Bruna, Jonathan Tompson, David Eigen, Yann Lecun
Current state-of-the-art classification and detection algorithms rely on supervised training.
no code implementations • 11 Mar 2015 • Joan Bruna, Soumith Chintala, Yann Lecun, Serkan Piantino, Arthur Szlam, Mark Tygert
Courtesy of the exact correspondence, the remarkably rich and rigorous body of mathematical analysis for wavelets applies directly to (complex-valued) convnets.
3 code implementations • 5 Feb 2015 • Xiang Zhang, Yann Lecun
This article demontrates that we can apply deep learning to text understanding from character-level inputs all the way up to abstract text concepts, using temporal convolutional networks (ConvNets).
2 code implementations • 24 Dec 2014 • Nicolas Vasilache, Jeff Johnson, Michael Mathieu, Soumith Chintala, Serkan Piantino, Yann Lecun
We examine the performance profile of Convolutional Neural Network training on the current generation of NVIDIA Graphics Processing Units.
no code implementations • 22 Dec 2014 • Pablo Sprechmann, Joan Bruna, Yann Lecun
In this report we describe an ongoing line of research for solving single-channel source separation problems.
no code implementations • 20 Dec 2014 • Levent Sagun, V. Ugur Guney, Gerard Ben Arous, Yann Lecun
Finding minima of a real valued non-convex function over a high dimensional space is a major challenge in science.
10 code implementations • NeurIPS 2015 • Sixin Zhang, Anna Choromanska, Yann Lecun
We empirically demonstrate that in the deep learning setting, due to the existence of many local optima, allowing more exploration can lead to the improved performance.
no code implementations • ICCV 2015 • Ross Goroshin, Joan Bruna, Jonathan Tompson, David Eigen, Yann Lecun
Current state-of-the-art classification and detection algorithms rely on supervised training.
1 code implementation • 30 Nov 2014 • Anna Choromanska, Mikael Henaff, Michael Mathieu, Gérard Ben Arous, Yann Lecun
We show that for large-size decoupled networks the lowest critical values of the random loss function form a layered structure and they are located in a well-defined band lower-bounded by the global minimum.
2 code implementations • CVPR 2015 • Jonathan Tompson, Ross Goroshin, Arjun Jain, Yann Lecun, Christopher Bregler
Recent state-of-the-art performance on human-body pose estimation has been achieved with Deep Convolutional Networks (ConvNets).
Ranked #42 on Pose Estimation on MPII Human Pose
no code implementations • 26 Oct 2014 • Mariusz Bojarski, Anna Choromanska, Krzysztof Choromanski, Yann Lecun
We consider supervised learning with random decision trees, where the tree construction is completely random.
no code implementations • 28 Sep 2014 • Arjun Jain, Jonathan Tompson, Yann Lecun, Christoph Bregler
In this work, we propose a novel and efficient method for articulated human pose estimation in videos using a convolutional network architecture, which incorporates both color and motion features.
1 code implementation • CVPR 2015 • Jure Žbontar, Yann Lecun
We present a method for extracting depth information from a rectified image pair.
1 code implementation • NeurIPS 2014 • Jonathan Tompson, Arjun Jain, Yann Lecun, Christoph Bregler
This paper proposes a new hybrid architecture that consists of a deep Convolutional Network and a Markov Random Field.
no code implementations • 29 Apr 2014 • Michael Mathieu, Yann Lecun
A new method to represent and approximate rotation matrices is introduced.
no code implementations • NeurIPS 2014 • Emily Denton, Wojciech Zaremba, Joan Bruna, Yann Lecun, Rob Fergus
We present techniques for speeding up the test-time evaluation of large convolutional networks, designed for object recognition tasks.
4 code implementations • 21 Dec 2013 • Pierre Sermanet, David Eigen, Xiang Zhang, Michael Mathieu, Rob Fergus, Yann Lecun
This integrated framework is the winner of the localization task of the ImageNet Large Scale Visual Recognition Challenge 2013 (ILSVRC2013) and obtained very competitive results for the detection and classifications tasks.
4 code implementations • 21 Dec 2013 • Joan Bruna, Wojciech Zaremba, Arthur Szlam, Yann Lecun
Convolutional Neural Networks are extremely efficient architectures in image and audio recognition tasks, thanks to their ability to exploit the local translational invariance of signal classes over their domain.
no code implementations • 20 Dec 2013 • Michael Mathieu, Mikael Henaff, Yann Lecun
Convolutional networks are one of the most widely employed architectures in computer vision and machine learning.
no code implementations • 6 Dec 2013 • David Eigen, Jason Rolfe, Rob Fergus, Yann Lecun
A key challenge in designing convolutional network models is sizing them appropriately.
no code implementations • 16 Nov 2013 • Joan Bruna, Arthur Szlam, Yann Lecun
In this work we compute lower Lipschitz bounds of $\ell_p$ pooling operators for $p=1, 2, \infty$ as well as $\ell_p$ pooling operators preceded by half-rectification layers.
1 code implementation • ICML'13: Proceedings of the 30th International Conference on International Conference on Machine Learning - Volume 28 2013 • Li Wan, Matthew Zeiler, Sixin Zhang, Yann Lecun, Rob Fergus
When training with Dropout, a randomly selected subset of activations are set to zero within each layer.
Ranked #6 on Image Classification on MNIST
no code implementations • 16 Jan 2013 • Tom Schaul, Yann Lecun
Recent work has established an empirically successful framework for adapting learning rates for stochastic gradient descent (SGD).
no code implementations • CVPR 2013 • Pierre Sermanet, Koray Kavukcuoglu, Soumith Chintala, Yann Lecun
Pedestrian detection is a problem of considerable practical interest.
no code implementations • 6 Jun 2012 • Tom Schaul, Sixin Zhang, Yann Lecun
The performance of stochastic gradient descent (SGD) depends critically on how learning rates are tuned and decreased over time.
2 code implementations • 18 Apr 2012 • Pierre Sermanet, Soumith Chintala, Yann Lecun
We classify digits of real-world house numbers using convolutional neural networks (ConvNets).