Search Results for author: Dilip Krishnan

Found 40 papers, 24 papers with code

Learning Vision from Models Rivals Learning Vision from Data

1 code implementation • 28 Dec 2023 • Yonglong Tian, Lijie Fan, KaiFeng Chen, Dina Katabi, Dilip Krishnan, Phillip Isola

We introduce SynCLR, a novel approach for learning visual representations exclusively from synthetic images and synthetic captions, without any real data.

Contrastive Learning Image Captioning +3

252

Paper
Code

Scaling Laws of Synthetic Images for Model Training ... for Now

1 code implementation • 7 Dec 2023 • Lijie Fan, KaiFeng Chen, Dilip Krishnan, Dina Katabi, Phillip Isola, Yonglong Tian

Our findings also suggest that scaling synthetic data can be particularly effective in scenarios such as: (1) when there is a limited supply of real images for a supervised problem (e. g., fewer than 0. 5 million images in ImageNet), (2) when the evaluation dataset diverges significantly from the training data, indicating the out-of-distribution scenario, or (3) when synthetic data is used in conjunction with real images, as demonstrated in the training of CLIP models.

252

Paper
Code

Improve Supervised Representation Learning with Masked Image Modeling

no code implementations • 1 Dec 2023 • KaiFeng Chen, Daniel Salz, Huiwen Chang, Kihyuk Sohn, Dilip Krishnan, Mojtaba Seyedhosseini

On K-Nearest-Neighbor image retrieval evaluation with ImageNet-1k, the same model outperforms the baseline by 1. 32%.

Image Retrieval Representation Learning +2

Paper
Add Code

Leveraging Unpaired Data for Vision-Language Generative Models via Cycle Consistency

no code implementations • 5 Oct 2023 • Tianhong Li, Sangnie Bhardwaj, Yonglong Tian, Han Zhang, Jarred Barber, Dina Katabi, Guillaume Lajoie, Huiwen Chang, Dilip Krishnan

We demonstrate image generation and captioning performance on par with state-of-the-art text-to-image and image-to-text models with orders of magnitude fewer (only 3M) paired image-text data.

Text-to-Image Generation

Paper
Add Code

Substance or Style: What Does Your Image Embedding Know?

no code implementations • 10 Jul 2023 • Cyrus Rashtchian, Charles Herrmann, Chun-Sung Ferng, Ayan Chakrabarti, Dilip Krishnan, Deqing Sun, Da-Cheng Juan, Andrew Tomkins

We find that image-text models (CLIP and ALIGN) are better at recognizing new examples of style transfer than masking-based models (CAN and MAE).

Style Transfer

Paper
Add Code

StyleDrop: Text-to-Image Generation in Any Style

3 code implementations • 1 Jun 2023 • Kihyuk Sohn, Nataniel Ruiz, Kimin Lee, Daniel Castro Chin, Irina Blok, Huiwen Chang, Jarred Barber, Lu Jiang, Glenn Entis, Yuanzhen Li, Yuan Hao, Irfan Essa, Michael Rubinstein, Dilip Krishnan

Pre-trained large text-to-image models synthesize impressive images with an appropriate use of text prompts.

Text-to-Image Generation

548

Paper
Code

StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners

2 code implementations • NeurIPS 2023 • Yonglong Tian, Lijie Fan, Phillip Isola, Huiwen Chang, Dilip Krishnan

We investigate the potential of learning visual representations using synthetic images generated by text-to-image models.

Contrastive Learning

2,881

Paper
Code

Improving CLIP Training with Language Rewrites

1 code implementation • NeurIPS 2023 • Lijie Fan, Dilip Krishnan, Phillip Isola, Dina Katabi, Yonglong Tian

During training, LaCLIP randomly selects either the original texts or the rewritten versions as text augmentations for each image.

In-Context Learning Sentence

222

Paper
Code

Steerable Equivariant Representation Learning

no code implementations • 22 Feb 2023 • Sangnie Bhardwaj, Willie McClinton, Tongzhou Wang, Guillaume Lajoie, Chen Sun, Phillip Isola, Dilip Krishnan

In this paper, we propose a method of learning representations that are instead equivariant to data augmentations.

Image Retrieval object-detection +5

Paper
Add Code

Muse: Text-To-Image Generation via Masked Generative Transformers

4 code implementations • 2 Jan 2023 • Huiwen Chang, Han Zhang, Jarred Barber, AJ Maschinot, Jose Lezama, Lu Jiang, Ming-Hsuan Yang, Kevin Murphy, William T. Freeman, Michael Rubinstein, Yuanzhen Li, Dilip Krishnan

Compared to pixel-space diffusion models, such as Imagen and DALL-E 2, Muse is significantly more efficient due to the use of discrete tokens and requiring fewer sampling iterations; compared to autoregressive models, such as Parti, Muse is more efficient due to the use of parallel decoding.

Ranked #1 on Text-to-Image Generation on MS-COCO (FID metric)

Language Modelling Large Language Model +1

813

Paper
Code

MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis

1 code implementation • CVPR 2023 • Tianhong Li, Huiwen Chang, Shlok Kumar Mishra, Han Zhang, Dina Katabi, Dilip Krishnan

In this work, we propose MAsked Generative Encoder (MAGE), the first framework to unify SOTA image generation and self-supervised representation learning.

Ranked #2 on Unconditional Image Generation on ImageNet 256x256

Image Generation Representation Learning +1

448

Paper
Code

A simple, efficient and scalable contrastive masked autoencoder for learning visual representations

1 code implementation • 30 Oct 2022 • Shlok Mishra, Joshua Robinson, Huiwen Chang, David Jacobs, Aaron Sarna, Aaron Maschinot, Dilip Krishnan

Our framework is a minimal and conceptually clean synthesis of (C) contrastive learning, (A) masked autoencoders, and (N) the noise prediction approach used in diffusion models.

Contrastive Learning Self-Supervised Learning +1

Paper
Code

Simplified Transfer Learning for Chest Radiography Models Using Less Data

1 code implementation • Radiology 2022 • Andrew B. Sellergren, Christina Chen, Zaid Nabulsi, Yuanzhen Li, Aaron Maschinot, Aaron Sarna, Jenny Huang, Charles Lau, Sreenivasa Raju Kalidindi, Mozziyar Etemadi, Florencia Garcia-Vicente, David Melnick, Yun Liu, Krish Eswaran, Daniel Tse, Neeral Beladia, Dilip Krishnan, Shravya Shetty

Supervised contrastive learning enabled performance comparable to state-of-the-art deep learning models in multiple clinical tasks by using as few as 45 images and is a promising method for predictive modeling with use of small data sets and for predicting outcomes in shifting patient populations.

Contrastive Learning Transfer Learning

Paper
Code

Object-Aware Cropping for Self-Supervised Learning

1 code implementation • 1 Dec 2021 • Shlok Mishra, Anshul Shah, Ankan Bansal, Abhyuday Jagannatha, Janit Anjaria, Abhishek Sharma, David Jacobs, Dilip Krishnan

This assumption is mostly satisfied in datasets such as ImageNet where there is a large, centered object, which is highly likely to be present in random crops of the full image.

Data Augmentation Object +3

Paper
Code

Pyramid Adversarial Training Improves ViT Performance

1 code implementation • CVPR 2022 • Charles Herrmann, Kyle Sargent, Lu Jiang, Ramin Zabih, Huiwen Chang, Ce Liu, Dilip Krishnan, Deqing Sun

In this work, we present pyramid adversarial training (PyramidAT), a simple and effective technique to improve ViT's overall performance.

Ranked #9 on Domain Generalization on ImageNet-C (using extra training data)

Adversarial Attack Data Augmentation +2

2,988

Paper
Code

CSI: Contrastive Data Stratification for Interaction Prediction and its Application to Compound-Protein Interaction Prediction

no code implementations • 18 Nov 2021 • Apurva Kalia, Dilip Krishnan, Soha Hassoun

Accurately predicting the likelihood of interaction between two objects (compound-protein sequence, user-item, author-paper, etc.)

Contrastive Learning

Paper
Add Code

Unsupervised Disentanglement without Autoencoding: Pitfalls and Future Directions

1 code implementation • 14 Aug 2021 • Andrea Burns, Aaron Sarna, Dilip Krishnan, Aaron Maschinot

Disentangled visual representations have largely been studied with generative models such as Variational AutoEncoders (VAEs).

Contrastive Learning Disentanglement

Paper
Code

Understanding Invariance via Feedforward Inversion of Discriminatively Trained Classifiers

no code implementations • 15 Mar 2021 • Piotr Teterwak, Chiyuan Zhang, Dilip Krishnan, Michael C. Mozer

We use our reconstruction model as a tool for exploring the nature of representations, including: the influence of model architecture and training objectives (specifically robust losses), the forms of invariance that networks achieve, representational differences between correctly and incorrectly classified images, and the effects of manipulating logits and images.

Paper
Add Code

What Makes for Good Views for Contrastive Learning?

1 code implementation • NeurIPS 2020 • Yonglong Tian, Chen Sun, Ben Poole, Dilip Krishnan, Cordelia Schmid, Phillip Isola

Contrastive learning between multiple views of the data has recently achieved state of the art performance in the field of self-supervised representation learning.

Ranked #2 on Contrastive Learning on imagenet-1k

Contrastive Learning Data Augmentation +8

1,908

Paper
Code

Supervised Contrastive Learning

23 code implementations • NeurIPS 2020 • Prannay Khosla, Piotr Teterwak, Chen Wang, Aaron Sarna, Yonglong Tian, Phillip Isola, Aaron Maschinot, Ce Liu, Dilip Krishnan

Contrastive learning applied to self-supervised representation learning has seen a resurgence in recent years, leading to state of the art performance in the unsupervised training of deep image models.

Ranked #2 on Class Incremental Learning on cifar100

Class Incremental Learning Contrastive Learning +4

32,754

Paper
Code

Rethinking Few-Shot Image Classification: a Good Embedding Is All You Need?

3 code implementations • ECCV 2020 • Yonglong Tian, Yue Wang, Dilip Krishnan, Joshua B. Tenenbaum, Phillip Isola

The focus of recent meta-learning research has been on the development of learning algorithms that can quickly adapt to test time tasks with limited data and low computational cost.

Few-Shot Image Classification Few-Shot Learning +1

363

Paper
Code

Fantastic Generalization Measures and Where to Find Them

3 code implementations • ICLR 2020 • Yiding Jiang, Behnam Neyshabur, Hossein Mobahi, Dilip Krishnan, Samy Bengio

We present the first large scale study of generalization in deep networks.

valid

Paper
Code

Contrastive Representation Distillation

3 code implementations • ICLR 2020 • Yonglong Tian, Dilip Krishnan, Phillip Isola

We demonstrate that this objective ignores important structural knowledge of the teacher network.

Ranked #13 on Knowledge Distillation on CIFAR-100

Contrastive Learning Knowledge Distillation +2

2,064

Paper
Code

Boundless: Generative Adversarial Networks for Image Extension

no code implementations • ICCV 2019 • Piotr Teterwak, Aaron Sarna, Dilip Krishnan, Aaron Maschinot, David Belanger, Ce Liu, William T. Freeman

Image extension models have broad applications in image editing, computational photography and computer graphics.

Ranked #2 on Uncropping on Places2 val

Generative Adversarial Network Image Inpainting +1

Paper
Add Code

Adversarial Robustness through Local Linearization

no code implementations • NeurIPS 2019 • Chongli Qin, James Martens, Sven Gowal, Dilip Krishnan, Krishnamurthy Dvijotham, Alhussein Fawzi, Soham De, Robert Stanforth, Pushmeet Kohli

Using this regularizer, we exceed current state of the art and achieve 47% adversarial accuracy for ImageNet with l-infinity adversarial perturbations of radius 4/255 under an untargeted, strong, white-box attack.

Ranked #2 on Adversarial Defense on ImageNet (non-targeted PGD, max perturbation=4)

Adversarial Defense Adversarial Robustness

Paper
Add Code

Contrastive Multiview Coding

8 code implementations • ECCV 2020 • Yonglong Tian, Dilip Krishnan, Phillip Isola

We analyze key properties of the approach that make it work, finding that the contrastive loss outperforms a popular alternative based on cross-view prediction, and that the more views we learn from, the better the resulting representation captures underlying scene semantics.

Ranked #48 on Self-Supervised Action Recognition on UCF101

Contrastive Learning Self-Supervised Action Recognition +1

1,908

Paper
Code

A Closed-Form Learned Pooling for Deep Classification Networks

no code implementations • 10 Jun 2019 • Vighnesh Birodkar, Hossein Mobahi, Dilip Krishnan, Samy Bengio

This operator can learn a strict super-set of what can be learned by average pooling or convolutions.

Classification Foveation +2

Paper
Add Code

Predicting the Generalization Gap in Deep Networks with Margin Distributions

2 code implementations • ICLR 2019 • Yiding Jiang, Dilip Krishnan, Hossein Mobahi, Samy Bengio

In this paper, we propose such a measure, and conduct extensive empirical studies on how well it can predict the generalization gap.

32,745

Paper
Code

Sparse, Smart Contours to Represent and Edit Images

no code implementations • CVPR 2018 • Tali Dekel, Chuang Gan, Dilip Krishnan, Ce Liu, William T. Freeman

We study the problem of reconstructing an image from information stored at contour locations.

Face Recognition Image Manipulation

Paper
Add Code

Large Margin Deep Networks for Classification

2 code implementations • NeurIPS 2018 • Gamaleldin F. Elsayed, Dilip Krishnan, Hossein Mobahi, Kevin Regan, Samy Bengio

We present a formulation of deep learning that aims at producing a large margin classifier.

Classification Data Augmentation +1

32,745

Paper
Code

Smart, Sparse Contours to Represent and Edit Images

no code implementations • 21 Dec 2017 • Tali Dekel, Chuang Gan, Dilip Krishnan, Ce Liu, William T. Freeman

We study the problem of reconstructing an image from information stored at contour locations.

Face Recognition Image Manipulation

Paper
Add Code

Synthesizing Normalized Faces from Facial Identity Features

1 code implementation • CVPR 2017 • Forrester Cole, David Belanger, Dilip Krishnan, Aaron Sarna, Inbar Mosseri, William T. Freeman

We present a method for synthesizing a frontal, neutral-expression image of a person's face given an input face photograph.

Paper
Code

Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks

6 code implementations • CVPR 2017 • Konstantinos Bousmalis, Nathan Silberman, David Dohan, Dumitru Erhan, Dilip Krishnan

Collecting well-annotated image datasets to train modern machine learning algorithms is prohibitively expensive for many tasks.

Generative Adversarial Network Unsupervised Domain Adaptation

65,339

Paper
Code

Domain Separation Networks

5 code implementations • NeurIPS 2016 • Konstantinos Bousmalis, George Trigeorgis, Nathan Silberman, Dilip Krishnan, Dumitru Erhan

However, by focusing only on creating a mapping or shared representation between the two domains, they ignore the individual characteristics of each domain.

Ranked #1 on Domain Adaptation on Synth Objects-to-LINEMOD

Domain Generalization Unsupervised Domain Adaptation

76,579

Paper
Code

Learning Ordinal Relationships for Mid-Level Vision

no code implementations • ICCV 2015 • Daniel Zoran, Phillip Isola, Dilip Krishnan, William T. Freeman

We demonstrate that this frame- work works well on two important mid-level vision tasks: intrinsic image decomposition and depth from an RGB im- age.

Depth Estimation Intrinsic Image Decomposition

Paper
Add Code

Learning visual groups from co-occurrences in space and time

2 code implementations • 21 Nov 2015 • Phillip Isola, Daniel Zoran, Dilip Krishnan, Edward H. Adelson

We propose a self-supervised framework that learns to group visual entities based on their rate of co-occurrence in space and time.

Binary Classification

Paper
Code

Reflection Removal Using Ghosting Cues

no code implementations • CVPR 2015 • YiChang Shih, Dilip Krishnan, Fredo Durand, William T. Freeman

For single-pane windows, ghosting cues arise from shifted reflections on the two surfaces of the glass pane.

Reflection Removal

Paper
Add Code

Shape and Illumination from Shading using the Generic Viewpoint Assumption

no code implementations • NeurIPS 2014 • Daniel Zoran, Dilip Krishnan, José Bento, Bill Freeman

The Generic Viewpoint Assumption (GVA) states that the position of the viewer or the light in a scene is not special.

Paper
Add Code

Blind Deconvolution with Non-local Sparsity Reweighting

no code implementations • 16 Nov 2013 • Dilip Krishnan, Joan Bruna, Rob Fergus

Blind deconvolution has made significant progress in the past decade.

Paper
Add Code

Fast Image Deconvolution using Hyper-Laplacian Priors

no code implementations • NeurIPS 2009 • Dilip Krishnan, Rob Fergus

In this paper we describe a deconvolution approach that is several orders of magnitude faster than existing techniques that use hyper-Laplacian priors.

Deblurring Denoising +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.