CPC v2

Introduced by Hénaff et al. in Data-Efficient Image Recognition with Contrastive Predictive Coding

Contrastive Predictive Coding v2 (CPC v2) is a self-supervised learning approach that builds upon the original CPC with several improvements. These improvements include:

Model capacity - The third residual stack of ResNet-101 (originally containing 23 blocks, 1024-dimensional feature maps, and 256-dimensional bottleneck layers), is converted to use 46 blocks, with 4096-dimensional feature maps and 512-dimensional bottleneck layers: ResNet-161.
Layer Normalization - The authors find CPC with batch normalization harms downstream performance. They hypothesize this is due to batch normalization allowing large models to find a trivial solution to CPC: it introduces a dependency between patches (through the batch statistics) that can be exploited to bypass the constraints on the receptive field. They replace batch normalization with layer normalization.
Predicting lengths and directions - patches are predicted with contexts from both directions rather than just spatially underneath.
Patch-based Augmentation - Utilising "color dropping" which randomly drops two of the three color channels in each patch, as well as random horizontal flips.

Consistent with prior results, this new architecture delivers better performance regardless of

Source: Data-Efficient Image Recognition with Contrastive Predictive Coding

Read Paper See Code

Papers

Paper	Code	Results	Date	Stars

Tasks

Task	Papers	Share
General Classification	1	25.00%
Object Detection	1	25.00%
Self-Supervised Image Classification	1	25.00%
Semi-Supervised Image Classification	1	25.00%

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
InfoNCE	Loss Functions
Layer Normalization	Normalization
Random Horizontal Flip	Image Data Augmentation
ResNet	Convolutional Neural Networks

Categories

Add Remove

Self-Supervised Learning

Semi-Supervised Learning Methods