Search Results for author: Josh Mcdermott

Found 9 papers, 4 papers with code

The Sound of Pixels

2 code implementations ECCV 2018 Hang Zhao, Chuang Gan, Andrew Rouditchenko, Carl Vondrick, Josh Mcdermott, Antonio Torralba

We introduce PixelPlayer, a system that, by leveraging large amounts of unlabeled videos, learns to locate image regions which produce sounds and separate the input sounds into a set of components that represents the sound from each pixel.

Metamers of neural networks reveal divergence from human perceptual systems

1 code implementation NeurIPS 2019 Jenelle Feather, Alex Durango, Ray Gonzalez, Josh Mcdermott

Although model metamers from early network layers were recognizable to humans, those from deeper layers were not.

Self-Supervised Audio-Visual Co-Segmentation

no code implementations18 Apr 2019 Andrew Rouditchenko, Hang Zhao, Chuang Gan, Josh Mcdermott, Antonio Torralba

Segmenting objects in images and separating sound sources in audio are challenging tasks, in part because traditional approaches require large amounts of labeled data.

Image Segmentation Segmentation +1

Probing emergent geometry in speech models via replica theory

no code implementations28 May 2019 Suchismita Padhy, Jenelle Feather, Cory Stephenson, Oguz Elibol, Hanlin Tang, Josh Mcdermott, SueYeon Chung

The success of deep neural networks in visual tasks have motivated recent theoretical and empirical work to understand how these networks operate.

speech-recognition Speech Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.