Search Results for author: Joseph Roth

Found 7 papers, 3 papers with code

Modeling Uncertainty with Hedged Instance Embeddings

no code implementations • ICLR 2019 • Seong Joon Oh, Kevin P. Murphy, Jiyan Pan, Joseph Roth, Florian Schroff, Andrew C. Gallagher

Instance embeddings are an efficient and versatile image representation that facilitates applications like recognition, verification, retrieval, and clustering.

Clustering Metric Learning +1

Paper
Add Code

AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection

1 code implementation • 5 Jan 2019 • Joseph Roth, Sourish Chaudhuri, Ondrej Klejch, Radhika Marvin, Andrew Gallagher, Liat Kaver, Sharadh Ramaswamy, Arkadiusz Stopczynski, Cordelia Schmid, Zhonghua Xi, Caroline Pantofaru

The dataset contains temporally labeled face tracks in video, where each face instance is labeled as speaking or not, and whether the speech is audible.

Audio-Visual Active Speaker Detection speaker-diarization +2

Paper
Code

Modeling Uncertainty with Hedged Instance Embedding

1 code implementation • 30 Sep 2018 • Seong Joon Oh, Kevin Murphy, Jiyan Pan, Joseph Roth, Florian Schroff, Andrew Gallagher

Instance embeddings are an efficient and versatile image representation that facilitates applications like recognition, verification, retrieval, and clustering.

Clustering Metric Learning +1

Paper
Code

AVA-Speech: A Densely Labeled Dataset of Speech Activity in Movies

1 code implementation • 2 Aug 2018 • Sourish Chaudhuri, Joseph Roth, Daniel P. W. Ellis, Andrew Gallagher, Liat Kaver, Radhika Marvin, Caroline Pantofaru, Nathan Reale, Loretta Guarino Reid, Kevin Wilson, Zhonghua Xi

Speech activity detection (or endpointing) is an important processing step for applications such as speech recognition, language identification and speaker diarization.

Sound Audio and Speech Processing

Paper
Code

Monocular Video-Based Trailer Coupler Detection Using Multiplexer Convolutional Neural Network

no code implementations • ICCV 2017 • Yousef Atoum, Joseph Roth, Michael Bliss, Wende Zhang, Xiaoming Liu

This paper presents an automated monocular-camera-based computer vision system for autonomous self-backing-up a vehicle towards a trailer, by continuously estimating the 3D trailer coupler position and feeding it to the vehicle control system, until the alignment of the tow hitch with the trailers coupler.

Paper
Add Code

Adaptive 3D Face Reconstruction From Unconstrained Photo Collections

no code implementations • CVPR 2016 • Joseph Roth, Yiying Tong, Xiaoming Liu

Given a collection of "in-the-wild" face images captured under a variety of unknown pose, expression, and illumination conditions, this paper presents a method for reconstructing a 3D face surface model of an individual along with albedo information.

3D Face Reconstruction

Paper
Add Code

Unconstrained 3D Face Reconstruction

no code implementations • CVPR 2015 • Joseph Roth, Yiying Tong, Xiaoming Liu

Second, by leveraging emerging face alignment techniques and our novel normal field-based Laplace editing, a combination of landmark constraints and photometric stereo-based normals drives our surface reconstruction.

3D Face Reconstruction Face Alignment +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.