Search Results for author: Ariel Ephrat

Found 11 papers, 6 papers with code

Teaching CLIP to Count to Ten

1 code implementation ICCV 2023 Roni Paiss, Ariel Ephrat, Omer Tov, Shiran Zada, Inbar Mosseri, Michal Irani, Tali Dekel

Our counting loss is deployed over automatically-created counterfactual examples, each consisting of an image and a caption containing an incorrect object count.

counterfactual Image Retrieval +4

SpeedNet: Learning the Speediness in Videos

1 code implementation CVPR 2020 Sagie Benaim, Ariel Ephrat, Oran Lang, Inbar Mosseri, William T. Freeman, Michael Rubinstein, Michal Irani, Tali Dekel

We demonstrate how those learned features can boost the performance of self-supervised action recognition, and can be used for video retrieval.

Binary Classification Retrieval +2

Neural separation of observed and unobserved distributions

1 code implementation ICLR 2019 Tavi Halperin, Ariel Ephrat, Yedid Hoshen

In this work, we introduce a new method---Neural Egg Separation---to tackle the scenario of extracting a signal from an unobserved distribution additively mixed with a signal from an observed distribution.

Speaker Separation

Neural Separation of Observed and Unobserved Distribution

1 code implementation ICML'19 2018 Tavi Halperin, Ariel Ephrat, Yedid Hoshen

In this work, we introduce a new method---Neural Egg Separation---to tackle the scenario of extracting a signal from an unobserved distribution additively mixed with a signal from an observed distribution.

Dynamic Temporal Alignment of Speech to Lips

1 code implementation19 Aug 2018 Tavi Halperin, Ariel Ephrat, Shmuel Peleg

This alignment is based on deep audio-visual features, mapping the lips video and the speech signal to a shared representation.

Constrained Lip-synchronization Video Alignment

Looking to Listen at the Cocktail Party: A Speaker-Independent Audio-Visual Model for Speech Separation

5 code implementations10 Apr 2018 Ariel Ephrat, Inbar Mosseri, Oran Lang, Tali Dekel, Kevin Wilson, Avinatan Hassidim, William T. Freeman, Michael Rubinstein

Solving this task using only audio as input is extremely challenging and does not provide an association of the separated speech signals with speakers in the video.

Speech Separation

Seeing Through Noise: Visually Driven Speaker Separation and Enhancement

no code implementations22 Aug 2017 Aviv Gabbay, Ariel Ephrat, Tavi Halperin, Shmuel Peleg

Isolating the voice of a specific person while filtering out other voices or background noises is challenging when video is shot in noisy environments.

Speaker Separation

Improved Speech Reconstruction from Silent Video

no code implementations1 Aug 2017 Ariel Ephrat, Tavi Halperin, Shmuel Peleg

Speechreading is the task of inferring phonetic information from visually observed articulatory facial movements, and is a notoriously difficult task for humans to perform.

Vid2speech: Speech Reconstruction from Silent Video

no code implementations2 Jan 2017 Ariel Ephrat, Shmuel Peleg

Speechreading is a notoriously difficult task for humans to perform.

Compact CNN for Indexing Egocentric Videos

no code implementations28 Apr 2015 Yair Poleg, Ariel Ephrat, Shmuel Peleg, Chetan Arora

Furthermore, our CNN is able to recognize whether a video is egocentric or not with 99. 2% accuracy, up by 24% from current state-of-the-art.

Activity Recognition Optical Flow Estimation

Cannot find the paper you are looking for? You can Submit a new open access paper.