Search Results for author: Jonathan C. Stroud

Found 5 papers, 2 papers with code

FIBER: Fill-in-the-Blanks as a Challenging Video Understanding Evaluation Framework

1 code implementation • ACL 2022 • Santiago Castro, Ruoyao Wang, Pingxuan Huang, Ian Stewart, Oana Ignat, Nan Liu, Jonathan C. Stroud, Rada Mihalcea

We propose fill-in-the-blanks as a video understanding evaluation framework and introduce FIBER -- a novel dataset consisting of 28, 000 videos and descriptions in support of this evaluation framework.

Language Modelling Multiple-choice +4

Paper
Code

Learning Video Representations from Textual Web Supervision

no code implementations • 29 Jul 2020 • Jonathan C. Stroud, Zhichao Lu, Chen Sun, Jia Deng, Rahul Sukthankar, Cordelia Schmid, David A. Ross

Based on this observation, we propose to use text as a method for learning video representations.

Action Recognition Representation Learning

Paper
Add Code

Compositional Temporal Visual Grounding of Natural Language Event Descriptions

no code implementations • 4 Dec 2019 • Jonathan C. Stroud, Ryan McCaffrey, Rada Mihalcea, Jia Deng, Olga Russakovsky

Temporal grounding entails establishing a correspondence between natural language event descriptions and their visual depictions.

Visual Grounding

Paper
Add Code

D3D: Distilled 3D Networks for Video Action Recognition

1 code implementation • 19 Dec 2018 • Jonathan C. Stroud, David A. Ross, Chen Sun, Jia Deng, Rahul Sukthankar

State-of-the-art methods for video action recognition commonly use an ensemble of two networks: the spatial stream, which takes RGB frames as input, and the temporal stream, which takes optical flow as input.

Ranked #11 on Action Recognition on AVA v2.1

Action Classification Action Recognition +2

Paper
Code

Temporal Action Localization by Structured Maximal Sums

no code implementations • CVPR 2017 • Zehuan Yuan, Jonathan C. Stroud, Tong Lu, Jia Deng

We pose action localization as a structured prediction over arbitrary-length temporal windows, where each window is scored as the sum of frame-wise classification scores.

Action Detection General Classification +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.