Search Results for author: Laura Sevilla-Lara

Found 22 papers, 7 papers with code

One-Shot Open Affordance Learning with Foundation Models

no code implementations • 29 Nov 2023 • Gen Li, Deqing Sun, Laura Sevilla-Lara, Varun Jampani

We introduce One-shot Open Affordance Learning (OOAL), where a model is trained with just one example per base object category, but is expected to identify novel objects and affordances.

Paper
Add Code

Efficient Pre-training for Localized Instruction Generation of Videos

no code implementations • 27 Nov 2023 • Anil Batra, Davide Moltisanti, Laura Sevilla-Lara, Marcus Rohrbach, Frank Keller

Understanding such videos is challenging, involving the precise localization of steps and the generation of textual instructions.

Paper
Add Code

Watt For What: Rethinking Deep Learning's Energy-Performance Relationship

no code implementations • 10 Oct 2023 • Shreyank N Gowda, Xinyue Hao, Gen Li, Laura Sevilla-Lara, Shashank Narayana Gowda

Deep learning models have revolutionized various fields, from image recognition to natural language processing, by achieving unprecedented levels of accuracy.

Paper
Add Code

Telling Stories for Common Sense Zero-Shot Action Recognition

1 code implementation • 29 Sep 2023 • Shreyank N Gowda, Laura Sevilla-Lara

The textual narratives forge connections between seen and unseen classes, overcoming the bottleneck of labeled data that has long impeded advancements in this exciting domain.

Action Recognition Common Sense Reasoning +5

Paper
Code

Learning Action Changes by Measuring Verb-Adverb Textual Relationships

1 code implementation • CVPR 2023 • Davide Moltisanti, Frank Keller, Hakan Bilen, Laura Sevilla-Lara

The goal of this work is to understand the way actions are performed in videos.

Ranked #2 on Video-Adverb Retrieval on HowTo100M Adverbs

Video-Adverb Retrieval

Paper
Code

LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding

no code implementations • CVPR 2023 • Gen Li, Varun Jampani, Deqing Sun, Laura Sevilla-Lara

A key step to acquire this skill is to identify what part of the object affords each action, which is called affordance grounding.

Object

Paper
Add Code

An Action Is Worth Multiple Words: Handling Ambiguity in Action Recognition

2 code implementations • 10 Oct 2022 • Kiyoon Kim, Davide Moltisanti, Oisin Mac Aodha, Laura Sevilla-Lara

In practice, a given video can contain multiple valid positive annotations for the same action.

Action Recognition valid

Paper
Code

A Closer Look at Temporal Ordering in the Segmentation of Instructional Videos

no code implementations • 30 Sep 2022 • Anil Batra, Shreyank N Gowda, Frank Keller, Laura Sevilla-Lara

We refer to this task as Procedure Segmentation and Summarization (PSS).

Dense Video Captioning Segmentation

Paper
Add Code

Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition

no code implementations • 9 Jun 2022 • Shreyank N Gowda, Marcus Rohrbach, Frank Keller, Laura Sevilla-Lara

We propose to learn what makes a good video for action recognition and select only high-quality samples for augmentation.

Ranked #2 on Few Shot Action Recognition on HMDB51

Data Augmentation Few Shot Action Recognition +1

Paper
Add Code

Capturing Temporal Information in a Single Frame: Channel Sampling Strategies for Action Recognition

1 code implementation • 25 Jan 2022 • Kiyoon Kim, Shreyank N Gowda, Oisin Mac Aodha, Laura Sevilla-Lara

We address the problem of capturing temporal information for video classification in 2D networks, without increasing their computational cost.

Action Recognition Optical Flow Estimation +2

Paper
Code

A New Split for Evaluating True Zero-Shot Action Recognition

1 code implementation • 27 Jul 2021 • Shreyank N Gowda, Laura Sevilla-Lara, Kiyoon Kim, Frank Keller, Marcus Rohrbach

We benchmark several recent approaches on the proposed True Zero-Shot(TruZe) Split for UCF101 and HMDB51, with zero-shot and generalized zero-shot evaluation.

Few-Shot action recognition Few Shot Action Recognition +2

Paper
Code

Adaptive Prototype Learning and Allocation for Few-Shot Segmentation

2 code implementations • CVPR 2021 • Gen Li, Varun Jampani, Laura Sevilla-Lara, Deqing Sun, Jonghyun Kim, Joongkyu Kim

By integrating the SGC and GPA together, we propose the Adaptive Superpixel-guided Network (ASGNet), which is a lightweight model and adapts to object scale and shape variation.

Ranked #58 on Few-Shot Semantic Segmentation on COCO-20i (5-shot)

Clustering Few-Shot Semantic Segmentation +1

112

Paper
Code

CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition

no code implementations • 18 Jan 2021 • Shreyank N Gowda, Laura Sevilla-Lara, Frank Keller, Marcus Rohrbach

Theproblem can be seen as learning a function which general-izes well to instances of unseen classes without losing dis-crimination between classes.

Ranked #2 on Zero-Shot Action Recognition on Olympics

Action Recognition Clustering +4

Paper
Add Code

SMART Frame Selection for Action Recognition

no code implementations • 19 Dec 2020 • Shreyank N Gowda, Marcus Rohrbach, Laura Sevilla-Lara

In this work, however, we focus on the more standard short, trimmed action recognition problem.

Ranked #4 on Action Recognition on UCF101

Action Recognition

Paper
Add Code

ALBA : Reinforcement Learning for Video Object Segmentation

1 code implementation • 26 May 2020 • Shreyank N Gowda, Panagiotis Eustratiadis, Timothy Hospedales, Laura Sevilla-Lara

We treat this as a grouping problem by exploiting object proposals and making a joint inference about grouping over both space and time.

Ranked #7 on Unsupervised Video Object Segmentation on DAVIS 2017 (val)

Object One-shot visual object segmentation +6

Paper
Code

Proceedings of the ICLR Workshop on Computer Vision for Agriculture (CV4A) 2020

no code implementations • 23 Apr 2020 • Yannis Kalantidis, Laura Sevilla-Lara, Ernest Mwebaze, Dina Machuve, Hamed Alemohammad, David Guerena

The workshop was held in conjunction with the International Conference on Learning Representations (ICLR) 2020.

Paper
Add Code

Only Time Can Tell: Discovering Temporal Data for Temporal Modeling

no code implementations • 19 Jul 2019 • Laura Sevilla-Lara, Shengxin Zha, Zhicheng Yan, Vedanuj Goswami, Matt Feiszli, Lorenzo Torresani

However, in current video datasets it has been observed that action classes can often be recognized without any temporal information from a single frame of video.

Benchmarking Motion Estimation +1

Paper
Add Code

FASTER Recurrent Networks for Efficient Video Classification

no code implementations • 10 Jun 2019 • Linchao Zhu, Laura Sevilla-Lara, Du Tran, Matt Feiszli, Yi Yang, Heng Wang

FASTER aims to leverage the redundancy between neighboring clips and reduce the computational cost by learning to aggregate the predictions from models of different complexities.

Ranked #26 on Action Recognition on UCF101

Action Classification Action Recognition +3

Paper
Add Code

DMC-Net: Generating Discriminative Motion Cues for Fast Compressed Video Action Recognition

no code implementations • CVPR 2019 • Zheng Shou, Xudong Lin, Yannis Kalantidis, Laura Sevilla-Lara, Marcus Rohrbach, Shih-Fu Chang, Zhicheng Yan

Motion has shown to be useful for video understanding, where motion is typically represented by optical flow.

Ranked #1 on Action Recognition on UCF-101

Action Classification Action Recognition In Videos +3

Paper
Add Code

On the Integration of Optical Flow and Action Recognition

no code implementations • 22 Dec 2017 • Laura Sevilla-Lara, Yiyi Liao, Fatma Guney, Varun Jampani, Andreas Geiger, Michael J. Black

Here we take a deeper look at the combination of flow and action recognition, and investigate why optical flow is helpful, what makes a flow method good for action recognition, and how we can make it better.

Action Recognition Optical Flow Estimation +1

Paper
Add Code

Optical Flow in Mostly Rigid Scenes

no code implementations • CVPR 2017 • Jonas Wulff, Laura Sevilla-Lara, Michael J. Black

Existing algorithms typically focus on either recovering motion and structure under the assumption of a purely static world or optical flow for general unconstrained scenes.

Ranked #13 on Optical Flow Estimation on Sintel-clean

Motion Estimation Optical Flow Estimation

Paper
Add Code

Optical Flow with Semantic Segmentation and Localized Layers

no code implementations • CVPR 2016 • Laura Sevilla-Lara, Deqing Sun, Varun Jampani, Michael J. Black

Existing optical flow methods make generic, spatially homogeneous, assumptions about the spatial structure of the flow.

Optical Flow Estimation Scene Segmentation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.