Search Results for author: Akikazu Takeuchi

Action Class Relation Detection and Classification Across Multiple Video Datasets

The Meta Video Dataset (MetaVD) provides annotated relations between action classes in major datasets for human action recognition in videos.

Paper
Add Code

Barlow Twins and VICReg are self-supervised representation learning models that use regularizers to decorrelate features.

Paper
Code

To realize this solution, we constructed a meta video dataset from the existing datasets for human action recognition, referred to as MetaVD.

Paper
Code

Each caption in our dataset describes a video in the form of "who does what and where."

Paper
Add Code

A new large-scale video dataset for human action recognition, called STAIR Actions is introduced.

110

Paper
Code

In recent years, automatic generation of image descriptions (captions), that is, image captioning, has attracted a great deal of attention.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.