1 code implementation • Computer Vision and Image Understanding 2021 • Yuya Yoshikawa, Yutaro Shigeto, Akikazu Takeuchi
To realize this solution, we constructed a meta video dataset from the existing datasets for human action recognition, referred to as MetaVD.
1 code implementation • CVPR 2023 • Yutaro Shigeto, Masashi Shimbo, Yuya Yoshikawa, Akikazu Takeuchi
Barlow Twins and VICReg are self-supervised representation learning models that use regularizers to decorrelate features.
no code implementations • 11 Jun 2018 • Yutaro Shigeto, Masashi Shimbo, Yuji Matsumoto
This paper proposes an inexpensive way to learn an effective dissimilarity function to be used for $k$-nearest neighbor ($k$-NN) classification.
1 code implementation • ACL 2017 • Yuya Yoshikawa, Yutaro Shigeto, Akikazu Takeuchi
In recent years, automatic generation of image descriptions (captions), that is, image captioning, has attracted a great deal of attention.
no code implementations • 3 Jul 2015 • Yutaro Shigeto, Ikumi Suzuki, Kazuo Hara, Masashi Shimbo, Yuji Matsumoto
This paper discusses the effect of hubness in zero-shot learning, when ridge regression is used to find a mapping between the example space to the label space.
no code implementations • LREC 2020 • Yutaro Shigeto, Yuya Yoshikawa, Jiaqing Lin, Akikazu Takeuchi
Each caption in our dataset describes a video in the form of "who does what and where."
no code implementations • 15 Aug 2023 • Yuya Yoshikawa, Yutaro Shigeto, Masashi Shimbo, Akikazu Takeuchi
The Meta Video Dataset (MetaVD) provides annotated relations between action classes in major datasets for human action recognition in videos.
1 code implementation • 13 Mar 2024 • Yuta Mukobara, Yutaro Shigeto, Masashi Shimbo
We explore loss functions for fact verification in the FEVER shared task.