Search Results for author: Toru Tamaki

Found 23 papers, 4 papers with code

Multi-model learning by sequential reading of untrimmed videos for action recognition

no code implementations26 Jan 2024 Kodai Kamiya, Toru Tamaki

We propose a new method for learning videos by aggregating multiple models by sequentially extracting video clips from untrimmed video.

Action Recognition Federated Learning

Joint learning of images and videos with a single Vision Transformer

no code implementations21 Aug 2023 Shuki Shimizu, Toru Tamaki

In this study, we propose a method for jointly learning of images and videos using a single model.

Action Recognition

Object-ABN: Learning to Generate Sharp Attention Maps for Action Recognition

no code implementations27 Jul 2022 Tomoya Nitta, Tsubasa Hirakawa, Hironobu Fujiyoshi, Toru Tamaki

Experimental results with UCF101 and SSv2 shows that the generated maps by the proposed method are much clearer qualitatively and quantitatively than those of the original ABN.

Action Recognition Instance Segmentation +2

Performance Evaluation of Action Recognition Models on Low Quality Videos

no code implementations19 Apr 2022 Aoi Otani, Ryota Hashiguchi, Kazuki Omi, Norishige Fukushima, Toru Tamaki

In general, action recognition models are trained on high-quality videos, hence it is not known how the model performance degrades when tested on low-quality videos, and how much the quality of training videos affects the performance.

Action Recognition

Vision Transformer with Cross-attention by Temporal Shift for Efficient Action Recognition

no code implementations1 Apr 2022 Ryota Hashiguchi, Toru Tamaki

We also investigate other variants in which the proposed MSCA is used along the patch dimension of ViT, instead of the head dimension.

Action Recognition

Improving the Reliability of Semantic Segmentation of Medical Images by Uncertainty Modeling with Bayesian Deep Networks and Curriculum Learning

1 code implementation26 Aug 2021 Sora Iwamoto, Bisser Raytchev, Toru Tamaki, Kazufumi Kaneda

In this paper we propose a novel method which leverages the uncertainty measures provided by Bayesian deep networks through curriculum learning so that the uncertainty estimates are fed back to the system to resample the training data more densely in areas where uncertainty is high.

Segmentation Semantic Segmentation

An Entropy Clustering Approach for Assessing Visual Question Difficulty

1 code implementation12 Apr 2020 Kento Terao, Toru Tamaki, Bisser Raytchev, Kazufumi Kaneda, Shun'ichi Satoh

Then we use state-of-the-art methods to determine the accuracy and the entropy of the answer distributions for each cluster.

Clustering Question Answering +1

Rephrasing visual questions by specifying the entropy of the answer distribution

no code implementations10 Apr 2020 Kento Terao, Toru Tamaki, Bisser Raytchev, Kazufumi Kaneda, Shun'ichi Satoh

The proposed model rephrases a source question given with an image so that the rephrased question has the ambiguity (or entropy) specified by users.

Question Answering Visual Question Answering

On-line non-overlapping camera calibration net

no code implementations19 Feb 2020 Zhao Fangda, Toru Tamaki, Takio Kurita, Bisser Raytchev, Kazufumi Kaneda

First, successive images are fed to a PoseNet-based network to obtain ego-motion of cameras between frames.

Camera Calibration Pose Estimation

Semantic segmentation of trajectories with improved agent models for pedestrian behavior analysis

no code implementations12 Dec 2019 Toru Tamaki, Daisuke Ogawa, Bisser Raytchev, Kazufumi Kaneda

In this paper, we propose a method for semantic segmentation of pedestrian trajectories based on pedestrian behavior models, or agents.

Semantic Segmentation Trajectory Modeling

Meaning guided video captioning

1 code implementation12 Dec 2019 Rushi J. Babariya, Toru Tamaki

Current video captioning approaches often suffer from problems of missing objects in the video to be described, while generating captions semantically similar with ground truth sentences.

object-detection Object Detection +1

Improved Activity Forecasting for Generating Trajectories

no code implementations12 Dec 2019 Daisuke Ogawa, Toru Tamaki, Tsubasa Hirakawa, Bisser Raytchev, Kazufumi Kaneda, Ken Yoda

An efficient inverse reinforcement learning for generating trajectories is proposed based of 2D and 3D activity forecasting.

reinforcement-learning Reinforcement Learning (RL)

Biomedical Image Segmentation by Retina-like Sequential Attention Mechanism Using Only A Few Training Images

no code implementations27 Sep 2019 Shohei Hayashi, Bisser Raytchev, Toru Tamaki, Kazufumi Kaneda

In this paper we propose a novel deep learning-based algorithm for biomedical image segmentation which uses a sequential attention mechanism able to shift the focus of attention across the image in a selective way, allowing subareas which are more difficult to classify to be processed at increased resolution.

Ensemble Learning Image Segmentation +2

Survey on Vision-based Path Prediction

no code implementations1 Nov 2018 Tsubasa Hirakawa, Takayoshi Yamashita, Toru Tamaki, Hironobu Fujiyoshi

Because path prediction as a task of computer vision uses video as input, various information used for prediction, such as the environment surrounding the target and the internal state of the target, need to be estimated from the video in addition to predicting paths.

Semantic segmentation of trajectories with agent models

no code implementations27 Feb 2018 Daisuke Ogawa, Toru Tamaki, Bisser Raytchev, Kazufumi Kaneda

Next, using learned behavior models and a hidden Markov model, we segment a trajectory into semantic segments.

Clustering General Classification +1

Development of a Real-time Colorectal Tumor Classification System for Narrow-band Imaging zoom-videoendoscopy

no code implementations15 Dec 2016 Tsubasa Hirakawa, Toru Tamaki, Bisser Raytchev, Kazufumi Kaneda, Tetsushi Koide, Shigeto Yoshida, Hiroshi Mieno, Shinji Tanaka

A computer-aided diagnosis (CAD) system that provides an objective measure to endoscopists during colorectal endoscopic examinations would be of great value.

General Classification

Domain Adaptation with L2 constraints for classifying images from different endoscope systems

no code implementations8 Nov 2016 Toru Tamaki, Shoji Sonoyama, Takio Kurita, Tsubasa Hirakawa, Bisser Raytchev, Kazufumi Kaneda, Tetsushi Koide, Shigeto Yoshida, Hiroshi Mieno, Shinji Tanaka, Kazuaki Chayama

This paper proposes a method for domain adaptation that extends the maximum margin domain transfer (MMDT) proposed by Hoffman et al., by introducing L2 distance constraints between samples of different domains; thus, our method is denoted as MMDTL2.

Domain Adaptation

Computer-Aided Colorectal Tumor Classification in NBI Endoscopy Using CNN Features

no code implementations24 Aug 2016 Toru Tamaki, Shoji Sonoyama, Tsubasa Hirakawa, Bisser Raytchev, Kazufumi Kaneda, Tetsushi Koide, Shigeto Yoshida, Hiroshi Mieno, Shinji Tanaka

In this paper we report results for recognizing colorectal NBI endoscopic images by using features extracted from convolutional neural network (CNN).

General Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.