no code implementations • 14 Mar 2023 • Maorong Wang, Ling Xiao, Toshihiko Yamasaki
Online knowledge distillation (KD) has received increasing attention in recent years.
1 code implementation • 9 Feb 2023 • Dichao Liu, Toshihiko Yamasaki, Yu Wang, Kenji Mase, Jien Kato
Experimental results on the Statefarm Distracted Driver Detection Dataset and AUC Distracted Driver Dataset show that the proposed approach is highly effective for recognizing distracted driving behaviors from photos: (1) the teacher network's accuracy surpasses the previous best accuracy; (2) the student network achieves very high accuracy with only 0. 42M parameters (around 55% of the previous most lightweight model).
no code implementations • 27 Dec 2022 • Ling Xiao, Toshihiko Yamasaki
In branch 2, we first propose a multi-level attention module to extract a more discriminative representation under the guidance of a specific attribute.
no code implementations • 27 Dec 2022 • Ling Xiao, Toshihiko Yamasaki
In this paper, we propose a general color distortion prediction task forcing the baseline to recognize low-level image information to learn more discriminative representation for fashion compatibility prediction.
1 code implementation • 11 Oct 2022 • Jianbo Wang, Huan Yang, Jianlong Fu, Toshihiko Yamasaki, Baining Guo
Such a design usually destroys the spatial information of the input images and fails to transfer fine-grained style patterns into style transfer results.
no code implementations • 6 Sep 2022 • Koki Mukai, Soichiro Kumano, Toshihiko Yamasaki
In order to improve the robustness against out-of-distribution data, we present a frequency-based data augmentation technique that replaces the frequency components with other images of the same class.
no code implementations • 16 Aug 2022 • Koyu Mizutani, Haruki Mitarai, Kakeru Miyazaki, Ryugo Shimamura, Soichiro Kumano, Toshihiko Yamasaki
The ground motion prediction equation is commonly used to predict the seismic intensity distribution.
no code implementations • 25 Jun 2022 • Ling Xiao, Toshihiko Yamasaki
Then, we propose a self-adaptive triplet loss (SATL), where the DS of the outfit is considered.
no code implementations • 29 May 2022 • Soichiro Kumano, Hiroshi Kera, Toshihiko Yamasaki
Adversarial attacks have only focused on changing the predictions of the classifier, but their danger greatly depends on how the class is mistaken.
1 code implementation • 26 May 2022 • Lang Huang, Shan You, Mingkai Zheng, Fei Wang, Chen Qian, Toshihiko Yamasaki
We present an efficient approach for Masked Image Modeling (MIM) with hierarchical Vision Transformers (ViTs), allowing the hierarchical ViTs to discard masked patches and operate only on the visible ones.
1 code implementation • CVPR 2022 • Kaede Shiohara, Toshihiko Yamasaki
In this paper, we present novel synthetic training data called self-blended images (SBIs) to detect deepfakes.
1 code implementation • CVPR 2022 • Lang Huang, Shan You, Mingkai Zheng, Fei Wang, Chen Qian, Toshihiko Yamasaki
In this paper, we present a new approach, Learning Where to Learn (LEWEL), to adaptively aggregate spatial information of features, so that the projected embeddings could be exactly aligned and thus guide the feature learning better.
no code implementations • 1 Nov 2021 • Tetsu Kasanishi, Xueting Wang, Toshihiko Yamasaki
Graph Neural Networks (GNNs) are deep learning models that take graph data as inputs, and they are applied to various tasks such as traffic prediction and molecular property prediction.
no code implementations • 10 Jun 2021 • Pengyu Xie, Xin Xu, Zheng Wang, Toshihiko Yamasaki
NHAC consists of a graph trimming module and a node re-sampling module.
1 code implementation • 5 Jan 2021 • Naoto Inoue, Toshihiko Yamasaki
To overcome this challenge, we present SynShadow, a novel large-scale synthetic shadow/shadow-free/matte image triplets dataset and a pipeline to synthesize it.
no code implementations • 22 Dec 2020 • Jun Ikeda, Hiroyuki Seshime, Xueting Wang, Toshihiko Yamasaki
With expansion of the video advertising market, research to predict the effects of video advertising is getting more attention.
1 code implementation • 7 Dec 2020 • Soichiro Kumano, Hiroshi Kera, Toshihiko Yamasaki
In this paper, we address the question of whether there can be fooling images with no characteristic pattern of natural objects locally or globally.
1 code implementation • 3 Dec 2020 • Hiya Roy, Subhajit Chaudhury, Toshihiko Yamasaki, Tatsuaki Hashimoto
To alleviate these problems, we investigate if it is possible to obtain better performance by training the networks using frequency domain information (Discrete Fourier Transform) along with the spatial domain information.
1 code implementation • 23 Nov 2020 • Zheng Wang, Xin Yuan, Toshihiko Yamasaki, Yutian Lin, Xin Xu, Wenjun Zeng
In essence, current re-ID overemphasizes the importance of retrieval but underemphasizes that of verification, \textit{i. e.}, all returned images are considered as the target.
1 code implementation • 29 Oct 2020 • Li Tao, Xueting Wang, Toshihiko Yamasaki
It is convenient to treat PCL as a standard training strategy and apply it to many other works in self-supervised video feature learning.
Ranked #10 on
Self-supervised Video Retrieval
on UCF101
1 code implementation • 2 Oct 2020 • Nobukatsu Kajiura, Satoshi Kosugi, Xueting Wang, Toshihiko Yamasaki
In this study, we address image retargeting, which is a task that adjusts input images to arbitrary sizes.
2 code implementations • 6 Aug 2020 • Li Tao, Xueting Wang, Toshihiko Yamasaki
With the proposed Inter-Intra Contrastive (IIC) framework, we can train spatio-temporal convolutional networks to learn video representations.
Ranked #10 on
Self-supervised Video Retrieval
on HMDB51
no code implementations • 5 Jul 2020 • Lijie Wang, Xueting Wang, Toshihiko Yamasaki
The spread of social networking services has created an increasing demand for selecting, editing, and generating impressive images.
3 code implementations • 21 Jun 2020 • Li Tao, Xueting Wang, Toshihiko Yamasaki
In this paper, we propose a fast but effective way to extract motion features from videos utilizing residual frames as the input data in 3D ConvNets.
1 code implementation • 14 Mar 2020 • Subhajit Chaudhury, Toshihiko Yamasaki
In this paper, we study the generalization properties of neural networks under input perturbations and show that minimal training data corruption by a few pixel modifications can cause drastic overfitting.
no code implementations • 3 Mar 2020 • Priyanto Hidayatullah, Xueting Wang, Toshihiko Yamasaki, Tati L. E. R. Mengko, Rinaldi Munir, Anggraini Barlian, Eros Sukmawati, Supraptono Supraptono
This study proposes an architecture, called DeepSperm, that solves the aforementioned challenges and is more accurate and faster than state-of-the-art architectures.
3 code implementations • 16 Jan 2020 • Li Tao, Xueting Wang, Toshihiko Yamasaki
Further analysis indicates that better motion features can be extracted using residual frames with 3D ConvNets, and our residual-frame-input path is a good supplement for existing RGB-frame-input models.
no code implementations • 15 Jan 2020 • Sourav Mishra, Subhajit Chaudhury, Hideaki Imaizumi, Toshihiko Yamasaki
This paper aims to evaluate the suitability of current deep learning methods for clinical workflow especially by focusing on dermatology.
no code implementations • 12 Jan 2020 • Yiyan Chen, Li Tao, Xueting Wang, Toshihiko Yamasaki
For each subtask, the manager is trained to set a subgoal only by a task-level binary label, which requires much fewer labels than conventional approaches.
Hierarchical Reinforcement Learning
reinforcement-learning
+2
no code implementations • 17 Dec 2019 • Satoshi Kosugi, Toshihiko Yamasaki
This paper tackles unpaired image enhancement, a task of learning a mapping function which transforms input images into enhanced images in the absence of input-output image pairs.
1 code implementation • 16 Dec 2019 • Ryosuke Furuta, Naoto Inoue, Toshihiko Yamasaki
However, the applications of deep reinforcement learning (RL) for image processing are still limited.
no code implementations • ICCV 2019 • Satoshi Kosugi, Toshihiko Yamasaki, Kiyoharu Aizawa
Weakly supervised object detection (WSOD), where a detector is trained with only image-level annotations, is attracting more and more attention.
no code implementations • 14 Apr 2019 • Hiya Roy, Subhajit Chaudhury, Toshihiko Yamasaki, Danielle DeLatte, Makiko Ohtake, Tatsuaki Hashimoto
Image restoration is a technique that reconstructs a feasible estimate of the original image from the noisy observation.
no code implementations • 26 Mar 2019 • Sourav Mishra, Toshihiko Yamasaki, Hideaki Imaizumi
Our paper introduces an efficient combination of established techniques to improve classifier performance, in terms of accuracy and training time.
1 code implementation • 10 Nov 2018 • Ryosuke Furuta, Naoto Inoue, Toshihiko Yamasaki
This paper tackles a new problem setting: reinforcement learning with pixel-wise rewards (pixelRL) for image processing.
no code implementations • CVPR 2018 • Daiki Ikami, Toshihiko Yamasaki, Kiyoharu Aizawa
We propose a local optimization method, which is widely applicable to graph-based clustering cost functions.
no code implementations • CVPR 2018 • Daiki Ikami, Toshihiko Yamasaki, Kiyoharu Aizawa
M-estimator using iteratively reweighted least squares (IRLS) is one of the best-known methods for robust estimation.
1 code implementation • CVPR 2018 • Daiki Tanaka, Daiki Ikami, Toshihiko Yamasaki, Kiyoharu Aizawa
Deep neural networks (DNNs) trained on large-scale datasets have exhibited significant performance in image classification.
Ranked #36 on
Image Classification
on Clothing1M
3 code implementations • CVPR 2018 • Naoto Inoue, Ryosuke Furuta, Toshihiko Yamasaki, Kiyoharu Aizawa
Can we detect common objects in a variety of image domains without instance-level annotations?
Ranked #4 on
Weakly Supervised Object Detection
on Watercolor2k
(using extra training data)
5 code implementations • 23 Mar 2018 • Toru Ogawa, Atsushi Otsubo, Rei Narita, Yusuke Matsui, Toshihiko Yamasaki, Kiyoharu Aizawa
We annotated an existing image dataset of comics and created the largest annotation dataset, named Manga109-annotations.
no code implementations • 11 Feb 2018 • Sourav Mishra, Toshihiko Yamasaki, Hideaki Imaizumi
This paper introduces a deep-learning based efficient classifier for common dermatological conditions, aimed at people without easy access to skin specialists.
1 code implementation • 12 Sep 2017 • Yusuke Matsui, Keisuke Ogaki, Toshihiko Yamasaki, Kiyoharu Aizawa
Data clustering is a fundamental operation in data analysis.
no code implementations • CVPR 2017 • Daiki Ikami, Toshihiko Yamasaki, Kiyoharu Aizawa
We propose the residual expansion (RE) algorithm: a global (or near-global) optimization method for nonconvex least squares problems.
no code implementations • 21 Apr 2017 • Yusuke Matsui, Toshihiko Yamasaki, Kiyoharu Aizawa
In this paper, we propose a product quantization table (PQTable); a fast search method for product-quantized codes via hash-tables.
no code implementations • CVPR 2016 • Keisuke Midorikawa, Toshihiko Yamasaki, Kiyoharu Aizawa
We propose a model that represents various isotropic reflectance functions by using the principal components of items in a dataset, and formulate the uncalibrated photometric stereo as a regression problem.
no code implementations • ICCV 2015 • Yusuke Matsui, Toshihiko Yamasaki, Kiyoharu Aizawa
We propose the product quantization table (PQTable), a product quantization-based hash table that is fast and requires neither parameter tuning nor training steps.
no code implementations • 15 Oct 2015 • Yusuke Matsui, Kota Ito, Yuji Aramaki, Toshihiko Yamasaki, Kiyoharu Aizawa
From the experiments, we verified that: (1) the retrieval accuracy of the proposed method is higher than those of previous methods; (2) the proposed method can localize an object instance with reasonable runtime and accuracy; and (3) sketch querying is useful for manga search.