1 code implementation • 17 Jun 2022 • Zelong Zeng, Fan Yang, Zheng Wang, Shin'ichi Satoh
Most deep metric learning (DML) methods employ a strategy that forces all positive samples to be close in the embedding space while keeping them away from negative ones.
1 code implementation • 10 Jun 2022 • Liang Liao, WenYi Chen, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh
Specifically, based on the two discoveries of local spatial similarity and adjacent temporal correspondence of the sequential image data, we propose a novel Target-Domain driven pseudo label Diffusion (TDo-Dif) scheme.
no code implementations • 22 May 2022 • Zelong Zeng, Zheng Wang, Fan Yang, Shin'ichi Satoh
The large variation of viewpoint and irrelevant content around the target always hinder accurate image retrieval and its subsequent tasks.
1 code implementation • CVPR 2022 • Zhixiang Wang, Xiang Ji, Jia-Bin Huang, Shin'ichi Satoh, Xiao Zhou, Yinqiang Zheng
In this paper, we investigate using rolling shutter with a global reset feature (RSGR) to restore clean global shutter (GS) videos.
no code implementations • CVPR 2022 • Mayu Otani, Riku Togashi, Yuta Nakashima, Esa Rahtu, Janne Heikkilä, Shin'ichi Satoh
OC-cost computes the cost of correcting detections to ground truths as a measure of accuracy.
1 code implementation • 29 Oct 2021 • Nobukatsu Kajiura, Hong Liu, Shin'ichi Satoh
This framework consists of three key components, i. e., a pseudo-edge generator, a pseudo-map generator, and an uncertainty-aware refinement module.
no code implementations • 11 May 2021 • Riku Togashi, Masahiro Kato, Mayu Otani, Tetsuya Sakai, Shin'ichi Satoh
However, such methods have two main drawbacks particularly in large-scale applications; (1) the pairwise approach is severely inefficient due to the quadratic computational cost; and (2) even recent model-based samplers (e. g. IRGAN) cannot achieve practical efficiency due to the training of an extra model.
no code implementations • 19 Jan 2021 • Riku Togashi, Masahiro Kato, Mayu Otani, Shin'ichi Satoh
Learning from implicit user feedback is challenging as we can only observe positive samples but never access negative ones.
no code implementations • CVPR 2021 • Liang Liao, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh
In this paper, we introduce coherence priors between the semantics and textures which make it possible to concentrate on completing separate textures in a semantic-wise manner.
2 code implementations • 10 Nov 2020 • Riku Togashi, Mayu Otani, Shin'ichi Satoh
Solving cold-start problems is indispensable to provide meaningful recommendation results for new users and items.
no code implementations • 12 Aug 2020 • Yuting Liu, Zheng Wang, Miaojing Shi, Shin'ichi Satoh, Qijun Zhao, Hongyu Yang
We formulate the mutual transformations between the outputs of regression- and detection-based models as two scene-agnostic transformers which enable knowledge distillation between the two models.
no code implementations • ECCV 2020 • Liang Liao, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh
Completing a corrupted image with correct structures and reasonable textures for a mixed scene remains an elusive challenge.
no code implementations • 14 Jun 2019 • Changhee Han, Leonardo Rundo, Kohei Murao, Zoltán Ádám Milacski, Kazuki Umemoto, Evis Sala, Hideki Nakayama, Shin'ichi Satoh
Unsupervised learning can discover various unseen diseases, relying on large-scale unannotated medical images of healthy subjects.
no code implementations • 24 May 2019 • Zheng Wang, Zhixiang Wang, Yinqiang Zheng, Yang Wu, Wen-Jun Zeng, Shin'ichi Satoh
An efficient and effective person re-identification (ReID) system relieves the users from painful and boring video watching and accelerates the process of video analysis.
no code implementations • 13 May 2019 • Ziling Huang, Zheng Wang, Chung-Chi Tsai, Shin'ichi Satoh, Chia-Wen Lin
To gain the superiority of deep learning models, we treat a group as multiple persons and transfer the domain of a labeled ReID dataset to a G-ReID target dataset style to learn single representations.
no code implementations • 11 May 2019 • Zelong Zeng, Zhixiang Wang, Zheng Wang, Yinqiang Zheng, Yung-Yu Chuang, Shin'ichi Satoh
To demonstrate the illumination issue and to evaluate our model, we construct two large-scale simulated datasets with a wide range of illumination variations.
no code implementations • 29 Mar 2019 • Changhee Han, Kohei Murao, Shin'ichi Satoh, Hideki Nakayama
Convolutional Neural Network (CNN)-based accurate prediction typically requires large-scale annotated training data.
no code implementations • 26 Feb 2019 • Changhee Han, Kohei Murao, Tomoyuki Noguchi, Yusuke Kawata, Fumiya Uchiyama, Leonardo Rundo, Hideki Nakayama, Shin'ichi Satoh
Accurate Computer-Assisted Diagnosis, associated with proper data wrangling, can alleviate the risk of overlooking the diagnosis in a clinical environment.
2 code implementations • 27 Nov 2018 • Fan Yang, Ryota Hinami, Yusuke Matsui, Steven Ly, Shin'ichi Satoh
Diffusion is commonly used as a ranking or re-ranking method in retrieval tasks to achieve higher retrieval performance, and has attracted lots of attention in recent years.
Ranked #1 on
Image Retrieval
on Par6k
1 code implementation • 12 Aug 2018 • Yusuke Matsui, Ryota Hinami, Shin'ichi Satoh
Owing to the linear layout, the data structure can be dynamically adjusted after new items are added, maintaining the fast speed of the system.
no code implementations • 2 Jul 2018 • Yaman Kumar, Mayank Aggarwal, Pratham Nawal, Shin'ichi Satoh, Rajiv Ratn Shah, Roger Zimmerman
Recently, research has started venturing into generating (audio) speech from silent video sequences but there have been no developments thus far in dealing with divergent views and poses of a speaker.
Sound Audio and Speech Processing
no code implementations • 6 Feb 2018 • Yuki Nagai, Yusuke Uchida, Shigeyuki Sakazawa, Shin'ichi Satoh
In this paper, we propose a digital watermarking technology for ownership authorization of deep neural networks.
no code implementations • 27 Dec 2017 • Sang Phan, Gustav Eje Henter, Yusuke Miyao, Shin'ichi Satoh
First we show that, by replacing model samples with ground-truth sentences, RL training can be seen as a form of weighted cross-entropy loss, giving a fast, RL-based pre-training algorithm.
no code implementations • EMNLP 2018 • Ryota Hinami, Shin'ichi Satoh
The proposed method can retrieve and localize objects specified by a textual query from one million images in only 0. 5 seconds with high precision.
no code implementations • 26 Sep 2017 • Ryota Hinami, Yusuke Matsui, Shin'ichi Satoh
Second, to help users specify spatial relationships among objects in an intuitive way, we propose recommendation techniques of spatial relationships.
no code implementations • ICCV 2017 • Ryota Hinami, Tao Mei, Shin'ichi Satoh
Although convolutional neural networks (CNNs) have achieved promising results in learning such concepts, it remains an open question as to how to effectively use CNNs for abnormal event detection, mainly due to the environment-dependent nature of the anomaly detection.
1 code implementation • 15 Jan 2017 • Yusuke Uchida, Yuki Nagai, Shigeyuki Sakazawa, Shin'ichi Satoh
Secondly, we propose a general framework to embed a watermark into model parameters using a parameter regularizer.
no code implementations • 20 Oct 2016 • Yusuke Uchida, Shigeyuki Sakazawa, Shin'ichi Satoh
In this paper, we propose a stand-alone mobile visual search system based on binary features and the bag-of-visual words framework.
no code implementations • 27 Sep 2016 • Yusuke Uchida, Shigeyuki Sakazawa, Shin'ichi Satoh
Recently, the Fisher vector representation of local features has attracted much attention because of its effectiveness in both image classification and image retrieval.
3 code implementations • 29 Apr 2016 • Amaia Salvador, Xavier Giro-i-Nieto, Ferran Marques, Shin'ichi Satoh
This work explores the suitability for instance retrieval of image- and region-wise representations pooled from an object detection CNN such as Faster R-CNN.
no code implementations • NeurIPS 2011 • Nobuyuki Morioka, Shin'ichi Satoh
Sparse coding, a method of explaining sensory data with as few dictionary bases as possible, has attracted much attention in computer vision.