Search Results for author: Shin'ichi Satoh

Found 45 papers, 20 papers with code

Generalized Lasso based Approximation of Sparse Coding for Visual Recognition

no code implementations • NeurIPS 2011 • Nobuyuki Morioka, Shin'ichi Satoh

Sparse coding, a method of explaining sensory data with as few dictionary bases as possible, has attracted much attention in computer vision.

Object Recognition

Paper
Add Code

Faster R-CNN Features for Instance Search

3 code implementations • 29 Apr 2016 • Amaia Salvador, Xavier Giro-i-Nieto, Ferran Marques, Shin'ichi Satoh

This work explores the suitability for instance retrieval of image- and region-wise representations pooled from an object detection CNN such as Faster R-CNN.

Instance Search object-detection +3

216

Paper
Code

Image Retrieval with Fisher Vectors of Binary Features

no code implementations • 27 Sep 2016 • Yusuke Uchida, Shigeyuki Sakazawa, Shin'ichi Satoh

Recently, the Fisher vector representation of local features has attracted much attention because of its effectiveness in both image classification and image retrieval.

Classification General Classification +3

Paper
Add Code

Adaptive Substring Extraction and Modified Local NBNN Scoring for Binary Feature-based Local Mobile Visual Search without False Positives

no code implementations • 20 Oct 2016 • Yusuke Uchida, Shigeyuki Sakazawa, Shin'ichi Satoh

In this paper, we propose a stand-alone mobile visual search system based on binary features and the bag-of-visual words framework.

Image Retrieval Retrieval

Paper
Add Code

Embedding Watermarks into Deep Neural Networks

1 code implementation • 15 Jan 2017 • Yusuke Uchida, Yuki Nagai, Shigeyuki Sakazawa, Shin'ichi Satoh

Secondly, we propose a general framework to embed a watermark into model parameters using a parameter regularizer.

115

Paper
Code

Joint Detection and Recounting of Abnormal Events by Learning Deep Generic Knowledge

no code implementations • ICCV 2017 • Ryota Hinami, Tao Mei, Shin'ichi Satoh

Although convolutional neural networks (CNNs) have achieved promising results in learning such concepts, it remains an open question as to how to effectively use CNNs for abnormal event detection, mainly due to the environment-dependent nature of the anomaly detection.

Anomaly Detection Event Detection +1

Paper
Add Code

Region-Based Image Retrieval Revisited

no code implementations • 26 Sep 2017 • Ryota Hinami, Yusuke Matsui, Shin'ichi Satoh

Second, to help users specify spatial relationships among objects in an intuitive way, we propose recommendation techniques of spatial relationships.

Attribute Image Retrieval +3

Paper
Add Code

Discriminative Learning of Open-Vocabulary Object Retrieval and Localization by Negative Phrase Augmentation

no code implementations • EMNLP 2018 • Ryota Hinami, Shin'ichi Satoh

The proposed method can retrieve and localize objects specified by a textual query from one million images in only 0. 5 seconds with high precision.

Object object-detection +2

Paper
Add Code

Consensus-based Sequence Training for Video Captioning

no code implementations • 27 Dec 2017 • Sang Phan, Gustav Eje Henter, Yusuke Miyao, Shin'ichi Satoh

First we show that, by replacing model samples with ground-truth sentences, RL training can be seen as a form of weighted cross-entropy loss, giving a fast, RL-based pre-training algorithm.

Reinforcement Learning (RL) Video Captioning

Paper
Add Code

Digital Watermarking for Deep Neural Networks

no code implementations • 6 Feb 2018 • Yuki Nagai, Yusuke Uchida, Shigeyuki Sakazawa, Shin'ichi Satoh

In this paper, we propose a digital watermarking technology for ownership authorization of deep neural networks.

Paper
Add Code

Harnessing AI for Speech Reconstruction using Multi-view Silent Video Feed

no code implementations • 2 Jul 2018 • Yaman Kumar, Mayank Aggarwal, Pratham Nawal, Shin'ichi Satoh, Rajiv Ratn Shah, Roger Zimmerman

Recently, research has started venturing into generating (audio) speech from silent video sequences but there have been no developments thus far in dealing with divergent views and poses of a speaker.

Sound Audio and Speech Processing

Paper
Add Code

Reconfigurable Inverted Index

1 code implementation • 12 Aug 2018 • Yusuke Matsui, Ryota Hinami, Shin'ichi Satoh

Owing to the linear layout, the data structure can be dynamically adjusted after new items are added, maintaining the fast speed of the system.

152

Paper
Code

Efficient Image Retrieval via Decoupling Diffusion into Online and Offline Processing

2 code implementations • 27 Nov 2018 • Fan Yang, Ryota Hinami, Yusuke Matsui, Steven Ly, Shin'ichi Satoh

Diffusion is commonly used as a ranking or re-ranking method in retrieval tasks to achieve higher retrieval performance, and has attracted lots of attention in recent years.

Ranked #1 on Image Retrieval on Par6k

Image Retrieval Re-Ranking +1

220

Paper
Code

Learning More with Less: Conditional PGGAN-based Data Augmentation for Brain Metastases Detection Using Highly-Rough Annotation on MR Images

no code implementations • 26 Feb 2019 • Changhee Han, Kohei Murao, Tomoyuki Noguchi, Yusuke Kawata, Fumiya Uchiyama, Leonardo Rundo, Hideki Nakayama, Shin'ichi Satoh

Accurate Computer-Assisted Diagnosis, associated with proper data wrangling, can alleviate the risk of overlooking the diagnosis in a clinical environment.

Data Augmentation

Paper
Add Code

Learning More with Less: GAN-based Medical Image Augmentation

no code implementations • 29 Mar 2019 • Changhee Han, Kohei Murao, Shin'ichi Satoh, Hideki Nakayama

Convolutional Neural Network (CNN)-based accurate prediction typically requires large-scale annotated training data.

Image Augmentation object-detection +1

Paper
Add Code

Illumination-Adaptive Person Re-identification

no code implementations • 11 May 2019 • Zelong Zeng, Zhixiang Wang, Zheng Wang, Yinqiang Zheng, Yung-Yu Chuang, Shin'ichi Satoh

To demonstrate the illumination issue and to evaluate our model, we construct two large-scale simulated datasets with a wide range of illumination variations.

Disentanglement Person Re-Identification +2

Paper
Add Code

DotSCN: Group Re-identification via Domain-Transferred Single and Couple Representation Learning

no code implementations • 13 May 2019 • Ziling Huang, Zheng Wang, Chung-Chi Tsai, Shin'ichi Satoh, Chia-Wen Lin

To gain the superiority of deep learning models, we treat a group as multiple persons and transfer the domain of a labeled ReID dataset to a G-ReID target dataset style to learn single representations.

Person Re-Identification Representation Learning

Paper
Add Code

Beyond Intra-modality: A Survey of Heterogeneous Person Re-identification

no code implementations • 24 May 2019 • Zheng Wang, Zhixiang Wang, Yinqiang Zheng, Yang Wu, Wen-Jun Zeng, Shin'ichi Satoh

An efficient and effective person re-identification (ReID) system relieves the users from painful and boring video watching and accelerates the process of video analysis.

Person Re-Identification

Paper
Add Code

GAN-based Multiple Adjacent Brain MRI Slice Reconstruction for Unsupervised Alzheimer's Disease Diagnosis

no code implementations • 14 Jun 2019 • Changhee Han, Leonardo Rundo, Kohei Murao, Zoltán Ádám Milacski, Kazuki Umemoto, Evis Sala, Hideki Nakayama, Shin'ichi Satoh

Unsupervised learning can discover various unseen diseases, relying on large-scale unannotated medical images of healthy subjects.

Generative Adversarial Network Unsupervised Anomaly Detection

Paper
Add Code

Guidance and Evaluation: Semantic-Aware Image Inpainting for Mixed Scenes

no code implementations • ECCV 2020 • Liang Liao, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh

Completing a corrupted image with correct structures and reasonable textures for a mixed scene remains an elusive challenge.

Image Inpainting Semantic Segmentation +1

Paper
Add Code

Towards Unsupervised Crowd Counting via Regression-Detection Bi-knowledge Transfer

no code implementations • 12 Aug 2020 • Yuting Liu, Zheng Wang, Miaojing Shi, Shin'ichi Satoh, Qijun Zhao, Hongyu Yang

We formulate the mutual transformations between the outputs of regression- and detection-based models as two scene-agnostic transformers which enable knowledge distillation between the two models.

Crowd Counting Knowledge Distillation +3

Paper
Add Code

Alleviating Cold-Start Problems in Recommendation through Pseudo-Labelling over Knowledge Graph

2 code implementations • 10 Nov 2020 • Riku Togashi, Mayu Otani, Shin'ichi Satoh

Solving cold-start problems is indispensable to provide meaningful recommendation results for new users and items.

Paper
Code

Image Inpainting Guided by Coherence Priors of Semantics and Textures

no code implementations • CVPR 2021 • Liang Liao, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh

In this paper, we introduce coherence priors between the semantics and textures which make it possible to concentrate on completing separate textures in a semantic-wise manner.

Image Inpainting Semantic Segmentation

Paper
Add Code

Density-Ratio Based Personalised Ranking from Implicit Feedback

no code implementations • 19 Jan 2021 • Riku Togashi, Masahiro Kato, Mayu Otani, Shin'ichi Satoh

Learning from implicit user feedback is challenging as we can only observe positive samples but never access negative ones.

Density Ratio Estimation

Paper
Add Code

Scalable Personalised Item Ranking through Parametric Density Estimation

no code implementations • 11 May 2021 • Riku Togashi, Masahiro Kato, Mayu Otani, Tetsuya Sakai, Shin'ichi Satoh

However, such methods have two main drawbacks particularly in large-scale applications; (1) the pairwise approach is severely inefficient due to the quadratic computational cost; and (2) even recent model-based samplers (e. g. IRGAN) cannot achieve practical efficiency due to the training of an extra model.

Density Estimation Learning-To-Rank

Paper
Add Code

Improving Camouflaged Object Detection with the Uncertainty of Pseudo-edge Labels

1 code implementation • 29 Oct 2021 • Nobukatsu Kajiura, Hong Liu, Shin'ichi Satoh

This framework consists of three key components, i. e., a pseudo-edge generator, a pseudo-map generator, and an uncertainty-aware refinement module.

object-detection Object Detection

Paper
Code

Optimal Correction Cost for Object Detection Evaluation

1 code implementation • CVPR 2022 • Mayu Otani, Riku Togashi, Yuta Nakashima, Esa Rahtu, Janne Heikkilä, Shin'ichi Satoh

OC-cost computes the cost of correcting detections to ground truths as a measure of accuracy.

Object object-detection +2

Paper
Code

Neural Global Shutter: Learn to Restore Video from a Rolling Shutter Camera with Global Reset Feature

1 code implementation • CVPR 2022 • Zhixiang Wang, Xiang Ji, Jia-Bin Huang, Shin'ichi Satoh, Xiao Zhou, Yinqiang Zheng

In this paper, we investigate using rolling shutter with a global reset feature (RSGR) to restore clean global shutter (GS) videos.

Image-to-Image Translation Motion Estimation

Paper
Code

Geo-Localization via Ground-to-Satellite Cross-View Image Retrieval

1 code implementation • 22 May 2022 • Zelong Zeng, Zheng Wang, Fan Yang, Shin'ichi Satoh

The large variation of viewpoint and irrelevant content around the target always hinder accurate image retrieval and its subsequent tasks.

Image Retrieval Representation Learning +1

Paper
Code

Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion

1 code implementation • 10 Jun 2022 • Liang Liao, WenYi Chen, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh

Specifically, based on the two discoveries of local spatial similarity and adjacent temporal correspondence of the sequential image data, we propose a novel Target-Domain driven pseudo label Diffusion (TDo-Dif) scheme.

Autonomous Driving Pseudo Label +4

Paper
Code

Improving Generalization of Metric Learning via Listwise Self-distillation

1 code implementation • 17 Jun 2022 • Zelong Zeng, Fan Yang, Zheng Wang, Shin'ichi Satoh

Most deep metric learning (DML) methods employ a strategy that forces all positive samples to be close in the embedding space while keeping them away from negative ones.

Metric Learning

Paper
Code

Reference-Guided Texture and Structure Inference for Image Inpainting

1 code implementation • 29 Jul 2022 • Taorong Liu, Liang Liao, Zheng Wang, Shin'ichi Satoh

Existing learning-based image inpainting methods are still in challenge when facing complex semantic environments and diverse hole patterns.

Image Inpainting

Paper
Code

Physical Adversarial Attack meets Computer Vision: A Decade Survey

1 code implementation • 30 Sep 2022 • Hui Wei, Hao Tang, Xuemei Jia, Zhixiang Wang, Hanxun Yu, Zhubo Li, Shin'ichi Satoh, Luc van Gool, Zheng Wang

Building upon this foundation, we uncover the pervasive role of artifacts carrying adversarial perturbations in the physical world.

Adversarial Attack Medical Diagnosis

Paper
Code

Multiple Object Tracking from appearance by hierarchically clustering tracklets

1 code implementation • 7 Oct 2022 • Andreu Girbau, Ferran Marqués, Shin'ichi Satoh

Current approaches in Multiple Object Tracking (MOT) rely on the spatio-temporal coherence between detections combined with object appearance to match objects from consecutive frames.

Ranked #10 on Multi-Object Tracking on MOT17

Clustering Multi-Object Tracking +2

Paper
Code

Self-distillation with Online Diffusion on Batch Manifolds Improves Deep Metric Learning

1 code implementation • 14 Nov 2022 • Zelong Zeng, Fan Yang, Hong Liu, Shin'ichi Satoh

However, this type of method normally ignores the crucial knowledge hidden in the data (e. g., intra-class information variation), which is harmful to the generalization of the trained model.

Metric Learning

Paper
Code

HOTCOLD Block: Fooling Thermal Infrared Detectors with a Novel Wearable Design

1 code implementation • 12 Dec 2022 • Hui Wei, Zhixiang Wang, Xuemei Jia, Yinqiang Zheng, Hao Tang, Shin'ichi Satoh, Zheng Wang

Adversarial attacks on thermal infrared imaging expose the risk of related applications.

Adversarial Attack

Paper
Code

Single Image Deblurring with Row-dependent Blur Magnitude

no code implementations • ICCV 2023 • Xiang Ji, Zhixiang Wang, Shin'ichi Satoh, Yinqiang Zheng

Image degradation often occurs during fast camera or object movements, regardless of the exposure modes: global shutter (GS) or rolling shutter (RS).

Deblurring Image Deblurring

Paper
Add Code

DisCO: Portrait Distortion Correction with Perspective-Aware 3D GANs

no code implementations • 23 Feb 2023 • Zhixiang Wang, Yu-Lun Liu, Jia-Bin Huang, Shin'ichi Satoh, Sizhuo Ma, Gurunandan Krishnan, Jian Wang

Close-up facial images captured at short distances often suffer from perspective distortion, resulting in exaggerated facial features and unnatural/unattractive appearances.

Scheduling

Paper
Add Code

Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation

no code implementations • CVPR 2023 • Mayu Otani, Riku Togashi, Yu Sawai, Ryosuke Ishigami, Yuta Nakashima, Esa Rahtu, Janne Heikkilä, Shin'ichi Satoh

Human evaluation is critical for validating the performance of text-to-image generative models, as this highly cognitive process requires deep comprehension of text and images.

Text-to-Image Generation

Paper
Add Code

Certified Zeroth-order Black-Box Defense with Robust UNet Denoiser

no code implementations • 13 Apr 2023 • Astha Verma, Siddhesh Bangar, A V Subramanyam, Naman Lal, Rajiv Ratn Shah, Shin'ichi Satoh

However, these methods suffer from high model variance with low performance on high-dimensional datasets due to the ineffective design of the denoiser and are limited in their utilization of ZO techniques.

Image Reconstruction

Paper
Add Code

TransRef: Multi-Scale Reference Embedding Transformer for Reference-Guided Image Inpainting

1 code implementation • 20 Jun 2023 • Liang Liao, Taorong Liu, Delin Chen, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh

For precise utilization of the reference features for guidance, a reference-patch alignment (Ref-PA) module is proposed to align the patch features of the reference and corrupted images and harmonize their style differences, while a reference-patch transformer (Ref-PT) module is proposed to refine the embedded reference feature.

Image Inpainting Image Restoration

Paper
Code

Beyond Domain Gap: Exploiting Subjectivity in Sketch-Based Person Retrieval

1 code implementation • 15 Sep 2023 • Kejun Lin, Zhixiang Wang, Zheng Wang, Yinqiang Zheng, Shin'ichi Satoh

2) Multi-perspective and multi-style.

Person Re-Identification Person Retrieval +1

Paper
Code

Contributing Dimension Structure of Deep Feature for Coreset Selection

1 code implementation • 29 Jan 2024 • Zhijing Wan, Zhixiang Wang, Yuran Wang, Zheng Wang, Hongyuan Zhu, Shin'ichi Satoh

Existing methods typically measure both the representation and diversity of data based on similarity metrics, such as L2-norm.

Paper
Code

The Effects of Short Video-Sharing Services on Video Copy Detection

no code implementations • 26 Mar 2024 • rintaro yanagi, Yamato Okamoto, Shuhei Yokoo, Shin'ichi Satoh

From the experimental results focusing on segment-level and video-level situations, we can see that three effects: "Segment-level VCD in short video-sharing services is more difficult than those in general video-sharing services", "Video-level VCD in short video-sharing services is easier than those in general video-sharing services", "The video alignment component mainly suppress the detection performance in short video-sharing services".

Copy Detection Video Alignment

Paper
Add Code

RanLayNet: A Dataset for Document Layout Detection used for Domain Adaptation and Generalization

1 code implementation • 15 Apr 2024 • Avinash Anand, Raj Jaiswal, Mohit Gupta, Siddhesh S Bangar, Pijush Bhuyan, Naman Lal, Rajeev Singh, Ritika Jha, Rajiv Ratn Shah, Shin'ichi Satoh

To solve this problem, domain adaptation approaches have been developed that use a small quantity of labeled data to adjust the model to the target domain.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.