no code implementations • 30 Jul 2024 • Zhixiang Wang, Baiang Li, Jian Wang, Yu-Lun Liu, Jinwei Gu, Yung-Yu Chuang, Shin'ichi Satoh
This paper introduces an innovative approach for image matting that redefines the traditional regression-based task as a generative modeling challenge.
no code implementations • 27 May 2024 • Jian Zhao, Lei Jin, Jianshu Li, Zheng Zhu, Yinglei Teng, Jiaojiao Zhao, Sadaf Gulshad, Zheng Wang, Bo Zhao, Xiangbo Shu, Yunchao Wei, Xuecheng Nie, Xiaojie Jin, Xiaodan Liang, Shin'ichi Satoh, Yandong Guo, Cewu Lu, Junliang Xing, Jane Shen Shengmei
The SkatingVerse Workshop & Challenge aims to encourage research in developing novel and accurate methods for human action understanding.
no code implementations • 16 Apr 2024 • Avinash Anand, Raj Jaiswal, Pijush Bhuyan, Mohit Gupta, Siddhesh Bangar, Md. Modassir Imam, Rajiv Ratn Shah, Shin'ichi Satoh
Our proposed approach achieves an IOU of 0. 96 and an OCR Accuracy of 78%, showcasing a remarkable improvement of approximately 25% in the OCR Accuracy compared to the previous Table Transformer approach.
1 code implementation • 15 Apr 2024 • Avinash Anand, Raj Jaiswal, Mohit Gupta, Siddhesh S Bangar, Pijush Bhuyan, Naman Lal, Rajeev Singh, Ritika Jha, Rajiv Ratn Shah, Shin'ichi Satoh
To solve this problem, domain adaptation approaches have been developed that use a small quantity of labeled data to adjust the model to the target domain.
no code implementations • 26 Mar 2024 • rintaro yanagi, Yamato Okamoto, Shuhei Yokoo, Shin'ichi Satoh
From the experimental results focusing on segment-level and video-level situations, we can see that three effects: "Segment-level VCD in short video-sharing services is more difficult than those in general video-sharing services", "Video-level VCD in short video-sharing services is easier than those in general video-sharing services", "The video alignment component mainly suppress the detection performance in short video-sharing services".
1 code implementation • 29 Jan 2024 • Zhijing Wan, Zhixiang Wang, Yuran Wang, Zheng Wang, Hongyuan Zhu, Shin'ichi Satoh
Existing methods typically measure both the representation and diversity of data based on similarity metrics, such as L2-norm.
1 code implementation • 15 Sep 2023 • Kejun Lin, Zhixiang Wang, Zheng Wang, Yinqiang Zheng, Shin'ichi Satoh
2) Multi-perspective and multi-style.
1 code implementation • 20 Jun 2023 • Liang Liao, Taorong Liu, Delin Chen, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh
For precise utilization of the reference features for guidance, a reference-patch alignment (Ref-PA) module is proposed to align the patch features of the reference and corrupted images and harmonize their style differences, while a reference-patch transformer (Ref-PT) module is proposed to refine the embedded reference feature.
no code implementations • 13 Apr 2023 • Astha Verma, A V Subramanyam, Siddhesh Bangar, Naman Lal, Rajiv Ratn Shah, Shin'ichi Satoh
However, these methods suffer from high model variance with low performance on high-dimensional datasets due to the ineffective design of the denoiser and are limited in their utilization of ZO techniques.
no code implementations • CVPR 2023 • Mayu Otani, Riku Togashi, Yu Sawai, Ryosuke Ishigami, Yuta Nakashima, Esa Rahtu, Janne Heikkilä, Shin'ichi Satoh
Human evaluation is critical for validating the performance of text-to-image generative models, as this highly cognitive process requires deep comprehension of text and images.
no code implementations • 23 Feb 2023 • Zhixiang Wang, Yu-Lun Liu, Jia-Bin Huang, Shin'ichi Satoh, Sizhuo Ma, Gurunandan Krishnan, Jian Wang
Close-up facial images captured at short distances often suffer from perspective distortion, resulting in exaggerated facial features and unnatural/unattractive appearances.
no code implementations • ICCV 2023 • Xiang Ji, Zhixiang Wang, Shin'ichi Satoh, Yinqiang Zheng
Image degradation often occurs during fast camera or object movements, regardless of the exposure modes: global shutter (GS) or rolling shutter (RS).
1 code implementation • 12 Dec 2022 • Hui Wei, Zhixiang Wang, Xuemei Jia, Yinqiang Zheng, Hao Tang, Shin'ichi Satoh, Zheng Wang
Adversarial attacks on thermal infrared imaging expose the risk of related applications.
1 code implementation • 14 Nov 2022 • Zelong Zeng, Fan Yang, Hong Liu, Shin'ichi Satoh
However, this type of method normally ignores the crucial knowledge hidden in the data (e. g., intra-class information variation), which is harmful to the generalization of the trained model.
1 code implementation • 7 Oct 2022 • Andreu Girbau, Ferran Marqués, Shin'ichi Satoh
Current approaches in Multiple Object Tracking (MOT) rely on the spatio-temporal coherence between detections combined with object appearance to match objects from consecutive frames.
Ranked #13 on Multi-Object Tracking on MOT17
1 code implementation • 30 Sep 2022 • Hui Wei, Hao Tang, Xuemei Jia, Zhixiang Wang, Hanxun Yu, Zhubo Li, Shin'ichi Satoh, Luc van Gool, Zheng Wang
Building upon this foundation, we uncover the pervasive role of artifacts carrying adversarial perturbations in the physical world.
1 code implementation • 29 Jul 2022 • Taorong Liu, Liang Liao, Zheng Wang, Shin'ichi Satoh
Existing learning-based image inpainting methods are still in challenge when facing complex semantic environments and diverse hole patterns.
1 code implementation • 17 Jun 2022 • Zelong Zeng, Fan Yang, Zheng Wang, Shin'ichi Satoh
Most deep metric learning (DML) methods employ a strategy that forces all positive samples to be close in the embedding space while keeping them away from negative ones.
1 code implementation • 10 Jun 2022 • Liang Liao, WenYi Chen, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh
Specifically, based on the two discoveries of local spatial similarity and adjacent temporal correspondence of the sequential image data, we propose a novel Target-Domain driven pseudo label Diffusion (TDo-Dif) scheme.
1 code implementation • 22 May 2022 • Zelong Zeng, Zheng Wang, Fan Yang, Shin'ichi Satoh
The large variation of viewpoint and irrelevant content around the target always hinder accurate image retrieval and its subsequent tasks.
1 code implementation • CVPR 2022 • Zhixiang Wang, Xiang Ji, Jia-Bin Huang, Shin'ichi Satoh, Xiao Zhou, Yinqiang Zheng
In this paper, we investigate using rolling shutter with a global reset feature (RSGR) to restore clean global shutter (GS) videos.
1 code implementation • CVPR 2022 • Mayu Otani, Riku Togashi, Yuta Nakashima, Esa Rahtu, Janne Heikkilä, Shin'ichi Satoh
OC-cost computes the cost of correcting detections to ground truths as a measure of accuracy.
1 code implementation • 29 Oct 2021 • Nobukatsu Kajiura, Hong Liu, Shin'ichi Satoh
This framework consists of three key components, i. e., a pseudo-edge generator, a pseudo-map generator, and an uncertainty-aware refinement module.
no code implementations • 11 May 2021 • Riku Togashi, Masahiro Kato, Mayu Otani, Tetsuya Sakai, Shin'ichi Satoh
However, such methods have two main drawbacks particularly in large-scale applications; (1) the pairwise approach is severely inefficient due to the quadratic computational cost; and (2) even recent model-based samplers (e. g. IRGAN) cannot achieve practical efficiency due to the training of an extra model.
no code implementations • 19 Jan 2021 • Riku Togashi, Masahiro Kato, Mayu Otani, Shin'ichi Satoh
Learning from implicit user feedback is challenging as we can only observe positive samples but never access negative ones.
no code implementations • CVPR 2021 • Liang Liao, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh
In this paper, we introduce coherence priors between the semantics and textures which make it possible to concentrate on completing separate textures in a semantic-wise manner.
2 code implementations • 10 Nov 2020 • Riku Togashi, Mayu Otani, Shin'ichi Satoh
Solving cold-start problems is indispensable to provide meaningful recommendation results for new users and items.
no code implementations • 12 Aug 2020 • Yuting Liu, Zheng Wang, Miaojing Shi, Shin'ichi Satoh, Qijun Zhao, Hongyu Yang
We formulate the mutual transformations between the outputs of regression- and detection-based models as two scene-agnostic transformers which enable knowledge distillation between the two models.
no code implementations • ECCV 2020 • Liang Liao, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh
Completing a corrupted image with correct structures and reasonable textures for a mixed scene remains an elusive challenge.
no code implementations • 14 Jun 2019 • Changhee Han, Leonardo Rundo, Kohei Murao, Zoltán Ádám Milacski, Kazuki Umemoto, Evis Sala, Hideki Nakayama, Shin'ichi Satoh
Unsupervised learning can discover various unseen diseases, relying on large-scale unannotated medical images of healthy subjects.
Generative Adversarial Network Unsupervised Anomaly Detection
no code implementations • 24 May 2019 • Zheng Wang, Zhixiang Wang, Yinqiang Zheng, Yang Wu, Wen-Jun Zeng, Shin'ichi Satoh
An efficient and effective person re-identification (ReID) system relieves the users from painful and boring video watching and accelerates the process of video analysis.
no code implementations • 13 May 2019 • Ziling Huang, Zheng Wang, Chung-Chi Tsai, Shin'ichi Satoh, Chia-Wen Lin
To gain the superiority of deep learning models, we treat a group as multiple persons and transfer the domain of a labeled ReID dataset to a G-ReID target dataset style to learn single representations.
no code implementations • 11 May 2019 • Zelong Zeng, Zhixiang Wang, Zheng Wang, Yinqiang Zheng, Yung-Yu Chuang, Shin'ichi Satoh
To demonstrate the illumination issue and to evaluate our model, we construct two large-scale simulated datasets with a wide range of illumination variations.
no code implementations • 29 Mar 2019 • Changhee Han, Kohei Murao, Shin'ichi Satoh, Hideki Nakayama
Convolutional Neural Network (CNN)-based accurate prediction typically requires large-scale annotated training data.
no code implementations • 26 Feb 2019 • Changhee Han, Kohei Murao, Tomoyuki Noguchi, Yusuke Kawata, Fumiya Uchiyama, Leonardo Rundo, Hideki Nakayama, Shin'ichi Satoh
Accurate Computer-Assisted Diagnosis, associated with proper data wrangling, can alleviate the risk of overlooking the diagnosis in a clinical environment.
2 code implementations • 27 Nov 2018 • Fan Yang, Ryota Hinami, Yusuke Matsui, Steven Ly, Shin'ichi Satoh
Diffusion is commonly used as a ranking or re-ranking method in retrieval tasks to achieve higher retrieval performance, and has attracted lots of attention in recent years.
Ranked #1 on Image Retrieval on Par6k
1 code implementation • 12 Aug 2018 • Yusuke Matsui, Ryota Hinami, Shin'ichi Satoh
Owing to the linear layout, the data structure can be dynamically adjusted after new items are added, maintaining the fast speed of the system.
no code implementations • 2 Jul 2018 • Yaman Kumar, Mayank Aggarwal, Pratham Nawal, Shin'ichi Satoh, Rajiv Ratn Shah, Roger Zimmerman
Recently, research has started venturing into generating (audio) speech from silent video sequences but there have been no developments thus far in dealing with divergent views and poses of a speaker.
Sound Audio and Speech Processing
no code implementations • 6 Feb 2018 • Yuki Nagai, Yusuke Uchida, Shigeyuki Sakazawa, Shin'ichi Satoh
In this paper, we propose a digital watermarking technology for ownership authorization of deep neural networks.
no code implementations • 27 Dec 2017 • Sang Phan, Gustav Eje Henter, Yusuke Miyao, Shin'ichi Satoh
First we show that, by replacing model samples with ground-truth sentences, RL training can be seen as a form of weighted cross-entropy loss, giving a fast, RL-based pre-training algorithm.
no code implementations • EMNLP 2018 • Ryota Hinami, Shin'ichi Satoh
The proposed method can retrieve and localize objects specified by a textual query from one million images in only 0. 5 seconds with high precision.
no code implementations • ICCV 2017 • Ryota Hinami, Tao Mei, Shin'ichi Satoh
Although convolutional neural networks (CNNs) have achieved promising results in learning such concepts, it remains an open question as to how to effectively use CNNs for abnormal event detection, mainly due to the environment-dependent nature of the anomaly detection.
no code implementations • 26 Sep 2017 • Ryota Hinami, Yusuke Matsui, Shin'ichi Satoh
Second, to help users specify spatial relationships among objects in an intuitive way, we propose recommendation techniques of spatial relationships.
1 code implementation • 15 Jan 2017 • Yusuke Uchida, Yuki Nagai, Shigeyuki Sakazawa, Shin'ichi Satoh
Secondly, we propose a general framework to embed a watermark into model parameters using a parameter regularizer.
no code implementations • 20 Oct 2016 • Yusuke Uchida, Shigeyuki Sakazawa, Shin'ichi Satoh
In this paper, we propose a stand-alone mobile visual search system based on binary features and the bag-of-visual words framework.
no code implementations • 27 Sep 2016 • Yusuke Uchida, Shigeyuki Sakazawa, Shin'ichi Satoh
Recently, the Fisher vector representation of local features has attracted much attention because of its effectiveness in both image classification and image retrieval.
3 code implementations • 29 Apr 2016 • Amaia Salvador, Xavier Giro-i-Nieto, Ferran Marques, Shin'ichi Satoh
This work explores the suitability for instance retrieval of image- and region-wise representations pooled from an object detection CNN such as Faster R-CNN.
no code implementations • NeurIPS 2011 • Nobuyuki Morioka, Shin'ichi Satoh
Sparse coding, a method of explaining sensory data with as few dictionary bases as possible, has attracted much attention in computer vision.