1 code implementation • IEEE Access 2022 • Masato Fujitake, Akihiro Sugimoto
In this paper, we enhance features element-wisely before the object candidate region detection, proposing Video Sparse Transformer with Attention-guided Memory (VSTAM).
Ranked #1 on
Video Object Detection
on ImageNet VID
no code implementations • CVPR 2022 • Duc Minh Vo, Hong Chen, Akihiro Sugimoto, Hideki Nakayama
We propose an end-to-end Novel Object Captioning with Retrieved vocabulary from External Knowledge method (NOC-REK), which simultaneously learns vocabulary retrieval and caption generation, successfully describing novel objects outside of the training dataset.
1 code implementation • 16 Mar 2022 • Khoa Vo, Kashu Yamazaki, Sang Truong, Minh-Triet Tran, Akihiro Sugimoto, Ngan Le
Temporal action proposal generation (TAPG) aims to estimate temporal intervals of actions in untrimmed videos, which is a challenging yet plays an important role in many tasks of video analysis and understanding.
no code implementations • 16 Mar 2022 • Duc Minh Vo, Akihiro Sugimoto, Hideki Nakayama
We push forward neural network compression research by exploiting a novel challenging task of large-scale conditional generative adversarial networks (GANs) compression.
no code implementations • 17 Jul 2021 • Viet-Khoa Vo-Ho, Ngan Le, Kashu Yamazaki, Akihiro Sugimoto, Minh-Triet Tran
Temporal action proposal generation is an essential and challenging task that aims at localizing temporal intervals containing human actions in untrimmed videos.
2 code implementations • Computer Vision and Image Understanding 2019 • Trung-Nghia Le, Tam V. Nguyen, Zhongliang Nie, Minh-Triet Tran, Akihiro Sugimoto
Different from existing networks for segmentation, our proposed network possesses the second branch for classification to predict the probability of containing camouflaged object(s) in an image, which is then fused into the main branch for segmentation to boost up the segmentation accuracy.
Ranked #9 on
Camouflaged Object Segmentation
on CAMO
no code implementations • ECCV 2020 • Zuzana Kukelova, Cenek Albl, Akihiro Sugimoto, Konrad Schindler, Tomas Pajdla
The internal geometry of most modern consumer cameras is not adequately described by the perspective projection.
no code implementations • CVPR 2020 • Hayato Onizuka, Zehra Hayirci, Diego Thomas, Akihiro Sugimoto, Hideaki Uchiyama, Rin-ichiro Taniguchi
In this paper, we propose the tetrahedral outer shell volumetric truncated signed distance function (TetraTSDF) model for the human body, and its corresponding part connection network (PCN) for 3D human body shape regression.
no code implementations • 19 Nov 2019 • Duc Minh Vo, Akihiro Sugimoto
The semantic content feature and the style representation feature are then concatenated adaptively and fed into the decoder to generate style-transferred (stylized) images.
no code implementations • ECCV 2020 • Duc Minh Vo, Akihiro Sugimoto
We also use individual relation separately to predict from the initial bounding-boxes relation-units for all the relations in the input text.
no code implementations • 30 Dec 2018 • Zuzana Kukelova, Cenek Albl, Akihiro Sugimoto, Tomas Pajdla
Our best 6-point solver, based on the new alternation technique, shows an identical or even better performance than the state-of-the-art R6P solver and is two orders of magnitude faster.
no code implementations • 4 Jul 2018 • Trung-Nghia Le, Akihiro Sugimoto
In addition, to tackle the task of VSSIS, we augment the DAVIS-2017 benchmark dataset by assigning semantic ground-truth for salient instance labels, obtaining SEmantic Salient Instance Video (SESIV) dataset.
1 code implementation • SPIE Journal of Electronic Imaging 2017 • Jiří Borovec, Jan Kybic, Akihiro Sugimoto
Region growing is a classical image segmentation method based on hierarchical region aggregation using local similarity rules.
no code implementations • 4 Aug 2017 • Trung-Nghia Le, Akihiro Sugimoto
Detecting salient objects from a video requires exploiting both spatial and temporal knowledge included in the video.
no code implementations • 4 Aug 2017 • Trung-Nghia Le, Akihiro Sugimoto
STCRF is our extension of CRF to the temporal domain and describes the relationships among neighboring regions both in a frame and over frames.