2 code implementations • 28 Mar 2024 • Nobuhiro Ueda, Hideko Habe, Yoko Matsui, Akishige Yuguchi, Seiya Kawano, Yasutomo Kawanishi, Sadao Kurohashi, Koichiro Yoshino
Understanding expressions that refer to the physical world is crucial for such human-assisting systems in the real world, as robots that must perform actions that are expected by users.
1 code implementation • 26 Mar 2024 • Shun Inadumi, Seiya Kawano, Akishige Yuguchi, Yasutomo Kawanishi, Koichiro Yoshino
Such ambiguities in questions are often clarified by the contexts in conversational situations, such as joint attention with a user or user gaze information.
no code implementations • 18 Mar 2024 • Vijay John, Yasutomo Kawanishi
In this paper, we propose a novel learning framework, where the weak labels are first used to train a multi-view video-based base model, which is subsequently used for downstream frame-level perception tasks.
no code implementations • 20 Oct 2023 • Daiju Kanaoka, Motoharu Sonogashira, Hakaru Tamukoh, Yasutomo Kawanishi
DietNeRF is an extension of NeRF that aims to achieve this task from only a few images by introducing a new loss function for unknown viewpoints with no input images.
no code implementations • 18th International Conference on Machine Vision and Applications (MVA) 2023 • Da Huo, Marc A. Kastner, TingWei Liu, Yasutomo Kawanishi, Takatsugu Hirayama, Takahiro Komamizu, Ichiro Ide
Object detection is the task of detecting objects in an image.
Ranked #4 on Small Object Detection on SOD4SB Public Test (using extra training data)
1 code implementation • 18 Jul 2023 • Yuki Kondo, Norimichi Ukita, Takayuki Yamaguchi, Hao-Yu Hou, Mu-Yi Shen, Chia-Chi Hsu, En-Ming Huang, Yu-Chen Huang, Yu-Cheng Xia, Chien-Yao Wang, Chun-Yi Lee, Da Huo, Marc A. Kastner, TingWei Liu, Yasutomo Kawanishi, Takatsugu Hirayama, Takahiro Komamizu, Ichiro Ide, Yosuke Shinya, Xinyao Liu, Guang Liang, Syusuke Yasui
Small Object Detection (SOD) is an important machine vision topic because (i) a variety of real-world applications require object detection for distant objects and (ii) SOD is a challenging task due to the noisy, blurred, and less-informative image appearances of small objects.
Ranked #2 on Small Object Detection on SOD4SB Public Test (using extra training data)
no code implementations • ICCV 2023 • Shu Nakamura, Yasutomo Kawanishi, Shohei Nobuhara, Ko Nishino
The first is the introduction of a first-of-its-kind large-scale dataset for pointing recognition and direction estimation, which we refer to as the DP Dataset.
no code implementations • 6 Mar 2023 • Chihaya Matsuhira, Marc A. Kastner, Takahiro Komamizu, Takatsugu Hirayama, Keisuke Doman, Yasutomo Kawanishi, Ichiro Ide
Furthermore, in some multimodal retrieval tasks, we confirm that the proposed pronunciation encoder enhances the performance of the text encoder and that the pronunciation encoder handles nonsense words in a more phonetic manner than the text encoder.
no code implementations • 20 Oct 2022 • Vijay John, Yasutomo Kawanishi
In the framework, a novel deep latent embedding framework, termed the AVTNet, is proposed to learn multiple latent embeddings.
1 code implementation • 27 Jun 2021 • Hitoshi Nishimura, Satoshi Komorita, Yasutomo Kawanishi, Hiroshi Murase
To maintain the tracking accuracy, we introduce robust interest point selection within human regions and a tracking termination metric calculated by the distribution of the interest points.
1 code implementation • 18 Sep 2019 • Hitoshi Nishimura, Kazuyuki Tasaka, Yasutomo Kawanishi, Hiroshi Murase
The accurate human tracking result using PAF helps multi-frame-based action recognition.