no code implementations • 15 Dec 2024 • Naoki Wake, Atsushi Kanehira, Daichi Saito, Jun Takamatsu, Kazuhiro Sasabuchi, Hideki Koike, Katsushi Ikeuchi
Multi-step dexterous manipulation is a fundamental skill in household scenarios, yet remains an underexplored area in robotics.
no code implementations • 25 Feb 2024 • Yasheng Sun, Wenqing Chu, Hang Zhou, Kaisiyuan Wang, Hideki Koike
In this paper, we propose AVI-Talking, an Audio-Visual Instruction system for expressive Talking face generation.
no code implementations • 14 Feb 2023 • Yasheng Sun, Qianyi Wu, Hang Zhou, Kaisiyuan Wang, Tianshu Hu, Chen-Chieh Liao, Shio Miyafuji, Ziwei Liu, Hideki Koike
Creating the photo-realistic version of people sketched portraits is useful to various entertainment purposes.
no code implementations • 9 Dec 2022 • Yasheng Sun, Hang Zhou, Kaisiyuan Wang, Qianyi Wu, Zhibin Hong, Jingtuo Liu, Errui Ding, Jingdong Wang, Ziwei Liu, Hideki Koike
This requires masking a large percentage of the original image and seamlessly inpainting it with the aid of audio and reference frames.
1 code implementation • 8 Apr 2021 • Christopher Mitcheltree, Hideki Koike
Learning to program an audio production VST synthesizer is a time consuming process, usually obtained through inefficient trial and error and only mastered after years of experience.
1 code implementation • 27 Feb 2021 • Naoki Wake, Daichi Saito, Kazuhiro Sasabuchi, Hideki Koike, Katsushi Ikeuchi
These findings highlight the significance of object affordance in multimodal robot teaching, regardless of whether real objects are present in the images.
1 code implementation • 5 Feb 2021 • Christopher Mitcheltree, Hideki Koike
Learning to program an audio production VST plugin is a time consuming process, usually obtained through inefficient trial and error and only mastered after extensive user experience.
no code implementations • 15 Jan 2020 • Dong-Hyun Hwang, Suntae Kim, Nicolas Monet, Hideki Koike, Soonmin Bae
We present MoVNect, a lightweight deep neural network to capture 3D human pose using a single RGB camera.