no code implementations • 5 Dec 2024 • Mithun Parab, Pranay Lendave, Jiyoung Kim, Thi Quynh Dan Nguyen, Palash Ingle
In image-assisted minimally invasive surgeries (MIS), understanding surgical scenes is vital for real-time feedback to surgeons, skill evaluation, and improving outcomes through collaborative human-robot procedures.
no code implementations • 4 Dec 2024 • Siyoon Jin, Jisu Nam, Jiyoung Kim, Dahyun Chung, Yeong-Seok Kim, Joonhyung Park, Heonjeong Chu, Seungryong Kim
Recent tuning-free approaches address this limitation by transferring local appearance from the exemplar image to the synthesized image through implicit cross-image matching in the augmented self-attention mechanism of pre-trained diffusion models.
1 code implementation • 28 Mar 2024 • Seyeon Kim, Siyoon Jin, JiHye Park, Kihong Kim, Jiyoung Kim, Jisu Nam, Seungryong Kim
AToM excels in capturing subtle lip movements by leveraging an audio attention mechanism.
no code implementations • 12 Dec 2023 • Jiyoung Kim, Kyuhong Shim, Insu Lee, Byonghyo Shim
In this paper, we propose a novel USS framework called Expand-and-Quantize Unsupervised Semantic Segmentation (EQUSS), which combines the benefits of high-dimensional spaces for better clustering and product quantization for effective information compression.
Ranked #4 on
Unsupervised Semantic Segmentation
on Potsdam-3
no code implementations • 25 Apr 2023 • Kyuhong Shim, Jiyoung Kim, Gusang Lee, Byonghyo Shim
Monocular depth estimation is very challenging because clues to the exact depth are incomplete in a single RGB image.
no code implementations • 6 Sep 2022 • Yongjun Ahn, jinhong Kim, Seungnyun Kim, Kyuhong Shim, Jiyoung Kim, Sangtae Kim, Byonghyo Shim
Beamforming technique realized by the multiple-input-multiple-output (MIMO) antenna arrays has been widely used to compensate for the severe path loss in the millimeter wave (mmWave) bands.