Search Results for author: Sanguk Park

Found 6 papers, 2 papers with code

MMTF: Multi-Modal Temporal Fusion for Commonsense Video Question Answering

no code implementations • Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, 2023 2023 • Mobeen Ahmad, Geonwoo Park, Dongchan Park, Sanguk Park

To address this, we propose a novel vision-text fusion module that learns the temporal context of the video and question.

Ranked #8 on Video Question Answering on AGQA 2.0 balanced (Average Accuracy metric)

counterfactual Question Answering +1

Paper
Add Code

Query-Dependent Video Representation for Moment Retrieval and Highlight Detection

1 code implementation • CVPR 2023 • WonJun Moon, Sangeek Hyun, Sanguk Park, Dongchan Park, Jae-Pil Heo

As we observe the insignificant role of a given query in transformer architectures, our encoding module starts with cross-attention layers to explicitly inject the context of text query into video representation.

Ranked #2 on Highlight Detection on TvSum

Highlight Detection Moment Retrieval +4

167

Paper
Code

Technical Report for CVPR 2022 LOVEU AQTC Challenge

1 code implementation • 29 Jun 2022 • Hyeonyu Kim, Jongeun Kim, Jeonghun Kang, Sanguk Park, Dongchan Park, Taehwan Kim

This technical report presents the 2nd winning model for AQTC, a task newly introduced in CVPR 2022 LOng-form VidEo Understanding (LOVEU) challenges.

Video Understanding

Paper
Code

Cardiac Segmentation on CT Images through Shape-Aware Contour Attentions

no code implementations • 27 May 2021 • Sanguk Park, Minyoung Chung

Cardiac segmentation of atriums, ventricles, and myocardium in computed tomography (CT) images is an important first-line task for presymptomatic cardiovascular disease diagnosis.

Cardiac Segmentation Computed Tomography (CT) +5

Paper
Add Code

Individual Tooth Detection and Identification from Dental Panoramic X-Ray Images via Point-wise Localization and Distance Regularization

no code implementations • 12 Apr 2020 • Minyoung Chung, Jusang Lee, Sanguk Park, Minkyung Lee, Chae Eun Lee, Jeongjin Lee, Yeong-Gil Shin

The accuracy of identification achieved a precision of 0. 997 and recall value of 0. 972.

regression

Paper
Add Code

Pose-Aware Instance Segmentation Framework from Cone Beam CT Images for Tooth Segmentation

no code implementations • 6 Feb 2020 • Minyoung Chung, Minkyung Lee, Jioh Hong, Sanguk Park, Jusang Lee, Jingyu Lee, Jeongjin Lee, Yeong-Gil Shin

The primary significance of the proposed method is two-fold: 1) an introduction of pose-aware VOI realignment followed by a robust tooth detection and 2) a metal-robust CNN framework for accurate tooth segmentation.

Distance regression Image Augmentation +6

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.