Search Results for author: Dohwan Ko

Found 4 papers, 4 papers with code

Large Language Models are Temporal and Causal Reasoners for Video Question Answering

1 code implementation • 24 Oct 2023 • Dohwan Ko, Ji Soo Lee, Wooyoung Kang, Byungseok Roh, Hyunwoo J. Kim

We observe that the LLMs provide effective priors in exploiting $\textit{linguistic shortcuts}$ for temporal and causal reasoning in Video Question Answering (VideoQA).

Ranked #1 on Video Question Answering on TVQA

Natural Language Understanding Question Answering +2

Paper
Code

Open-vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models

1 code implementation • ICCV 2023 • Dohwan Ko, Ji Soo Lee, Miso Choi, Jaewon Chu, Jihwan Park, Hyunwoo J. Kim

We hence propose a new benchmark, Open-vocabulary Video Question Answering (OVQA), to measure the generalizability of VideoQA models by considering rare and unseen answers.

Ranked #8 on Visual Question Answering (VQA) on MSRVTT-QA

Multiple-choice Question Answering +4

Paper
Code

MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models

1 code implementation • CVPR 2023 • Dohwan Ko, Joonmyung Choi, Hyeong Kyu Choi, Kyoung-Woon On, Byungseok Roh, Hyunwoo J. Kim

Therefore, we propose MEta Loss TRansformer (MELTR), a plug-in module that automatically and non-linearly combines various loss functions to aid learning the target task via auxiliary learning.

Ranked #2 on Video Captioning on YouCook2

Auxiliary Learning Multimodal Sentiment Analysis +10

Paper
Code

Video-Text Representation Learning via Differentiable Weak Temporal Alignment

1 code implementation • CVPR 2022 • Dohwan Ko, Joonmyung Choi, Juyeon Ko, Shinyeong Noh, Kyoung-Woon On, Eun-Sol Kim, Hyunwoo J. Kim

In this paper, we propose a novel multi-modal self-supervised framework Video-Text Temporally Weak Alignment-based Contrastive Learning (VT-TWINS) to capture significant information from noisy and weakly correlated data using a variant of Dynamic Time Warping (DTW).

Contrastive Learning Dynamic Time Warping +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.