Search Results for author: Jinwoo Ahn

Found 6 papers, 3 papers with code

Fine-Grained Open-Vocabulary Object Recognition via User-Guided Segmentation

no code implementations23 Nov 2024 Jinwoo Ahn, Hyeokjoon Kwon, Hwiyeon Yoo

Recent advent of vision-based foundation models has enabled efficient and high-quality object detection at ease.

Object object-detection +3

Visual Contexts Clarify Ambiguous Expressions: A Benchmark Dataset

1 code implementation21 Nov 2024 Heejeong Nam, Jinwoo Ahn

Our work aims to delve deeper into the ability of models to understand indirect communication and seek to contribute to the development of models capable of more refined and human-like interactions.

Question Answering Visual Grounding +2

Solution for SMART-101 Challenge of CVPR Multi-modal Algorithmic Reasoning Task 2024

no code implementations10 Jun 2024 Jinwoo Ahn, Junhyeok Park, Min-Jun Kim, Kang-Hyeon Kim, So-Yeong Sohn, Yun-Ji Lee, Du-Seong Chang, Yu-Jung Heo, Eun-Sol Kim

Second, due to the nature of puzzle images, which often contain various geometric visual patterns, we utilize an object detection algorithm to ensure these patterns are not overlooked in the captioning process.

Language Modelling object-detection +3

Recursive Chain-of-Feedback Prevents Performance Degradation from Redundant Prompting

no code implementations5 Feb 2024 Jinwoo Ahn, Kyuseung Shin

Our preliminary results show that majority of questions that LLMs fail to respond correctly can be answered using R-CoF without any sample data outlining the logical process.

Compositional Video Understanding with Spatiotemporal Structure-based Transformers

1 code implementation CVPR 2024 Hoyeoung Yun, Jinwoo Ahn, Minseo Kim, Eun-Sol Kim

In this paper we suggest a new novel method to understand complex semantic structures through long video inputs.

Video Understanding

Cannot find the paper you are looking for? You can Submit a new open access paper.