no code implementations • 13 Feb 2025 • Jonathan Roberts, Mohammad Reza Taesiri, Ansh Sharma, Akash Gupta, Samuel Roberts, Ioana Croitoru, Simion-Vlad Bogolin, Jialu Tang, Florian Langer, Vyas Raina, Vatsal Raina, Hanyi Xiong, Vishaal Udandarao, Jingyi Lu, Shiyang Chen, Sam Purkis, Tianshuo Yan, Wenye Lin, Gyungin Shin, Qiaochu Yang, Anh Totti Nguyen, David I. Atkinson, Aaditya Baranwal, Alexandru Coca, Mikah Dang, Sebastian Dziadzio, Jakob D. Kunz, Kaiqu Liang, Alexander Lo, Brian Pulfer, Steven Walton, Charig Yang, Kai Han, Samuel Albanie
Large Multimodal Models (LMMs) exhibit major shortfalls when interpreting images and, by some measures, have poorer spatial cognition than small children or animals.
no code implementations • 27 Sep 2024 • Dylan Li, Gyungin Shin
Unsupervised instance segmentation aims to segment distinct object instances in an image without relying on human-labeled data.
1 code implementation • 1 Aug 2024 • Ragav Sachdeva, Gyungin Shin, Andrew Zisserman
Enabling engagement of manga by visually impaired individuals presents a significant challenge due to its inherently visual nature.
1 code implementation • 13 Jun 2023 • Gyungin Shin, Weidi Xie, Samuel Albanie
In this paper, we propose to meet this challenge through the novel task of automatic table verification (AutoTV), in which the objective is to verify the accuracy of numerical data in tables by cross-referencing cited sources.
1 code implementation • 27 Apr 2023 • Gyungin Shin, Samuel Albanie, Weidi Xie
Segmentation is a core computer vision competency, with applications spanning a broad range of scientifically and economically valuable domains.
1 code implementation • 22 Sep 2022 • Gyungin Shin, Weidi Xie, Samuel Albanie
Our method, termed NamedMask, begins by using CLIP to construct category-specific archives of images.
2 code implementations • 14 Jun 2022 • Gyungin Shin, Weidi Xie, Samuel Albanie
Semantic segmentation has a broad range of applications, but its real-world impact has been significantly limited by the prohibitive annotation costs necessary to enable deployment.
1 code implementation • 23 Mar 2022 • Gyungin Shin, Samuel Albanie, Weidi Xie
In this paper, we tackle the challenging task of unsupervised salient object detection (SOD) by leveraging spectral clustering on self-supervised features.
Ranked #1 on
Unsupervised Saliency Detection
on ECSSD
2 code implementations • 13 Apr 2021 • Gyungin Shin, Weidi Xie, Samuel Albanie
A central challenge for the task of semantic segmentation is the prohibitive cost of obtaining dense pixel-level annotations to supervise model training.