no code implementations • 14 Jul 2023 • Zixin Guo, Tzu-Jui Julius Wang, Selen Pehlivan, Abduljalil Radman, Jorma Laaksonen
To further reduce the amount of supervision, we propose Prompts-in-The-Loop (PiTL) that prompts knowledge from large language models (LLMs) to describe images.
no code implementations • 18 Aug 2020 • Tzu-Jui Julius Wang, Selen Pehlivan, Jorma Laaksonen
Recent scene graph generation (SGG) models have shown their capability of capturing the most frequent relations among visual entities.