Search Results for author: Zixin Guo

Found 3 papers, 1 papers with code

EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning

no code implementations15 Apr 2024 Yue Jiang, Zixin Guo, Hamed Rezazadegan Tavakoli, Luis A. Leiva, Antti Oulasvirta

From a visual perception perspective, modern graphical user interfaces (GUIs) comprise a complex graphics-rich two-dimensional visuospatial arrangement of text, images, and interactive objects such as buttons and menus.

reinforcement-learning

PiTL: Cross-modal Retrieval with Weakly-supervised Vision-language Pre-training via Prompting

no code implementations14 Jul 2023 Zixin Guo, Tzu-Jui Julius Wang, Selen Pehlivan, Abduljalil Radman, Jorma Laaksonen

To further reduce the amount of supervision, we propose Prompts-in-The-Loop (PiTL) that prompts knowledge from large language models (LLMs) to describe images.

Cross-Modal Retrieval Object +1

CLIP4IDC: CLIP for Image Difference Captioning

1 code implementation1 Jun 2022 Zixin Guo, Tzu-Jui Julius Wang, Jorma Laaksonen

Different from directly fine-tuning CLIP to generate sentences, we introduce an adaptation training process to adapt CLIP's visual encoder to capture and align differences in image pairs based on the textual descriptions.

Domain Adaptation Image Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.