Search Results for author: Fawaz Sammani

Found 6 papers, 5 papers with code

Uni-NLX: Unifying Textual Explanations for Vision and Vision-Language Tasks

2 code implementations17 Aug 2023 Fawaz Sammani, Nikos Deligiannis

In this work, we propose Uni-NLX, a unified framework that consolidates all NLE tasks into a single and compact multi-task model using a unified training objective of text generation.

Question Answering Text Generation +2

Visualizing and Understanding Contrastive Learning

1 code implementation20 Jun 2022 Fawaz Sammani, Boris Joukovsky, Nikos Deligiannis

Contrastive learning has revolutionized the field of computer vision, learning rich representations from unlabeled data, which generalize well to diverse vision tasks.

Contrastive Learning Data Augmentation +2

NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks

1 code implementation CVPR 2022 Fawaz Sammani, Tanmoy Mukherjee, Nikos Deligiannis

Current NLE models explain the decision-making process of a vision or vision-language model (a. k. a., task model), e. g., a VQA model, via a language model (a. k. a., explanation model), e. g., GPT.

Decision Making Explainable artificial intelligence +4

Show, Edit and Tell: A Framework for Editing Image Captions

1 code implementation CVPR 2020 Fawaz Sammani, Luke Melas-Kyriazi

Specifically, our caption-editing model consisting of two sub-modules: (1) EditNet, a language module with an adaptive copy mechanism (Copy-LSTM) and a Selective Copy Memory Attention mechanism (SCMA), and (2) DCNet, an LSTM-based denoising auto-encoder.

Denoising Image Captioning +1

HOW IMPORTANT ARE NETWORK WEIGHTS? TO WHAT EXTENT DO THEY NEED AN UPDATE?

no code implementations ICLR 2020 Fawaz Sammani, Mahmoud Elsayed, Abdelsalam Hamdi

We wish to show that starting from the third epoch, freezing weights which have no informative gradient and are less likely to be changed during training, results in a very slight drop in the overall accuracy (and in sometimes better).

Image Captioning

Look and Modify: Modification Networks for Image Captioning

1 code implementation7 Sep 2019 Fawaz Sammani, Mahmoud Elsayed

Attention-based neural encoder-decoder frameworks have been widely used for image captioning.

Image Captioning object-detection +1

Cannot find the paper you are looking for? You can Submit a new open access paper.