Search Results for author: Roy Ganz

Found 14 papers, 7 papers with code

GRAM: Global Reasoning for Multi-Page VQA

no code implementations7 Jan 2024 Tsachi Blau, Sharon Fogel, Roi Ronen, Alona Golts, Roy Ganz, Elad Ben Avraham, Aviad Aberdam, Shahar Tsiper, Ron Litman

The increasing use of transformer-based large language models brings forward the challenge of processing long sequences.

Question Answering Visual Question Answering

CLIPAG: Towards Generator-Free Text-to-Image Generation

no code implementations29 Jun 2023 Roy Ganz, Michael Elad

Perceptually Aligned Gradients (PAG) refer to an intriguing property observed in robust image classification models, wherein their input gradients align with human perception and pose semantic meanings.

Image Classification Text-to-Image Generation

FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions

1 code implementation28 May 2023 Noam Rotstein, David Bensaid, Shaked Brody, Roy Ganz, Ron Kimmel

Our proposed method, FuseCap, fuses the outputs of such vision experts with the original captions using a large language model (LLM), yielding comprehensive image descriptions.

 Ranked #1 on Image Captioning on COCO Captions (CLIPScore metric)

Attribute Image Captioning +5

Classifier Robustness Enhancement Via Test-Time Transformation

1 code implementation27 Mar 2023 Tsachi Blau, Roy Ganz, Chaim Baskin, Michael Elad, Alex Bronstein

We show that the proposed method achieves state-of-the-art results and validate our claim through extensive experiments on a variety of defense methods, classifier architectures, and datasets.

Adversarial Attack

Towards Models that Can See and Read

no code implementations ICCV 2023 Roy Ganz, Oren Nuriel, Aviad Aberdam, Yair Kittenplon, Shai Mazor, Ron Litman

Visual Question Answering (VQA) and Image Captioning (CAP), which are among the most popular vision-language tasks, have analogous scene-text versions that require reasoning from the text in the image.

Image Captioning Question Answering +1

Enhancing Diffusion-Based Image Synthesis with Robust Classifier Guidance

1 code implementation18 Aug 2022 Bahjat Kawar, Roy Ganz, Michael Elad

In order to obtain class-conditional generation, it was suggested to guide the diffusion process by gradients from a time-dependent classifier.

Denoising Image Generation

Do Perceptually Aligned Gradients Imply Adversarial Robustness?

1 code implementation22 Jul 2022 Roy Ganz, Bahjat Kawar, Michael Elad

In this work, we focus on this trait and test whether \emph{Perceptually Aligned Gradients imply Robustness}.

Adversarial Robustness Image Classification

Threat Model-Agnostic Adversarial Defense using Diffusion Models

1 code implementation17 Jul 2022 Tsachi Blau, Roy Ganz, Bahjat Kawar, Alex Bronstein, Michael Elad

Deep Neural Networks (DNNs) are highly sensitive to imperceptible malicious perturbations, known as adversarial attacks.

Adversarial Defense Denoising

Improved Image Generation via Sparsity

no code implementations29 Sep 2021 Roy Ganz, Michael Elad

The interest of the deep learning community in image synthesis has grown massively in recent years.

Image Generation

BIGRoC: Boosting Image Generation via a Robust Classifier

1 code implementation8 Aug 2021 Roy Ganz, Michael Elad

The interest of the machine learning community in image synthesis has grown significantly in recent years, with the introduction of a wide range of deep generative models and means for training them.

Image Generation

Improved Image Generation via Sparse Modeling

no code implementations1 Apr 2021 Roy Ganz, Michael Elad

The interest of the deep learning community in image synthesis has grown massively in recent years.

Image Generation

Cannot find the paper you are looking for? You can Submit a new open access paper.