Search Results for author: Oren Nuriel

Found 7 papers, 2 papers with code

Enhancing Vision-Language Pre-training with Rich Supervisions

no code implementations • 5 Mar 2024 • Yuan Gao, Kunyu Shi, Pengkai Zhu, Edouard Belval, Oren Nuriel, Srikar Appalaraju, Shabnam Ghadar, Vijay Mahadevan, Zhuowen Tu, Stefano Soatto

We propose Strongly Supervised pre-training with ScreenShots (S4) - a novel pre-training paradigm for Vision-Language Models using data from large-scale web screenshot rendering.

Table Detection

Paper
Add Code

Question Aware Vision Transformer for Multimodal Reasoning

no code implementations • 8 Feb 2024 • Roy Ganz, Yair Kittenplon, Aviad Aberdam, Elad Ben Avraham, Oren Nuriel, Shai Mazor, Ron Litman

This integration results in dynamic visual features focusing on relevant image aspects to the posed question.

Language Modelling Large Language Model +1

Paper
Add Code

CLIPTER: Looking at the Bigger Picture in Scene Text Recognition

no code implementations • ICCV 2023 • Aviad Aberdam, David Bensaïd, Alona Golts, Roy Ganz, Oren Nuriel, Royee Tichauer, Shai Mazor, Ron Litman

Reading text in real-world scenarios often requires understanding the context surrounding it, especially when dealing with poor-quality text.

Language Modelling Scene Text Recognition

Paper
Add Code

Towards Models that Can See and Read

no code implementations • ICCV 2023 • Roy Ganz, Oren Nuriel, Aviad Aberdam, Yair Kittenplon, Shai Mazor, Ron Litman

Visual Question Answering (VQA) and Image Captioning (CAP), which are among the most popular vision-language tasks, have analogous scene-text versions that require reasoning from the text in the image.

Image Captioning Question Answering +1

Paper
Add Code

Out-of-Vocabulary Challenge Report

no code implementations • 14 Sep 2022 • Sergi Garcia-Bordils, Andrés Mafla, Ali Furkan Biten, Oren Nuriel, Aviad Aberdam, Shai Mazor, Ron Litman, Dimosthenis Karatzas

This paper presents final results of the Out-Of-Vocabulary 2022 (OOV) challenge.

Optical Character Recognition Optical Character Recognition (OCR) +1

Paper
Add Code

TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers

1 code implementation • 9 May 2021 • Oren Nuriel, Sharon Fogel, Ron Litman

However in some cases, their decisions are based on unintended information leading to high performance on standard benchmarks but also to a lack of generalization to challenging testing conditions and unintuitive failures.

Handwritten Text Recognition Scene Text Recognition

Paper
Code

Permuted AdaIN: Reducing the Bias Towards Global Statistics in Image Classification

1 code implementation • CVPR 2021 • Oren Nuriel, Sagie Benaim, Lior Wolf

In the setting of robustness, our method improves on both ImageNet-C and Cifar-100-C for multiple architectures.

Ranked #40 on Synthetic-to-Real Translation on GTAV-to-Cityscapes Labels

Domain Generalization General Classification +4

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.