Search Results for author: Silvia Cascianelli

Found 17 papers, 6 papers with code

VATr++: Choose Your Words Wisely for Handwritten Text Generation

no code implementations16 Feb 2024 Bram Vanherle, Vittorio Pippi, Silvia Cascianelli, Nick Michiels, Frank Van Reeth, Rita Cucchiara

Styled Handwritten Text Generation (HTG) has received significant attention in recent years, propelled by the success of learning-based solutions employing GANs, Transformers, and, preliminarily, Diffusion Models.

Benchmarking Text Generation

HWD: A Novel Evaluation Score for Styled Handwritten Text Generation

1 code implementation31 Oct 2023 Vittorio Pippi, Fabio Quattrini, Silvia Cascianelli, Rita Cucchiara

Through extensive experimental evaluation on different word-level and line-level datasets of handwritten text images, we demonstrate the suitability of the proposed HWD as a score for Styled HTG.

Image Generation Perceptual Distance +1

Volumetric Fast Fourier Convolution for Detecting Ink on the Carbonized Herculaneum Papyri

1 code implementation9 Aug 2023 Fabio Quattrini, Vittorio Pippi, Silvia Cascianelli, Rita Cucchiara

Recent advancements in Digital Document Restoration (DDR) have led to significant breakthroughs in analyzing highly damaged written artifacts.

How to Choose Pretrained Handwriting Recognition Models for Single Writer Fine-Tuning

no code implementations4 May 2023 Vittorio Pippi, Silvia Cascianelli, Christopher Kermorvant, Rita Cucchiara

Recent advancements in Deep Learning-based Handwritten Text Recognition (HTR) have led to models with remarkable performance on both modern and historical manuscripts in large benchmark datasets.

Handwriting Recognition Handwritten Text Recognition +2

Evaluating Synthetic Pre-Training for Handwriting Processing Tasks

no code implementations4 Apr 2023 Vittorio Pippi, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara

In this work, we explore massive pre-training on synthetic word images for enhancing the performance on four benchmark downstream handwriting analysis tasks.

Retrieval

Handwritten Text Generation from Visual Archetypes

1 code implementation CVPR 2023 Vittorio Pippi, Silvia Cascianelli, Rita Cucchiara

Generating synthetic images of handwritten text in a writer-specific style is a challenging task, especially in the case of unseen styles and new words, and even more when these latter contain characters that are rarely encountered during training.

Text Generation

Embodied Agents for Efficient Exploration and Smart Scene Description

no code implementations17 Jan 2023 Roberto Bigazzi, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara

The development of embodied agents that can communicate with humans in natural language has gained increasing interest over the last years, as it facilitates the diffusion of robotic platforms in human-populated environments.

Efficient Exploration Image Captioning +1

Boosting Modern and Historical Handwritten Text Recognition with Deformable Convolutions

no code implementations17 Aug 2022 Silvia Cascianelli, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara

Handwritten Text Recognition (HTR) in free-layout pages is a challenging image understanding task that can provide a relevant boost to the digitization of handwritten documents and reuse of their content.

Handwritten Text Recognition HTR

The LAM Dataset: A Novel Benchmark for Line-Level Handwritten Text Recognition

no code implementations16 Aug 2022 Silvia Cascianelli, Vittorio Pippi, Martin Maarand, Marcella Cornia, Lorenzo Baraldi, Christopher Kermorvant, Rita Cucchiara

With the aim of fostering the research on this topic, in this paper we present the Ludovico Antonio Muratori (LAM) dataset, a large line-level HTR dataset of Italian ancient manuscripts edited by a single author over 60 years.

Handwritten Text Recognition HTR

Embodied Navigation at the Art Gallery

no code implementations19 Apr 2022 Roberto Bigazzi, Federico Landi, Silvia Cascianelli, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara

This feature is challenging for occupancy-based agents which are usually trained in crowded domestic environments with plenty of occupancy information.

Navigate PointGoal Navigation

Spot the Difference: A Novel Task for Embodied Agents in Changing Environments

no code implementations18 Apr 2022 Federico Landi, Roberto Bigazzi, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara

To make a step towards this setting, we propose Spot the Difference: a novel task for Embodied AI where the agent has access to an outdated map of the environment and needs to recover the correct layout in a fixed time budget.

CaMEL: Mean Teacher Learning for Image Captioning

1 code implementation21 Feb 2022 Manuele Barraco, Matteo Stefanini, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara

Describing images in natural language is a fundamental step towards the automatic modeling of connections between the visual and textual modalities.

Image Captioning Knowledge Distillation

Focus on Impact: Indoor Exploration with Intrinsic Motivation

1 code implementation14 Sep 2021 Roberto Bigazzi, Federico Landi, Silvia Cascianelli, Lorenzo Baraldi, Marcella Cornia, Rita Cucchiara

The proposed exploration approach outperforms DRL-based competitors relying on intrinsic rewards and surpasses the agents trained with a dense extrinsic reward computed with the environment layouts.

From Show to Tell: A Survey on Deep Learning-based Image Captioning

no code implementations14 Jul 2021 Matteo Stefanini, Marcella Cornia, Lorenzo Baraldi, Silvia Cascianelli, Giuseppe Fiameni, Rita Cucchiara

Starting from 2015 the task has generally been addressed with pipelines composed of a visual encoder and a language model for text generation.

Image Captioning Language Modelling +1

The Role of the Input in Natural Language Video Description

no code implementations9 Feb 2021 Silvia Cascianelli, Gabriele Costante, Alessandro Devo, Thomas A. Ciarfuglia, Paolo Valigi, Mario L. Fravolini

Natural Language Video Description (NLVD) has recently received strong interest in the Computer Vision, Natural Language Processing (NLP), Multimedia, and Autonomous Robotics communities.

Data Augmentation Video Description

Explore and Explain: Self-supervised Navigation and Recounting

no code implementations14 Jul 2020 Roberto Bigazzi, Federico Landi, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara

In this paper, we devise a novel embodied setting in which an agent needs to explore a previously unknown environment while recounting what it sees during the path.

Navigate

Cannot find the paper you are looking for? You can Submit a new open access paper.