no code implementations • 16 Feb 2024 • Bram Vanherle, Vittorio Pippi, Silvia Cascianelli, Nick Michiels, Frank Van Reeth, Rita Cucchiara
Styled Handwritten Text Generation (HTG) has received significant attention in recent years, propelled by the success of learning-based solutions employing GANs, Transformers, and, preliminarily, Diffusion Models.
1 code implementation • 31 Oct 2023 • Vittorio Pippi, Fabio Quattrini, Silvia Cascianelli, Rita Cucchiara
Through extensive experimental evaluation on different word-level and line-level datasets of handwritten text images, we demonstrate the suitability of the proposed HWD as a score for Styled HTG.
1 code implementation • 9 Aug 2023 • Fabio Quattrini, Vittorio Pippi, Silvia Cascianelli, Rita Cucchiara
Recent advancements in Digital Document Restoration (DDR) have led to significant breakthroughs in analyzing highly damaged written artifacts.
no code implementations • 4 May 2023 • Vittorio Pippi, Silvia Cascianelli, Christopher Kermorvant, Rita Cucchiara
Recent advancements in Deep Learning-based Handwritten Text Recognition (HTR) have led to models with remarkable performance on both modern and historical manuscripts in large benchmark datasets.
no code implementations • 4 Apr 2023 • Vittorio Pippi, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara
In this work, we explore massive pre-training on synthetic word images for enhancing the performance on four benchmark downstream handwriting analysis tasks.
1 code implementation • CVPR 2023 • Vittorio Pippi, Silvia Cascianelli, Rita Cucchiara
Generating synthetic images of handwritten text in a writer-specific style is a challenging task, especially in the case of unseen styles and new words, and even more when these latter contain characters that are rarely encountered during training.
no code implementations • 17 Jan 2023 • Roberto Bigazzi, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara
The development of embodied agents that can communicate with humans in natural language has gained increasing interest over the last years, as it facilitates the diffusion of robotic platforms in human-populated environments.
no code implementations • 17 Aug 2022 • Silvia Cascianelli, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara
Handwritten Text Recognition (HTR) in free-layout pages is a challenging image understanding task that can provide a relevant boost to the digitization of handwritten documents and reuse of their content.
no code implementations • 16 Aug 2022 • Silvia Cascianelli, Vittorio Pippi, Martin Maarand, Marcella Cornia, Lorenzo Baraldi, Christopher Kermorvant, Rita Cucchiara
With the aim of fostering the research on this topic, in this paper we present the Ludovico Antonio Muratori (LAM) dataset, a large line-level HTR dataset of Italian ancient manuscripts edited by a single author over 60 years.
no code implementations • 19 Apr 2022 • Roberto Bigazzi, Federico Landi, Silvia Cascianelli, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara
This feature is challenging for occupancy-based agents which are usually trained in crowded domestic environments with plenty of occupancy information.
no code implementations • 18 Apr 2022 • Federico Landi, Roberto Bigazzi, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara
To make a step towards this setting, we propose Spot the Difference: a novel task for Embodied AI where the agent has access to an outdated map of the environment and needs to recover the correct layout in a fixed time budget.
1 code implementation • 21 Feb 2022 • Manuele Barraco, Matteo Stefanini, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara
Describing images in natural language is a fundamental step towards the automatic modeling of connections between the visual and textual modalities.
1 code implementation • 14 Sep 2021 • Roberto Bigazzi, Federico Landi, Silvia Cascianelli, Lorenzo Baraldi, Marcella Cornia, Rita Cucchiara
The proposed exploration approach outperforms DRL-based competitors relying on intrinsic rewards and surpasses the agents trained with a dense extrinsic reward computed with the environment layouts.
no code implementations • 14 Jul 2021 • Matteo Stefanini, Marcella Cornia, Lorenzo Baraldi, Silvia Cascianelli, Giuseppe Fiameni, Rita Cucchiara
Starting from 2015 the task has generally been addressed with pipelines composed of a visual encoder and a language model for text generation.
1 code implementation • 12 May 2021 • Roberto Bigazzi, Federico Landi, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara
In this work, we detail how to transfer the knowledge acquired in simulation into the real world.
no code implementations • 9 Feb 2021 • Silvia Cascianelli, Gabriele Costante, Alessandro Devo, Thomas A. Ciarfuglia, Paolo Valigi, Mario L. Fravolini
Natural Language Video Description (NLVD) has recently received strong interest in the Computer Vision, Natural Language Processing (NLP), Multimedia, and Autonomous Robotics communities.
no code implementations • 14 Jul 2020 • Roberto Bigazzi, Federico Landi, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara
In this paper, we devise a novel embodied setting in which an agent needs to explore a previously unknown environment while recounting what it sees during the path.