no code implementations • EACL (WASSA) 2021 • Tommaso Fornaciari, Federico Bianchi, Debora Nozza, Dirk Hovy
The paper describes the MilaNLP team’s submission (Bocconi University, Milan) in the WASSA 2021 Shared Task on Empathy Detection and Emotion Classification.
1 code implementation • NAACL (WOAH) 2022 • Debora Nozza, Federico Bianchi, Giuseppe Attanasio
Online hate speech is a dangerous phenomenon that can (and should) be promptly counteracted properly.
1 code implementation • SemEval (NAACL) 2022 • Giuseppe Attanasio, Debora Nozza, Federico Bianchi
In this paper, we describe the system proposed by the MilaNLP team for the Multimedia Automatic Misogyny Identification (MAMI) challenge.
no code implementations • WASSA (ACL) 2022 • Federico Bianchi, Debora Nozza, Dirk Hovy
Detecting emotion in text allows social and computational scientists to study how people behave and react to online events.
no code implementations • ECNLP (ACL) 2022 • Patrick John Chia, Jacopo Tagliabue, Federico Bianchi, Ciro Greco, Diogo Goncalves
Product discovery is a crucial component for online shopping.
1 code implementation • LTEDI (ACL) 2022 • Debora Nozza, Federico Bianchi, Anne Lauscher, Dirk Hovy
Current language technology is ubiquitous and directly influences individuals’ lives worldwide.
no code implementations • BigScience (ACL) 2022 • Debora Nozza, Federico Bianchi, Dirk Hovy
We hope to open a discussion on the best methodologies to handle social bias testing in language models.
no code implementations • EACL (WASSA) 2021 • Sotiris Lamprinidis, Federico Bianchi, Daniel Hardt, Dirk Hovy
While emotions are universal aspects of human psychology, they are expressed differently across different languages and cultures.
1 code implementation • EACL (WASSA) 2021 • Federico Bianchi, Debora Nozza, Dirk Hovy
While sentiment analysis is a popular task to understand people’s reactions online, we often need more nuanced information: is the post negative because the user is angry or sad?
1 code implementation • 18 Dec 2022 • Rishav Hada, Amir Ebrahimi Fard, Sarah Shugars, Federico Bianchi, Patricia Rossini, Dirk Hovy, Rebekah Tromble, Nava Tintarev
We find that the diversity scores for both Fragmentation and Representation are lower for immigration than for DST.
1 code implementation • Scientific Reports 2022 • Patrick John Chia, Giuseppe Attanasio, Federico Bianchi, Silvia Terragni, Ana Rita Magalhães, Diogo Goncalves, Ciro Greco, Jacopo Tagliabue
The steady rise of online shopping goes hand in hand with the development of increasingly complex ML and NLP models.
1 code implementation • 8 Nov 2022 • Anne Lauscher, Federico Bianchi, Samuel Bowman, Dirk Hovy
Our results show that PLMs do encode these sociodemographics, and that this knowledge is sometimes spread across the layers of some of the tested PLMs.
no code implementations • 7 Nov 2022 • Federico Bianchi, Pratyusha Kalluri, Esin Durmus, Faisal Ladhak, Myra Cheng, Debora Nozza, Tatsunori Hashimoto, Dan Jurafsky, James Zou, Aylin Caliskan
Machine learning models are now able to convert user-written text descriptions into naturalistic images.
no code implementations • 28 Oct 2022 • Federico Bianchi, Stefanie Anja Hills, Patricia Rossini, Dirk Hovy, Rebekah Tromble, Nava Tintarev
Well-annotated data is a prerequisite for good Natural Language Processing models.
1 code implementation • 26 Oct 2022 • Tommaso Fornaciari, Dirk Hovy, Federico Bianchi
The most common ways to explore latent document dimensions are topic models and clustering methods.
1 code implementation • 20 Oct 2022 • Paul Röttger, Debora Nozza, Federico Bianchi, Dirk Hovy
More data is needed, but annotating hateful content is expensive, time-consuming and potentially harmful to annotators.
no code implementations • 13 Oct 2022 • Giuseppe Attanasio, Debora Nozza, Federico Bianchi, Dirk Hovy
Language is constantly changing and evolving, leaving language models to quickly become outdated, both factually and linguistically.
no code implementations • 4 Oct 2022 • Mert Yuksekgonul, Federico Bianchi, Pratyusha Kalluri, Dan Jurafsky, James Zou
ARO consists of Visual Genome Attribution, to test the understanding of objects' properties; Visual Genome Relation, to test for relational understanding; and COCO & Flickr30k-Order, to test for order sensitivity.
no code implementations • 10 Aug 2022 • Federico Bianchi, Stefano Speziali, Andrea Marini, Massimiliano Proietti, Lorenzo Menculini, Alberto Garinei, Gabriele Bellani, Marcello Marconi
In this work, we describe in detail how Deep Learning and Computer Vision can help to detect fault events of the AirTender system, an aftermarket motorcycle damping system component.
1 code implementation • 12 Jul 2022 • Jacopo Tagliabue, Federico Bianchi, Tobias Schnabel, Giuseppe Attanasio, Ciro Greco, Gabriel de Souza P. Moreira, Patrick John Chia
Much of the complexity of Recommender Systems (RSs) comes from the fact that they are used as part of more complex applications and affect user experience through a varied range of user interfaces.
1 code implementation • 8 Apr 2022 • Patrick John Chia, Giuseppe Attanasio, Federico Bianchi, Silvia Terragni, Ana Rita Magalhães, Diogo Goncalves, Ciro Greco, Jacopo Tagliabue
The steady rise of online shopping goes hand in hand with the development of increasingly complex ML and NLP models.
no code implementations • 5 Apr 2022 • Patrick John Chia, Jacopo Tagliabue, Federico Bianchi, Ciro Greco, Diogo Goncalves
Product discovery is a crucial component for online shopping.
1 code implementation • 26 Jan 2022 • Federico Bianchi, Vincenzo Cutrona, Dirk Hovy
Twitter data have become essential to Natural Language Processing (NLP) and social science research, driving various scientific discoveries in recent years.
2 code implementations • 18 Nov 2021 • Patrick John Chia, Jacopo Tagliabue, Federico Bianchi, Chloe He, Brian Ko
As with most Machine Learning systems, recommender systems are typically evaluated through performance metrics computed over held-out data points.
1 code implementation • nlppower (ACL) 2022 • Federico Bianchi, Debora Nozza, Dirk Hovy
We introduce language invariant properties: i. e., properties that should not change when we transform text, and how they can be used to quantitatively evaluate the robustness of transformation algorithms.
1 code implementation • EMNLP 2021 • Federico Bianchi, Marco Marelli, Paolo Nicoli, Matteo Palmonari
Understanding differences of viewpoints across corpora is a fundamental task for computational social sciences.
1 code implementation • 19 Aug 2021 • Federico Bianchi, Giuseppe Attanasio, Raphael Pisoni, Silvia Terragni, Gabriele Sarti, Sri Lakshmi
CLIP (Contrastive Language-Image Pre-training) is a very recent multi-modal model that jointly learns representations of images and texts.
1 code implementation • NAACL 2021 • Debora Nozza, Federico Bianchi, Dirk Hovy
Our results show that 4. 3{\%} of the time, language models complete a sentence with a hurtful word.
Ranked #1 on
Hurtful Sentence Completion
on HONEST
1 code implementation • NAACL 2021 • Federico Bianchi, Jacopo Tagliabue, Bingqing Yu
We present Query2Prod2Vec, a model that grounds lexical representations for product search in product embeddings: in our model, meaning is a mapping between words and a latent space of products in a digital shop.
1 code implementation • Cognitive Science 2021 • Giovanni Cassani, Federico Bianchi, Marco Marelli
In this study, we use temporally aligned word embeddings and a large diachronic corpus of English to quantify language change in a data-driven, scalable way, which is grounded in language use.
3 code implementations • 19 Apr 2021 • Jacopo Tagliabue, Ciro Greco, Jean-Francis Roy, Bingqing Yu, Patrick John Chia, Federico Bianchi, Giovanni Cassani
The 2021 SIGIR workshop on eCommerce is hosting the Coveo Data Challenge for "In-session prediction for purchase intent and recommendations".
no code implementations • NAACL 2021 • Federico Bianchi, Ciro Greco, Jacopo Tagliabue
We investigate grounded language learning through real-world data, by modelling a teacher-learner dynamics through the natural interactions occurring between users and search engines; in particular, we explore the emergence of semantic generalization from unsupervised dense representations outside of synthetic environments.
1 code implementation • 2 Apr 2021 • Federico Bianchi, Jacopo Tagliabue, Bingqing Yu
We present Query2Prod2Vec, a model that grounds lexical representations for product search in product embeddings: in our model, meaning is a mapping between words and a latent space of products in a digital shop.
no code implementations • EACL 2021 • Tommaso Fornaciari, Federico Bianchi, Massimo Poesio, Dirk Hovy
In most cases, however, the target texts{'} preceding context is not considered.
1 code implementation • ACL (ECNLP) 2021 • Federico Bianchi, Bingqing Yu, Jacopo Tagliabue
Word embeddings (e. g., word2vec) have been applied successfully to eCommerce products through~\textit{prod2vec}.
1 code implementation • International Semantic Web Conference (ISWC) 2020 • Vincenzo Cutrona, Federico Bianchi, Ernesto Jimenez-Ruiz, Matteo Palmonari
Table annotation is a key task to improve querying the Web and support the Knowledge Graph population from legacy sources (tables).
no code implementations • 20 Jul 2020 • Federico Bianchi, Jacopo Tagliabue, Bingqing Yu, Luca Bigon, Ciro Greco
This paper addresses the challenge of leveraging multiple embedding spaces for multi-shop personalization, proving that zero-shot inference is possible by transferring shopping intent from one website to another without manual intervention.
no code implementations • ACL 2020 • Dirk Hovy, Federico Bianchi, Tommaso Fornaciari
The main goal of machine translation has been to convey the correct content.
no code implementations • 30 Apr 2020 • Federico Bianchi, Gaetano Rossiello, Luca Costabello, Matteo Palmonari, Pasquale Minervini
Knowledge graph embeddings are now a widely adopted approach to knowledge representation in which entities and relationships are embedded in vector spaces.
2 code implementations • EACL 2021 • Federico Bianchi, Silvia Terragni, Dirk Hovy, Debora Nozza, Elisabetta Fersini
They all cover the same content, but the linguistic differences make it impossible to use traditional, bag-of-word-based topic models.
1 code implementation • 13 Apr 2020 • Federico Bianchi, Valerio Di Carlo, Paolo Nicoli, Matteo Palmonari
In this paper, we present a general framework to support cross-corpora language studies with word embeddings, where embeddings generated from different corpora can be compared to find correspondences and differences in meaning across the corpora.
3 code implementations • ACL 2021 • Federico Bianchi, Silvia Terragni, Dirk Hovy
Topic models extract groups of words from documents, whose interpretation as a topic hopefully allows for a better understanding of the data.
no code implementations • 11 Mar 2020 • Bingqing Yu, Jacopo Tagliabue, Ciro Greco, Federico Bianchi
We address the problem of personalizing query completion in a digital commerce setting, in which the bounce rate is typically high and recurring users are rare.
no code implementations • 5 Mar 2020 • Debora Nozza, Federico Bianchi, Dirk Hovy
Driven by the potential of BERT models, the NLP community has started to investigate and generate an abundant number of BERT models that are trained on a particular language, and tested on a specific data domain and task.
1 code implementation • 5 Jun 2019 • Valerio Di Carlo, Federico Bianchi, Matteo Palmonari
Temporal word embeddings have been proposed to support the analysis of word meaning shifts during time and to study the evolution of languages.
no code implementations • 11 Apr 2019 • Adriano Macarone Palmieri, Egor Kovlakov, Federico Bianchi, Dmitry Yudin, Stanislav Straupe, Jacob Biamonte, Sergei Kulik
We compared the neural network state reconstruction protocol with a protocol treating SPAM errors by process tomography, as well as to a SPAM-agnostic protocol with idealized measurements.
2 code implementations • 9 Nov 2018 • Monireh Ebrahimi, Md. Kamruzzaman Sarker, Federico Bianchi, Ning Xie, Derek Doran, Pascal Hitzler
Semantic Web knowledge representation standards, and in particular RDF and OWL, often come endowed with a formal semantics which is considered to be of fundamental importance for the field.