no code implementations • NAACL (SIGTYP) 2022 • Temuulen Khishigsuren, Gábor Bella, Thomas Brochhagen, Daariimaa Marav, Fausto Giunchiglia, Khuyagbaatar Batsuren
Metonymy is regarded by most linguists as a universal cognitive phenomenon, especially since the emergence of the theory of conceptual mappings.
no code implementations • EMNLP 2021 • Yonghao Liu, Renchu Guan, Fausto Giunchiglia, Yanchun Liang, Xiaoyue Feng
Text classification is a fundamental task with broad applications in natural language processing.
no code implementations • GWC 2016 • Abed Alhakim Freihat, Fausto Giunchiglia, Biswanath Dutta
WordNet represents polysemous terms by capturing the different meanings of these terms at the lexical level, but without giving emphasis on the polysemy types such terms belong to.
no code implementations • LREC 2022 • Nandu Chandran Nair, Rajendran S. Velayuthan, Yamini Chandrashekar, Gábor Bella, Fausto Giunchiglia
We introduce the IndoUKC, a new multilingual lexical database comprised of eighteen Indian languages, with a focus on formally capturing words and word meanings specific to Indian languages and cultures.
1 code implementation • Findings (ACL) 2022 • Yang Chi, Fausto Giunchiglia, Daqian Shi, Xiaolei Diao, Chuntao Li, Hao Xu
In addition, powered by the knowledge of radical systems in ZiNet, this paper introduces glyph similarity measurement between ancient Chinese characters, which could capture similar glyph pairs that are potentially related in origins or semantics.
1 code implementation • ACL (SIGMORPHON) 2021 • Khuyagbaatar Batsuren, Gábor Bella, Fausto Giunchiglia
Large-scale morphological databases provide essential input to a wide range of NLP applications.
no code implementations • EACL (DravidianLangTech) 2021 • Nandu Chandran Nair, Maria-chiara Giangregorio, Fausto Giunchiglia
Quality of a product is the degree to which a product meets the customer’s expectation, which must also be valid for the case of lexical semantic resources.
no code implementations • GWC 2019 • Khuyagbaatar Batsuren, Amarsanaa Ganbold, Altangerel Chagnaa, Fausto Giunchiglia
The manual evaluation of the resource1 estimated its quality at 96. 4%.
1 code implementation • 20 Jul 2024 • Yonghao Liu, Mengyu Li, Ximing Li, Lan Huang, Fausto Giunchiglia, Yanchun Liang, Xiaoyue Feng, Renchu Guan
Node classification is an essential problem in graph learning.
no code implementations • 7 Jul 2024 • Daqian Shi, Xiaoyue Li, Fausto Giunchiglia
A common solution to the semantic heterogeneity problem is to perform knowledge graph (KG) extension exploiting the information encoded in one or more candidate KGs, where the alignment between the reference KG and candidate KGs is considered the critical procedure.
no code implementations • 21 May 2024 • Yonghao Liu, Mengyu Li, Di Liang, Ximing Li, Fausto Giunchiglia, Lan Huang, Xiaoyue Feng, Renchu Guan
By incorporating relevant visual information and leveraging linguistic knowledge, our approach bridges the gap between language and vision, leading to improved understanding and inference capabilities in NLI tasks.
no code implementations • 19 May 2024 • Mengyu Li, Yonghao Liu, Fausto Giunchiglia, Xiaoyue Feng, Renchu Guan
Compared with the previous learning paradigm of pre-training and fine-tuning by cross entropy loss, the recently proposed supervised contrastive learning approach has received tremendous attention due to its powerful feature learning capability and robustness.
no code implementations • 2 May 2024 • Gertraud Koch, Gábor Bella, Paula Helm, Fausto Giunchiglia
Language technology is a complex and emerging field that presents challenges for co-design interventions due to enfolding in assemblages of global scale and diverse sites and its knowledge intensity.
no code implementations • 12 Apr 2024 • Maria Kasinidou, Styliani Kleanthous, Matteo Busso, Marcelo Rodas, Jahna Otterbacher, Fausto Giunchiglia
With the surge in data-centric AI and its increasing capabilities, AI applications have become a part of our everyday lives.
no code implementations • 29 Mar 2024 • Abed Alhakim Freihat, Hadi Khalilia, Gábor Bella, Fausto Giunchiglia
High-quality WordNets are crucial for achieving high-quality results in NLP applications that rely on such resources.
no code implementations • 22 Jan 2024 • Fausto Giunchiglia, Mayukh Bagchi, Subhashis Das
Knowledge Organization (KO) and Knowledge Representation (KR) have been the two mainstream methodologies of knowledge modelling in the Information Science community and the Artificial Intelligence community, respectively.
1 code implementation • 25 Dec 2023 • Rui Song, Fausto Giunchiglia, Yingji Li, Mingjie Tian, Hao Xu
However, these methods rely on unlabeled samples provided by the target domains, which renders the model ineffective when the target domain is agnostic.
no code implementations • 12 Dec 2023 • Fausto Giunchiglia, Mayukh Bagchi
Knowledge Representation (KR) and facet-analytical Knowledge Organization (KO) have been the two most prominent methodologies of data and knowledge modelling in the Artificial Intelligence community and the Information Science community, respectively.
no code implementations • 21 Nov 2023 • Mattia Fumagalli, Marco Boffo, Daqian Shi, Mayukh Bagchi, Fausto Giunchiglia
One of the significant barriers to the training of statistical models on knowledge graphs is the difficulty that scientists have in finding the best input data to address their prediction goal.
no code implementations • 24 Aug 2023 • Hadi Khalilia, Gábor Bella, Abed Alhakim Freihat, Shandy Darma, Fausto Giunchiglia
The method is verified through two large-scale case studies on kinship terminology, a domain known to be diverse across languages and cultures: one case study deals with seven Arabic dialects, while the other one with three Indonesian languages.
no code implementations • 26 Jul 2023 • Fausto Giunchiglia, Mayukh Bagchi, Xiaolei Diao
Recent work in Machine Learning and Computer Vision has highlighted the presence of various types of systematic flaws inside ground truth object recognition benchmark datasets.
no code implementations • 25 Jul 2023 • Paula Helm, Gábor Bella, Gertraud Koch, Fausto Giunchiglia
It is well known that AI-based language technology -- large language models, machine translation systems, multilingual dictionaries, and corpora -- is currently limited to 2 to 3 percent of the world's most widely spoken and/or financially and politically best supported languages.
no code implementations • 25 Jul 2023 • Gábor Bella, Paula Helm, Gertraud Koch, Fausto Giunchiglia
It is a well-known fact that current AI-based language technology -- language models, machine translation systems, multilingual dictionaries and corpora -- focuses on the world's 2-3% most widely spoken languages.
no code implementations • 1 Jul 2023 • Rui Song, Fausto Giunchiglia, Yingji Li, Hao Xu
Despite large-scale pre-trained language models have achieved striking results for text classificaion, recent work has raised concerns about the challenge of shortcut learning.
no code implementations • 10 May 2023 • Simone Bocca, Alessio Zamboni, Gabor Bella, Yamini Chandrashekar, Mayukh Bagchi, Gabriel Kuper, Paolo Bouquet, Fausto Giunchiglia
When building a new application we are increasingly confronted with the need of reusing and integrating pre-existing knowledge.
no code implementations • 9 May 2023 • Luca Erculiani, Andrea Bontempelli, Andrea Passerini, Fausto Giunchiglia
We achieve this goal by implementing an algorithm which, for any object, recursively recognizes its visual genus and its visual differentia.
no code implementations • 18 Apr 2023 • Fausto Giunchiglia, Xiaolei Diao, Mayukh Bagchi
Data quality is critical for multimedia tasks, while various types of systematic flaws are found in image benchmark datasets, as discussed in recent work.
no code implementations • 16 Apr 2023 • Daqian Shi, Fausto Giunchiglia
Thus, the entity type (etype) recognition task is proposed to deal with such heterogeneity, aiming to infer the class of entities and etypes by exploiting the information encoded in ontologies.
no code implementations • 27 Feb 2023 • Mattia Fumagalli, Daqian Shi, Fausto Giunchiglia
The main goal of this paper is to evaluate knowledge base schemas, modeled as a set of entity types, each such type being associated with a set of properties, according to their focus.
no code implementations • 22 Jan 2023 • Fausto Giunchiglia, Gabor Bella, Nandu Chandran Nair, Yang Chi, Hao Xu
In today's multilingual lexical databases, the majority of the world's languages are under-represented.
no code implementations • 13 Dec 2022 • Fausto Giunchiglia, Mayukh Bagchi, Xiaolei Diao
We discuss two kinds of semantics relevant to Computer Vision (CV) systems - Visual Semantics and Lexical Semantics.
no code implementations • 28 Sep 2022 • Fausto Giunchiglia, Simone Bocca, Mattia Fumagalli, Mayukh Bagchi, Alessio Zamboni
The intuition is that data will be treated differently based on their popularity: the more a certain set of data have been reused, the more they will be reused and the less they will be changed across reuses, thus decreasing the overall data preprocessing costs, while increasing backward compatibility and future sharing
no code implementations • 13 Jul 2022 • Mattia Fumagalli, Marco Boffo, Daqian Shi, Mayukh Bagchi, Fausto Giunchiglia
In this paper, we describe the LiveSchema initiative, namely a gateway that offers a family of services to easily access, analyze, transform and exploit knowledge graph schemas, with the main goal of facilitating the reuse of these resources in machine learning use cases.
no code implementations • 3 Jul 2022 • Fausto Giunchiglia, Mayukh Bagchi
Semantic Heterogeneity is conventionally understood as the existence of variance in the representation of a target reality when modelled, by independent parties, in different databases, schemas and/ or data.
1 code implementation • NAACL (SIGMORPHON) 2022 • Khuyagbaatar Batsuren, Gábor Bella, Aryaman Arora, Viktor Martinović, Kyle Gorman, Zdeněk Žabokrtský, Amarsanaa Ganbold, Šárka Dohnalová, Magda Ševčíková, Kateřina Pelegrinová, Fausto Giunchiglia, Ryan Cotterell, Ekaterina Vylomova
The SIGMORPHON 2022 shared task on morpheme segmentation challenged systems to decompose a word into a sequence of morphemes and covered most types of morphology: compounds, derivations, and inflections.
Ranked #8 on Morpheme Segmentaiton on UniMorph 4.0
1 code implementation • 31 May 2022 • Andrea Bontempelli, Stefano Teso, Katya Tentori, Fausto Giunchiglia, Andrea Passerini
We propose ProtoPDebug, an effective concept-level debugger for ProtoPNets in which a human supervisor, guided by the model's explanations, supplies feedback in the form of what part-prototypes must be forgotten or kept, and the model is fine-tuned to align with this supervision.
no code implementations • 10 May 2022 • Andrea Bontempelli, Marcelo Rodas Britez, Xiaoyue Li, Haonan Zhao, Luca Erculiani, Stefano Teso, Andrea Passerini, Fausto Giunchiglia
We focus on the development of AIs which live in lifelong symbiosis with a human.
no code implementations • LREC 2022 • Khuyagbaatar Batsuren, Omer Goldman, Salam Khalifa, Nizar Habash, Witold Kieraś, Gábor Bella, Brian Leonard, Garrett Nicolai, Kyle Gorman, Yustinus Ghanggo Ate, Maria Ryskina, Sabrina J. Mielke, Elena Budianskaya, Charbel El-Khaissi, Tiago Pimentel, Michael Gasser, William Lane, Mohit Raj, Matt Coler, Jaime Rafael Montoya Samame, Delio Siticonatzi Camaiteri, Benoît Sagot, Esaú Zumaeta Rojas, Didier López Francis, Arturo Oncevay, Juan López Bautista, Gema Celeste Silva Villegas, Lucas Torroba Hennigen, Adam Ek, David Guriel, Peter Dirix, Jean-Philippe Bernardy, Andrey Scherbakov, Aziyana Bayyr-ool, Antonios Anastasopoulos, Roberto Zariquiey, Karina Sheifer, Sofya Ganieva, Hilaria Cruz, Ritván Karahóǧa, Stella Markantonatou, George Pavlidis, Matvey Plugaryov, Elena Klyachko, Ali Salehi, Candy Angulo, Jatayu Baxi, Andrew Krizhanovsky, Natalia Krizhanovskaya, Elizabeth Salesky, Clara Vania, Sardana Ivanova, Jennifer White, Rowan Hall Maudslay, Josef Valvoda, Ran Zmigrod, Paula Czarnowska, Irene Nikkarinen, Aelita Salchak, Brijesh Bhatt, Christopher Straughn, Zoey Liu, Jonathan North Washington, Yuval Pinter, Duygu Ataman, Marcin Wolinski, Totok Suhardijanto, Anna Yablonskaya, Niklas Stoehr, Hossep Dolatian, Zahroh Nuriah, Shyam Ratan, Francis M. Tyers, Edoardo M. Ponti, Grant Aiton, Aryaman Arora, Richard J. Hatcher, Ritesh Kumar, Jeremiah Young, Daria Rodionova, Anastasia Yemelina, Taras Andrushko, Igor Marchenko, Polina Mashkovtseva, Alexandra Serova, Emily Prud'hommeaux, Maria Nepomniashchaya, Fausto Giunchiglia, Eleanor Chodroff, Mans Hulden, Miikka Silfverberg, Arya D. McCarthy, David Yarowsky, Ryan Cotterell, Reut Tsarfaty, Ekaterina Vylomova
The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema.
1 code implementation • LREC 2022 • Temuulen Khishigsuren, Gábor Bella, Khuyagbaatar Batsuren, Abed Alhakim Freihat, Nandu Chandran Nair, Amarsanaa Ganbold, Hadi Khalilia, Yamini Chandrashekar, Fausto Giunchiglia
We capture the phenomenon of diversity through the notions of lexical gap and language-specific word and use a systematic method to infer gaps semi-automatically on a large scale.
no code implementations • ACL 2022 • Gábor Bella, Erdenebileg Byambadorj, Yamini Chandrashekar, Khuyagbaatar Batsuren, Danish Ashgar Cheema, Fausto Giunchiglia
The Universal Knowledge Core (UKC) is a large multilingual lexical database with a focus on language diversity and covering over a thousand languages.
no code implementations • 17 Feb 2022 • Fausto Giunchiglia, Mayukh Bagchi, Xiaolei Diao
Recent work in Machine Learning and Computer Vision has provided evidence of systematic design flaws in the development of major object recognition benchmark datasets.
no code implementations • 20 Dec 2021 • Fausto Giunchiglia, Mayukh Bagchi
We base our work on the teleosemantic modelling of concepts as abilities implementing the distinct functions of recognition and classification.
no code implementations • 23 Sep 2021 • Andrea Bontempelli, Fausto Giunchiglia, Andrea Passerini, Stefano Teso
In this paper, we tackle interactive debugging of "gray-box" concept-based models (CBMs).
no code implementations • 18 Aug 2021 • Fausto Giunchiglia, Marcelo Rodas Britez, Andrea Bontempelli, Xiaoyue Li
The representation of the personal context is complex and essential to improve the help machines can give to humans for making sense of the world, and the help humans can give to machines to improve their efficiency.
1 code implementation • NeurIPS 2021 • Stefano Teso, Andrea Bontempelli, Fausto Giunchiglia, Andrea Passerini
We tackle sequential learning under label noise in applications where a human supervisor can be queried to relabel suspicious examples.
no code implementations • 19 May 2021 • Fausto Giunchiglia, Simone Bocca, Mattia Fumagalli, Mayukh Bagchi, Alessio Zamboni
When building a new application we are more and more confronted with the need of reusing and integrating pre-existing knowledge, e. g., ontologies, schemas, data of any kind, from multiple sources.
no code implementations • 19 May 2021 • Fausto Giunchiglia, Mayukh Bagchi
We assume that substances in the world are represented by two types of concepts, namely substance concepts and classification concepts, the former instrumental to (visual) perception, the latter to (language based) classification.
no code implementations • 19 May 2021 • Fausto Giunchiglia, Alessio Zamboni, Mayukh Bagchi, Simone Bocca
We propose a novel approach to the problem of semantic heterogeneity where data are organized into a set of stratified and independent representation layers, namely: conceptual(where a set of unique alinguistic identifiers are connected inside a graph codifying their meaning), language(where sets of synonyms, possibly from multiple languages, annotate concepts), knowledge(in the form of a graph where nodes are entity types and links are properties), and data(in the form of a graph of entities populating the previous knowledge graph).
no code implementations • 26 Apr 2021 • Fausto Giunchiglia, Luca Erculiani, Andrea Passerini
In this paper we provide a theory and an algorithm for how to build substance concepts which are in a one-to-one correspondence with classifications concepts, thus paving the way to the seamless integration between natural language descriptions and visual perception.
no code implementations • 12 Apr 2021 • Fausto Giunchiglia, Jahna Otterbacher, Styliani Kleanthous, Khuyagbaatar Batsuren, Veronika Bogin, Tsvi Kuflik, Avital Shulner Tal
As the role of algorithmic systems and processes increases in society, so does the risk of bias, which can result in discrimination against individuals and social groups.
no code implementations • 3 Apr 2021 • Rui Song, Fausto Giunchiglia, Ke Zhao, Hao Xu
The complexity and non-Euclidean structure of graph data hinder the development of data augmentation methods similar to those in computer vision.
1 code implementation • 27 Mar 2021 • Andrea Bontempelli, Fausto Giunchiglia, Andrea Passerini, Stefano Teso
Motivated by this, we introduce TRCKD, a novel approach that combines automated drift detection and adaptation with an interactive stage in which the user is asked to disambiguate between different kinds of KD.
no code implementations • COLING 2020 • G{\'a}bor Bella, Linda Gremes, Fausto Giunchiglia
We set out to uncover the unique grammatical properties of an important yet so far under-researched type of natural language text: that of short labels typically found within structured datasets.
no code implementations • 19 Nov 2020 • Qiang Shen, Stefano Teso, Wanyi Zhang, Hao Xu, Fausto Giunchiglia
Second, existing models typically assume that context is objective, whereas in most applications context is best viewed from the user's perspective.
1 code implementation • 2 Nov 2020 • Andrea Bontempelli, Stefano Teso, Fausto Giunchiglia, Andrea Passerini
The ability to learn from human supervision is fundamental for personal assistants and other interactive applications of AI.
no code implementations • LREC 2020 • G{\'a}bor Bella, Fiona McNeill, Rody Gorman, Caoimhin O Donnaile, Kirsty MacDonald, Ch, Yamini rashekar, Abed Alhakim Freihat, Fausto Giunchiglia
We present a new wordnet resource for Scottish Gaelic, a Celtic minority language spoken by about 60, 000 speakers, most of whom live in Northwestern Scotland.
1 code implementation • 6 Dec 2019 • Luca Erculiani, Fausto Giunchiglia, Andrea Passerini
We present a framework capable of tackilng the problem of continual object recognition in a setting which resembles that under whichhumans see and learn.
1 code implementation • ACL 2019 • Khuyagbaatar Batsuren, Gabor Bella, Fausto Giunchiglia
This paper introduces CogNet, a new, large-scale lexical database that provides cognates -words of common origin and meaning- across languages.
no code implementations • SEMEVAL 2017 • Mohammed R. H. Qwaider, Abed Alhakim Freihat, Fausto Giunchiglia
In this paper we present the Tren-toTeam system which participated to thetask 3 at SemEval-2017 (Nakov et al., 2017). We concentrated our work onapplying Grice Maxims(used in manystate-of-the-art Machine learning applica-tions(Vogel et al., 2013; Kheirabadiand Aghagolzadeh, 2012; Dale and Re-iter, 1995; Franke, 2011)) to ranking an-swers of a question by answers relevancy. Particularly, we created a ranker systembased on relevancy scores, assigned by 3main components: Named entity recogni-tion, similarity score, sentiment analysis. Our system obtained a comparable resultsto Machine learning systems.
BIG-bench Machine Learning Named Entity Recognition (NER) +1
no code implementations • 22 Nov 2016 • Xixun Lin, Yanchun Liang, Fausto Giunchiglia, Xiaoyue Feng, Renchu Guan
In this paper, we study the problem of how to better embed entities and relations of knowledge bases into different low-dimensional spaces by taking full advantage of the additional semantics of relation paths, and we propose a compositional learning model of relation path embedding (RPE).