Word Sense Induction

19 papers with code • 1 benchmarks • 1 datasets

Word sense induction (WSI) is widely known as the “unsupervised version” of WSD. The problem states as: Given a target word (e.g., “cold”) and a collection of sentences (e.g., “I caught a cold”, “The weather is cold”) that use the word, cluster the sentences according to their different senses/meanings. We do not need to know the sense/meaning of each cluster, but sentences inside a cluster should have used the target words with the same sense.

Description from NLP Progress

Benchmarks

Add a Result

These leaderboards are used to track progress in Word Sense Induction

Trend	Dataset	Best Model	Paper	Code	Compare
	SemEval 2010 WSI	BERT+DP			See all

Datasets

BRWAC

Latest papers with no code

Most implemented Social Latest No code

The LSCD Benchmark: a Testbed for Diachronic Word Meaning Tasks

no code yet • 29 Mar 2024

The repository reflects the task's modularity by allowing model evaluation for WiC, WSI and LSCD.

Paper
Add Code

Word Sense Induction with Knowledge Distillation from BERT

no code yet • 20 Apr 2023

This paper proposes a two-stage method to distill multiple word senses from a pre-trained language model (BERT) by using attention over the senses of a word in a context and transferring this sense information to fit multi-sense embeddings in a skip-gram-like framework.

Paper
Add Code

Word Sense Induction with Hierarchical Clustering and Mutual Information Maximization

no code yet • 11 Oct 2022

In this paper, we propose a novel unsupervised method based on hierarchical clustering and invariant information clustering (IIC).

Paper
Add Code

Towards Automatic Construction of Filipino WordNet: Word Sense Induction and Synset Induction Using Sentence Embeddings

no code yet • 7 Apr 2022

The resulting sense inventory and synonym sets can be used in automatically creating a wordnet.

Paper
Add Code

Topological Data Analysis for Word Sense Disambiguation

no code yet • 1 Mar 2022

We develop and test a novel unsupervised algorithm for word sense induction and disambiguation which uses topological data analysis.

Paper
Add Code

Large Scale Substitution-based Word Sense Induction

no code yet • ACL 2022

We present a word-sense induction method based on pre-trained masked language models (MLMs), which can cheaply scale to large vocabularies and large corpora.

Paper
Add Code

BOS at SemEval-2020 Task 1: Word Sense Induction via Lexical Substitution for Lexical Semantic Change Detection

no code yet • SEMEVAL 2020

The first solution performs word sense induction (WSI) first, then makes the decision based on the induced word senses.

Paper
Add Code

Topology of Word Embeddings: Singularities Reflect Polysemy

no code yet • Joint Conference on Lexical and Computational Semantics 2020

We argue that we should, more accurately, expect them to live on a pinched manifold: a singular quotient of a manifold obtained by identifying some of its points.

Paper
Add Code

Combining Neural Language Models for WordSense Induction

no code yet • 23 Jun 2020

Word sense induction (WSI) is the problem of grouping occurrences of an ambiguous word according to the expressed sense of this word.

Paper
Add Code

A Comparative Study of Lexical Substitution Approaches based on Neural Language Models

no code yet • 29 May 2020

Lexical substitution in context is an extremely powerful technology that can be used as a backbone of various NLP applications, such as word sense induction, lexical relation extraction, data augmentation, etc.

Paper
Add Code

Word Sense Induction

Benchmarks Add a Result

Datasets

Latest papers with no code

Content

Benchmarks

Add a Result