TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Sentiment Analysis	MR	TM-Glove	Accuracy	77.51	# 13
Text Classification	R52	TM-Glove	Accuracy	89.14	# 8
Text Classification	R8	TM-Glove	Accuracy	97.50	# 11
Text Classification	TREC-6	TM-Glove	Error	9.96	# 19

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/distributed-word-representation-in-tsetlin/text-classification-on-r52)](https://paperswithcode.com/sota/text-classification-on-r52?p=distributed-word-representation-in-tsetlin)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/distributed-word-representation-in-tsetlin/text-classification-on-r8)](https://paperswithcode.com/sota/text-classification-on-r8?p=distributed-word-representation-in-tsetlin)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/distributed-word-representation-in-tsetlin/sentiment-analysis-on-mr)](https://paperswithcode.com/sota/sentiment-analysis-on-mr?p=distributed-word-representation-in-tsetlin)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/distributed-word-representation-in-tsetlin/text-classification-on-trec-6)](https://paperswithcode.com/sota/text-classification-on-trec-6?p=distributed-word-representation-in-tsetlin)`

Enhancing Interpretable Clauses Semantically using Pretrained Word Representation

EMNLP (BlackboxNLP) 2021 · Rohan Kumar Yadav, Lei Jiao, Ole-Christoffer Granmo, Morten Goodwin ·

Tsetlin Machine (TM) is an interpretable pattern recognition algorithm based on propositional logic, which has demonstrated competitive performance in many Natural Language Processing (NLP) tasks, including sentiment analysis, text classification, and Word Sense Disambiguation. To obtain human-level interpretability, legacy TM employs Boolean input features such as bag-of-words (BOW). However, the BOW representation makes it difficult to use any pre-trained information, for instance, word2vec and GloVe word representations. This restriction has constrained the performance of TM compared to deep neural networks (DNNs) in NLP. To reduce the performance gap, in this paper, we propose a novel way of using pre-trained word representations for TM. The approach significantly enhances the performance and interpretability of TM. We achieve this by extracting semantically related words from pre-trained word representations as input features to the TM. Our experiments show that the accuracy of the proposed approach is significantly higher than the previous BOW-based TM, reaching the level of DNN-based models.

PDF Abstract EMNLP (BlackboxNLP) 2021 PDF EMNLP (BlackboxNLP) 2021 Abstract

Code

Add Remove Mark official

cair/TsetlinMachine

449

cair/pyTsetlinMachine

121

cair/PyTsetlinMachineCUDA

cair/pyTsetlinMachineParallel

cair/pyTsetlinMachineMT

Tasks

Add Remove

Sentiment Analysis

Text Classification

Word Sense Disambiguation

Datasets

Reuters-21578

Results from the Paper

Edit

Ranked #8 on Text Classification on R52

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Sentiment Analysis	MR	TM-Glove	Accuracy	77.51	# 13	Compare
Text Classification	R52	TM-Glove	Accuracy	89.14	# 8	Compare
Text Classification	R8	TM-Glove	Accuracy	97.50	# 11	Compare
Text Classification	TREC-6	TM-Glove	Error	9.96	# 19	Compare

Methods

Add Remove

GloVe

Edit Social Preview

Enhancing Interpretable Clauses Semantically using Pretrained Word Representation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove