Text classification is the task of assigning a sentence or document an appropriate category. The categories depend on the chosen dataset and can range from topics.
( Image credit: Text Classification Algorithms: A Survey )
|TREND||DATASET||BEST METHOD||PAPER TITLE||PAPER||CODE||COMPARE|
The results show that the text classification models trained under our proposed framework outperform traditional models significantly in terms of fairness, and also slightly in terms of classification performance.
In this paper, we seek to improve the faithfulness of attention-based explanations for text classification.
Each transaction is accompanied by a text description provided by the customer to describe the products being picked up and delivered which can be used to classify the transaction.
To enhance the generalization ability of PanGu-$\alpha$, we collect 1. 1TB high-quality Chinese data from a wide range of domains to pretrain the model.
Ranked #1 on Reading Comprehension (Zero-Shot) on CMRC 2018
CLOZE (MULTI-CHOICES) (FEW-SHOT) CLOZE (MULTI-CHOICES) (ONE-SHOT) CLOZE (MULTI-CHOICES) (ZERO-SHOT) COMMON SENSE REASONING (FEW-SHOT) COMMON SENSE REASONING (ONE-SHOT) COMMON SENSE REASONING (ZERO-SHOT) DIALOGUE GENERATION FEW-SHOT IMAGE CLASSIFICATION NATURAL LANGUAGE INFERENCE NATURAL LANGUAGE INFERENCE (FEW-SHOT) NATURAL LANGUAGE INFERENCE (ONE-SHOT) NATURAL LANGUAGE INFERENCE (ZERO-SHOT) NATURAL LANGUAGE UNDERSTANDING QUESTION ANSWERING READING COMPREHENSION READING COMPREHENSION (FEW-SHOT) READING COMPREHENSION (ONE-SHOT) READING COMPREHENSION (ZERO-SHOT) TEXT CLASSIFICATION TEXT SUMMARIZATION
We show that an informational approach to representing the meaning of a text has offered a way to effectively predict the scientific impact of research papers.
Query categorization is an essential part of query intent understanding in e-commerce search.
This is because the nearest neighbor to the noised input is likely to be the original input.
Our analyses reveal a significant drop in performance when testing neural models on out-of-domain data and non-English languages that may be mitigated using diverse training data.