News Classification
27 papers with code • 3 benchmarks • 11 datasets
Datasets
Latest papers
Improving Black-box Robustness with In-Context Rewriting
Most techniques for improving OOD robustness are not applicable to settings where the model is effectively a black box, such as when the weights are frozen, retraining is costly, or the model is leveraged via an API.
Accuracy of TextFooler black box adversarial attacks on 01 loss sign activation neural network ensemble
We ask the following question in this study: are 01 loss sign activation neural networks hard to deceive with a popular black box text adversarial attack program called TextFooler?
InterpretCC: Conditional Computation for Inherently Interpretable Neural Networks
Real-world interpretability for neural networks is a tradeoff between three concerns: 1) it requires humans to trust the explanation approximation (e. g. post-hoc approaches), 2) it compromises the understandability of the explanation (e. g. automatically identified feature masks), and 3) it compromises the model performance (e. g. decision trees).
Benchmarking Multilabel Topic Classification in the Kyrgyz Language
Kyrgyz is a very underrepresented language in terms of modern natural language processing resources.
A Dataset and Strong Baselines for Classification of Czech News Texts
Pre-trained models for Czech Natural Language Processing are often evaluated on purely linguistic tasks (POS tagging, parsing, NER) and relatively simple classification tasks such as sentiment classification or article classification from a single news source.
MasakhaNEWS: News Topic Classification for African languages
Furthermore, we explore several alternatives to full fine-tuning of language models that are better suited for zero-shot and few-shot learning such as cross-lingual parameter-efficient fine-tuning (like MAD-X), pattern exploiting training (PET), prompting language models (like ChatGPT), and prompt-free sentence transformer fine-tuning (SetFit and Cohere Embedding API).
Classifying the Ideological Orientation of User-Submitted Texts in Social Media
With the long-term goal of understanding how language is used and evolves within online communities, this work explores the application of natural language processing techniques to classify text articles according to their ideological orientation (i. e., conservative or liberal).
Wild-Time: A Benchmark of in-the-Wild Distribution Shift over Time
Temporal shifts -- distribution shifts arising from the passage of time -- often occur gradually and have the additional structure of timestamp metadata.
Multiverse: Multilingual Evidence for Fake News Detection
In this work, we propose Multiverse -- a new feature based on multilingual evidence that can be used for fake news detection and improve existing approaches.
Astock: A New Dataset and Automated Stock Trading based on Stock-specific News Analyzing Model
In addition, we propose a self-supervised learning strategy based on SRLP to enhance the out-of-distribution generalization performance of our system.