Search Results for author: John X. Morris

Found 11 papers, 10 papers with code

Nomic Embed: Training a Reproducible Long Context Text Embedder

1 code implementation2 Feb 2024 Zach Nussbaum, John X. Morris, Brandon Duderstadt, Andriy Mulyar

This technical report describes the training of nomic-embed-text-v1, the first fully reproducible, open-source, open-weights, open-data, 8192 context length English text embedding model that outperforms both OpenAI Ada-002 and OpenAI text-embedding-3-small on short and long-context tasks.

Language Model Inversion

2 code implementations22 Nov 2023 John X. Morris, Wenting Zhao, Justin T. Chiu, Vitaly Shmatikov, Alexander M. Rush

We consider the problem of language model inversion and show that next-token probabilities contain a surprising amount of information about the preceding text.

Language Modelling

Text Embeddings Reveal (Almost) As Much As Text

1 code implementation10 Oct 2023 John X. Morris, Volodymyr Kuleshov, Vitaly Shmatikov, Alexander M. Rush

How much private information do text embeddings reveal about the original text?

Unsupervised Text Deidentification

1 code implementation20 Oct 2022 John X. Morris, Justin T. Chiu, Ramin Zabih, Alexander M. Rush

We propose an unsupervised deidentification method that masks words that leak personally-identifying information.

Named Entity Recognition Named Entity Recognition (NER)

Second-Order NLP Adversarial Examples

1 code implementation5 Oct 2020 John X. Morris

In these methods, a valid adversarial example fools the model being attacked, and is determined to be semantically or syntactically valid by a second model.

Adversarial Attack Semantic Similarity +3

Searching for a Search Method: Benchmarking Search Algorithms for Generating NLP Adversarial Examples

2 code implementations EMNLP (BlackboxNLP) 2020 Jin Yong Yoo, John X. Morris, Eli Lifland, Yanjun Qi

We study the behavior of several black-box search algorithms used for generating adversarial examples for natural language processing (NLP) tasks.

Adversarial Text Benchmarking +1

TextAttack: A Framework for Adversarial Attacks, Data Augmentation, and Adversarial Training in NLP

2 code implementations EMNLP 2020 John X. Morris, Eli Lifland, Jin Yong Yoo, Jake Grigsby, Di Jin, Yanjun Qi

TextAttack also includes data augmentation and adversarial training modules for using components of adversarial attacks to improve model accuracy and robustness.

Adversarial Text Data Augmentation +3

Cannot find the paper you are looking for? You can Submit a new open access paper.