Search Results for author: Avi Schwarzschild

Found 26 papers, 15 papers with code

Rethinking LLM Memorization through the Lens of Adversarial Compression

no code implementations • 23 Apr 2024 • Avi Schwarzschild, Zhili Feng, Pratyush Maini, Zachary C. Lipton, J. Zico Kolter

We outline the limitations of existing notions of memorization and show how the ACR overcomes these challenges by (i) offering an adversarial view to measuring memorization, especially for monitoring unlearning and compliance; and (ii) allowing for the flexibility to measure memorization for arbitrary strings at a reasonably low compute.

Paper
Add Code

Forcing Diffuse Distributions out of Language Models

1 code implementation • 16 Apr 2024 • Yiming Zhang, Avi Schwarzschild, Nicholas Carlini, Zico Kolter, Daphne Ippolito

Despite being trained specifically to follow user instructions, today's language models perform poorly when instructed to produce random outputs.

Language Modelling valid

Paper
Code

Benchmarking ChatGPT on Algorithmic Reasoning

1 code implementation • 4 Apr 2024 • Sean McLeish, Avi Schwarzschild, Tom Goldstein

We evaluate ChatGPT's ability to solve algorithm problems from the CLRS benchmark suite that is designed for GNNs.

Benchmarking

Paper
Code

Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text

1 code implementation • 22 Jan 2024 • Abhimanyu Hans, Avi Schwarzschild, Valeriia Cherepanova, Hamid Kazemi, Aniruddha Saha, Micah Goldblum, Jonas Geiping, Tom Goldstein

Detecting text generated by modern large language models is thought to be hard, as both LLMs and humans can exhibit a wide range of complex behaviors.

156

Paper
Code

TOFU: A Task of Fictitious Unlearning for LLMs

no code implementations • 11 Jan 2024 • Pratyush Maini, Zhili Feng, Avi Schwarzschild, Zachary C. Lipton, J. Zico Kolter

Large language models trained on massive corpora of data from the web can memorize and reproduce sensitive or private data raising both legal and ethical concerns.

Paper
Add Code

Effective Backdoor Mitigation Depends on the Pre-training Objective

no code implementations • 25 Nov 2023 • Sahil Verma, Gantavya Bhatt, Avi Schwarzschild, Soumye Singhal, Arnav Mohanty Das, Chirag Shah, John P Dickerson, Jeff Bilmes

In this work, we demonstrate that the efficacy of CleanCLIP in mitigating backdoors is highly dependent on the particular objective used during model pre-training.

Paper
Add Code

NEFTune: Noisy Embeddings Improve Instruction Finetuning

3 code implementations • 9 Oct 2023 • Neel Jain, Ping-Yeh Chiang, Yuxin Wen, John Kirchenbauer, Hong-Min Chu, Gowthami Somepalli, Brian R. Bartoldson, Bhavya Kailkhura, Avi Schwarzschild, Aniruddha Saha, Micah Goldblum, Jonas Geiping, Tom Goldstein

We show that language model finetuning can be improved, sometimes dramatically, with a simple augmentation.

Language Modelling

5,707

Paper
Code

Baseline Defenses for Adversarial Attacks Against Aligned Language Models

1 code implementation • 1 Sep 2023 • Neel Jain, Avi Schwarzschild, Yuxin Wen, Gowthami Somepalli, John Kirchenbauer, Ping-Yeh Chiang, Micah Goldblum, Aniruddha Saha, Jonas Geiping, Tom Goldstein

We find that the weakness of existing discrete optimizers for text, combined with the relatively high costs of optimization, makes standard adaptive attacks more challenging for LLMs.

Paper
Code

A Cookbook of Self-Supervised Learning

no code implementations • 24 Apr 2023 • Randall Balestriero, Mark Ibrahim, Vlad Sobal, Ari Morcos, Shashank Shekhar, Tom Goldstein, Florian Bordes, Adrien Bardes, Gregoire Mialon, Yuandong Tian, Avi Schwarzschild, Andrew Gordon Wilson, Jonas Geiping, Quentin Garrido, Pierre Fernandez, Amir Bar, Hamed Pirsiavash, Yann Lecun, Micah Goldblum

Self-supervised learning, dubbed the dark matter of intelligence, is a promising path to advance machine learning.

Navigate Self-Supervised Learning

Paper
Add Code

Reckoning with the Disagreement Problem: Explanation Consensus as a Training Objective

no code implementations • 23 Mar 2023 • Avi Schwarzschild, Max Cembalest, Karthik Rao, Keegan Hines, John Dickerson

We observe on three datasets that we can train a model with this loss term to improve explanation consensus on unseen data, and see improved consensus between explainers other than those used in the loss term.

Paper
Add Code

Neural Auctions Compromise Bidder Information

1 code implementation • 28 Feb 2023 • Alex Stein, Avi Schwarzschild, Michael Curry, Tom Goldstein, John Dickerson

It has been shown that neural networks can be used to approximate optimal mechanisms while satisfying the constraints that an auction be strategyproof and individually rational.

Paper
Code

Universal Guidance for Diffusion Models

1 code implementation • 14 Feb 2023 • Arpit Bansal, Hong-Min Chu, Avi Schwarzschild, Soumyadip Sengupta, Micah Goldblum, Jonas Geiping, Tom Goldstein

Typical diffusion models are trained to accept a particular form of conditioning, most commonly text, and cannot be conditioned on other modalities without retraining.

Face Recognition object-detection +1

409

Paper
Code

Transfer Learning with Deep Tabular Models

1 code implementation • 30 Jun 2022 • Roman Levin, Valeriia Cherepanova, Avi Schwarzschild, Arpit Bansal, C. Bayan Bruss, Tom Goldstein, Andrew Gordon Wilson, Micah Goldblum

In this work, we demonstrate that upstream data gives tabular neural networks a decisive advantage over widely used GBDT models.

Medical Diagnosis Transfer Learning

Paper
Code

End-to-end Algorithm Synthesis with Recurrent Networks: Logical Extrapolation Without Overthinking

1 code implementation • 11 Feb 2022 • Arpit Bansal, Avi Schwarzschild, Eitan Borgnia, Zeyad Emam, Furong Huang, Micah Goldblum, Tom Goldstein

Algorithmic extrapolation can be achieved through recurrent systems, which can be iterated many times to solve difficult reasoning problems.

Logical Reasoning

Paper
Code

Thinking Deeper With Recurrent Networks: Logical Extrapolation Without Overthinking

no code implementations • 29 Sep 2021 • Arpit Bansal, Avi Schwarzschild, Eitan Borgnia, Zeyad Emam, Furong Huang, Micah Goldblum, Tom Goldstein

Classical machine learning systems perform best when they are trained and tested on the same distribution, and they lack a mechanism to increase model power after training is complete.

Paper
Add Code

Datasets for Studying Generalization from Easy to Hard Examples

1 code implementation • 13 Aug 2021 • Avi Schwarzschild, Eitan Borgnia, Arjun Gupta, Arpit Bansal, Zeyad Emam, Furong Huang, Micah Goldblum, Tom Goldstein

We describe new datasets for studying generalization from easy to hard examples.

Paper
Code

MetaBalance: High-Performance Neural Networks for Class-Imbalanced Data

no code implementations • 17 Jun 2021 • Arpit Bansal, Micah Goldblum, Valeriia Cherepanova, Avi Schwarzschild, C. Bayan Bruss, Tom Goldstein

Class-imbalanced data, in which some classes contain far more samples than others, is ubiquitous in real-world applications.

Fraud Detection Image Classification +3

Paper
Add Code

Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks

1 code implementation • NeurIPS 2021 • Avi Schwarzschild, Eitan Borgnia, Arjun Gupta, Furong Huang, Uzi Vishkin, Micah Goldblum, Tom Goldstein

In this work, we show that recurrent networks trained to solve simple problems with few recurrent steps can indeed solve much more complex problems simply by performing additional recurrences during inference.

Paper
Code

SAINT: Improved Neural Networks for Tabular Data via Row Attention and Contrastive Pre-Training

7 code implementations • 2 Jun 2021 • Gowthami Somepalli, Micah Goldblum, Avi Schwarzschild, C. Bayan Bruss, Tom Goldstein

We devise a hybrid deep learning approach to solving tabular data problems.

Fraud Detection Insurance Prediction

1,234

Paper
Code

The Uncanny Similarity of Recurrence and Depth

1 code implementation • ICLR 2022 • Avi Schwarzschild, Arjun Gupta, Amin Ghiasi, Micah Goldblum, Tom Goldstein

It is widely believed that deep neural networks contain layer specialization, wherein neural networks extract hierarchical features representing edges and patterns in shallow layers and complete objects in deeper layers.

Image Classification

Paper
Code

Just How Toxic is Data Poisoning? A Benchmark for Backdoor and Data Poisoning Attacks

no code implementations • 1 Jan 2021 • Avi Schwarzschild, Micah Goldblum, Arjun Gupta, John P Dickerson, Tom Goldstein

Data poisoning and backdoor attacks manipulate training data in order to cause models to fail during inference.

Data Poisoning Experimental Design

Paper
Add Code

Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses

no code implementations • 18 Dec 2020 • Micah Goldblum, Dimitris Tsipras, Chulin Xie, Xinyun Chen, Avi Schwarzschild, Dawn Song, Aleksander Madry, Bo Li, Tom Goldstein

As machine learning systems grow in scale, so do their training data requirements, forcing practitioners to automate and outsource the curation of training data in order to achieve state-of-the-art performance.

BIG-bench Machine Learning Data Poisoning

Paper
Add Code

Just How Toxic is Data Poisoning? A Unified Benchmark for Backdoor and Data Poisoning Attacks

2 code implementations • 22 Jun 2020 • Avi Schwarzschild, Micah Goldblum, Arjun Gupta, John P. Dickerson, Tom Goldstein

Data poisoning and backdoor attacks manipulate training data in order to cause models to fail during inference.

Data Poisoning

144

Paper
Code

Headless Horseman: Adversarial Attacks on Transfer Learning Models

no code implementations • 20 Apr 2020 • Ahmed Abdelkader, Michael J. Curry, Liam Fowl, Tom Goldstein, Avi Schwarzschild, Manli Shu, Christoph Studer, Chen Zhu

We first demonstrate successful transfer attacks against a victim network using \textit{only} its feature extractor.

Adversarial Attack Transfer Learning

Paper
Add Code

Adversarial Attacks on Machine Learning Systems for High-Frequency Trading

no code implementations • 21 Feb 2020 • Micah Goldblum, Avi Schwarzschild, Ankit B. Patel, Tom Goldstein

Algorithmic trading systems are often completely automated, and deep learning is increasingly receiving attention in this domain.

Algorithmic Trading BIG-bench Machine Learning +1

Paper
Add Code

Truth or Backpropaganda? An Empirical Investigation of Deep Learning Theory

1 code implementation • ICLR 2020 • Micah Goldblum, Jonas Geiping, Avi Schwarzschild, Michael Moeller, Tom Goldstein

We empirically evaluate common assumptions about neural networks that are widely held by practitioners and theorists alike.

Learning Theory

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.