no code implementations • 1 Apr 2024 • Wilson Wu, John X. Morris, Lionel Levine
Do transformers "think ahead" during inference at a given position?
no code implementations • 18 Nov 2023 • Wilson Wu
Our goal is to learn a DFA representation of the oracle that preserves the information that it is confident in.
no code implementations • 2 Oct 2019 • Lakshya Jain, Wilson Wu, Steven Chen, Uyeong Jang, Varun Chandrasekaran, Sanjit Seshia, Somesh Jha
In this paper we explore semantic adversarial examples (SAEs) where an attacker creates perturbations in the semantic space representing the environment that produces input for the ML model.