Search Results for author: Alex Mallen

Found 6 papers, 5 papers with code

Neural Networks Learn Statistics of Increasing Complexity

1 code implementation6 Feb 2024 Nora Belrose, Quintin Pope, Lucia Quirke, Alex Mallen, Xiaoli Fern

The distributional simplicity bias (DSB) posits that neural networks learn low-order moments of the data distribution first, before moving on to higher-order correlations.

Eliciting Latent Knowledge from Quirky Language Models

1 code implementation2 Dec 2023 Alex Mallen, Madeline Brumley, Julia Kharchenko, Nora Belrose

Eliciting Latent Knowledge (ELK) aims to find patterns in a capable neural network's activations that robustly track the true state of the world, especially in hard-to-verify cases where the model's output is untrusted.

Anomaly Detection Math

Representation Engineering: A Top-Down Approach to AI Transparency

1 code implementation2 Oct 2023 Andy Zou, Long Phan, Sarah Chen, James Campbell, Phillip Guo, Richard Ren, Alexander Pan, Xuwang Yin, Mantas Mazeika, Ann-Kathrin Dombrowski, Shashwat Goel, Nathaniel Li, Michael J. Byun, Zifan Wang, Alex Mallen, Steven Basart, Sanmi Koyejo, Dawn Song, Matt Fredrikson, J. Zico Kolter, Dan Hendrycks

In this paper, we identify and characterize the emerging area of representation engineering (RepE), an approach to enhancing the transparency of AI systems that draws on insights from cognitive neuroscience.

Question Answering

When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories

1 code implementation20 Dec 2022 Alex Mallen, Akari Asai, Victor Zhong, Rajarshi Das, Daniel Khashabi, Hannaneh Hajishirzi

Despite their impressive performance on diverse tasks, large language models (LMs) still struggle with tasks requiring rich world knowledge, implying the limitations of relying solely on their parameters to encode a wealth of world knowledge.

Knowledge Probing Memorization +2

Koopman-theoretic Approach for Identification of Exogenous Anomalies in Nonstationary Time-series Data

1 code implementation18 Sep 2022 Alex Mallen, Christoph A. Keller, J. Nathan Kutz

In many scenarios, it is necessary to monitor a complex system via a time-series of observations and determine when anomalous exogenous events have occurred so that relevant actions can be taken.

Time Series Time Series Analysis

Deep Probabilistic Koopman: Long-term time-series forecasting under periodic uncertainties

no code implementations10 Jun 2021 Alex Mallen, Henning Lange, J. Nathan Kutz

Probabilistic forecasting of complex phenomena is paramount to various scientific disciplines and applications.

Time Series Time Series Forecasting

Cannot find the paper you are looking for? You can Submit a new open access paper.