Search Results for author: Jack Chen

Found 6 papers, 2 papers with code

Alignment faking in large language models

1 code implementation18 Dec 2024 Ryan Greenblatt, Carson Denison, Benjamin Wright, Fabien Roger, Monte MacDiarmid, Sam Marks, Johannes Treutlein, Tim Belonax, Jack Chen, David Duvenaud, Akbir Khan, Julian Michael, Sören Mindermann, Ethan Perez, Linda Petrini, Jonathan Uesato, Jared Kaplan, Buck Shlegeris, Samuel R. Bowman, Evan Hubinger

Explaining this gap, in almost all cases where the model complies with a harmful query from a free user, we observe explicit alignment-faking reasoning, with the model stating it is strategically answering harmful queries in training to preserve its preferred harmlessness behavior out of training.

Large Language Model

BugBlitz-AI: An Intelligent QA Assistant

no code implementations17 May 2024 Yi Yao, Jun Wang, Yabai Hu, LiFeng Wang, Yi Zhou, Jack Chen, Xuming Gai, Zhenming Wang, Wenjun Liu

The evolution of software testing from manual to automated methods has significantly influenced quality assurance (QA) practices.

software testing

Bolt: Bridging the Gap between Auto-tuners and Hardware-native Performance

no code implementations25 Oct 2021 Jiarong Xing, Leyuan Wang, Shang Zhang, Jack Chen, Ang Chen, Yibo Zhu

Today's auto-tuners (e. g., AutoTVM, Ansor) generate efficient tensor programs by navigating a large search space to identify effective implementations, but they do so with opaque hardware details.

Experimental Demonstration of Learned Time-Domain Digital Back-Propagation

no code implementations23 Dec 2019 Eric Sillekens, Wenting Yi, Daniel Semrau, Alessandro Ottino, Boris Karanov, Sujie Zhou, Kevin Law, Jack Chen, Domanic Lavery, Lidia Galdino, Polina Bayvel, Robert I. Killey

We present the first experimental demonstration of learned time-domain digital back-propagation (DBP), in 64-GBd dual-polarization 64-QAM signal transmission over 1014 km.

Adversarial Explanations for Understanding Image Classification Decisions and Improved Neural Network Robustness

1 code implementation7 Jun 2019 Walt Woods, Jack Chen, Christof Teuscher

For sensitive problems, such as medical imaging or fraud detection, Neural Network (NN) adoption has been slow due to concerns about their reliability, leading to a number of algorithms for explaining their decisions.

Adversarial Defense Fraud Detection +5

Cannot find the paper you are looking for? You can Submit a new open access paper.