Search Results for author: Sankaran Vaidyanathan

Found 4 papers, 1 papers with code

Adaptive Circuit Behavior and Generalization in Mechanistic Interpretability

no code implementations25 Nov 2024 Jatin Nainani, Sankaran Vaidyanathan, AJ Yeung, Kartik Gupta, David Jensen

For instance, it is unclear whether the models generalization results from reusing the same circuit components, the components behaving differently, or the use of entirely different components.

Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges

1 code implementation18 Jun 2024 Aman Singh Thakur, Kartik Choudhary, Venkat Srinik Ramayapally, Sankaran Vaidyanathan, Dieuwke Hupkes

Lastly, our research rediscovers the importance of using alignment metrics beyond simple percent alignment, showing that judges with high percent agreement can still assign vastly different scores.

TriviaQA

Automated Discovery of Functional Actual Causes in Complex Environments

no code implementations16 Apr 2024 Caleb Chuck, Sankaran Vaidyanathan, Stephen Giguere, Amy Zhang, David Jensen, Scott Niekum

This paper introduces functional actual cause (FAC), a framework that uses context-specific independencies in the environment to restrict the set of actual causes.

Attribute Reinforcement Learning (RL)

Hypergraph Clustering: A Modularity Maximization Approach

no code implementations28 Dec 2018 Tarun Kumar, Sankaran Vaidyanathan, Harini Ananthapadmanabhan, Srinivasan Parthasarathy, Balaraman Ravindran

Clustering on hypergraphs has been garnering increased attention with potential applications in network analysis, VLSI design and computer vision, among others.

Clustering

Cannot find the paper you are looking for? You can Submit a new open access paper.