Search Results for author: Zachary Yahn

Found 6 papers, 5 papers with code

Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable

1 code implementation1 Mar 2025 Tiansheng Huang, Sihao Hu, Fatih Ilhan, Selim Furkan Tekin, Zachary Yahn, Yichang Xu, Ling Liu

While safety alignment has been extensively studied for LLM, there is still a large research gap for Large Reasoning Models (LRMs) that equip with improved reasoning capability.

Language Modeling Language Modelling +2

Multi-Agent Reinforcement Learning with Focal Diversity Optimization

1 code implementation6 Feb 2025 Selim Furkan Tekin, Fatih Ilhan, Tiansheng Huang, Sihao Hu, Zachary Yahn, Ling Liu

First, we develop an agent-fusion framework for encouraging multiple LLM based agents to collaborate in producing the final inference output for each LLM query.

Diversity Multi-agent Reinforcement Learning +3

Rapid Automated Mapping of Clouds on Titan With Instance Segmentation

1 code implementation8 Jan 2025 Zachary Yahn, Douglas M Trent, Ethan Duncan, Benoît Seignovert, John Santerre, Conor Nixon

Despite widespread adoption of deep learning models to address a variety of computer vision tasks, planetary science has yet to see extensive utilization of such tools to address its unique problems.

Instance Segmentation Semantic Segmentation +1

$H^3$Fusion: Helpful, Harmless, Honest Fusion of Aligned LLMs

1 code implementation26 Nov 2024 Selim Furkan Tekin, Fatih Ilhan, Tiansheng Huang, Sihao Hu, Zachary Yahn, Ling Liu

The former penalizes the selection errors of the expert-router, and the latter mediates the expert weights drifting during fine-tuning and dynamically adjusts the fusion behavior of the resulting model by canalizing the activations on the experts.

Mixture-of-Experts

Feature Extraction and Classification from Planetary Science Datasets enabled by Machine Learning

no code implementations26 Oct 2023 Conor Nixon, Zachary Yahn, Ethan Duncan, Ian Neidel, Alyssa Mills, Benoît Seignovert, Andrew Larsen, Kathryn Gansler, Charles Liles, Catherine Walker, Douglas Trent, John Santerre

In a different application, we applied the Mask R-CNN to recognize clouds on Titan, again through updated training followed by testing against new data, with a precision of 95% over 369 images.

Transfer Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.