no code implementations • 2 Apr 2025 • DongHyun Choi, Lucas Spangher, Chris Hidey, Peter Grabowski, Ramy Eskander
Transformer-based Large Language Models, which suffer from high computational costs, advance so quickly that techniques proposed to streamline earlier iterations are not guaranteed to benefit more modern models.
no code implementations • 29 Mar 2025 • Nicholas Roth, Christopher Hidey, Lucas Spangher, William F. Arnold, Chang Ye, Nick Masiewicki, Jinoo Baek, Peter Grabowski, Eugene Ie
In this paper, we propose a novel factored agent architecture designed to overcome the limitations of traditional single-agent systems in agentic AI.
no code implementations • 28 Oct 2024 • Lucas Spangher, Tianle Li, William F. Arnold, Nick Masiewicki, Xerxes Dotiwalla, Rama Parusmathi, Peter Grabowski, Eugene Ie, Dan Gruhl
There exists an extremely wide array of LLM benchmarking tasks, whereas oftentimes a single number is the most actionable for decision-making, especially by non-experts.
no code implementations • 14 Oct 2024 • Dhruva Chayapathy, Tavis Siebert, Lucas Spangher, Akshata Kishore Moharir, Om Manoj Patil, Cristina Rea
Machine Learning guided data augmentation may support the development of technologies in the physical sciences, such as nuclear fusion tokamaks.
no code implementations • 29 May 2024 • Pierre Harvey Richemond, Yunhao Tang, Daniel Guo, Daniele Calandriello, Mohammad Gheshlaghi Azar, Rafael Rafailov, Bernardo Avila Pires, Eugene Tarassov, Lucas Spangher, Will Ellsworth, Aliaksei Severyn, Jonathan Mallinson, Lior Shani, Gil Shamir, Rishabh Joshi, Tianqi Liu, Remi Munos, Bilal Piot
The canonical element of such datasets is for instance an LLM's response to a user's prompt followed by a user's feedback such as a thumbs-up/down.
2 code implementations • 16 Dec 2023 • Doseok Jang, Larry Yan, Lucas Spangher, Costas Spanos
Reinforcement learning (RL) is a powerful tool for optimal control that has found great success in Atari games, the game of Go, robotic control, and building optimization.
no code implementations • 3 Dec 2023 • William F Arnold, Lucas Spangher, Christina Rea
Grid decarbonization for climate change requires dispatchable carbon-free energy like nuclear fusion.
no code implementations • 27 Nov 2022 • Hari Prasanna Das, Yu-Wen Lin, Utkarsha Agwan, Lucas Spangher, Alex Devonport, Yu Yang, Jan Drgona, Adrian Chong, Stefano Schiavon, Costas J. Spanos
In this work, we review the ways in which machine learning has been leveraged to make buildings smart and energy-efficient.
no code implementations • 13 Oct 2022 • Doseok Jang, Larry Yan, Lucas Spangher, Costas J. Spanos
We develop the first application of Personalized Federated Hypernetworks (PFH) to Reinforcement Learning (RL).
Multi-agent Reinforcement Learning
Personalized Federated Learning
+3
no code implementations • 11 Nov 2021 • William Arnold, Tarang Srivastava, Lucas Spangher, Utkarsha Agwan, Costas Spanos
Optimizing prices for energy demand response requires a flexible controller with ability to navigate complex environments.
no code implementations • 14 Aug 2021 • Doseok Jang, Lucas Spangher, Manan Khattar, Utkarsha Agwan, Selvaprabuh Nadarajah, Costas Spanos
Our team is proposing to run a full-scale energy demand response experiment in an office building.
no code implementations • 29 Apr 2021 • Doseok Jang, Lucas Spangher, Manan Khattar, Utkarsha Agwan, Costas Spanos
Our team is proposing to run a full-scale energy demand response experiment in an office building.