no code implementations • 11 Dec 2023 • Sangwon Hyun, Mingyu Guo, M. Ali Babar
Through the experiments conducted with three prominent LLMs, we have confirmed that the METAL framework effectively evaluates essential QAs on primary LLM tasks and reveals the quality risks in LLMs.
2 code implementations • 25 Aug 2020 • Sangwon Hyun, Mattias Rolf Cape, Francois Ribalet, Jacob Bien
The ocean is filled with microscopic microalgae called phytoplankton, which together are responsible for as much photosynthesis as all plants on land combined.
1 code implementation • 11 Jun 2016 • Sangwon Hyun, Max G'Sell, Ryan J. Tibshirani
Leveraging a sequential characterization of this path from Tibshirani & Taylor (2011), and recent advances in post-selection inference from Lee et al. (2016), Tibshirani et al. (2016), we develop exact hypothesis tests and confidence intervals for linear contrasts of the underlying mean vector, conditioned on any model selection event along the generalized lasso path (assuming Gaussian errors in the observations).
Methodology