no code implementations • 16 Feb 2024 • Dylan Zhang, Justin Wang, Francois Charton
We investigate the trade-off between the number of instructions the model is trained on and the number of training samples provided for each instruction and observe that the diversity of the instruction set determines generalization.
no code implementations • 23 Jan 2024 • Dylan Zhang, Curt Tigges, Zory Zhang, Stella Biderman, Maxim Raginsky, Talia Ringer
The framework includes a representation that captures the general \textit{syntax} of structural recursion, coupled with two different frameworks for understanding their \textit{semantics} -- one that is more natural from a programming languages perspective and one that helps bridge that perspective with a mechanistic understanding of the underlying transformer architecture.
no code implementations • 11 Sep 2023 • Dylan Zhang, Xuchao Zhang, Chetan Bansal, Pedro Las-Casas, Rodrigo Fonseca, Saravan Rajmohan
Major cloud providers have employed advanced AI-based solutions like large language models to aid humans in identifying the root causes of cloud incidents.
no code implementations • CUHK Course IERG5350 2020 • Dylan Zhang, Xiaotong LIN
Quantitative trading strategies play an important role in stock trading, and reinforcement learning (RL) has been increasingly applied to trading activities in recent years.