Search Results for author: Dylan Zhang

Found 4 papers, 0 papers with code

Instruction Diversity Drives Generalization To Unseen Tasks

no code implementations • 16 Feb 2024 • Dylan Zhang, Justin Wang, Francois Charton

We investigate the trade-off between the number of instructions the model is trained on and the number of training samples provided for each instruction and observe that the diversity of the instruction set determines generalization.

Language Modelling Large Language Model

Paper
Add Code

Transformer-Based Models Are Not Yet Perfect At Learning to Emulate Structural Recursion

no code implementations • 23 Jan 2024 • Dylan Zhang, Curt Tigges, Zory Zhang, Stella Biderman, Maxim Raginsky, Talia Ringer

The framework includes a representation that captures the general \textit{syntax} of structural recursion, coupled with two different frameworks for understanding their \textit{semantics} -- one that is more natural from a programming languages perspective and one that helps bridge that perspective with a mechanistic understanding of the underlying transformer architecture.

Paper
Add Code

PACE-LM: Prompting and Augmentation for Calibrated Confidence Estimation with GPT-4 in Cloud Incident Root Cause Analysis

no code implementations • 11 Sep 2023 • Dylan Zhang, Xuchao Zhang, Chetan Bansal, Pedro Las-Casas, Rodrigo Fonseca, Saravan Rajmohan

Major cloud providers have employed advanced AI-based solutions like large language models to aid humans in identifying the root causes of cloud incidents.

Decision Making

Paper
Add Code

Optimization of Multi-Factor Model in Quantitative Trading Based On Reinforcement Learning

no code implementations • CUHK Course IERG5350 2020 • Dylan Zhang, Xiaotong LIN

Quantitative trading strategies play an important role in stock trading, and reinforcement learning (RL) has been increasingly applied to trading activities in recent years.

Decision Making reinforcement-learning +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.