no code implementations • 24 Mar 2023 • A. B. Siddique, M. H. Maqbool, Kshitija Taywade, Hassan Foroosh
In this work, we propose a novel framework, P-ToD, to personalize task-oriented dialog systems capable of adapting to a wide range of user profiles in an unsupervised fashion using a zero-shot generalizable reward function.
no code implementations • 3 Jan 2022 • Kshitija Taywade, Brent Harrison, Judy Goldsmith
We found that using our proposed method, agents are able to swiftly change their course of action according to the changes in demand, and they also engage in collusive behavior in many simulations.
no code implementations • 1 Jan 2022 • Kshitija Taywade, Brent Harrison, Adib Bagh
We investigate the use of a multi-agent multi-armed bandit (MA-MAB) setting for modeling repeated Cournot oligopoly games, where the firms acting as agents choose from the set of arms representing production quantity (a discrete value).
no code implementations • 3 May 2020 • Kshitija Taywade, Judy Goldsmith, Brent Harrison
Along with conventional stable matching case where agents have strictly ordered preferences, we check the applicability of our approach for stable matching with incomplete lists and ties.