Search Results for author: Kshitija Taywade

Found 4 papers, 0 papers with code

Personalizing Task-oriented Dialog Systems via Zero-shot Generalizable Reward Function

no code implementations24 Mar 2023 A. B. Siddique, M. H. Maqbool, Kshitija Taywade, Hassan Foroosh

In this work, we propose a novel framework, P-ToD, to personalize task-oriented dialog systems capable of adapting to a wide range of user profiles in an unsupervised fashion using a zero-shot generalizable reward function.

Using Non-Stationary Bandits for Learning in Repeated Cournot Games with Non-Stationary Demand

no code implementations3 Jan 2022 Kshitija Taywade, Brent Harrison, Judy Goldsmith

We found that using our proposed method, agents are able to swiftly change their course of action according to the changes in demand, and they also engage in collusive behavior in many simulations.

Efficient Exploration

Modelling Cournot Games as Multi-agent Multi-armed Bandits

no code implementations1 Jan 2022 Kshitija Taywade, Brent Harrison, Adib Bagh

We investigate the use of a multi-agent multi-armed bandit (MA-MAB) setting for modeling repeated Cournot oligopoly games, where the firms acting as agents choose from the set of arms representing production quantity (a discrete value).

Multi-Armed Bandits

Multi-agent Reinforcement Learning for Decentralized Stable Matching

no code implementations3 May 2020 Kshitija Taywade, Judy Goldsmith, Brent Harrison

Along with conventional stable matching case where agents have strictly ordered preferences, we check the applicability of our approach for stable matching with incomplete lists and ties.

Fairness Multi-agent Reinforcement Learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.