no code implementations • 14 Nov 2019 • Aditya Narayan Ravi, Pranav Poduval, Dr. Sharayu Moharir
We model the recommendation system as a bandit seeking to maximize reward by pulling on arms with unknown rewards.
Multi-Armed Bandits Recommendation Systems