Search Results for author: Mohammad Pedramfar

Found 7 papers, 2 papers with code

From Linear to Linearizable Optimization: A Novel Framework with Applications to Stationary and Non-stationary DR-submodular Optimization

no code implementations • 27 Apr 2024 • Mohammad Pedramfar, Vaneet Aggarwal

This paper introduces the notion of upper linearizable/quadratizable functions, a class that extends concavity and DR-submodularity in various settings, including monotone and non-monotone cases over different convex sets.

Paper
Add Code

Unified Projection-Free Algorithms for Adversarial DR-Submodular Optimization

1 code implementation • 15 Mar 2024 • Mohammad Pedramfar, Yididiya Y. Nadew, Christopher J. Quinn, Vaneet Aggarwal

This paper introduces unified projection-free Frank-Wolfe type algorithms for adversarial continuous DR-submodular optimization, spanning scenarios such as full information and (semi-)bandit feedback, monotone and non-monotone functions, different constraints, and types of stochastic queries.

Paper
Code

A Generalized Approach to Online Convex Optimization

no code implementations • 13 Feb 2024 • Mohammad Pedramfar, Vaneet Aggarwal

We also show that any such algorithm that requires full-information feedback may be transformed to an algorithm with semi-bandit feedback with comparable regret bound.

Paper
Add Code

Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement Learning

no code implementations • NeurIPS 2023 • Ahmadreza Moradipari, Mohammad Pedramfar, Modjtaba Shokrian Zini, Vaneet Aggarwal

In this paper, we prove the first Bayesian regret bounds for Thompson Sampling in reinforcement learning in a multitude of settings.

reinforcement-learning Thompson Sampling

Paper
Add Code

A Unified Approach for Maximizing Continuous DR-submodular Functions

no code implementations • NeurIPS 2023 • Mohammad Pedramfar, Christopher John Quinn, Vaneet Aggarwal

This paper presents a unified approach for maximizing continuous DR-submodular functions that encompasses a range of settings and oracle access types.

Paper
Add Code

Stochastic Submodular Bandits with Delayed Composite Anonymous Bandit Feedback

no code implementations • 23 Mar 2023 • Mohammad Pedramfar, Vaneet Aggarwal

This paper investigates the problem of combinatorial multiarmed bandits with stochastic submodular (in expectation) rewards and full-bandit delayed feedback, where the delayed feedback is assumed to be composite and anonymous.

Paper
Add Code

Coagent Networks Revisited

1 code implementation • 28 Jan 2020 • Modjtaba Shokrian Zini, Mohammad Pedramfar, Matthew Riemer, Ahmadreza Moradipari, Miao Liu

Coagent networks formalize the concept of arbitrary networks of stochastic agents that collaborate to take actions in a reinforcement learning environment.

Hierarchical Reinforcement Learning reinforcement-learning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.