Search Results for author: Mohammad Pedramfar

Found 6 papers, 1 papers with code

Unified Projection-Free Algorithms for Adversarial DR-Submodular Optimization

no code implementations15 Mar 2024 Mohammad Pedramfar, Yididiya Y. Nadew, Christopher J. Quinn, Vaneet Aggarwal

This paper introduces unified projection-free Frank-Wolfe type algorithms for adversarial continuous DR-submodular optimization, spanning scenarios such as full information and (semi-)bandit feedback, monotone and non-monotone functions, different constraints, and types of stochastic queries.

A Generalized Approach to Online Convex Optimization

no code implementations13 Feb 2024 Mohammad Pedramfar, Vaneet Aggarwal

We also show that any such algorithm that requires full-information feedback may be transformed to an algorithm with semi-bandit feedback with comparable regret bound.

A Unified Approach for Maximizing Continuous DR-submodular Functions

no code implementations NeurIPS 2023 Mohammad Pedramfar, Christopher John Quinn, Vaneet Aggarwal

This paper presents a unified approach for maximizing continuous DR-submodular functions that encompasses a range of settings and oracle access types.

Stochastic Submodular Bandits with Delayed Composite Anonymous Bandit Feedback

no code implementations23 Mar 2023 Mohammad Pedramfar, Vaneet Aggarwal

This paper investigates the problem of combinatorial multiarmed bandits with stochastic submodular (in expectation) rewards and full-bandit delayed feedback, where the delayed feedback is assumed to be composite and anonymous.

Coagent Networks Revisited

1 code implementation28 Jan 2020 Modjtaba Shokrian Zini, Mohammad Pedramfar, Matthew Riemer, Ahmadreza Moradipari, Miao Liu

Coagent networks formalize the concept of arbitrary networks of stochastic agents that collaborate to take actions in a reinforcement learning environment.

Hierarchical Reinforcement Learning reinforcement-learning

Cannot find the paper you are looking for? You can Submit a new open access paper.