no code implementations • 16 Mar 2023 • Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf
We introduce a Reinforcement Learning Psychotherapy AI Companion that generates topic recommendations for therapists based on patient responses.
no code implementations • 21 Feb 2023 • Baihan Lin, Stefan Zecevic, Djallel Bouneffouf, Guillermo Cecchi
We present the TherapyView, a demonstration system to help therapists visualize the dynamic contents of past treatment sessions, enabled by the state-of-the-art neural topic modeling techniques to analyze the topical tendencies of various psychiatric conditions and deep learning-based image generation engine to provide a visual summary.
no code implementations • 2 Feb 2023 • Baihan Lin, Djallel Bouneffouf, Irina Rish
The field of compositional generalization is currently experiencing a renaissance in AI, as novel problem settings and algorithms motivated by various practical applications are being introduced, building on top of the classical compositional generalization problem.
no code implementations • 28 Oct 2022 • Qinyi Chen, Negin Golrezaei, Djallel Bouneffouf
We formalize this by characterizing a per-round regret lower bound, where the regret is measured against a strong (dynamic) benchmark.
1 code implementation • 27 Oct 2022 • Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf
As a predictive measure of the treatment outcome in psychotherapy, the working alliance measures the agreement of the patient and the therapist in terms of their bond, task and goal.
no code implementations • 8 Sep 2022 • Djallel Bouneffouf, Charu C. Aggarwal
In recent years, the Neurosymbolic framework has attracted a lot of attention in various applications, from recommender systems and information retrieval to healthcare and finance.
no code implementations • 27 Aug 2022 • Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf
We propose a recommendation system that suggests treatment strategies to a therapist during the psychotherapy session in real-time.
no code implementations • 22 Aug 2022 • Tsuyoshi Idé, Keerthiram Murugesan, Djallel Bouneffouf, Naoki Abe
The proposed framework is designed to accommodate any number of feature vectors in the form of multi-mode tensor, thereby enabling to capture the heterogeneity that may exist over user preferences, products, and campaign strategies in a unified manner.
no code implementations • 13 Apr 2022 • Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi, Ravi Tejwani
In this work, we compare different neural topic modeling methods in learning the topical propensities of different psychiatric conditions from the psychotherapy session transcripts parsed from speech recordings.
no code implementations • 12 Apr 2022 • Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf
The therapeutic working alliance is an important predictor of the outcome of the psychotherapy treatment.
1 code implementation • 30 Jun 2021 • Baihan Lin, Djallel Bouneffouf
In light of the COVID-19 pandemic, it is an open challenge and critical practical problem to find a optimal way to dynamically prescribe the best policies that balance both the governmental resources and epidemic control in different countries and regions.
no code implementations • ICML Workshop AutoML 2021 • Akihiro Kishimoto, Djallel Bouneffouf, Radu Marinescu, Parikshit Ram, Ambrish Rawat, Martin Wistuba, Paulito Pedregosa Palmes, Adi Botea
Optimizing a machine learning (ML) pipeline has been an important topic of AI and ML.
1 code implementation • 15 Mar 2021 • Jonathan P. Epperlein, Roman Overko, Sergiy Zhuk, Christopher King, Djallel Bouneffouf, Andrew Cullen, Robert Shorten
In some cases, the environment is not influenced by the actions of the RL agent, in which case the problem can be modeled as a contextual multi-armed bandit and lightweight myopic algorithms can be employed.
no code implementations • 4 Jan 2021 • Djallel Bouneffouf
The Multi-armed bandit offer the advantage to learn and exploit the already learnt knowledge at the same time.
1 code implementation • 22 Oct 2020 • Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi
Unlike traditional time series, the action sequences of human decision making usually involve many cognitive processes such as beliefs, desires, intentions, and theory of mind, i. e., what others are thinking.
no code implementations • 15 Oct 2020 • Djallel Bouneffouf, Raphaël Féraud, Sohini Upadhyay, Yasaman Khazaeni, Irina Rish
In this paper, we analyze and extend an online learning framework known as Context-Attentive Bandit, motivated by various practical applications, from medical diagnosis to dialog systems, where due to observation costs only a small subset of a potentially large number of context variables can be observed at each iteration;however, the agent has a freedom to choose which variables to observe.
no code implementations • 29 Sep 2020 • Pu Zhao, Parikshit Ram, Songtao Lu, Yuguang Yao, Djallel Bouneffouf, Xue Lin, Sijia Liu
The resulting scheme for meta-learning a UAP generator (i) has better performance (50% higher ASR) than baselines such as Projected Gradient Descent, (ii) has better performance (37% faster) than the vanilla L2O and MAML frameworks (when applicable), and (iii) is able to simultaneously handle UAP generation for different victim models and image data sources.
no code implementations • 21 Jul 2020 • Djallel Bouneffouf
Spectral clustering has shown a superior performance in analyzing the cluster structure.
no code implementations • 17 Jul 2020 • Djallel Bouneffouf
Dirichlet-multinomial (DMN) distribution is commonly used to model over-dispersion in count data.
no code implementations • 13 Jul 2020 • Djallel Bouneffouf, Sohini Upadhyay, Yasaman Khazaeni
We consider a novel variant of the contextual bandit problem (i. e., the multi-armed bandit with side-information, or context, available to a decision-maker) where the reward associated with each context-based decision may not always be observed("missing rewards").
no code implementations • 26 Jun 2020 • Djallel Bouneffouf
We consider a novel variant of the contextual bandit problem (i. e., the multi-armed bandit with side-information, or context, available to a decision-maker) where the context used at each decision may be corrupted ("useless context").
no code implementations • 17 Jun 2020 • Parikshit Ram, Sijia Liu, Deepak Vijaykeerthi, Dakuo Wang, Djallel Bouneffouf, Greg Bramble, Horst Samulowitz, Alexander G. Gray
The CASH problem has been widely studied in the context of automated configurations of machine learning (ML) pipelines and various solvers and toolkits are available.
1 code implementation • 9 Jun 2020 • Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi
As an important psychological and social experiment, the Iterated Prisoner's Dilemma (IPD) treats the choice to cooperate or defect as an atomic action.
1 code implementation • 10 May 2020 • Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf, Jenna Reinen, Irina Rish
Artificial behavioral agents are often evaluated based on their consistent behaviors and performance to take sequential actions in an environment to maximize some notion of cumulative reward.
no code implementations • 4 May 2020 • Djallel Bouneffouf, Emmanuelle Claeys
We study here the problem of learning the exploration exploitation trade-off in the contextual bandit problem with linear reward function setting.
no code implementations • 22 Oct 2019 • Charu Aggarwal, Djallel Bouneffouf, Horst Samulowitz, Beat Buesser, Thanh Hoang, Udayan Khurana, Sijia Liu, Tejaswini Pedapati, Parikshit Ram, Ambrish Rawat, Martin Wistuba, Alexander Gray
Data science is labor-intensive and human experts are scarce but heavily involved in every aspect of it.
no code implementations • NeurIPS Workshop Neuro_AI 2019 • Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf, Jenna Reinen, Irina Rish
Drawing an inspiration from behavioral studies of human decision making, we propose here a general parametric framework for a reinforcement learning problem, which extends the standard Q-learning approach to incorporate a two-stream framework of reward processing with biases biologically associated with several neurological and psychiatric conditions, including Parkinson's and Alzheimer's diseases, attention-deficit/hyperactivity disorder (ADHD), addiction, and chronic pain.
1 code implementation • 21 Jun 2019 • Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf, Jenna Reinen, Irina Rish
Drawing an inspiration from behavioral studies of human decision making, we propose here a more general and flexible parametric framework for reinforcement learning that extends standard Q-learning to a two-stream model for processing positive and negative rewards, and allows to incorporate a wide range of reward-processing biases -- an important component of human decision making which can help us better understand a wide spectrum of multi-agent interactions in complex real-world socioeconomic systems, as well as various neuropsychiatric conditions associated with disruptions in normal reward processing.
1 code implementation • 21 Jun 2019 • Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi
Drawing an inspiration from behavioral studies of human decision making, we propose here a general parametric framework for a reinforcement learning problem, which extends the standard Q-learning approach to incorporate a two-stream framework of reward processing with biases biologically associated with several neurological and psychiatric conditions, including Parkinson's and Alzheimer's diseases, attention-deficit/hyperactivity disorder (ADHD), addiction, and chronic pain.
no code implementations • 31 May 2019 • Djallel Bouneffouf, Srinivasan Parthasarathy, Horst Samulowitz, Martin Wistub
We consider the stochastic multi-armed bandit problem and the contextual bandit problem with historical observations and pre-clustered arms.
no code implementations • 1 May 2019 • Sijia Liu, Parikshit Ram, Deepak Vijaykeerthy, Djallel Bouneffouf, Gregory Bramble, Horst Samulowitz, Dakuo Wang, Andrew Conn, Alexander Gray
We study the AutoML problem of automatically configuring machine learning pipelines by jointly selecting algorithms and their appropriate hyper-parameters for all steps in supervised learning pipelines.
no code implementations • 2 Apr 2019 • Djallel Bouneffouf, Irina Rish
In recent years, multi-armed bandit (MAB) framework has attracted a lot of attention in various applications, from recommender systems and information retrieval to healthcare and finance, due to its stellar performance combined with certain attractive properties, such as learning from less feedback.
no code implementations • 21 Sep 2018 • Ritesh Noothigattu, Djallel Bouneffouf, Nicholas Mattei, Rachita Chandra, Piyush Madan, Kush Varshney, Murray Campbell, Moninder Singh, Francesca Rossi
To ensure that agents behave in ways aligned with the values of the societies in which they operate, we must develop techniques that allow these agents to not only maximize their reward in an environment, but also to learn and follow the implicit constraints of society.
Multi-Objective Reinforcement Learning
reinforcement-learning
no code implementations • 15 Sep 2018 • Avinash Balakrishnan, Djallel Bouneffouf, Nicholas Mattei, Francesca Rossi
To define this agent, we propose to adopt a novel extension to the classical contextual multi-armed bandit setting and we provide a new algorithm called Behavior Constrained Thompson Sampling (BCTS) that allows for online learning while obeying exogenous constraints.
1 code implementation • 24 Jun 2018 • Anna Choromanska, Benjamin Cowen, Sadhana Kumaravel, Ronny Luss, Mattia Rigotti, Irina Rish, Brian Kingsbury, Paolo DiAchille, Viatcheslav Gurev, Ravi Tejwani, Djallel Bouneffouf
Despite significant recent advances in deep neural networks, training them remains a challenge due to the highly non-convex nature of the objective function.
1 code implementation • 3 Feb 2018 • Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi, Irina Rish
Our experiments on a variety of datasets, and both in stationary and non-stationary environments of several kinds demonstrate clear advantages of the proposed adaptive representation learning over the standard contextual bandit based on "raw" input contexts.
no code implementations • 17 Nov 2017 • Matthew Riemer, Tim Klinger, Djallel Bouneffouf, Michele Franceschini
Given the recent success of Deep Learning applied to a variety of single tasks, it is natural to consider more human-realistic settings.
no code implementations • 7 Jun 2017 • Djallel Bouneffouf, Irina Rish, Guillermo A. Cecchi
Drawing an inspiration from behavioral studies of human decision making, we propose here a general parametric framework for multi-armed bandit problem, which extends the standard Thompson Sampling approach to incorporate reward processing biases associated with several neurological and psychiatric conditions, including Parkinson's and Alzheimer's diseases, attention-deficit/hyperactivity disorder (ADHD), addiction, and chronic pain.
no code implementations • 10 May 2017 • Djallel Bouneffouf, Irina Rish, Guillermo A. Cecchi, Raphael Feraud
We consider a novel formulation of the multi-armed bandit model, which we call the contextual bandit with restricted context, where only a limited number of features can be accessed by the learner at every iteration.
no code implementations • 28 Aug 2015 • Djallel Bouneffouf, Raphaël Feraud
We consider a variant of the multi-armed bandit model, which we call multi-armed bandit problem with known trend, where the gambler knows the shape of the reward function of each arm but not its distribution.
no code implementations • 29 Sep 2014 • Robin Allesiardo, Raphael Feraud, Djallel Bouneffouf
This paper presents a new contextual bandit algorithm, NeuralBandit, which does not need hypothesis on stationarity of contexts and rewards.
no code implementations • 29 Sep 2014 • Djallel Bouneffouf
In this sense, we propose to study the freshness of the user's content in CARS through the bandit problem.
no code implementations • 10 Aug 2014 • Djallel Bouneffouf
Active learning strategies respond to the costly labelling task in a supervised classification by selecting the most useful unlabelled examples in training a predictive model.
no code implementations • 10 Aug 2014 • Djallel Bouneffouf
Mobile Context-Aware Recommender Systems can be naturally modelled as an exploration/exploitation trade-off (exr/exp) problem, where the system has to choose between maximizing its expected rewards dealing with its current knowledge (exploitation) and learning more about the unknown user's preferences to improve its knowledge (exploration).
no code implementations • 9 Feb 2014 • Djallel Bouneffouf
We introduce in this paper an algorithm named Contextuel-E-Greedy that tackles the dynamicity of the user's content.
no code implementations • 10 May 2013 • Djallel Bouneffouf
We present Exponentiated Gradient LINUCB, an algorithm for con-textual multi-armed bandits.
no code implementations • 10 Mar 2013 • Djallel Bouneffouf
Ubiquitous information access becomes more and more important nowadays and research is aimed at making it adapted to users.