Search Results for author: Djallel Bouneffouf

Found 53 papers, 10 papers with code

Interpolating Item and User Fairness in Multi-Sided Recommendations

no code implementations12 Jun 2023 Qinyi Chen, Jason Cheuk Nam Liang, Negin Golrezaei, Djallel Bouneffouf

Motivated by this, we formulate a novel fair recommendation framework, called Problem (FAIR), that not only maximizes the platform's revenue, but also accommodates varying fairness considerations from the perspectives of items and users.

Fairness Recommendation Systems

Utterance Classification with Logical Neural Network: Explainable AI for Mental Disorder Diagnosis

no code implementations6 Jun 2023 Yeldar Toleubay, Don Joven Agravante, Daiki Kimura, Baihan Lin, Djallel Bouneffouf, Michiaki Tatsubori

The proposed system addresses the lack of explainability of current Neural Network models and provides a more trustworthy solution for mental disorder diagnosis.

Towards Healthy AI: Large Language Models Need Therapists Too

no code implementations2 Apr 2023 Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi, Kush R. Varshney

By incorporating psychotherapy and reinforcement learning techniques, the framework enables AI chatbots to learn and adapt to human preferences and values in a safe and ethical way, contributing to the development of a more human-centric and responsible AI.

Chatbot

Psychotherapy AI Companion with Reinforcement Learning Recommendations and Interpretable Policy Dynamics

no code implementations16 Mar 2023 Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf

We introduce a Reinforcement Learning Psychotherapy AI Companion that generates topic recommendations for therapists based on patient responses.

reinforcement-learning Reinforcement Learning (RL)

TherapyView: Visualizing Therapy Sessions with Temporal Topic Modeling and AI-Generated Arts

no code implementations21 Feb 2023 Baihan Lin, Stefan Zecevic, Djallel Bouneffouf, Guillermo Cecchi

We present the TherapyView, a demonstration system to help therapists visualize the dynamic contents of past treatment sessions, enabled by the state-of-the-art neural topic modeling techniques to analyze the topical tendencies of various psychiatric conditions and deep learning-based image generation engine to provide a visual summary.

Image Generation Time Series +1

A Survey on Compositional Generalization in Applications

no code implementations2 Feb 2023 Baihan Lin, Djallel Bouneffouf, Irina Rish

The field of compositional generalization is currently experiencing a renaissance in AI, as novel problem settings and algorithms motivated by various practical applications are being introduced, building on top of the classical compositional generalization problem.

Non-Stationary Bandits with Auto-Regressive Temporal Dependency

no code implementations NeurIPS 2023 Qinyi Chen, Negin Golrezaei, Djallel Bouneffouf

Traditional multi-armed bandit (MAB) frameworks, predominantly examined under stochastic or adversarial settings, often overlook the temporal dynamics inherent in many real-world applications such as recommendation systems and online advertising.

Marketing Recommendation Systems +1

Working Alliance Transformer for Psychotherapy Dialogue Classification

1 code implementation27 Oct 2022 Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf

As a predictive measure of the treatment outcome in psychotherapy, the working alliance measures the agreement of the patient and the therapist in terms of their bond, task and goal.

Classification

Survey on Applications of Neurosymbolic Artificial Intelligence

no code implementations8 Sep 2022 Djallel Bouneffouf, Charu C. Aggarwal

In recent years, the Neurosymbolic framework has attracted a lot of attention in various applications, from recommender systems and information retrieval to healthcare and finance.

Information Retrieval Recommendation Systems +1

Targeted Advertising on Social Networks Using Online Variational Tensor Regression

no code implementations22 Aug 2022 Tsuyoshi Idé, Keerthiram Murugesan, Djallel Bouneffouf, Naoki Abe

The proposed framework is designed to accommodate any number of feature vectors in the form of multi-mode tensor, thereby enabling to capture the heterogeneity that may exist over user preferences, products, and campaign strategies in a unified manner.

Marketing regression

Neural Topic Modeling of Psychotherapy Sessions

no code implementations13 Apr 2022 Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi, Ravi Tejwani

In this work, we compare different neural topic modeling methods in learning the topical propensities of different psychiatric conditions from the psychotherapy session transcripts parsed from speech recordings.

Time Series Time Series Analysis

Deep Annotation of Therapeutic Working Alliance in Psychotherapy

no code implementations12 Apr 2022 Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf

The therapeutic working alliance is an important predictor of the outcome of the psychotherapy treatment.

Optimal Epidemic Control as a Contextual Combinatorial Bandit with Budget

1 code implementation30 Jun 2021 Baihan Lin, Djallel Bouneffouf

In light of the COVID-19 pandemic, it is an open challenge and critical practical problem to find a optimal way to dynamically prescribe the best policies that balance both the governmental resources and epidemic control in different countries and regions.

Reinforcement Learning with Algorithms from Probabilistic Structure Estimation

1 code implementation15 Mar 2021 Jonathan P. Epperlein, Roman Overko, Sergiy Zhuk, Christopher King, Djallel Bouneffouf, Andrew Cullen, Robert Shorten

In some cases, the environment is not influenced by the actions of the RL agent, in which case the problem can be modeled as a contextual multi-armed bandit and lightweight myopic algorithms can be employed.

reinforcement-learning Reinforcement Learning (RL)

Etat de l'art sur l'application des bandits multi-bras

no code implementations4 Jan 2021 Djallel Bouneffouf

The Multi-armed bandit offer the advantage to learn and exploit the already learnt knowledge at the same time.

Thompson Sampling

Predicting human decision making in psychological tasks with recurrent neural networks

1 code implementation22 Oct 2020 Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi

Unlike traditional time series, the action sequences of human decision making usually involve many cognitive processes such as beliefs, desires, intentions, and theory of mind, i. e., what others are thinking.

Decision Making Time Series +1

Double-Linear Thompson Sampling for Context-Attentive Bandits

no code implementations15 Oct 2020 Djallel Bouneffouf, Raphaël Féraud, Sohini Upadhyay, Yasaman Khazaeni, Irina Rish

In this paper, we analyze and extend an online learning framework known as Context-Attentive Bandit, motivated by various practical applications, from medical diagnosis to dialog systems, where due to observation costs only a small subset of a potentially large number of context variables can be observed at each iteration;however, the agent has a freedom to choose which variables to observe.

Medical Diagnosis Thompson Sampling

Learning to Generate Image Source-Agnostic Universal Adversarial Perturbations

no code implementations29 Sep 2020 Pu Zhao, Parikshit Ram, Songtao Lu, Yuguang Yao, Djallel Bouneffouf, Xue Lin, Sijia Liu

The resulting scheme for meta-learning a UAP generator (i) has better performance (50% higher ASR) than baselines such as Projected Gradient Descent, (ii) has better performance (37% faster) than the vanilla L2O and MAML frameworks (when applicable), and (iii) is able to simultaneously handle UAP generation for different victim models and image data sources.

Adversarial Attack Bilevel Optimization +1

Spectral Clustering using Eigenspectrum Shape Based Nystrom Sampling

no code implementations21 Jul 2020 Djallel Bouneffouf

Spectral clustering has shown a superior performance in analyzing the cluster structure.

Clustering

Computing the Dirichlet-Multinomial Log-Likelihood Function

no code implementations17 Jul 2020 Djallel Bouneffouf

Dirichlet-multinomial (DMN) distribution is commonly used to model over-dispersion in count data.

Contextual Bandit with Missing Rewards

no code implementations13 Jul 2020 Djallel Bouneffouf, Sohini Upadhyay, Yasaman Khazaeni

We consider a novel variant of the contextual bandit problem (i. e., the multi-armed bandit with side-information, or context, available to a decision-maker) where the reward associated with each context-based decision may not always be observed("missing rewards").

Clustering

Online learning with Corrupted context: Corrupted Contextual Bandits

no code implementations26 Jun 2020 Djallel Bouneffouf

We consider a novel variant of the contextual bandit problem (i. e., the multi-armed bandit with side-information, or context, available to a decision-maker) where the context used at each decision may be corrupted ("useless context").

Multi-Armed Bandits

Solving Constrained CASH Problems with ADMM

no code implementations17 Jun 2020 Parikshit Ram, Sijia Liu, Deepak Vijaykeerthi, Dakuo Wang, Djallel Bouneffouf, Greg Bramble, Horst Samulowitz, Alexander G. Gray

The CASH problem has been widely studied in the context of automated configurations of machine learning (ML) pipelines and various solvers and toolkits are available.

BIG-bench Machine Learning Fairness

Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior

1 code implementation9 Jun 2020 Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi

As an important psychological and social experiment, the Iterated Prisoner's Dilemma (IPD) treats the choice to cooperate or defect as an atomic action.

Multi-Armed Bandits reinforcement-learning +1

Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL

1 code implementation10 May 2020 Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf, Jenna Reinen, Irina Rish

Artificial behavioral agents are often evaluated based on their consistent behaviors and performance to take sequential actions in an environment to maximize some notion of cumulative reward.

Decision Making Multi-Armed Bandits +1

Hyper-parameter Tuning for the Contextual Bandit

no code implementations4 May 2020 Djallel Bouneffouf, Emmanuelle Claeys

We study here the problem of learning the exploration exploitation trade-off in the contextual bandit problem with linear reward function setting.

Reinforcement Learning Models of Human Behavior: Reward Processing in Mental Disorders

no code implementations NeurIPS Workshop Neuro_AI 2019 Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf, Jenna Reinen, Irina Rish

Drawing an inspiration from behavioral studies of human decision making, we propose here a general parametric framework for a reinforcement learning problem, which extends the standard Q-learning approach to incorporate a two-stream framework of reward processing with biases biologically associated with several neurological and psychiatric conditions, including Parkinson's and Alzheimer's diseases, attention-deficit/hyperactivity disorder (ADHD), addiction, and chronic pain.

Decision Making Q-Learning +3

Split Q Learning: Reinforcement Learning with Two-Stream Rewards

1 code implementation21 Jun 2019 Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi

Drawing an inspiration from behavioral studies of human decision making, we propose here a general parametric framework for a reinforcement learning problem, which extends the standard Q-learning approach to incorporate a two-stream framework of reward processing with biases biologically associated with several neurological and psychiatric conditions, including Parkinson's and Alzheimer's diseases, attention-deficit/hyperactivity disorder (ADHD), addiction, and chronic pain.

Decision Making Q-Learning +4

A Story of Two Streams: Reinforcement Learning Models from Human Behavior and Neuropsychiatry

1 code implementation21 Jun 2019 Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf, Jenna Reinen, Irina Rish

Drawing an inspiration from behavioral studies of human decision making, we propose here a more general and flexible parametric framework for reinforcement learning that extends standard Q-learning to a two-stream model for processing positive and negative rewards, and allows to incorporate a wide range of reward-processing biases -- an important component of human decision making which can help us better understand a wide spectrum of multi-agent interactions in complex real-world socioeconomic systems, as well as various neuropsychiatric conditions associated with disruptions in normal reward processing.

Decision Making Q-Learning +2

Optimal Exploitation of Clustering and History Information in Multi-Armed Bandit

no code implementations31 May 2019 Djallel Bouneffouf, Srinivasan Parthasarathy, Horst Samulowitz, Martin Wistub

We consider the stochastic multi-armed bandit problem and the contextual bandit problem with historical observations and pre-clustered arms.

Clustering

An ADMM Based Framework for AutoML Pipeline Configuration

no code implementations1 May 2019 Sijia Liu, Parikshit Ram, Deepak Vijaykeerthy, Djallel Bouneffouf, Gregory Bramble, Horst Samulowitz, Dakuo Wang, Andrew Conn, Alexander Gray

We study the AutoML problem of automatically configuring machine learning pipelines by jointly selecting algorithms and their appropriate hyper-parameters for all steps in supervised learning pipelines.

AutoML Binary Classification

A Survey on Practical Applications of Multi-Armed and Contextual Bandits

no code implementations2 Apr 2019 Djallel Bouneffouf, Irina Rish

In recent years, multi-armed bandit (MAB) framework has attracted a lot of attention in various applications, from recommender systems and information retrieval to healthcare and finance, due to its stellar performance combined with certain attractive properties, such as learning from less feedback.

Information Retrieval Multi-Armed Bandits +2

Interpretable Multi-Objective Reinforcement Learning through Policy Orchestration

no code implementations21 Sep 2018 Ritesh Noothigattu, Djallel Bouneffouf, Nicholas Mattei, Rachita Chandra, Piyush Madan, Kush Varshney, Murray Campbell, Moninder Singh, Francesca Rossi

To ensure that agents behave in ways aligned with the values of the societies in which they operate, we must develop techniques that allow these agents to not only maximize their reward in an environment, but also to learn and follow the implicit constraints of society.

Multi-Objective Reinforcement Learning reinforcement-learning

Incorporating Behavioral Constraints in Online AI Systems

no code implementations15 Sep 2018 Avinash Balakrishnan, Djallel Bouneffouf, Nicholas Mattei, Francesca Rossi

To define this agent, we propose to adopt a novel extension to the classical contextual multi-armed bandit setting and we provide a new algorithm called Behavior Constrained Thompson Sampling (BCTS) that allows for online learning while obeying exogenous constraints.

Thompson Sampling

Beyond Backprop: Online Alternating Minimization with Auxiliary Variables

1 code implementation24 Jun 2018 Anna Choromanska, Benjamin Cowen, Sadhana Kumaravel, Ronny Luss, Mattia Rigotti, Irina Rish, Brian Kingsbury, Paolo DiAchille, Viatcheslav Gurev, Ravi Tejwani, Djallel Bouneffouf

Despite significant recent advances in deep neural networks, training them remains a challenge due to the highly non-convex nature of the objective function.

Contextual Bandit with Adaptive Feature Extraction

1 code implementation3 Feb 2018 Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi, Irina Rish

Our experiments on a variety of datasets, and both in stationary and non-stationary environments of several kinds demonstrate clear advantages of the proposed adaptive representation learning over the standard contextual bandit based on "raw" input contexts.

Clustering Decision Making +2

Scalable Recollections for Continual Lifelong Learning

no code implementations17 Nov 2017 Matthew Riemer, Tim Klinger, Djallel Bouneffouf, Michele Franceschini

Given the recent success of Deep Learning applied to a variety of single tasks, it is natural to consider more human-realistic settings.

Bandit Models of Human Behavior: Reward Processing in Mental Disorders

no code implementations7 Jun 2017 Djallel Bouneffouf, Irina Rish, Guillermo A. Cecchi

Drawing an inspiration from behavioral studies of human decision making, we propose here a general parametric framework for multi-armed bandit problem, which extends the standard Thompson Sampling approach to incorporate reward processing biases associated with several neurological and psychiatric conditions, including Parkinson's and Alzheimer's diseases, attention-deficit/hyperactivity disorder (ADHD), addiction, and chronic pain.

Decision Making Thompson Sampling

Context Attentive Bandits: Contextual Bandit with Restricted Context

no code implementations10 May 2017 Djallel Bouneffouf, Irina Rish, Guillermo A. Cecchi, Raphael Feraud

We consider a novel formulation of the multi-armed bandit model, which we call the contextual bandit with restricted context, where only a limited number of features can be accessed by the learner at every iteration.

Recommendation Systems Thompson Sampling

Multi-armed Bandit Problem with Known Trend

no code implementations28 Aug 2015 Djallel Bouneffouf, Raphaël Feraud

We consider a variant of the multi-armed bandit model, which we call multi-armed bandit problem with known trend, where the gambler knows the shape of the reward function of each arm but not its distribution.

Active Learning

Freshness-Aware Thompson Sampling

no code implementations29 Sep 2014 Djallel Bouneffouf

In this sense, we propose to study the freshness of the user's content in CARS through the bandit problem.

Recommendation Systems Thompson Sampling

A Neural Networks Committee for the Contextual Bandit Problem

no code implementations29 Sep 2014 Robin Allesiardo, Raphael Feraud, Djallel Bouneffouf

This paper presents a new contextual bandit algorithm, NeuralBandit, which does not need hypothesis on stationarity of contexts and rewards.

Exponentiated Gradient Exploration for Active Learning

no code implementations10 Aug 2014 Djallel Bouneffouf

Active learning strategies respond to the costly labelling task in a supervised classification by selecting the most useful unlabelled examples in training a predictive model.

Active Learning General Classification

R-UCB: a Contextual Bandit Algorithm for Risk-Aware Recommender Systems

no code implementations10 Aug 2014 Djallel Bouneffouf

Mobile Context-Aware Recommender Systems can be naturally modelled as an exploration/exploitation trade-off (exr/exp) problem, where the system has to choose between maximizing its expected rewards dealing with its current knowledge (exploitation) and learning more about the unknown user's preferences to improve its knowledge (exploration).

Recommendation Systems

Recommandation mobile, sensible au contexte de contenus évolutifs: Contextuel-E-Greedy

no code implementations9 Feb 2014 Djallel Bouneffouf

We introduce in this paper an algorithm named Contextuel-E-Greedy that tackles the dynamicity of the user's content.

Exponentiated Gradient LINUCB for Contextual Multi-Armed Bandits

no code implementations10 May 2013 Djallel Bouneffouf

We present Exponentiated Gradient LINUCB, an algorithm for con-textual multi-armed bandits.

Multi-Armed Bandits

Hybrid Q-Learning Applied to Ubiquitous recommender system

no code implementations10 Mar 2013 Djallel Bouneffouf

Ubiquitous information access becomes more and more important nowadays and research is aimed at making it adapted to users.

Q-Learning Recommendation Systems +1

Cannot find the paper you are looking for? You can Submit a new open access paper.