Search Results for author: Djallel Bouneffouf

Found 53 papers, 10 papers with code

Contextual Moral Value Alignment Through Context-Based Aggregation

no code implementations • 19 Mar 2024 • Pierre Dognin, Jesus Rios, Ronny Luss, Inkit Padhi, Matthew D Riemer, Miao Liu, Prasanna Sattigeri, Manish Nagireddy, Kush R. Varshney, Djallel Bouneffouf

Developing value-aligned AI agents is a complex undertaking and an ongoing challenge in the field of AI.

Paper
Add Code

Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

no code implementations • 9 Mar 2024 • Swapnaja Achintalwar, Adriana Alvarado Garcia, Ateret Anaby-Tavor, Ioana Baldini, Sara E. Berger, Bishwaranjan Bhattacharjee, Djallel Bouneffouf, Subhajit Chaudhury, Pin-Yu Chen, Lamogha Chiazor, Elizabeth M. Daly, Rogério Abreu de Paula, Pierre Dognin, Eitan Farchi, Soumya Ghosh, Michael Hind, Raya Horesh, George Kour, Ja Young Lee, Erik Miehling, Keerthiram Murugesan, Manish Nagireddy, Inkit Padhi, David Piorkowski, Ambrish Rawat, Orna Raz, Prasanna Sattigeri, Hendrik Strobelt, Sarathkrishna Swaminathan, Christoph Tillmann, Aashka Trivedi, Kush R. Varshney, Dennis Wei, Shalisha Witherspooon, Marcel Zalmanovici

Large language models (LLMs) are susceptible to a variety of risks, from non-faithful output to biased and toxic generations.

Paper
Add Code

COMPASS: Computational Mapping of Patient-Therapist Alliance Strategies with Language Modeling

no code implementations • 22 Feb 2024 • Baihan Lin, Djallel Bouneffouf, Yulia Landa, Rachel Jespersen, Cheryl Corcoran, Guillermo Cecchi

The therapeutic working alliance is a critical factor in predicting the success of psychotherapy treatment.

Language Modelling

Paper
Add Code

Interpolating Item and User Fairness in Multi-Sided Recommendations

no code implementations • 12 Jun 2023 • Qinyi Chen, Jason Cheuk Nam Liang, Negin Golrezaei, Djallel Bouneffouf

Motivated by this, we formulate a novel fair recommendation framework, called Problem (FAIR), that not only maximizes the platform's revenue, but also accommodates varying fairness considerations from the perspectives of items and users.

Fairness Recommendation Systems

Paper
Add Code

Utterance Classification with Logical Neural Network: Explainable AI for Mental Disorder Diagnosis

no code implementations • 6 Jun 2023 • Yeldar Toleubay, Don Joven Agravante, Daiki Kimura, Baihan Lin, Djallel Bouneffouf, Michiaki Tatsubori

The proposed system addresses the lack of explainability of current Neural Network models and provides a more trustworthy solution for mental disorder diagnosis.

Paper
Add Code

Towards Healthy AI: Large Language Models Need Therapists Too

no code implementations • 2 Apr 2023 • Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi, Kush R. Varshney

By incorporating psychotherapy and reinforcement learning techniques, the framework enables AI chatbots to learn and adapt to human preferences and values in a safe and ethical way, contributing to the development of a more human-centric and responsible AI.

Chatbot

Paper
Add Code

Psychotherapy AI Companion with Reinforcement Learning Recommendations and Interpretable Policy Dynamics

no code implementations • 16 Mar 2023 • Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf

We introduce a Reinforcement Learning Psychotherapy AI Companion that generates topic recommendations for therapists based on patient responses.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

TherapyView: Visualizing Therapy Sessions with Temporal Topic Modeling and AI-Generated Arts

no code implementations • 21 Feb 2023 • Baihan Lin, Stefan Zecevic, Djallel Bouneffouf, Guillermo Cecchi

We present the TherapyView, a demonstration system to help therapists visualize the dynamic contents of past treatment sessions, enabled by the state-of-the-art neural topic modeling techniques to analyze the topical tendencies of various psychiatric conditions and deep learning-based image generation engine to provide a visual summary.

Image Generation Time Series +1

Paper
Add Code

A Survey on Compositional Generalization in Applications

no code implementations • 2 Feb 2023 • Baihan Lin, Djallel Bouneffouf, Irina Rish

The field of compositional generalization is currently experiencing a renaissance in AI, as novel problem settings and algorithms motivated by various practical applications are being introduced, building on top of the classical compositional generalization problem.

Paper
Add Code

Non-Stationary Bandits with Auto-Regressive Temporal Dependency

no code implementations • NeurIPS 2023 • Qinyi Chen, Negin Golrezaei, Djallel Bouneffouf

Traditional multi-armed bandit (MAB) frameworks, predominantly examined under stochastic or adversarial settings, often overlook the temporal dynamics inherent in many real-world applications such as recommendation systems and online advertising.

Marketing Recommendation Systems +1

Paper
Add Code

Working Alliance Transformer for Psychotherapy Dialogue Classification

1 code implementation • 27 Oct 2022 • Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf

As a predictive measure of the treatment outcome in psychotherapy, the working alliance measures the agreement of the patient and the therapist in terms of their bond, task and goal.

Classification

Paper
Code

Survey on Applications of Neurosymbolic Artificial Intelligence

no code implementations • 8 Sep 2022 • Djallel Bouneffouf, Charu C. Aggarwal

In recent years, the Neurosymbolic framework has attracted a lot of attention in various applications, from recommender systems and information retrieval to healthcare and finance.

Information Retrieval Recommendation Systems +1

Paper
Add Code

SupervisorBot: NLP-Annotated Real-Time Recommendations of Psychotherapy Treatment Strategies with Deep Reinforcement Learning

no code implementations • 27 Aug 2022 • Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf

We propose a recommendation system that suggests treatment strategies to a therapist during the psychotherapy session in real-time.

Reinforcement Learning (RL) Sentence

Paper
Add Code

Targeted Advertising on Social Networks Using Online Variational Tensor Regression

no code implementations • 22 Aug 2022 • Tsuyoshi Idé, Keerthiram Murugesan, Djallel Bouneffouf, Naoki Abe

The proposed framework is designed to accommodate any number of feature vectors in the form of multi-mode tensor, thereby enabling to capture the heterogeneity that may exist over user preferences, products, and campaign strategies in a unified manner.

Marketing regression

Paper
Add Code

Neural Topic Modeling of Psychotherapy Sessions

no code implementations • 13 Apr 2022 • Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi, Ravi Tejwani

In this work, we compare different neural topic modeling methods in learning the topical propensities of different psychiatric conditions from the psychotherapy session transcripts parsed from speech recordings.

Time Series Time Series Analysis

Paper
Add Code

Deep Annotation of Therapeutic Working Alliance in Psychotherapy

no code implementations • 12 Apr 2022 • Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf

The therapeutic working alliance is an important predictor of the outcome of the psychotherapy treatment.

Paper
Add Code

Optimal Epidemic Control as a Contextual Combinatorial Bandit with Budget

1 code implementation • 30 Jun 2021 • Baihan Lin, Djallel Bouneffouf

In light of the COVID-19 pandemic, it is an open challenge and critical practical problem to find a optimal way to dynamically prescribe the best policies that balance both the governmental resources and epidemic control in different countries and regions.

Paper
Code

Bandit Limited Discrepancy Search and Application to Machine Learning Pipeline Optimization

no code implementations • ICML Workshop AutoML 2021 • Akihiro Kishimoto, Djallel Bouneffouf, Radu Marinescu, Parikshit Ram, Ambrish Rawat, Martin Wistuba, Paulito Pedregosa Palmes, Adi Botea

Optimizing a machine learning (ML) pipeline has been an important topic of AI and ML.

BIG-bench Machine Learning

Paper
Add Code

Reinforcement Learning with Algorithms from Probabilistic Structure Estimation

1 code implementation • 15 Mar 2021 • Jonathan P. Epperlein, Roman Overko, Sergiy Zhuk, Christopher King, Djallel Bouneffouf, Andrew Cullen, Robert Shorten

In some cases, the environment is not influenced by the actions of the RL agent, in which case the problem can be modeled as a contextual multi-armed bandit and lightweight myopic algorithms can be employed.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Etat de l'art sur l'application des bandits multi-bras

no code implementations • 4 Jan 2021 • Djallel Bouneffouf

The Multi-armed bandit offer the advantage to learn and exploit the already learnt knowledge at the same time.

Thompson Sampling

Paper
Add Code

Predicting human decision making in psychological tasks with recurrent neural networks

1 code implementation • 22 Oct 2020 • Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi

Unlike traditional time series, the action sequences of human decision making usually involve many cognitive processes such as beliefs, desires, intentions, and theory of mind, i. e., what others are thinking.

Decision Making Time Series +1

Paper
Code

Double-Linear Thompson Sampling for Context-Attentive Bandits

no code implementations • 15 Oct 2020 • Djallel Bouneffouf, Raphaël Féraud, Sohini Upadhyay, Yasaman Khazaeni, Irina Rish

In this paper, we analyze and extend an online learning framework known as Context-Attentive Bandit, motivated by various practical applications, from medical diagnosis to dialog systems, where due to observation costs only a small subset of a potentially large number of context variables can be observed at each iteration;however, the agent has a freedom to choose which variables to observe.

Medical Diagnosis Thompson Sampling

Paper
Add Code

Learning to Generate Image Source-Agnostic Universal Adversarial Perturbations

no code implementations • 29 Sep 2020 • Pu Zhao, Parikshit Ram, Songtao Lu, Yuguang Yao, Djallel Bouneffouf, Xue Lin, Sijia Liu

The resulting scheme for meta-learning a UAP generator (i) has better performance (50% higher ASR) than baselines such as Projected Gradient Descent, (ii) has better performance (37% faster) than the vanilla L2O and MAML frameworks (when applicable), and (iii) is able to simultaneously handle UAP generation for different victim models and image data sources.

Adversarial Attack Bilevel Optimization +1

Paper
Add Code

Spectral Clustering using Eigenspectrum Shape Based Nystrom Sampling

no code implementations • 21 Jul 2020 • Djallel Bouneffouf

Spectral clustering has shown a superior performance in analyzing the cluster structure.

Clustering

Paper
Add Code

Computing the Dirichlet-Multinomial Log-Likelihood Function

no code implementations • 17 Jul 2020 • Djallel Bouneffouf

Dirichlet-multinomial (DMN) distribution is commonly used to model over-dispersion in count data.

Paper
Add Code

Contextual Bandit with Missing Rewards

no code implementations • 13 Jul 2020 • Djallel Bouneffouf, Sohini Upadhyay, Yasaman Khazaeni

We consider a novel variant of the contextual bandit problem (i. e., the multi-armed bandit with side-information, or context, available to a decision-maker) where the reward associated with each context-based decision may not always be observed("missing rewards").

Clustering

Paper
Add Code

Online learning with Corrupted context: Corrupted Contextual Bandits

no code implementations • 26 Jun 2020 • Djallel Bouneffouf

We consider a novel variant of the contextual bandit problem (i. e., the multi-armed bandit with side-information, or context, available to a decision-maker) where the context used at each decision may be corrupted ("useless context").

Multi-Armed Bandits

Paper
Add Code

Solving Constrained CASH Problems with ADMM

no code implementations • 17 Jun 2020 • Parikshit Ram, Sijia Liu, Deepak Vijaykeerthi, Dakuo Wang, Djallel Bouneffouf, Greg Bramble, Horst Samulowitz, Alexander G. Gray

The CASH problem has been widely studied in the context of automated configurations of machine learning (ML) pipelines and various solvers and toolkits are available.

BIG-bench Machine Learning Fairness

Paper
Add Code

Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior

1 code implementation • 9 Jun 2020 • Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi

As an important psychological and social experiment, the Iterated Prisoner's Dilemma (IPD) treats the choice to cooperate or defect as an atomic action.

Multi-Armed Bandits reinforcement-learning +1

Paper
Code

Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL

1 code implementation • 10 May 2020 • Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf, Jenna Reinen, Irina Rish

Artificial behavioral agents are often evaluated based on their consistent behaviors and performance to take sequential actions in an environment to maximize some notion of cumulative reward.

Decision Making Multi-Armed Bandits +1

Paper
Code

Hyper-parameter Tuning for the Contextual Bandit

no code implementations • 4 May 2020 • Djallel Bouneffouf, Emmanuelle Claeys

We study here the problem of learning the exploration exploitation trade-off in the contextual bandit problem with linear reward function setting.

Paper
Add Code

How can AI Automate End-to-End Data Science?

no code implementations • 22 Oct 2019 • Charu Aggarwal, Djallel Bouneffouf, Horst Samulowitz, Beat Buesser, Thanh Hoang, Udayan Khurana, Sijia Liu, Tejaswini Pedapati, Parikshit Ram, Ambrish Rawat, Martin Wistuba, Alexander Gray

Data science is labor-intensive and human experts are scarce but heavily involved in every aspect of it.

Paper
Add Code

Reinforcement Learning Models of Human Behavior: Reward Processing in Mental Disorders

no code implementations • NeurIPS Workshop Neuro_AI 2019 • Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf, Jenna Reinen, Irina Rish

Drawing an inspiration from behavioral studies of human decision making, we propose here a general parametric framework for a reinforcement learning problem, which extends the standard Q-learning approach to incorporate a two-stream framework of reward processing with biases biologically associated with several neurological and psychiatric conditions, including Parkinson's and Alzheimer's diseases, attention-deficit/hyperactivity disorder (ADHD), addiction, and chronic pain.

Decision Making Q-Learning +3

Paper
Add Code

Split Q Learning: Reinforcement Learning with Two-Stream Rewards

1 code implementation • 21 Jun 2019 • Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi

Decision Making Q-Learning +4

Paper
Code

A Story of Two Streams: Reinforcement Learning Models from Human Behavior and Neuropsychiatry

1 code implementation • 21 Jun 2019 • Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf, Jenna Reinen, Irina Rish

Drawing an inspiration from behavioral studies of human decision making, we propose here a more general and flexible parametric framework for reinforcement learning that extends standard Q-learning to a two-stream model for processing positive and negative rewards, and allows to incorporate a wide range of reward-processing biases -- an important component of human decision making which can help us better understand a wide spectrum of multi-agent interactions in complex real-world socioeconomic systems, as well as various neuropsychiatric conditions associated with disruptions in normal reward processing.

Decision Making Q-Learning +2

Paper
Code

Optimal Exploitation of Clustering and History Information in Multi-Armed Bandit

no code implementations • 31 May 2019 • Djallel Bouneffouf, Srinivasan Parthasarathy, Horst Samulowitz, Martin Wistub

We consider the stochastic multi-armed bandit problem and the contextual bandit problem with historical observations and pre-clustered arms.

Clustering

Paper
Add Code

An ADMM Based Framework for AutoML Pipeline Configuration

no code implementations • 1 May 2019 • Sijia Liu, Parikshit Ram, Deepak Vijaykeerthy, Djallel Bouneffouf, Gregory Bramble, Horst Samulowitz, Dakuo Wang, Andrew Conn, Alexander Gray

We study the AutoML problem of automatically configuring machine learning pipelines by jointly selecting algorithms and their appropriate hyper-parameters for all steps in supervised learning pipelines.

AutoML Binary Classification

Paper
Add Code

A Survey on Practical Applications of Multi-Armed and Contextual Bandits

no code implementations • 2 Apr 2019 • Djallel Bouneffouf, Irina Rish

In recent years, multi-armed bandit (MAB) framework has attracted a lot of attention in various applications, from recommender systems and information retrieval to healthcare and finance, due to its stellar performance combined with certain attractive properties, such as learning from less feedback.

Information Retrieval Multi-Armed Bandits +2

Paper
Add Code

Interpretable Multi-Objective Reinforcement Learning through Policy Orchestration

no code implementations • 21 Sep 2018 • Ritesh Noothigattu, Djallel Bouneffouf, Nicholas Mattei, Rachita Chandra, Piyush Madan, Kush Varshney, Murray Campbell, Moninder Singh, Francesca Rossi

To ensure that agents behave in ways aligned with the values of the societies in which they operate, we must develop techniques that allow these agents to not only maximize their reward in an environment, but also to learn and follow the implicit constraints of society.

Multi-Objective Reinforcement Learning reinforcement-learning

Paper
Add Code

Incorporating Behavioral Constraints in Online AI Systems

no code implementations • 15 Sep 2018 • Avinash Balakrishnan, Djallel Bouneffouf, Nicholas Mattei, Francesca Rossi

To define this agent, we propose to adopt a novel extension to the classical contextual multi-armed bandit setting and we provide a new algorithm called Behavior Constrained Thompson Sampling (BCTS) that allows for online learning while obeying exogenous constraints.

Thompson Sampling

Paper
Add Code

Beyond Backprop: Online Alternating Minimization with Auxiliary Variables

1 code implementation • 24 Jun 2018 • Anna Choromanska, Benjamin Cowen, Sadhana Kumaravel, Ronny Luss, Mattia Rigotti, Irina Rish, Brian Kingsbury, Paolo DiAchille, Viatcheslav Gurev, Ravi Tejwani, Djallel Bouneffouf

Despite significant recent advances in deep neural networks, training them remains a challenge due to the highly non-convex nature of the objective function.

Paper
Code

Contextual Bandit with Adaptive Feature Extraction

1 code implementation • 3 Feb 2018 • Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi, Irina Rish

Our experiments on a variety of datasets, and both in stationary and non-stationary environments of several kinds demonstrate clear advantages of the proposed adaptive representation learning over the standard contextual bandit based on "raw" input contexts.

Clustering Decision Making +2

Paper
Code

Scalable Recollections for Continual Lifelong Learning

no code implementations • 17 Nov 2017 • Matthew Riemer, Tim Klinger, Djallel Bouneffouf, Michele Franceschini

Given the recent success of Deep Learning applied to a variety of single tasks, it is natural to consider more human-realistic settings.

Paper
Add Code

Bandit Models of Human Behavior: Reward Processing in Mental Disorders

no code implementations • 7 Jun 2017 • Djallel Bouneffouf, Irina Rish, Guillermo A. Cecchi

Drawing an inspiration from behavioral studies of human decision making, we propose here a general parametric framework for multi-armed bandit problem, which extends the standard Thompson Sampling approach to incorporate reward processing biases associated with several neurological and psychiatric conditions, including Parkinson's and Alzheimer's diseases, attention-deficit/hyperactivity disorder (ADHD), addiction, and chronic pain.

Decision Making Thompson Sampling

Paper
Add Code

Context Attentive Bandits: Contextual Bandit with Restricted Context

no code implementations • 10 May 2017 • Djallel Bouneffouf, Irina Rish, Guillermo A. Cecchi, Raphael Feraud

We consider a novel formulation of the multi-armed bandit model, which we call the contextual bandit with restricted context, where only a limited number of features can be accessed by the learner at every iteration.

Recommendation Systems Thompson Sampling

Paper
Add Code

Multi-armed Bandit Problem with Known Trend

no code implementations • 28 Aug 2015 • Djallel Bouneffouf, Raphaël Feraud

We consider a variant of the multi-armed bandit model, which we call multi-armed bandit problem with known trend, where the gambler knows the shape of the reward function of each arm but not its distribution.

Active Learning

Paper
Add Code

Freshness-Aware Thompson Sampling

no code implementations • 29 Sep 2014 • Djallel Bouneffouf

In this sense, we propose to study the freshness of the user's content in CARS through the bandit problem.

Recommendation Systems Thompson Sampling

Paper
Add Code

A Neural Networks Committee for the Contextual Bandit Problem

no code implementations • 29 Sep 2014 • Robin Allesiardo, Raphael Feraud, Djallel Bouneffouf

This paper presents a new contextual bandit algorithm, NeuralBandit, which does not need hypothesis on stationarity of contexts and rewards.

Paper
Add Code

Exponentiated Gradient Exploration for Active Learning

no code implementations • 10 Aug 2014 • Djallel Bouneffouf

Active learning strategies respond to the costly labelling task in a supervised classification by selecting the most useful unlabelled examples in training a predictive model.

Active Learning General Classification

Paper
Add Code

R-UCB: a Contextual Bandit Algorithm for Risk-Aware Recommender Systems

no code implementations • 10 Aug 2014 • Djallel Bouneffouf

Mobile Context-Aware Recommender Systems can be naturally modelled as an exploration/exploitation trade-off (exr/exp) problem, where the system has to choose between maximizing its expected rewards dealing with its current knowledge (exploitation) and learning more about the unknown user's preferences to improve its knowledge (exploration).

Recommendation Systems

Paper
Add Code

Recommandation mobile, sensible au contexte de contenus évolutifs: Contextuel-E-Greedy

no code implementations • 9 Feb 2014 • Djallel Bouneffouf

We introduce in this paper an algorithm named Contextuel-E-Greedy that tackles the dynamicity of the user's content.

Paper
Add Code

Exponentiated Gradient LINUCB for Contextual Multi-Armed Bandits

no code implementations • 10 May 2013 • Djallel Bouneffouf

We present Exponentiated Gradient LINUCB, an algorithm for con-textual multi-armed bandits.

Multi-Armed Bandits

Paper
Add Code

Hybrid Q-Learning Applied to Ubiquitous recommender system

no code implementations • 10 Mar 2013 • Djallel Bouneffouf

Ubiquitous information access becomes more and more important nowadays and research is aimed at making it adapted to users.

Q-Learning Recommendation Systems +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.