Search Results for author: Scott Sanner

Found 54 papers, 22 papers with code

Bayesian Optimization with LLM-Based Acquisition Functions for Natural Language Preference Elicitation

no code implementations • 2 May 2024 • David Eric Austin, Anton Korikov, Armin Toroghi, Scott Sanner

Designing preference elicitation (PE) methodologies that can quickly ascertain a user's top item preferences in a cold-start setting is a key challenge for building effective and personalized conversational recommendation (ConvRec) systems.

Bayesian Optimization Natural Language Inference +1

Paper
Add Code

A Review of Modern Recommender Systems Using Generative Models (Gen-RecSys)

1 code implementation • 31 Mar 2024 • Yashar Deldjoo, Zhankui He, Julian McAuley, Anton Korikov, Scott Sanner, Arnau Ramisa, René Vidal, Maheswaran Sathiamoorthy, Atoosa Kasirzadeh, Silvia Milano

Traditional recommender systems (RS) have used user-item rating histories as their primary data source, with collaborative filtering being one of the principal methods.

Collaborative Filtering Recommendation Systems +1

Paper
Code

Right for Right Reasons: Large Language Models for Verifiable Commonsense Knowledge Graph Question Answering

no code implementations • 3 Mar 2024 • Armin Toroghi, Willis Guo, Mohammad Mahdi Abdollah Pour, Scott Sanner

Knowledge Graph Question Answering (KGQA) methods seek to answer Natural Language questions using the relational information stored in Knowledge Graphs (KGs).

Claim Verification Graph Question Answering +3

Paper
Add Code

CR-LT-KGQA: A Knowledge Graph Question Answering Dataset Requiring Commonsense Reasoning and Long-Tail Knowledge

1 code implementation • 3 Mar 2024 • Willis Guo, Armin Toroghi, Scott Sanner

In this work, we seek a novel KGQA dataset that supports commonsense reasoning and focuses on long-tail entities (e. g., non-mainstream and recent entities) where LLMs frequently hallucinate, and thus create the need for novel methodologies that leverage the KG for factual and attributable commonsense inference.

Claim Verification Graph Question Answering +4

Paper
Code

Constraint-Generation Policy Optimization (CGPO): Nonlinear Programming for Policy Optimization in Mixed Discrete-Continuous MDPs

no code implementations • 20 Jan 2024 • Michael Gimelfarb, Ayal Taitler, Scott Sanner

To achieve such results, CGPO proposes a bi-level mixed-integer nonlinear optimization framework for optimizing policies within defined expressivity classes (i. e. piecewise (non)-linear) and reduces it to an optimal constraint generation methodology that adversarially generates worst-case state trajectories.

counterfactual

Paper
Add Code

Diffusion on the Probability Simplex

no code implementations • 5 Sep 2023 • Griffin Floto, Thorsteinn Jonsson, Mihai Nica, Scott Sanner, Eric Zhengyu Zhu

However, the desired continuous nature of the noising process can be at odds with discrete data.

Image Generation

Paper
Add Code

Self-Supervised Contrastive BERT Fine-tuning for Fusion-based Reviewed-Item Retrieval

2 code implementations • 1 Aug 2023 • Mohammad Mahdi Abdollah Pour, Parsa Farinneya, Armin Toroghi, Anton Korikov, Ali Pesaranghader, Touqir Sajed, Manasa Bharadwaj, Borislav Mavrin, Scott Sanner

Experimental results show that Late Fusion contrastive learning for Neural RIR outperforms all other contrastive IR configurations, Neural IR, and sparse retrieval baselines, thus demonstrating the power of exploiting the two-level structure in Neural RIR approaches as well as the importance of preserving the nuance of individual review content via Late Fusion methods.

Contrastive Learning Information Retrieval +2

Paper
Code

Large Language Models are Competitive Near Cold-start Recommenders for Language- and Item-based Preferences

no code implementations • 26 Jul 2023 • Scott Sanner, Krisztian Balog, Filip Radlinski, Ben Wedin, Lucas Dixon

Inspired by recent successes of prompting paradigms for large language models (LLMs), we study their use for making recommendations from both item-based and language-based preferences in comparison to state-of-the-art item-based collaborative filtering (CF) methods.

Collaborative Filtering Recommendation Systems

Paper
Add Code

DiffuDetox: A Mixed Diffusion Model for Text Detoxification

no code implementations • 14 Jun 2023 • Griffin Floto, Mohammad Mahdi Abdollah Pour, Parsa Farinneya, Zhenwei Tang, Ali Pesaranghader, Manasa Bharadwaj, Scott Sanner

Text detoxification is a conditional text generation task aiming to remove offensive content from toxic text.

Conditional Text Generation

Paper
Add Code

Bayesian Knowledge-driven Critiquing with Indirect Evidence

no code implementations • 9 Jun 2023 • Armin Toroghi, Griffin Floto, Zhenwei Tang, Scott Sanner

This work enables a new paradigm for using rich knowledge content and reasoning over indirect evidence as a mechanism for critiquing interactions with CRS.

Bayesian Inference Knowledge Graphs +1

Paper
Add Code

Revisiting Random Forests in a Comparative Evaluation of Graph Convolutional Neural Network Variants for Traffic Prediction

no code implementations • 30 May 2023 • Ta Jiun Ting, Xiaocan Li, Scott Sanner, Baher Abdulhai

This suggests that the current graph convolutional methods may not be the best approach to traffic prediction and there is still room for improvement.

regression Traffic Prediction

Paper
Add Code

Perimeter Control Using Deep Reinforcement Learning: A Model-free Approach towards Homogeneous Flow Rate Optimization

no code implementations • 29 May 2023 • Xiaocan Li, Ray Coden Mercurius, Ayal Taitler, Xiaoyu Wang, Mohammad Noaeen, Scott Sanner, Baher Abdulhai

Moreover, no existing studies have employed reinforcement learning for homogeneous flow rate optimization in microscopic simulation, where spatial characteristics, vehicle-level information, and metering realizations -- often overlooked in macroscopic simulations -- are taken into account.

reinforcement-learning

Paper
Add Code

LLMs and the Abstraction and Reasoning Corpus: Successes, Failures, and the Importance of Object-based Representations

1 code implementation • 26 May 2023 • Yudong Xu, Wenhao Li, Pashootan Vaezipoor, Scott Sanner, Elias B. Khalil

Although the state-of-the-art GPT-4 is unable to "reason" perfectly within non-language domains such as the 1D-ARC or a simple ARC subset, our study reveals that the use of object-based representations can significantly improve its reasoning ability.

Language Modelling Large Language Model

Paper
Code

A Generalized Framework for Predictive Clustering and Optimization

no code implementations • 7 May 2023 • Aravinth Chembu, Scott Sanner

In this article, we define a generalized optimization framework for predictive clustering that admits different cluster definitions (arbitrary point assignment, closest center, and bounding box) and both regression and classification objectives.

Clustering regression

Paper
Add Code

LogicRec: Recommendation with Users' Logical Requirements

1 code implementation • 23 Apr 2023 • Zhenwei Tang, Griffin Floto, Armin Toroghi, Shichao Pei, Xiangliang Zhang, Scott Sanner

In this work, we formulate the problem of recommendation with users' logical requirements (LogicRec) and construct benchmark datasets for LogicRec.

Knowledge Graphs Recommendation Systems +1

Paper
Code

Safe MDP Planning by Learning Temporal Patterns of Undesirable Trajectories and Averting Negative Side Effects

1 code implementation • 6 Apr 2023 • Siow Meng Low, Akshat Kumar, Scott Sanner

In safe MDP planning, a cost function based on the current state and action is often used to specify safety aspects.

Paper
Code

A Critical Review of Traffic Signal Control and A Novel Unified View of Reinforcement Learning and Model Predictive Control Approaches for Adaptive Traffic Signal Control

no code implementations • 26 Nov 2022 • Xiaoyu Wang, Scott Sanner, Baher Abdulhai

Recent years have witnessed substantial growth in adaptive traffic signal control (ATSC) methodologies that improve transportation network efficiency, especially in branches leveraging artificial intelligence based optimization and control algorithms such as reinforcement learning as well as conventional model predictive control.

Model Predictive Control

Paper
Add Code

pyRDDLGym: From RDDL to Gym Environments

2 code implementations • 11 Nov 2022 • Ayal Taitler, Michael Gimelfarb, Jihwan Jeong, Sriram Gopalakrishnan, Martin Mladenov, Xiaotian Liu, Scott Sanner

We present pyRDDLGym, a Python framework for auto-generation of OpenAI Gym environments from RDDL declerative description.

OpenAI Gym

Paper
Code

Learning to Follow Instructions in Text-Based Games

1 code implementation • 8 Nov 2022 • Mathieu Tuli, Andrew C. Li, Pashootan Vaezipoor, Toryn Q. Klassen, Scott Sanner, Sheila A. McIlraith

Text-based games present a unique class of sequential decision making problem in which agents interact with a partially observable, simulated environment via actions and observations conveyed through natural language.

Decision Making Instruction Following +2

Paper
Code

Graphs, Constraints, and Search for the Abstraction and Reasoning Corpus

1 code implementation • 18 Oct 2022 • Yudong Xu, Elias B. Khalil, Scott Sanner

The Abstraction and Reasoning Corpus (ARC) aims at benchmarking the performance of general artificial intelligence algorithms.

Benchmarking Few-Shot Learning +1

Paper
Code

Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization

1 code implementation • 7 Oct 2022 • Jihwan Jeong, Xiaoyu Wang, Michael Gimelfarb, Hyunwoo Kim, Baher Abdulhai, Scott Sanner

Offline reinforcement learning (RL) addresses the problem of learning a performant policy from a fixed batch of data collected by following some behavior policy.

Continuous Control D4RL +1

Paper
Code

Sample-efficient Iterative Lower Bound Optimization of Deep Reactive Policies for Planning in Continuous MDPs

no code implementations • 23 Mar 2022 • Siow Meng Low, Akshat Kumar, Scott Sanner

This novel formulation of DRP learning as iterative lower bound optimization (ILBO) is particularly appealing because (i) each step is structurally easier to optimize than the overall objective, (ii) it guarantees a monotonically improving objective under certain theoretical conditions, and (iii) it reuses samples between iterations thus lowering sample complexity.

Paper
Add Code

TransCAM: Transformer Attention-based CAM Refinement for Weakly Supervised Semantic Segmentation

1 code implementation • 14 Mar 2022 • Ruiwen Li, Zheda Mai, Chiheb Trabelsi, Zhibo Zhang, Jongseong Jang, Scott Sanner

In this paper, we propose TransCAM, a Conformer-based solution to WSSS that explicitly leverages the attention weights from the transformer branch of the Conformer to refine the CAM generated from the CNN branch.

Weakly supervised Semantic Segmentation Weakly-Supervised Semantic Segmentation

Paper
Code

Unintended Bias in Language Model-driven Conversational Recommendation

no code implementations • 17 Jan 2022 • Tianshu Shen, Jiaru Li, Mohamed Reda Bouadjenek, Zheda Mai, Scott Sanner

Conversational Recommendation Systems (CRSs) have recently started to leverage pretrained language models (LM) such as BERT for their ability to semantically interpret a wide range of preference statement variations.

Language Modelling Recommendation Systems

Paper
Add Code

Representer Point Selection via Local Jacobian Expansion for Post-hoc Classifier Explanation of Deep Neural Networks and Ensemble Models

1 code implementation • NeurIPS 2021 • Yi Sui, Ga Wu, Scott Sanner

Explaining the influence of training data on deep neural network predictions is a critical tool for debugging models through data curation.

Image Classification tabular-classification +2

Paper
Code

ExCon: Explanation-driven Supervised Contrastive Learning for Image Classification

1 code implementation • 28 Nov 2021 • Zhibo Zhang, Jongseong Jang, Chiheb Trabelsi, Ruiwen Li, Scott Sanner, Yeonjeong Jeong, Dongsub Shim

Contrastive learning has led to substantial improvements in the quality of learned embedding representations for tasks such as image classification.

Adversarial Robustness Classification +2

Paper
Code

Multi-axis Attentive Prediction for Sparse EventData: An Application to Crime Prediction

1 code implementation • 5 Oct 2021 • Yi Sui, Ga Wu, Scott Sanner

We additionally introduce a novel Frobenius norm-based contrastive learning objective to improve latent representational generalization. Empirically, we validate MAPSED on two publicly accessible urban crime datasets for spatiotemporal sparse event prediction, where MAPSED outperforms both classical and state-of-the-art deep learning models.

Contrastive Learning Crime Prediction

Paper
Code

Planning with Learned Binarized Neural Networks Benchmarks for MaxSAT Evaluation 2021

no code implementations • 2 Aug 2021 • Buser Say, Scott Sanner, Jo Devriendt, Jakob Nordström, Peter J. Stuckey

This document provides a brief introduction to learned automated planning problem where the state transition function is in the form of a binarized neural network (BNN), presents a general MaxSAT encoding for this problem, and describes the four domains, namely: Navigation, Inventory Control, System Administrator and Cellda, that are submitted as benchmarks for MaxSAT Evaluation 2021.

Paper
Add Code

RAPTOR: End-to-end Risk-Aware MDP Planning and Policy Learning by Backpropagation

no code implementations • 14 Jun 2021 • Noah Patton, Jihwan Jeong, Michael Gimelfarb, Scott Sanner

The direct optimization of this empirical objective in an end-to-end manner is called the risk-averse straight-line plan, which commits to a sequence of actions in advance and can be sub-optimal in highly stochastic domains.

Paper
Add Code

EDDA: Explanation-driven Data Augmentation to Improve Explanation Faithfulness

no code implementations • 29 May 2021 • Ruiwen Li, Zhibo Zhang, Jiani Li, Chiheb Trabelsi, Scott Sanner, Jongseong Jang, Yeonjeong Jeong, Dongsub Shim

Recent years have seen the introduction of a range of methods for post-hoc explainability of image classifier predictions.

Data Augmentation Image Classification

Paper
Add Code

Risk-Aware Transfer in Reinforcement Learning using Successor Features

no code implementations • NeurIPS 2021 • Michael Gimelfarb, André Barreto, Scott Sanner, Chi-Guhn Lee

Sample efficiency and risk-awareness are central to the development of practical reinforcement learning (RL) for complex decision-making.

Decision Making reinforcement-learning +2

Paper
Add Code

Supervised Contrastive Replay: Revisiting the Nearest Class Mean Classifier in Online Class-Incremental Continual Learning

3 code implementations • 22 Mar 2021 • Zheda Mai, Ruiwen Li, Hyunwoo Kim, Scott Sanner

Online class-incremental continual learning (CL) studies the problem of learning new classes continually from an online non-stationary data stream, intending to adapt to new data while mitigating catastrophic forgetting.

Class Incremental Learning

1,692

Paper
Code

Online Continual Learning in Image Classification: An Empirical Survey

1 code implementation • 25 Jan 2021 • Zheda Mai, Ruiwen Li, Jihwan Jeong, David Quispe, Hyunwoo Kim, Scott Sanner

To better understand the relative advantages of various approaches and the settings where they work best, this survey aims to (1) compare state-of-the-art methods such as MIR, iCARL, and GDumb and determine which works best at different experimental settings; (2) determine if the best class incremental methods are also competitive in domain incremental setting; (3) evaluate the performance of 7 simple but effective trick such as "review" trick and nearest class mean (NCM) classifier to assess their relative impact.

Classification Continual Learning +2

359

Paper
Code

Attentive Autoencoders for Multifaceted Preference Learning in One-class Collaborative Filtering

no code implementations • 24 Oct 2020 • Zheda Mai, Ga Wu, Kai Luo, Scott Sanner

In order to capture multifaceted user preferences, existing recommender systems either increase the encoding complexity or extend the latent representation dimension.

Collaborative Filtering Recommendation Systems

Paper
Add Code

Online Class-Incremental Continual Learning with Adversarial Shapley Value

3 code implementations • 31 Aug 2020 • Dongsub Shim, Zheda Mai, Jihwan Jeong, Scott Sanner, Hyunwoo Kim, Jongseong Jang

As image-based deep learning becomes pervasive on every device, from cell phones to smart watches, there is a growing need to develop methods that continually learn from data while minimizing memory footprint and power consumption.

Continual Learning Open-Ended Question Answering

359

Paper
Code

Noise Contrastive Estimation for Autoencoding-based One-Class Collaborative Filtering

no code implementations • 3 Aug 2020 • Jin Peng Zhou, Ga Wu, Zheda Mai, Scott Sanner

One-class collaborative filtering (OC-CF) is a common class of recommendation problem where only the positive class is explicitly observed (e. g., purchases, clicks).

Collaborative Filtering

Paper
Add Code

Batch-level Experience Replay with Review for Continual Learning

1 code implementation • 11 Jul 2020 • Zheda Mai, Hyunwoo Kim, Jihwan Jeong, Scott Sanner

Continual learning is a branch of deep learning that seeks to strike a balance between learning stability and plasticity.

Continual Learning

Paper
Code

ε-BMC: A Bayesian Ensemble Approach to Epsilon-Greedy Exploration in Model-Free Reinforcement Learning

1 code implementation • 2 Jul 2020 • Michael Gimelfarb, Scott Sanner, Chi-Guhn Lee

Resolving the exploration-exploitation trade-off remains a fundamental problem in the design and implementation of reinforcement learning (RL) algorithms.

Reinforcement Learning (RL)

Paper
Code

Bayesian Experience Reuse for Learning from Multiple Demonstrators

no code implementations • 10 Jun 2020 • Michael Gimelfarb, Scott Sanner, Chi-Guhn Lee

We demonstrate the effectiveness of this approach for static optimization of smooth functions, and transfer learning in a high-dimensional supply chain problem with cost uncertainty.

Transfer Learning

Paper
Add Code

Contextual Policy Transfer in Reinforcement Learning Domains via Deep Mixtures-of-Experts

no code implementations • 29 Feb 2020 • Michael Gimelfarb, Scott Sanner, Chi-Guhn Lee

In this paper, we assume knowledge of estimated source task dynamics and policies, and common sub-goals but different dynamics.

OpenAI Gym Q-Learning +2

Paper
Add Code

Optimizing Search API Queries for Twitter Topic Classifiers Using a Maximum Set Coverage Approach

no code implementations • 23 Apr 2019 • Kasra Safari, Scott Sanner

Thus, it is critically important to query the Twitter API relative to the intended topical classifier in a way that minimizes the amount of negatively classified data retrieved.

Paper
Add Code

Reward Potentials for Planning with Learned Neural Network Transition Models

no code implementations • 19 Apr 2019 • Buser Say, Scott Sanner, Sylvie Thiébaux

We then strengthen the linear relaxation of the underlying MILP model by introducing constraints to bound the reward function based on the precomputed reward potentials.

Paper
Add Code

Scalable Planning with Deep Neural Network Learned Transition Models

no code implementations • 5 Apr 2019 • Ga Wu, Buser Say, Scott Sanner

But there remains one major problem for the task of control -- how can we plan with deep network learned transition models without resorting to Monte Carlo Tree Search and other black-box transition model techniques that ignore model structure and do not easily extend to mixed discrete and continuous domains?

Paper
Add Code

Reinforcement Learning with Multiple Experts: A Bayesian Model Combination Approach

no code implementations • NeurIPS 2018 • Michael Gimelfarb, Scott Sanner, Chi-Guhn Lee

Potential based reward shaping is a powerful technique for accelerating convergence of reinforcement learning algorithms.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Compact and Efficient Encodings for Planning in Factored State and Action Spaces with Learned Binarized Neural Network Transition Models

no code implementations • 26 Nov 2018 • Buser Say, Scott Sanner

In this paper, we leverage the efficiency of Binarized Neural Networks (BNNs) to learn complex state transition models of planning domains with discretized factored state and action spaces.

Computational Efficiency

Paper
Add Code

Aesthetic Features for Personalized Photo Recommendation

no code implementations • 31 Aug 2018 • Yu Qing Zhou, Ga Wu, Scott Sanner, Putra Manggala

Many photography websites such as Flickr, 500px, Unsplash, and Adobe Behance are used by amateur and professional photography enthusiasts.

Collaborative Filtering Image Retrieval +1

Paper
Add Code

Conditional Inference in Pre-trained Variational Autoencoders via Cross-coding

1 code implementation • ICLR 2019 • Ga Wu, Justin Domke, Scott Sanner

Variational Autoencoders (VAEs) are a popular generative model, but one in which conditional inference can be challenging.

Paper
Code

Scalable Planning with Tensorflow for Hybrid Nonlinear Domains

no code implementations • NeurIPS 2017 • Ga Wu, Buser Say, Scott Sanner

Given recent deep learning results that demonstrate the ability to effectively optimize high-dimensional non-convex functions with gradient descent optimization on GPUs, we ask in this paper whether symbolic gradient optimization tools such as Tensorflow can be effective for planning in hybrid (mixed discrete and continuous) nonlinear domains with high dimensional state and action spaces?

Paper
Add Code

Stochastic Planning and Lifted Inference

no code implementations • 4 Jan 2017 • Roni Khardon, Scott Sanner

Lifted probabilistic inference (Poole, 2003) and symbolic dynamic programming for lifted stochastic planning (Boutilier et al, 2001) were introduced around the same time as algorithmic efforts to use abstraction in stochastic systems.

Decision Making

Paper
Add Code

Expecting to be HIP: Hawkes Intensity Processes for Social Media Popularity

1 code implementation • 19 Feb 2016 • Marian-Andrei Rizoiu, Lexing Xie, Scott Sanner, Manuel Cebrian, Honglin Yu, Pascal Van Hentenryck

Modeling and predicting the popularity of online content is a significant problem for the practice of information dissemination, advertising, and consumption.

Social and Information Networks

Paper
Code

AutoRec: Autoencoders Meet Collaborative Filtering

2 code implementations • Proceedings of the 24th International Conference on World Wide Web 2015 • Suvash Sedhain, Aditya Krishna Menon, Scott Sanner, Lexing Xie

This paper proposes AutoRec, a novel autoencoder framework for collaborative filtering (CF).

Ranked #5 on Recommendation Systems on MovieLens 1M

Collaborative Filtering Recommendation Systems

135

Paper
Code

Bounded Approximate Symbolic Dynamic Programming for Hybrid MDPs

no code implementations • 26 Sep 2013 • Luis Gustavo Vianna, Scott Sanner, Leliane Nunes de Barros

Recent advances in symbolic dynamic programming (SDP) combined with the extended algebraic decision diagram (XADD) data structure have provided exact solutions for mixed discrete and continuous (hybrid) MDPs with piecewise linear dynamics and continuous actions.

Paper
Add Code

Symbolic Dynamic Programming for Continuous State and Observation POMDPs

no code implementations • NeurIPS 2012 • Zahra Zamani, Scott Sanner, Pascal Poupart, Kristian Kersting

In recent years, point- based value iteration methods have proven to be extremely effective techniques for ﬁnding (approximately) optimal dynamic programming solutions to POMDPs when an initial set of belief states is known.

Decision Making

Paper
Add Code

Gaussian Process Preference Elicitation

no code implementations • NeurIPS 2010 • Shengbo Guo, Scott Sanner, Edwin V. Bonilla

Bayesian approaches to preference elicitation (PE) are particularly attractive due to their ability to explicitly model uncertainty in users' latent utility functions.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.