Search Results for author: Pascal Poupart

Found 72 papers, 28 papers with code

Quantifying and Improving Transferability in Domain Generalization

2 code implementations • NeurIPS 2021 • Guojun Zhang, Han Zhao, YaoLiang Yu, Pascal Poupart

We then prove that our transferability can be estimated with enough samples and give a new upper bound for the target error based on our transferability.

Domain Generalization Out-of-Distribution Generalization

1,331

Paper
Code

SPFlow: An Easy and Extensible Library for Deep Probabilistic Learning using Sum-Product Networks

1 code implementation • 11 Jan 2019 • Alejandro Molina, Antonio Vergari, Karl Stelzner, Robert Peharz, Pranav Subramani, Nicola Di Mauro, Pascal Poupart, Kristian Kersting

We introduce SPFlow, an open-source Python library providing a simple interface to inference, learning and manipulation routines for deep and tractable probabilistic models called Sum-Product Networks (SPNs).

281

Paper
Code

Time2Vec: Learning a Vector Representation of Time

6 code implementations • 11 Jul 2019 • Seyed Mehran Kazemi, Rishab Goel, Sepehr Eghbali, Janahan Ramanan, Jaspreet Sahota, Sanjay Thakur, Stella Wu, Cathal Smyth, Pascal Poupart, Marcus Brubaker

Time is an important feature in many applications involving events that occur synchronously and/or asynchronously.

238

Paper
Code

Matrix Nets: A New Deep Architecture for Object Detection

2 code implementations • 13 Aug 2019 • Abdullah Rashwan, Agastya Kalra, Pascal Poupart

We present Matrix Nets (xNets), a new deep architecture for object detection.

Ranked #97 on Object Detection on COCO test-dev

Object object-detection +1

171

Paper
Code

MatrixNets: A New Scale and Aspect Ratio Aware Architecture for Object Detection

1 code implementation • 9 Jan 2020 • Abdullah Rashwan, Rishav Agarwal, Agastya Kalra, Pascal Poupart

We present MatrixNets (xNets), a new deep architecture for object detection.

Object object-detection +1

171

Paper
Code

Diachronic Embedding for Temporal Knowledge Graph Completion

2 code implementations • 6 Jul 2019 • Rishab Goel, Seyed Mehran Kazemi, Marcus Brubaker, Pascal Poupart

In this paper, we build novel models for temporal KG completion through equipping static models with a diachronic entity embedding function which provides the characteristics of entities at any point in time.

Temporal Knowledge Graph Completion

Paper
Code

Variational Attention for Sequence-to-Sequence Models

2 code implementations • COLING 2018 • Hareesh Bahuleyan, Lili Mou, Olga Vechtomova, Pascal Poupart

The variational encoder-decoder (VED) encodes source information as a set of random variables using a neural network, which in turn is decoded into target data using another neural network.

Paper
Code

Unsupervised Video Object Segmentation for Deep Reinforcement Learning

1 code implementation • NeurIPS 2018 • Vik Goel, Jameson Weng, Pascal Poupart

The detection of moving objects is done in an unsupervised way by exploiting structure from motion.

Atari Games Decision Making +6

Paper
Code

Learning Dynamic Belief Graphs to Generalize on Text-Based Games

1 code implementation • NeurIPS 2020 • Ashutosh Adhikari, Xingdi Yuan, Marc-Alexandre Côté, Mikuláš Zelinka, Marc-Antoine Rondeau, Romain Laroche, Pascal Poupart, Jian Tang, Adam Trischler, William L. Hamilton

Playing text-based games requires skills in processing natural language and sequential decision making.

Decision Making Knowledge Graphs +2

Paper
Code

FedFormer: Contextual Federation with Attention in Reinforcement Learning

1 code implementation • 27 May 2022 • Liam Hebert, Lukasz Golab, Pascal Poupart, Robin Cohen

We evaluate our methods on the Meta-World environment and find that our approach yields significant improvements over FedAvg and non-federated Soft Actor-Critic single-agent methods.

Federated Learning reinforcement-learning +1

Paper
Code

Multi Type Mean Field Reinforcement Learning

1 code implementation • 6 Feb 2020 • Sriram Ganapathi Subramanian, Pascal Poupart, Matthew E. Taylor, Nidhi Hegde

We consider two different kinds of mean field environments: a) Games where agents belong to predefined types that are known a priori and b) Games where the type of each agent is unknown and therefore must be learned based on observations.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Code

NTS-NOTEARS: Learning Nonparametric DBNs With Prior Knowledge

1 code implementation • 9 Sep 2021 • Xiangyu Sun, Oliver Schulte, Guiliang Liu, Pascal Poupart

We describe NTS-NOTEARS, a score-based structure learning method for time-series data to learn dynamic Bayesian networks (DBNs) that captures nonlinear, lagged (inter-slice) and instantaneous (intra-slice) relations among variables.

Time Series Time Series Analysis

Paper
Code

Progressive Memory Banks for Incremental Domain Adaptation

1 code implementation • ICLR 2020 • Nabiha Asghar, Lili Mou, Kira A. Selby, Kevin D. Pantasdo, Pascal Poupart, Xin Jiang

The memory bank provides a natural way of IDA: when adapting our model to a new domain, we progressively add new slots to the memory bank, which increases the number of parameters, and thus the model capacity.

Domain Adaptation

Paper
Code

Online Structure Learning for Sum-Product Networks with Gaussian Leaves

1 code implementation • 19 Jan 2017 • Wilson Hsu, Agastya Kalra, Pascal Poupart

Sum-product networks have recently emerged as an attractive representation due to their dual view as a special type of deep neural network with clear semantics and a special type of probabilistic graphical model for which inference is always tractable.

valid

Paper
Code

Benchmarking Constraint Inference in Inverse Reinforcement Learning

2 code implementations • 20 Jun 2022 • Guiliang Liu, Yudong Luo, Ashish Gaurav, Kasra Rezaee, Pascal Poupart

When deploying Reinforcement Learning (RL) agents into a physical system, we must ensure that these agents are well aware of the underlying constraints.

Autonomous Driving Benchmarking +2

Paper
Code

Unsupervised Multilingual Alignment using Wasserstein Barycenter

1 code implementation • 28 Jan 2020 • Xin Lian, Kshitij Jain, Jakub Truszkowski, Pascal Poupart, Yao-Liang Yu

We study unsupervised multilingual alignment, the problem of finding word-to-word translations between multiple languages without using any parallel data.

Ranked #1 on Word Alignment on en-es

Translation Unsupervised Machine Translation +1

Paper
Code

A Sober Look at LLMs for Material Discovery: Are They Actually Good for Bayesian Optimization Over Molecules?

1 code implementation • 7 Feb 2024 • Agustinus Kristiadi, Felix Strieth-Kalthoff, Marta Skreta, Pascal Poupart, Alán Aspuru-Guzik, Geoff Pleiss

Bayesian optimization (BO) is an essential part of such workflows, enabling scientists to leverage prior domain knowledge into efficient exploration of a large molecular space.

Bayesian Optimization Efficient Exploration

Paper
Code

Partially Observable Mean Field Reinforcement Learning

1 code implementation • 31 Dec 2020 • Sriram Ganapathi Subramanian, Matthew E. Taylor, Mark Crowley, Pascal Poupart

Traditional multi-agent reinforcement learning algorithms are not scalable to environments with more than a few agents, since these algorithms are exponential in the number of agents.

Multi-agent Reinforcement Learning Q-Learning Multiagent Systems

Paper
Code

WatClaimCheck: A new Dataset for Claim Entailment and Inference

1 code implementation • ACL 2022 • Kashif Khan, Ruizhe Wang, Pascal Poupart

We contribute a new dataset for the task of automated fact checking and an evaluation of state of the art algorithms.

Fact Checking Passage Retrieval +1

Paper
Code

Order-Planning Neural Text Generation From Structured Data

1 code implementation • 1 Sep 2017 • Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Sujian Li, Baobao Chang, Zhifang Sui

Generating texts from structured data (e. g., a table) is important for various natural language processing tasks such as question answering and dialog systems.

Question Answering Table-to-Text Generation

Paper
Code

Newton-type Methods for Minimax Optimization

1 code implementation • 25 Jun 2020 • Guojun Zhang, Kaiwen Wu, Pascal Poupart, Yao-Liang Yu

We prove their local convergence at strict local minimax points, which are surrogates of global solutions.

Reinforcement Learning (RL) Vocal Bursts Type Prediction

Paper
Code

On Improving Deep Reinforcement Learning for POMDPs

1 code implementation • 26 Apr 2017 • Pengfei Zhu, Xin Li, Pascal Poupart, Guanghui Miao

Deep Reinforcement Learning (RL) recently emerged as one of the most competitive approaches for learning in sequential decision making problems with fully observable environments, e. g., computer Go.

Atari Games Decision Making +4

Paper
Code

Learning Functions on Multiple Sets using Multi-Set Transformers

1 code implementation • 30 Jun 2022 • Kira Selby, Ahmad Rashid, Ivan Kobyzev, Mehdi Rezagholizadeh, Pascal Poupart

We propose a general deep architecture for learning functions on multiple permutation-invariant sets.

Paper
Code

Robust One Round Federated Learning with Predictive Space Bayesian Inference

1 code implementation • 20 Jun 2022 • Mohsin Hasan, Zehao Zhang, Kaiyang Guo, Mahdi Karami, Guojun Zhang, Xi Chen, Pascal Poupart

In contrast, our method performs the aggregation on the predictive posteriors, which are typically easier to approximate owing to the low-dimensionality of the output space.

Bayesian Inference Federated Learning

Paper
Code

On Improving Deep Reinforcement Learning for POMDPs

no code implementations • 17 Apr 2018 • Pengfei Zhu, Xin Li, Pascal Poupart, Guanghui Miao

Deep Reinforcement Learning (RL) recently emerged as one of the most competitive approaches for learning in sequential decision making problems with fully observable environments, e. g., computer Go.

Atari Games Decision Making +4

Paper
Add Code

Why Do Neural Dialog Systems Generate Short and Meaningless Replies? A Comparison between Dialog and Translation

no code implementations • 6 Dec 2017 • Bolin Wei, Shuai Lu, Lili Mou, Hao Zhou, Pascal Poupart, Ge Li, Zhi Jin

This paper addresses the question: Why do neural dialog systems generate short and meaningless replies?

Machine Translation Translation

Paper
Add Code

Affective Neural Response Generation

no code implementations • 12 Sep 2017 • Nabiha Asghar, Pascal Poupart, Jesse Hoey, Xin Jiang, Lili Mou

Existing neural conversational models process natural language primarily on a lexico-syntactic level, thereby ignoring one of the most crucial components of human-to-human dialogue: its affective content.

Response Generation Word Embeddings

Paper
Add Code

Deep Active Learning for Dialogue Generation

no code implementations • SEMEVAL 2017 • Nabiha Asghar, Pascal Poupart, Xin Jiang, Hang Li

We propose an online, end-to-end, neural generative conversational model for open-domain dialogue.

Active Learning Dialogue Generation +1

Paper
Add Code

Generative Mixture of Networks

no code implementations • 10 Feb 2017 • Ershad Banijamali, Ali Ghodsi, Pascal Poupart

The model consists of K networks that are trained together to learn the underlying distribution of a given data set.

Clustering

Paper
Add Code

Discovering Conversational Dependencies between Messages in Dialogs

no code implementations • 8 Dec 2016 • Wenchao Du, Pascal Poupart, Wei Xu

We investigate the task of inferring conversational dependencies between messages in one-on-one online chat, which has become one of the most popular forms of customer service.

Paper
Add Code

Online and Distributed learning of Gaussian mixture models by Bayesian Moment Matching

no code implementations • 19 Sep 2016 • Priyank Jaini, Pascal Poupart

The Gaussian mixture model is a classic technique for clustering and data modeling that is used in numerous applications.

Clustering

Paper
Add Code

A Unified Approach for Learning the Parameters of Sum-Product Networks

no code implementations • NeurIPS 2016 • Han Zhao, Pascal Poupart, Geoff Gordon

We present a unified approach for learning the parameters of Sum-Product networks (SPNs).

Paper
Add Code

Dynamic Sum Product Networks for Tractable Inference on Sequence Data (Extended Version)

no code implementations • 13 Nov 2015 • Mazen Melibari, Pascal Poupart, Prashant Doshi, George Trimponias

Since SPNs represent distributions over a fixed set of variables only, we propose dynamic sum product networks (DSPNs) as a generalization of SPNs for sequence data of varying length.

Paper
Add Code

On the Relationship between Sum-Product Networks and Bayesian Networks

no code implementations • 6 Jan 2015 • Han Zhao, Mazen Melibari, Pascal Poupart

We conclude the paper with some discussion of the implications of the proof and establish a connection between the depth of an SPN and a lower bound of the tree-width of its corresponding BN.

Paper
Add Code

Self-Adaptive Hierarchical Sentence Model

1 code implementation • 20 Apr 2015 • Han Zhao, Zhengdong Lu, Pascal Poupart

The ability to accurately model a sentence at varying stages (e. g., word-phrase-sentence) plays a central role in natural language processing.

Ranked #5 on Subjectivity Analysis on SUBJ

General Classification Sentence +1

Paper
Code

A Sober Look at Spectral Learning

no code implementations • 18 Jun 2014 • Han Zhao, Pascal Poupart

In contrast, maximum likelihood estimates may get trapped in local optima due to the non-convex nature of the likelihood function of latent variable models.

Paper
Add Code

Exploiting Structure in Weighted Model Counting Approaches to Probabilistic Inference

no code implementations • 16 Jan 2014 • Wei Li, Pascal Poupart, Peter van Beek

Previous studies have demonstrated that encoding a Bayesian network into a SAT formula and then performing weighted model counting using a backtracking search algorithm can be an effective method for exact inference.

Paper
Add Code

Overfitting at SemEval-2016 Task 3: Detecting Semantically Similar Questions in Community Question Answering Forums with Word Embeddings

no code implementations • SEMEVAL 2016 • Hujie Wang, Pascal Poupart

Community Question Answering Question Similarity +2

Paper
Add Code

Online Structure Learning for Feed-Forward and Recurrent Sum-Product Networks

no code implementations • NeurIPS 2018 • Agastya Kalra, Abdullah Rashwan, Wei-Shou Hsu, Pascal Poupart, Prashant Doshi, Georgios Trimponias

valid

Paper
Add Code

Deep Homogeneous Mixture Models: Representation, Separation, and Approximation

no code implementations • NeurIPS 2018 • Priyank Jaini, Pascal Poupart, Yao-Liang Yu

At their core, many unsupervised learning models provide a compact representation of homogeneous density mixtures, but their similarities and differences are not always clearly understood.

Density Estimation

Paper
Add Code

Monte-Carlo Tree Search for Constrained POMDPs

no code implementations • NeurIPS 2018 • Jongmin Lee, Geon-Hyeong Kim, Pascal Poupart, Kee-Eung Kim

In this paper, we present CC-POMCP (Cost-Constrained POMCP), an online MCTS algorithm for large CPOMDPs that leverages the optimization of LP-induced parameters and only requires a black-box simulator of the environment.

Decision Making

Paper
Add Code

Online Bayesian Moment Matching for Topic Modeling with Unknown Number of Topics

no code implementations • NeurIPS 2016 • Wei-Shou Hsu, Pascal Poupart

When the number of topics (or latent groups) is unknown, the Hierarchical Dirichlet Process (HDP) provides an elegant non-parametric extension; however, it is a complex model and it is difficult to incorporate prior knowledge since the distribution over topics is implicit.

Paper
Add Code

Symbolic Dynamic Programming for Continuous State and Observation POMDPs

no code implementations • NeurIPS 2012 • Zahra Zamani, Scott Sanner, Pascal Poupart, Kristian Kersting

In recent years, point- based value iteration methods have proven to be extremely effective techniques for ﬁnding (approximately) optimal dynamic programming solutions to POMDPs when an initial set of belief states is known.

Decision Making

Paper
Add Code

Cost-Sensitive Exploration in Bayesian Reinforcement Learning

no code implementations • NeurIPS 2012 • Dongho Kim, Kee-Eung Kim, Pascal Poupart

In this paper, we consider Bayesian reinforcement learning (BRL) where actions incur costs in addition to rewards, and thus exploration has to be constrained in terms of the expected total cost while learning to maximize the expected long-term total reward.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Automated Refinement of Bayes Networks' Parameters based on Test Ordering Constraints

no code implementations • NeurIPS 2011 • Omar Z. Khan, Pascal Poupart, John-Mark M. Agosta

We demonstrate that consistency with an expert's test selection leads to non-convex constraints on the model parameters.

Paper
Add Code

Representation Learning for Dynamic Graphs: A Survey

no code implementations • 27 May 2019 • Seyed Mehran Kazemi, Rishab Goel, Kshitij Jain, Ivan Kobyzev, Akshay Sethi, Peter Forsyth, Pascal Poupart

Graphs arise naturally in many real-world applications including social networks, recommender systems, ontologies, biology, and computational finance.

Knowledge Graphs Recommendation Systems +1

Paper
Add Code

Comparing EM with GD in Mixture Models of Two Components

1 code implementation • 8 Jul 2019 • Guojun Zhang, Pascal Poupart, George Trimponias

In the case of mixtures of Bernoullis, we find that there exist one-cluster regions that are stable for GD and therefore trap GD, but those regions are unstable for EM, allowing EM to escape.

Vocal Bursts Valence Prediction

Paper
Code

Batch norm with entropic regularization turns deterministic autoencoders into generative models

no code implementations • 25 Feb 2020 • Amur Ghose, Abdullah Rashwan, Pascal Poupart

The variational autoencoder is a well defined deep generative model that utilizes an encoder-decoder framework where an encoding neural network outputs a non-deterministic code for reconstructing an input.

Paper
Add Code

Optimality and Stability in Non-Convex Smooth Games

no code implementations • 27 Feb 2020 • Guojun Zhang, Pascal Poupart, Yao-Liang Yu

Convergence to a saddle point for convex-concave functions has been studied for decades, while recent years has seen a surge of interest in non-convex (zero-sum) smooth games, motivated by their recent wide applications.

Paper
Add Code

Generating Emotionally Aligned Responses in Dialogues using Affect Control Theory

no code implementations • 7 Mar 2020 • Nabiha Asghar, Ivan Kobyzev, Jesse Hoey, Pascal Poupart, Muhammad Bilal Sheikh

State-of-the-art neural dialogue systems excel at syntactic and semantic modelling of language, but often have a hard time establishing emotional alignment with the human interactant during a conversation.

Dialogue Generation

Paper
Add Code

A Positivstellensatz for Conditional SAGE Signomials

no code implementations • 8 Mar 2020 • Allen Houze Wang, Priyank Jaini, Yao-Liang Yu, Pascal Poupart

Recently, the conditional SAGE certificate has been proposed as a sufficient condition for signomial positivity over a convex set.

Paper
Add Code

Online Bayesian Moment Matching based SAT Solver Heuristics

no code implementations • ICML 2020 • Haonan Duan, Saeed Nejati, George Trimponias, Pascal Poupart, Vijay Ganesh

Our solvers out-perform the baselines by solving 12 more instances from the SAT competition 2018 application benchmark and are %40 faster on average in solving hard cryptographic instances.

Paper
Add Code

Learning Agent Representations for Ice Hockey

no code implementations • NeurIPS 2020 • Guiliang Liu, Oliver Schulte, Pascal Poupart, Mike Rudd, Mehrsan Javan

This paper develops a new approach for agent representations, based on a Markov game model, that is tailored towards applications in professional ice hockey.

Sports Analytics

Paper
Add Code

Prediction by Anticipation: An Action-Conditional Prediction Method based on Interaction Learning

no code implementations • ICCV 2021 • Ershad Banijamali, Mohsen Rohani, Elmira Amirloo, Jun Luo, Pascal Poupart

In autonomous driving (AD), accurately predicting changes in the environment can effectively improve safety and comfort.

Autonomous Driving

Paper
Add Code

Self-Supervised Simultaneous Multi-Step Prediction of Road Dynamics and Cost Map

no code implementations • CVPR 2021 • Elmira Amirloo, Mohsen Rohani, Ershad Banijamali, Jun Luo, Pascal Poupart

While supervised learning is widely used for perception modules in conventional autonomous driving solutions, scalability is hindered by the huge amount of data labeling needed.

Autonomous Driving Motion Planning

Paper
Add Code

Robust Embeddings Via Distributions

no code implementations • 17 Apr 2021 • Kira A. Selby, Yinong Wang, Ruizhe Wang, Peyman Passban, Ahmad Rashid, Mehdi Rezagholizadeh, Pascal Poupart

Despite recent monumental advances in the field, many Natural Language Processing (NLP) models still struggle to perform adequately on noisy domains.

Paper
Add Code

RAIL-KD: RAndom Intermediate Layer Mapping for Knowledge Distillation

no code implementations • Findings (NAACL) 2022 • Md Akmal Haidar, Nithin Anchuri, Mehdi Rezagholizadeh, Abbas Ghaddar, Philippe Langlais, Pascal Poupart

To address these problems, we propose a RAndom Intermediate Layer Knowledge Distillation (RAIL-KD) approach in which, intermediate layers from the teacher model are selected randomly to be distilled into the intermediate layers of the student model.

Knowledge Distillation

Paper
Add Code

Distributional Reinforcement Learning with Monotonic Splines

no code implementations • ICLR 2022 • Yudong Luo, Guiliang Liu, Haonan Duan, Oliver Schulte, Pascal Poupart

Distributional Reinforcement Learning (RL) differs from traditional RL by estimating the distribution over returns to capture the intrinsic uncertainty of MDPs.

Distributional Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Learning Object-Oriented Dynamics for Planning from Text

no code implementations • ICLR 2022 • Guiliang Liu, Ashutosh Adhikari, Amir-Massoud Farahmand, Pascal Poupart

The advancement of dynamics models enables model-based planning in complex environments.

Decision Making Object +1

Paper
Add Code

Learning Tree Interpretation from Object Representation for Deep Reinforcement Learning

no code implementations • NeurIPS 2021 • Guiliang Liu, Xiangyu Sun, Oliver Schulte, Pascal Poupart

We propose a Represent And Mimic (RAMi) framework for training 1) an identifiable latent representation to capture the independent factors of variation for the objects and 2) a mimic tree that extracts the causal impact of the latent features on DRL action values.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Physics Constrained Flow Neural Network for Short-Timescale Predictions in Data Communications Networks

no code implementations • 23 Dec 2021 • Xiangle Cheng, James He, Shihan Xiao, Yingxue Zhang, Zhitang Chen, Pascal Poupart, FengLin Li

Machine learning is gaining growing momentum in various recent models for the dynamic analysis of information flows in data communications networks.

Self-Supervised Learning

Paper
Add Code

CILDA: Contrastive Data Augmentation using Intermediate Layer Knowledge Distillation

no code implementations • COLING 2022 • Md Akmal Haidar, Mehdi Rezagholizadeh, Abbas Ghaddar, Khalil Bibi, Philippe Langlais, Pascal Poupart

Knowledge distillation (KD) is an efficient framework for compressing large-scale pre-trained language models.

Contrastive Learning Data Augmentation +1

Paper
Add Code

Do we need Label Regularization to Fine-tune Pre-trained Language Models?

no code implementations • 25 May 2022 • Ivan Kobyzev, Aref Jafari, Mehdi Rezagholizadeh, Tianda Li, Alan Do-Omri, Peng Lu, Pascal Poupart, Ali Ghodsi

Knowledge Distillation (KD) is a prominent neural model compression technique that heavily relies on teacher network predictions to guide the training of a student model.

Knowledge Distillation Model Compression

Paper
Add Code

Learning Soft Constraints From Constrained Expert Demonstrations

no code implementations • 2 Jun 2022 • Ashish Gaurav, Kasra Rezaee, Guiliang Liu, Pascal Poupart

We consider the setting where the reward function is given, and the constraints are unknown, and propose a method that is able to recover these constraints satisfactorily from the expert data.

Paper
Add Code

Federated Bayesian Neural Regression: A Scalable Global Federated Gaussian Process

no code implementations • 13 Jun 2022 • Haolin Yu, Kaiyang Guo, Mahdi Karami, Xi Chen, Guojun Zhang, Pascal Poupart

We present Federated Bayesian Neural Regression (FedBNR), an algorithm that learns a scalable stand-alone global federated GP that respects clients' privacy.

Federated Learning Knowledge Distillation +1

Paper
Add Code

Label Alignment Regularization for Distribution Shift

no code implementations • 27 Nov 2022 • Ehsan Imani, Guojun Zhang, Runjia Li, Jun Luo, Pascal Poupart, Philip H. S. Torr, Yangchen Pan

Recent work has highlighted the label alignment property (LAP) in supervised learning, where the vector of all labels in the dataset is mostly in the span of the top few singular vectors of the data matrix.

Representation Learning Sentiment Analysis +1

Paper
Add Code

Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization

no code implementations • 12 Dec 2022 • Aref Jafari, Ivan Kobyzev, Mehdi Rezagholizadeh, Pascal Poupart, Ali Ghodsi

Knowledge Distillation (KD) has been extensively used for natural language understanding (NLU) tasks to improve a small model's (a student) generalization by transferring the knowledge from a larger model (a teacher).

Knowledge Distillation Natural Language Understanding

Paper
Add Code

Attribute Controlled Dialogue Prompting

no code implementations • 11 Jul 2023 • Runcheng Liu, Ahmad Rashid, Ivan Kobyzev, Mehdi Rezagholizadeh, Pascal Poupart

Prompt-tuning has become an increasingly popular parameter-efficient method for adapting large pretrained language models to downstream tasks.

Attribute Dialogue Generation

Paper
Add Code

Preventing Arbitrarily High Confidence on Far-Away Data in Point-Estimated Discriminative Neural Networks

1 code implementation • 7 Nov 2023 • Ahmad Rashid, Serena Hacker, Guojun Zhang, Agustinus Kristiadi, Pascal Poupart

For instance, ReLU networks - a popular class of neural network architectures - have been shown to almost always yield high confidence predictions when the test data are far away from the training set, even when they are trained with OOD data.

Paper
Code

Calibrated One Round Federated Learning with Bayesian Inference in the Predictive Space

2 code implementations • 15 Dec 2023 • Mohsin Hasan, Guojun Zhang, Kaiyang Guo, Xi Chen, Pascal Poupart

To improve scalability for larger models, one common Bayesian approach is to approximate the global predictive posterior by multiplying local predictive posteriors.

Bayesian Inference Federated Learning

Paper
Code

Why Online Reinforcement Learning is Causal

no code implementations • 7 Mar 2024 • Oliver Schulte, Pascal Poupart

Our main argument is that in online learning, conditional probabilities are causal, and therefore offline RL is the setting where causal learning has the most potential to make a difference.

counterfactual Offline RL +2

Paper
Add Code

A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization

no code implementations • 17 Mar 2024 • Yudong Luo, Yangchen Pan, Han Wang, Philip Torr, Pascal Poupart

Reinforcement learning algorithms utilizing policy gradients (PG) to optimize Conditional Value at Risk (CVaR) face significant challenges with sample inefficiency, hindering their practical applications.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.