Search Results for author: Lawrence Carin

Found 225 papers, 64 papers with code

What Makes Good In-Context Examples for GPT-$3$?

3 code implementations • 17 Jan 2021 • Jiachang Liu, Dinghan Shen, Yizhe Zhang, Bill Dolan, Lawrence Carin, Weizhu Chen

Inspired by the recent success of leveraging a retrieval module to augment large-scale neural network models, we propose to retrieve examples that are semantically-similar to a test sample to formulate its corresponding prompt.

Few-Shot Learning Natural Language Understanding +4

10,348

Paper
Code

Use HiResCAM instead of Grad-CAM for faithful explanations of convolutional neural networks

2 code implementations • 17 Nov 2020 • Rachel Lea Draelos, Lawrence Carin

Explanation methods facilitate the development of models that learn meaningful concepts and avoid exploiting spurious correlations.

General Classification Image Classification

9,412

Paper
Code

Improving Zero-shot Voice Style Transfer via Disentangled Representation Learning

1 code implementation • ICLR 2021 • Siyang Yuan, Pengyu Cheng, Ruiyi Zhang, Weituo Hao, Zhe Gan, Lawrence Carin

Voice style transfer, also called voice conversion, seeks to modify one speaker's voice to generate speech as if it came from another (target) speaker.

Representation Learning Style Transfer +1

500

Paper
Code

Certified Adversarial Robustness with Additive Noise

3 code implementations • NeurIPS 2019 • Bai Li, Changyou Chen, Wenlin Wang, Lawrence Carin

The existence of adversarial data examples has drawn significant attention in the deep-learning community; such data are seemingly minimally perturbed relative to the original data, but lead to very different outputs from a deep-learning algorithm.

Adversarial Attack Adversarial Robustness

351

Paper
Code

Joint Embedding of Words and Labels for Text Classification

2 code implementations • ACL 2018 • Guoyin Wang, Chunyuan Li, Wenlin Wang, Yizhe Zhang, Dinghan Shen, Xinyuan Zhang, Ricardo Henao, Lawrence Carin

Word embeddings are effective intermediate representations for capturing semantic regularities between words, when learning the representations of text sequences.

Ranked #11 on Text Classification on DBpedia

General Classification Sentiment Analysis +2

323

Paper
Code

Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms

2 code implementations • ACL 2018 • Dinghan Shen, Guoyin Wang, Wenlin Wang, Martin Renqiang Min, Qinliang Su, Yizhe Zhang, Chunyuan Li, Ricardo Henao, Lawrence Carin

Many deep learning architectures have been proposed to model the compositionality in text sequences, requiring a substantial number of parameters and expensive computations.

Ranked #1 on Named Entity Recognition (NER) on CoNLL 2000

Document Classification General Classification +4

284

Paper
Code

CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information

2 code implementations • ICML 2020 • Pengyu Cheng, Weituo Hao, Shuyang Dai, Jiachang Liu, Zhe Gan, Lawrence Carin

In this paper, we propose a novel Contrastive Log-ratio Upper Bound (CLUB) of mutual information.

Domain Adaptation

276

Paper
Code

StoryGAN: A Sequential Conditional GAN for Story Visualization

1 code implementation • CVPR 2019 • Yitong Li, Zhe Gan, Yelong Shen, Jingjing Liu, Yu Cheng, Yuexin Wu, Lawrence Carin, David Carlson, Jianfeng Gao

We therefore propose a new story-to-image-sequence generation model, StoryGAN, based on the sequential conditional GAN framework.

Sentence Story Visualization +1

231

Paper
Code

Cyclical Annealing Schedule: A Simple Approach to Mitigating KL Vanishing

2 code implementations • NAACL 2019 • Hao Fu, Chunyuan Li, Xiaodong Liu, Jianfeng Gao, Asli Celikyilmaz, Lawrence Carin

Variational autoencoders (VAEs) with an auto-regressive decoder have been applied for many natural language processing (NLP) tasks.

Language Modelling Response Generation +1

172

Paper
Code

Deconvolutional Paragraph Representation Learning

4 code implementations • NeurIPS 2017 • Yizhe Zhang, Dinghan Shen, Guoyin Wang, Zhe Gan, Ricardo Henao, Lawrence Carin

Learning latent representations from long text sequences is an important first step in many natural language processing applications.

General Classification Representation Learning +1

151

Paper
Code

Graph Optimal Transport for Cross-Domain Alignment

1 code implementation • ICML 2020 • Liqun Chen, Zhe Gan, Yu Cheng, Linjie Li, Lawrence Carin, Jingjing Liu

In GOT, cross-domain alignment is formulated as a graph matching problem, by representing entities into a dynamically-constructed graph.

Graph Matching Image Captioning +8

149

Paper
Code

ALICE: Towards Understanding Adversarial Learning for Joint Distribution Matching

5 code implementations • NeurIPS 2017 • Chunyuan Li, Hao liu, Changyou Chen, Yunchen Pu, Liqun Chen, Ricardo Henao, Lawrence Carin

We investigate the non-identifiability issues associated with bidirectional adversarial training for joint distribution matching.

Paper
Code

Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-training

1 code implementation • CVPR 2020 • Weituo Hao, Chunyuan Li, Xiujun Li, Lawrence Carin, Jianfeng Gao

By training on a large amount of image-text-action triplets in a self-supervised learning manner, the pre-trained model provides generic representations of visual environments and language instructions.

Ranked #1 on Visual Navigation on Help, Anna! (HANNA)

Navigate Self-Supervised Learning +2

Paper
Code

Simpler, Faster, Stronger: Breaking The log-K Curse On Contrastive Learners With FlatNCE

1 code implementation • 2 Jul 2021 • Junya Chen, Zhe Gan, Xuan Li, Qing Guo, Liqun Chen, Shuyang Gao, Tagyoung Chung, Yi Xu, Belinda Zeng, Wenlian Lu, Fan Li, Lawrence Carin, Chenyang Tao

InfoNCE-based contrastive representation learners, such as SimCLR, have been tremendously successful in recent years.

Contrastive Learning

Paper
Code

Adversarial Feature Matching for Text Generation

1 code implementation • ICML 2017 • Yizhe Zhang, Zhe Gan, Kai Fan, Zhi Chen, Ricardo Henao, Dinghan Shen, Lawrence Carin

We propose a framework for generating realistic text via adversarial training.

Generative Adversarial Network Text Generation

Paper
Code

Semantic Compositional Networks for Visual Captioning

1 code implementation • CVPR 2017 • Zhe Gan, Chuang Gan, Xiaodong He, Yunchen Pu, Kenneth Tran, Jianfeng Gao, Lawrence Carin, Li Deng

The degree to which each member of the ensemble is used to generate an image caption is tied to the image-dependent probability of the corresponding tag.

Image Captioning Semantic Composition +1

Paper
Code

Gromov-Wasserstein Learning for Graph Matching and Node Embedding

2 code implementations • 17 Jan 2019 • Hongteng Xu, Dixin Luo, Hongyuan Zha, Lawrence Carin

A novel Gromov-Wasserstein learning framework is proposed to jointly match (align) graphs and learn embedding vectors for the associated graph nodes.

Graph Matching

Paper
Code

Triangle Generative Adversarial Networks

1 code implementation • NeurIPS 2017 • Zhe Gan, Liqun Chen, Wei-Yao Wang, Yunchen Pu, Yizhe Zhang, Hao liu, Chunyuan Li, Lawrence Carin

The generators are designed to learn the two-way conditional distributions between the two domains, while the discriminators implicitly define a ternary discriminative function, which is trained to distinguish real data pairs and two kinds of fake data pairs.

Attribute Generative Adversarial Network +3

Paper
Code

On Leveraging Pretrained GANs for Generation with Limited Data

1 code implementation • ICML 2020 • Miaoyun Zhao, Yulai Cong, Lawrence Carin

Demonstrated by natural-image generation, we reveal that low-level filters (those close to observations) of both the generator and discriminator of pretrained GANs can be transferred to facilitate generation in a perceptually-distinct target domain with limited training data.

Image Generation Transfer Learning

Paper
Code

GAN Memory with No Forgetting

1 code implementation • NeurIPS 2020 • Yulai Cong, Miaoyun Zhao, Jianqiao Li, Sijia Wang, Lawrence Carin

As a fundamental issue in lifelong learning, catastrophic forgetting is directly caused by inaccessible historical data; accordingly, if the data (information) were memorized perfectly, no forgetting should be expected.

Paper
Code

Dynamic Clustering via Asymptotics of the Dependent Dirichlet Process Mixture

1 code implementation • NeurIPS 2013 • Trevor Campbell, Miao Liu, Brian Kulis, Jonathan P. How, Lawrence Carin

This paper presents a novel algorithm, based upon the dependent Dirichlet process mixture model (DDPMM), for clustering batch-sequential data containing an unknown number of evolving clusters.

Clustering

Paper
Code

Deep Temporal Sigmoid Belief Networks for Sequence Modeling

1 code implementation • NeurIPS 2015 • Zhe Gan, Chunyuan Li, Ricardo Henao, David Carlson, Lawrence Carin

Deep dynamic generative models are developed to learn sequential dependencies in time-series data.

Time Series Time Series Analysis

Paper
Code

Learning Compressed Sentence Representations for On-Device Text Processing

1 code implementation • ACL 2019 • Dinghan Shen, Pengyu Cheng, Dhanasekar Sundararaman, Xinyuan Zhang, Qian Yang, Meng Tang, Asli Celikyilmaz, Lawrence Carin

Vector representations of sentences, trained on massive text corpora, are widely used as generic sentence embeddings across a variety of NLP problems.

Retrieval Sentence +1

Paper
Code

Learning Autoencoders with Relational Regularization

2 code implementations • ICML 2020 • Hongteng Xu, Dixin Luo, Ricardo Henao, Svati Shah, Lawrence Carin

A new algorithmic framework is proposed for learning autoencoders of data distributions.

MULTI-VIEW LEARNING

Paper
Code

Scalable Gromov-Wasserstein Learning for Graph Partitioning and Matching

1 code implementation • NeurIPS 2019 • Hongteng Xu, Dixin Luo, Lawrence Carin

Using this concept, we extend our method to multi-graph partitioning and matching by learning a Gromov-Wasserstein barycenter graph for multiple observed graphs; the barycenter graph plays the role of the disconnected graph, and since it is learned, so is the clustering.

Clustering Graph Matching +1

Paper
Code

Adversarial Time-to-Event Modeling

4 code implementations • ICML 2018 • Paidamoyo Chapfuwa, Chenyang Tao, Chunyuan Li, Courtney Page, Benjamin Goldstein, Lawrence Carin, Ricardo Henao

Modern health data science applications leverage abundant molecular and electronic health data, providing opportunities for machine learning to build statistical models to support clinical practice.

Survival Analysis

Paper
Code

Towards Amortized Ranking-Critical Training for Collaborative Filtering

1 code implementation • 10 Jun 2019 • Sam Lobel, Chunyuan Li, Jianfeng Gao, Lawrence Carin

In this paper we investigate new methods for training collaborative filtering models based on actor-critic reinforcement learning, to directly optimize the non-differentiable quality metrics of interest.

Ranked #4 on Recommendation Systems on Million Song Dataset

Collaborative Filtering Learning-To-Rank +1

Paper
Code

RaCT: Toward Amortized Ranking-Critical Training For Collaborative Filtering

1 code implementation • ICLR 2020 • Sam Lobel*, Chunyuan Li*, Jianfeng Gao, Lawrence Carin

We investigate new methods for training collaborative filtering models based on actor-critic reinforcement learning, to more directly maximize ranking-based objective functions.

Collaborative Filtering Learning-To-Rank +2

Paper
Code

Enhancing Cross-task Black-Box Transferability of Adversarial Examples with Dispersion Reduction

2 code implementations • CVPR 2020 • Yantao Lu, Yunhan Jia, Jian-Yu Wang, Bai Li, Weiheng Chai, Lawrence Carin, Senem Velipasalar

Neural networks are known to be vulnerable to carefully crafted adversarial examples, and these malicious samples often transfer, i. e., they remain adversarial even against other models.

Adversarial Attack Image Classification +5

Paper
Code

Learning Graphons via Structured Gromov-Wasserstein Barycenters

1 code implementation • 10 Dec 2020 • Hongteng Xu, Dixin Luo, Lawrence Carin, Hongyuan Zha

Accordingly, given a set of graphs generated by an underlying graphon, we learn the corresponding step function as the Gromov-Wasserstein barycenter of the given graphs.

LEMMA

Paper
Code

Understanding and Accelerating Particle-Based Variational Inference

1 code implementation • 4 Jul 2018 • Chang Liu, Jingwei Zhuo, Pengyu Cheng, Ruiyi Zhang, Jun Zhu, Lawrence Carin

Particle-based variational inference methods (ParVIs) have gained attention in the Bayesian inference literature, for their capacity to yield flexible and accurate approximations.

Bayesian Inference Variational Inference

Paper
Code

Machine-Learning-Based Multiple Abnormality Prediction with Large-Scale Chest Computed Tomography Volumes

1 code implementation • 12 Feb 2020 • Rachel Lea Draelos, David Dov, Maciej A. Mazurowski, Joseph Y. Lo, Ricardo Henao, Geoffrey D. Rubin, Lawrence Carin

This model reached a classification performance of AUROC greater than 0. 90 for 18 abnormalities, with an average AUROC of 0. 773 for all 83 abnormalities, demonstrating the feasibility of learning from unfiltered whole volume CT data.

BIG-bench Machine Learning Computed Tomography (CT) +1

Paper
Code

JointGAN: Multi-Domain Joint Distribution Learning with Generative Adversarial Nets

2 code implementations • ICML 2018 • Yunchen Pu, Shuyang Dai, Zhe Gan, Wei-Yao Wang, Guoyin Wang, Yizhe Zhang, Ricardo Henao, Lawrence Carin

Distinct from most existing approaches, that only learn conditional distributions, the proposed model aims to learn a joint distribution of multiple random variables (domains).

Generative Adversarial Network

Paper
Code

Symmetric Variational Autoencoder and Connections to Adversarial Learning

2 code implementations • 6 Sep 2017 • Liqun Chen, Shuyang Dai, Yunchen Pu, Chunyuan Li, Qinliang Su, Lawrence Carin

A new form of the variational autoencoder (VAE) is proposed, based on the symmetric Kullback-Leibler divergence.

Paper
Code

Revisiting the Softmax Bellman Operator: New Benefits and New Perspective

2 code implementations • 2 Dec 2018 • Zhao Song, Ronald E. Parr, Lawrence Carin

The impact of softmax on the value function itself in reinforcement learning (RL) is often viewed as problematic because it leads to sub-optimal value (or Q) functions and interferes with the contraction properties of the Bellman operator.

Atari Games Q-Learning +1

Paper
Code

Bridging the Gap between Stochastic Gradient MCMC and Stochastic Optimization

1 code implementation • 25 Dec 2015 • Changyou Chen, David Carlson, Zhe Gan, Chunyuan Li, Lawrence Carin

Stochastic gradient Markov chain Monte Carlo (SG-MCMC) methods are Bayesian analogs to popular stochastic optimization methods; however, this connection is not well studied.

Stochastic Optimization

Paper
Code

Efficient Feature Transformations for Discriminative and Generative Continual Learning

1 code implementation • CVPR 2021 • Vinay Kumar Verma, Kevin J Liang, Nikhil Mehta, Piyush Rai, Lawrence Carin

However, the growth in the number of additional parameters of many of these types of methods can be computationally expensive at larger scales, at times prohibitively so.

Continual Learning

Paper
Code

Variational Gaussian Copula Inference

1 code implementation • 19 Jun 2015 • Shaobo Han, Xuejun Liao, David B. Dunson, Lawrence Carin

We utilize copulas to constitute a unified framework for constructing and optimizing variational proposals in hierarchical Bayesian models.

Paper
Code

Adversarial Learning of a Sampler Based on an Unnormalized Distribution

3 code implementations • 3 Jan 2019 • Chunyuan Li, Ke Bai, Jianqiao Li, Guoyin Wang, Changyou Chen, Lawrence Carin

We investigate adversarial learning in the case when only an unnormalized form of the density can be accessed, rather than samples.

Q-Learning

Paper
Code

Survival Function Matching for Calibrated Time-to-Event Predictions

1 code implementation • 21 May 2019 • Paidamoyo Chapfuwa, Chenyang Tao, Lawrence Carin, Ricardo Henao

We present a survival function estimator for probabilistic predictions in time-to-event models, based on a neural network model for draws from the distribution of event times, without explicit assumptions on the form of the distribution.

Paper
Code

Survival Cluster Analysis

1 code implementation • 29 Feb 2020 • Paidamoyo Chapfuwa, Chunyuan Li, Nikhil Mehta, Lawrence Carin, Ricardo Henao

As a result, there is an unmet need in survival analysis for identifying subpopulations with distinct risk profiles, while jointly accounting for accurate individualized time-to-event predictions.

Survival Analysis

Paper
Code

Supercharging Imbalanced Data Learning With Energy-based Contrastive Representation Transfer

1 code implementation • NeurIPS 2021 • Zidi Xiu, Junya Chen, Ricardo Henao, Benjamin Goldstein, Lawrence Carin, Chenyang Tao

Dealing with severe class imbalance poses a major challenge for real-world applications, especially when the accurate classification and generalization of minority classes is of primary interest.

Inductive Bias Transfer Learning

Paper
Code

Tight Mutual Information Estimation With Contrastive Fenchel-Legendre Optimization

1 code implementation • 2 Jul 2021 • Qing Guo, Junya Chen, Dong Wang, Yuewei Yang, Xinwei Deng, Lawrence Carin, Fan Li, Jing Huang, Chenyang Tao

Successful applications of InfoNCE and its variants have popularized the use of contrastive variational mutual information (MI) estimators in machine learning.

Mutual Information Estimation

Paper
Code

Ouroboros: On Accelerating Training of Transformer-Based Language Models

1 code implementation • NeurIPS 2019 • Qian Yang, Zhouyuan Huo, Wenlin Wang, Heng Huang, Lawrence Carin

Model parallelism is required if a model is too large to fit in a single computing device.

Language Modelling Machine Translation +2

Paper
Code

Dynamic Embedding on Textual Networks via a Gaussian Process

1 code implementation • 5 Oct 2019 • Pengyu Cheng, Yitong Li, Xinyuan Zhang, Liqun Cheng, David Carlson, Lawrence Carin

The relative importance of global versus local structure for the embeddings is learned automatically.

Link Prediction Network Embedding +1

Paper
Code

Enabling Counterfactual Survival Analysis with Balanced Representations

1 code implementation • 14 Jun 2020 • Paidamoyo Chapfuwa, Serge Assaad, Shuxi Zeng, Michael J. Pencina, Lawrence Carin, Ricardo Henao

Balanced representation learning methods have been applied successfully to counterfactual inference from observational data.

counterfactual Counterfactual Inference +2

Paper
Code

Reconsidering Generative Objectives For Counterfactual Reasoning

1 code implementation • NeurIPS 2020 • Danni Lu, Chenyang Tao, Junya Chen, Fan Li, Feng Guo, Lawrence Carin

As a step towards more flexible, scalable and accurate ITE estimation, we present a novel generative Bayesian estimation framework that integrates representation learning, adversarial matching and causal estimation.

Causal Inference counterfactual +2

Paper
Code

Students Need More Attention: BERT-based AttentionModel for Small Data with Application to AutomaticPatient Message Triage

1 code implementation • 22 Jun 2020 • Shijing Si, Rui Wang, Jedrek Wosik, Hao Zhang, David Dov, Guoyin Wang, Ricardo Henao, Lawrence Carin

Small and imbalanced datasets commonly seen in healthcare represent a challenge when training classifiers based on deep learning models.

Paper
Code

Learning Structural Weight Uncertainty for Sequential Decision-Making

1 code implementation • 30 Dec 2017 • Ruiyi Zhang, Chunyuan Li, Changyou Chen, Lawrence Carin

Learning probability distributions on the weights of neural networks (NNs) has recently proven beneficial in many applications.

Decision Making Multi-Armed Bandits +1

Paper
Code

Kernel-Based Approaches for Sequence Modeling: Connections to Neural Methods

1 code implementation • NeurIPS 2019 • Kevin J Liang, Guoyin Wang, Yitong Li, Ricardo Henao, Lawrence Carin

We investigate time-dependent data analysis from the perspective of recurrent kernel machines, from which models with hidden units and gated memory cells arise naturally.

Paper
Code

NASH: Toward End-to-End Neural Architecture for Generative Semantic Hashing

1 code implementation • ACL 2018 • Dinghan Shen, Qinliang Su, Paidamoyo Chapfuwa, Wenlin Wang, Guoyin Wang, Lawrence Carin, Ricardo Henao

Semantic hashing has become a powerful paradigm for fast similarity search in many information retrieval systems.

Information Retrieval Retrieval +1

Paper
Code

Alternating Minimization Algorithm with Automatic Relevance Determination for Transmission Tomography under Poisson Noise

1 code implementation • 29 Dec 2014 • Yan Kaganovsky, Shaobo Han, Soysal Degirmenci, David G. Politte, David J. Brady, Joseph A. O'Sullivan, Lawrence Carin

We propose a globally convergent alternating minimization (AM) algorithm for image reconstruction in transmission tomography, which extends automatic relevance determination (ARD) to Poisson noise models with Beer's law.

Image Reconstruction

Paper
Code

Adversarial Text Generation via Feature-Mover's Distance

1 code implementation • NeurIPS 2018 • Liqun Chen, Shuyang Dai, Chenyang Tao, Dinghan Shen, Zhe Gan, Haichao Zhang, Yizhe Zhang, Lawrence Carin

However, the discrete nature of text hinders the application of GAN to text-generation tasks.

Adversarial Text Style Transfer +1

Paper
Code

Towards Practical Lottery Ticket Hypothesis for Adversarial Training

1 code implementation • 6 Mar 2020 • Bai Li, Shiqi Wang, Yunhan Jia, Yantao Lu, Zhenyu Zhong, Lawrence Carin, Suman Jana

Recent research has proposed the lottery ticket hypothesis, suggesting that for a deep neural network, there exist trainable sub-networks performing equally or better than the original model with commensurate training steps.

Paper
Code

Double Robust Representation Learning for Counterfactual Prediction

1 code implementation • 15 Oct 2020 • Shuxi Zeng, Serge Assaad, Chenyang Tao, Shounak Datta, Lawrence Carin, Fan Li

Causal inference, or counterfactual prediction, is central to decision making in healthcare, policy and social sciences.

Causal Inference counterfactual +2

Paper
Code

GO Gradient for Expectation-Based Objectives

1 code implementation • ICLR 2019 • Yulai Cong, Miaoyun Zhao, Ke Bai, Lawrence Carin

Within many machine learning algorithms, a fundamental problem concerns efficient calculation of an unbiased gradient wrt parameters $\gammav$ for expectation-based objectives $\Ebb_{q_{\gammav} (\yv)} [f(\yv)]$.

Paper
Code

On Fenchel Mini-Max Learning

1 code implementation • NeurIPS 2019 • Chenyang Tao, Liqun Chen, Shuyang Dai, Junya Chen, Ke Bai, Dong Wang, Jianfeng Feng, Wenlian Lu, Georgiy Bobashev, Lawrence Carin

Inference, estimation, sampling and likelihood evaluation are four primary goals of probabilistic modeling.

Density Estimation

Paper
Code

Explainable multiple abnormality classification of chest CT volumes

2 code implementations • 24 Nov 2021 • Rachel Lea Draelos, Lawrence Carin

We introduce the challenging new task of explainable multiple abnormality classification in volumetric medical images, in which a model must indicate the regions used to predict each abnormality.

Classification Multiple Instance Learning +1

Paper
Code

Improving Textual Network Learning with Variational Homophilic Embeddings

1 code implementation • NeurIPS 2019 • Wenlin Wang, Chenyang Tao, Zhe Gan, Guoyin Wang, Liqun Chen, Xinyuan Zhang, Ruiyi Zhang, Qian Yang, Ricardo Henao, Lawrence Carin

This paper considers a novel variational formulation of network embeddings, with special focus on textual networks.

Network Embedding

Paper
Code

Integrating Task Specific Information into Pretrained Language Models for Low Resource Fine Tuning

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Rui Wang, Shijing Si, Guoyin Wang, Lei Zhang, Lawrence Carin, Ricardo Henao

Pretrained Language Models (PLMs) have improved the performance of natural language understanding in recent years.

Natural Language Understanding

Paper
Code

Negative Binomial Process Count and Mixture Modeling

1 code implementation • 15 Sep 2012 • Mingyuan Zhou, Lawrence Carin

A gamma process is employed to model the rate measure of a Poisson process, whose normalization provides a random probability measure for mixture modeling and whose marginalization leads to an NB process for count modeling.

Bayesian Inference

Paper
Code

GO Hessian for Expectation-Based Objectives

1 code implementation • 16 Jun 2020 • Yulai Cong, Miaoyun Zhao, Jianqiao Li, Junya Chen, Lawrence Carin

An unbiased low-variance gradient estimator, termed GO gradient, was proposed recently for expectation-based objectives $\mathbb{E}_{q_{\boldsymbol{\gamma}}(\boldsymbol{y})} [f(\boldsymbol{y})]$, where the random variable (RV) $\boldsymbol{y}$ may be drawn from a stochastic computation graph with continuous (non-reparameterizable) internal nodes and continuous/discrete leaves.

Paper
Code

Gradient Importance Learning for Incomplete Observations

1 code implementation • ICLR 2022 • Qitong Gao, Dong Wang, Joshua D. Amason, Siyang Yuan, Chenyang Tao, Ricardo Henao, Majda Hadziahmetovic, Lawrence Carin, Miroslav Pajic

Though recent works have developed methods that can generate estimates (or imputations) of the missing entries in a dataset to facilitate downstream analysis, most depend on assumptions that may not align with real-world applications and could suffer from poor performance in subsequent tasks such as classification.

Imputation Reinforcement Learning (RL) +2

Paper
Code

Diffusion Maps for Textual Network Embedding

no code implementations • NeurIPS 2018 • Xinyuan Zhang, Yitong Li, Dinghan Shen, Lawrence Carin

Textual network embedding leverages rich text information associated with the network to learn low-dimensional vectorial representations of vertices.

General Classification Link Prediction +1

Paper
Add Code

Nonlocal Low-Rank Tensor Factor Analysis for Image Restoration

no code implementations • CVPR 2018 • Xinyuan Zhang, Xin Yuan, Lawrence Carin

Low-rank signal modeling has been widely leveraged to capture non-local correlation in image processing applications.

Compressive Sensing Image Reconstruction +1

Paper
Add Code

Topic Compositional Neural Language Model

no code implementations • 28 Dec 2017 • Wenlin Wang, Zhe Gan, Wenqi Wang, Dinghan Shen, Jiaji Huang, Wei Ping, Sanjeev Satheesh, Lawrence Carin

The TCNLM learns the global semantic coherence of a document via a neural topic model, and the probability of each learned latent topic is further used to build a Mixture-of-Experts (MoE) language model, where each expert (corresponding to one topic) is a recurrent neural network (RNN) that accounts for learning the local structure of a word sequence.

Language Modelling

Paper
Add Code

Superposition-Assisted Stochastic Optimization for Hawkes Processes

no code implementations • 13 Feb 2018 • Hongteng Xu, Xu Chen, Lawrence Carin

We consider the learning of multi-agent Hawkes processes, a model containing multiple Hawkes processes with shared endogenous impact functions and different exogenous intensities.

Sequential Recommendation Stochastic Optimization

Paper
Add Code

Benefits from Superposed Hawkes Processes

no code implementations • 14 Oct 2017 • Hongteng Xu, Dixin Luo, Xu Chen, Lawrence Carin

The superposition of Hawkes processes is demonstrated to be beneficial for tightening the upper bound of excess risk under certain conditions, and we show the feasibility of the benefit in typical situations.

Point Processes Recommendation Systems

Paper
Add Code

Continuous-Time Flows for Efficient Inference and Density Estimation

no code implementations • ICML 2018 • Changyou Chen, Chunyuan Li, Liqun Chen, Wenlin Wang, Yunchen Pu, Lawrence Carin

Distinct from normalizing flows and GANs, CTFs can be adopted to achieve the above two goals in one framework, with theoretical guarantees.

Density Estimation

Paper
Add Code

Learning Registered Point Processes from Idiosyncratic Observations

no code implementations • ICML 2018 • Hongteng Xu, Lawrence Carin, Hongyuan Zha

A parametric point process model is developed, with modeling based on the assumption that sequential observations often share latent phenomena, while also possessing idiosyncratic effects.

Point Processes

Paper
Add Code

Multi-Label Learning from Medical Plain Text with Convolutional Residual Models

no code implementations • 15 Jan 2018 • Xinyuan Zhang, Ricardo Henao, Zhe Gan, Yitong Li, Lawrence Carin

Since diagnoses are typically correlated, a deep residual network is employed on top of the CNN encoder, to capture label (diagnosis) dependencies and incorporate information directly from the encoded sentence vector.

General Classification Multi-Label Classification +4

Paper
Add Code

Stochastic Gradient Monomial Gamma Sampler

no code implementations • ICML 2017 • Yizhe Zhang, Changyou Chen, Zhe Gan, Ricardo Henao, Lawrence Carin

A framework is proposed to improve the sampling efficiency of stochastic gradient MCMC, based on Hamiltonian Monte Carlo.

Paper
Add Code

Towards Unifying Hamiltonian Monte Carlo and Slice Sampling

no code implementations • NeurIPS 2016 • Yizhe Zhang, Xiangyu Wang, Changyou Chen, Ricardo Henao, Kai Fan, Lawrence Carin

We unify slice sampling and Hamiltonian Monte Carlo (HMC) sampling, demonstrating their connection via the Hamiltonian-Jacobi equation from Hamiltonian mechanics.

Paper
Add Code

On Connecting Stochastic Gradient MCMC and Differential Privacy

no code implementations • 25 Dec 2017 • Bai Li, Changyou Chen, Hao liu, Lawrence Carin

Significant success has been realized recently on applying machine learning to real-world applications.

BIG-bench Machine Learning

Paper
Add Code

Deconvolutional Latent-Variable Model for Text Sequence Matching

no code implementations • 21 Sep 2017 • Dinghan Shen, Yizhe Zhang, Ricardo Henao, Qinliang Su, Lawrence Carin

A latent-variable model is introduced for text matching, inferring sentence representations by jointly optimizing generative and discriminative objectives.

Sentence Text Matching

Paper
Add Code

Zero-Shot Learning via Class-Conditioned Deep Generative Models

no code implementations • 15 Nov 2017 • Wenlin Wang, Yunchen Pu, Vinay Kumar Verma, Kai Fan, Yizhe Zhang, Changyou Chen, Piyush Rai, Lawrence Carin

We present a deep generative model for learning to predict classes not seen at training time.

Few-Shot Learning Zero-Shot Learning

Paper
Add Code

Adversarial Symmetric Variational Autoencoder

no code implementations • NeurIPS 2017 • Yunchen Pu, Wei-Yao Wang, Ricardo Henao, Liqun Chen, Zhe Gan, Chunyuan Li, Lawrence Carin

A new form of variational autoencoder (VAE) is developed, in which the joint distribution of data and codes is considered in two (symmetric) forms: ($i$) from observed data fed through the encoder to yield codes, and ($ii$) from latent codes drawn from a simple prior and propagated through the decoder to manifest data.

Paper
Add Code

VAE Learning via Stein Variational Gradient Descent

no code implementations • NeurIPS 2017 • Yunchen Pu, Zhe Gan, Ricardo Henao, Chunyuan Li, Shaobo Han, Lawrence Carin

A new method for learning variational autoencoders (VAEs) is developed, based on Stein variational gradient descent.

Paper
Add Code

Adaptive Feature Abstraction for Translating Video to Text

no code implementations • 23 Nov 2016 • Yunchen Pu, Martin Renqiang Min, Zhe Gan, Lawrence Carin

Previous models for video captioning often use the output from a specific layer of a Convolutional Neural Network (CNN) as video features.

Video Captioning

Paper
Add Code

An inner-loop free solution to inverse problems using deep neural networks

no code implementations • NeurIPS 2017 • Qi Wei, Kai Fan, Lawrence Carin, Katherine A. Heller

For matrix inversion in the second sub-problem, we learn a convolutional neural network to approximate the matrix inversion, i. e., the inverse mapping is learned by feeding the input through the learned forward network.

Denoising

Paper
Add Code

Learning Context-Sensitive Convolutional Filters for Text Processing

no code implementations • EMNLP 2018 • Dinghan Shen, Martin Renqiang Min, Yitong Li, Lawrence Carin

The role of meta network is to abstract the contextual information of a sentence or document into a set of input-aware filters.

Ranked #13 on Text Classification on DBpedia

Paraphrase Identification Sentence +2

Paper
Add Code

A Probabilistic Framework for Nonlinearities in Stochastic Neural Networks

no code implementations • NeurIPS 2017 • Qinliang Su, Xuejun Liao, Lawrence Carin

We present a probabilistic framework for nonlinearities, based on doubly truncated Gaussian distributions.

Paper
Add Code

A Convergence Analysis for A Class of Practical Variance-Reduction Stochastic Gradient MCMC

no code implementations • 4 Sep 2017 • Changyou Chen, Wenlin Wang, Yizhe Zhang, Qinliang Su, Lawrence Carin

However, there has been little theoretical analysis of the impact of minibatch size to the algorithm's convergence rate.

Stochastic Optimization

Paper
Add Code

Learning Generic Sentence Representations Using Convolutional Neural Networks

no code implementations • EMNLP 2017 • Zhe Gan, Yunchen Pu, Ricardo Henao, Chunyuan Li, Xiaodong He, Lawrence Carin

We propose a new encoder-decoder approach to learn distributed sentence representations that are applicable to multiple purposes.

Sentence

Paper
Add Code

Deep Generative Models for Relational Data with Side Information

no code implementations • ICML 2017 • Changwei Hu, Piyush Rai, Lawrence Carin

Moreover, inference cost scales in the number of edges which is attractive for massive but sparse networks.

Data Augmentation Link Prediction

Paper
Add Code

Scalable Bayesian Learning of Recurrent Neural Networks for Language Modeling

no code implementations • ACL 2017 • Zhe Gan, Chunyuan Li, Changyou Chen, Yunchen Pu, Qinliang Su, Lawrence Carin

Recurrent neural networks (RNNs) have shown promising performance for language modeling.

Language Modelling Stochastic Optimization

Paper
Add Code

Tensor-Dictionary Learning with Deep Kruskal-Factor Analysis

no code implementations • 8 Dec 2016 • Andrew Stevens, Yunchen Pu, Yannan Sun, Greg Spell, Lawrence Carin

A multi-way factor analysis model is introduced for tensor-variate data of any order.

Denoising Dictionary Learning +2

Paper
Add Code

Compressive Sensing via Convolutional Factor Analysis

no code implementations • 11 Jan 2017 • Xin Yuan, Yunchen Pu, Lawrence Carin

During reconstruction and testing, we project the upper layer dictionary to the data level and only a single layer deconvolution is required.

Compressive Sensing General Classification

Paper
Add Code

Nonlinear Statistical Learning with Truncated Gaussian Graphical Models

no code implementations • 2 Jun 2016 • Qinliang Su, Xuejun Liao, Changyou Chen, Lawrence Carin

We introduce the truncated Gaussian graphical model (TGGM) as a novel framework for designing statistical models for nonlinear learning.

General Classification

Paper
Add Code

Unsupervised Learning with Truncated Gaussian Graphical Models

no code implementations • 15 Nov 2016 • Qinliang Su, Xuejun Liao, Chunyuan Li, Zhe Gan, Lawrence Carin

Gaussian graphical models (GGMs) are widely used for statistical modeling, because of ease of inference and the ubiquitous use of the normal distribution in practical approximations.

Unsupervised Pre-training

Paper
Add Code

Earliness-Aware Deep Convolutional Networks for Early Time Series Classification

no code implementations • 14 Nov 2016 • Wenlin Wang, Changyou Chen, Wenqi Wang, Piyush Rai, Lawrence Carin

Unlike most existing methods for early classification of time series data, that are designed to solve this problem under the assumption of the availability of a good set of pre-defined (often hand-crafted) features, our framework can jointly perform feature learning (by learning a deep hierarchy of \emph{shapelets} capturing the salient characteristics in each time series), along with a dynamic truncation model to help our deep feature learning architecture focus on the early parts of each time series.

Classification Early Classification +4

Paper
Add Code

On the Convergence of Stochastic Gradient MCMC Algorithms with High-Order Integrators

no code implementations • NeurIPS 2015 • Changyou Chen, Nan Ding, Lawrence Carin

Our theoretical results show faster convergence rates and more accurate invariant measures for SG-MCMCs with higher-order integrators.

Paper
Add Code

Stochastic Gradient MCMC with Stale Gradients

no code implementations • NeurIPS 2016 • Changyou Chen, Nan Ding, Chunyuan Li, Yizhe Zhang, Lawrence Carin

In this paper we develop theory to show that while the bias and MSE of an SG-MCMC algorithm depend on the staleness of stochastic gradients, its estimation variance (relative to the expected estimate, based on a prescribed number of samples) is independent of it.

Paper
Add Code

Variational Autoencoder for Deep Learning of Images, Labels and Captions

no code implementations • NeurIPS 2016 • Yunchen Pu, Zhe Gan, Ricardo Henao, Xin Yuan, Chunyuan Li, Andrew Stevens, Lawrence Carin

A novel variational autoencoder is developed to model images, as well as associated labels or captions.

Paper
Add Code

Factored Temporal Sigmoid Belief Networks for Sequence Learning

no code implementations • 22 May 2016 • Jiaming Song, Zhe Gan, Lawrence Carin

Deep conditional generative models are developed to simultaneously learn the temporal dependencies of multiple sequences.

General Classification

Paper
Add Code

Classification and Reconstruction of High-Dimensional Signals from Low-Dimensional Features in the Presence of Side Information

no code implementations • 1 Dec 2014 • Francesco Renna, Liming Wang, Xin Yuan, Jianbo Yang, Galen Reeves, Robert Calderbank, Lawrence Carin, Miguel R. D. Rodrigues

These conditions, which are reminiscent of the well-known Slepian-Wolf and Wyner-Ziv conditions, are a function of the number of linear features extracted from the signal of interest, the number of linear features extracted from the side information signal, and the geometry of these signals and their interplay.

General Classification

Paper
Add Code

Spectrally Grouped Total Variation Reconstruction for Scatter Imaging Using ADMM

no code implementations • 29 Jan 2016 • Ikenna Odinaka, Yan Kaganovsky, Joel A. Greenberg, Mehadi Hassan, David G. Politte, Joseph A. O'Sullivan, Lawrence Carin, David J. Brady

We pursue an optimization transfer approach where convex decompositions are used to lift the problem such that all hyper-voxels can be updated in parallel and in closed-form.

Image Reconstruction

Paper
Add Code

Joint System and Algorithm Design for Computationally Efficient Fan Beam Coded Aperture X-ray Coherent Scatter Imaging

no code implementations • 29 Jan 2016 • Ikenna Odinaka, Joseph A. O'Sullivan, David G. Politte, Kenneth P. MacCabe, Yan Kaganovsky, Joel A. Greenberg, Manu Lakshmanan, Kalyani Krishnamurthy, Anuj Kapadia, Lawrence Carin, David J. Brady

In x-ray coherent scatter tomography, tomographic measurements of the forward scatter distribution are used to infer scatter densities within a volume.

Paper
Add Code

Preconditioned Stochastic Gradient Langevin Dynamics for Deep Neural Networks

no code implementations • 23 Dec 2015 • Chunyuan Li, Changyou Chen, David Carlson, Lawrence Carin

Pytorch implementations of Bayes By Backprop, MC Dropout, SGLD, the Local Reparametrization Trick, KF-Laplace and more

Paper
Add Code

High-Order Stochastic Gradient Thermostats for Bayesian Learning of Deep Models

no code implementations • 23 Dec 2015 • Chunyuan Li, Changyou Chen, Kai Fan, Lawrence Carin

Stochastic gradient MCMC algorithms (SG-MCMC) are a family of diffusion-based sampling methods for large-scale Bayesian learning.

Vocal Bursts Intensity Prediction

Paper
Add Code

A Deep Generative Deconvolutional Image Model

no code implementations • 23 Dec 2015 • Yunchen Pu, Xin Yuan, Andrew Stevens, Chunyuan Li, Lawrence Carin

A deep generative model is developed for representation and analysis of images, based on a hierarchical convolutional dictionary-learning framework.

Dictionary Learning Image Generation

Paper
Add Code

Learning a Hybrid Architecture for Sequence Regression and Annotation

no code implementations • 16 Dec 2015 • Yizhe Zhang, Ricardo Henao, Lawrence Carin, Jianling Zhong, Alexander J. Hartemink

When learning a hidden Markov model (HMM), sequen- tial observations can often be complemented by real-valued summary response variables generated from the path of hid- den states.

regression

Paper
Add Code

Stick-Breaking Policy Learning in Dec-POMDPs

no code implementations • 1 May 2015 • Miao Liu, Christopher Amato, Xuejun Liao, Lawrence Carin, Jonathan P. How

Expectation maximization (EM) has recently been shown to be an efficient algorithm for learning finite-state controllers (FSCs) in large decentralized POMDPs (Dec-POMDPs).

Paper
Add Code

Scalable Bayesian Non-Negative Tensor Factorization for Massive Count Data

no code implementations • 18 Aug 2015 • Changwei Hu, Piyush Rai, Changyou Chen, Matthew Harding, Lawrence Carin

We present a Bayesian non-negative tensor factorization model for count-valued tensor data, and develop scalable inference algorithms (both batch and online) for dealing with massive tensors.

Paper
Add Code

Zero-Truncated Poisson Tensor Factorization for Massive Binary Tensors

no code implementations • 18 Aug 2015 • Changwei Hu, Piyush Rai, Lawrence Carin

We present a scalable Bayesian model for low-rank factorization of massive tensors with binary observations.

Bayesian Inference

Paper
Add Code

Non-Gaussian Discriminative Factor Models via the Max-Margin Rank-Likelihood

no code implementations • 28 Apr 2015 • Xin Yuan, Ricardo Henao, Ephraim L. Tsalik, Raymond J. Langley, Lawrence Carin

A Bayesian model based on the ranks of the data is proposed.

Paper
Add Code

A Generative Model for Deep Convolutional Learning

no code implementations • 15 Apr 2015 • Yunchen Pu, Xin Yuan, Lawrence Carin

A generative model is developed for deep (multi-layered) convolutional dictionary learning.

Dictionary Learning General Classification

Paper
Add Code

Compressive Hyperspectral Imaging with Side Information

no code implementations • 22 Feb 2015 • Xin Yuan, Tsung-Han Tsai, Ruoyu Zhu, Patrick Llull, David Brady, Lawrence Carin

By using RGB images as side information of the compressive sensing system, the proposed approach is extended to learn a coupled dictionary from the joint dataset of the compressed measurements and the corresponding RGB images, to improve reconstruction quality.

Compressive Sensing

Paper
Add Code

Generative Deep Deconvolutional Learning

no code implementations • 18 Dec 2014 • Yunchen Pu, Xin Yuan, Lawrence Carin

A generative Bayesian model is developed for deep (multi-layer) convolutional dictionary learning.

Dictionary Learning General Classification

Paper
Add Code

Tree-Structure Bayesian Compressive Sensing for Video

no code implementations • 12 Oct 2014 • Xin Yuan, Patrick Llull, David J. Brady, Lawrence Carin

A Bayesian compressive sensing framework is developed for video reconstruction based on the color coded aperture compressive temporal imaging (CACTI) system.

Compressive Sensing Video Reconstruction

Paper
Add Code

Low-Cost Compressive Sensing for Color Video and Depth

no code implementations • CVPR 2014 • Xin Yuan, Patrick Llull, Xuejun Liao, Jianbo Yang, Guillermo Sapiro, David J. Brady, Lawrence Carin

A simple and inexpensive (low-power and low-bandwidth) modification is made to a conventional off-the-shelf color video camera, from which we recover {multiple} color frames for each of the original measured frames, and each of the recovered frames can be focused at a different depth.

Compressive Sensing Translation

Paper
Add Code

Multiscale Shrinkage and Lévy Processes

no code implementations • 11 Jan 2014 • Xin Yuan, Vinayak Rao, Shaobo Han, Lawrence Carin

The method we consider in detail, and for which numerical results are presented, is based on increments of a gamma process.

Bayesian Inference Compressive Sensing +1

Paper
Add Code

Adaptive Temporal Compressive Sensing for Video

no code implementations • 14 Feb 2013 • Xin Yuan, Jianbo Yang, Patrick Llull, Xuejun Liao, Guillermo Sapiro, David J. Brady, Lawrence Carin

This paper introduces the concept of adaptive temporal compressive sensing (CS) for video.

Compressive Sensing

Paper
Add Code

Generalized Bregman Divergence and Gradient of Mutual Information for Vector Poisson Channels

no code implementations • 28 Jan 2013 • Liming Wang, Miguel Rodrigues, Lawrence Carin

We investigate connections between information-theoretic and estimation-theoretic quantities in vector Poisson channel models.

Compressive Sensing Document Classification

Paper
Add Code

Policy Optimization as Wasserstein Gradient Flows

no code implementations • ICML 2018 • Ruiyi Zhang, Changyou Chen, Chunyuan Li, Lawrence Carin

Policy optimization is a core component of reinforcement learning (RL), and most existing RL methods directly optimize parameters of a policy based on maximizing the expected total reward, or its surrogate.

Reinforcement Learning (RL)

Paper
Add Code

Improved Semantic-Aware Network Embedding with Fine-Grained Word Alignment

no code implementations • EMNLP 2018 • Dinghan Shen, Xinyuan Zhang, Ricardo Henao, Lawrence Carin

Network embeddings, which learn low-dimensional representations for each vertex in a large-scale network, have received considerable attention in recent years.

Link Prediction Network Embedding +1

Paper
Add Code

Stochastic Particle-Optimization Sampling and the Non-Asymptotic Convergence Theory

no code implementations • 5 Sep 2018 • Jianyi Zhang, Ruiyi Zhang, Lawrence Carin, Changyou Chen

Particle-optimization-based sampling (POS) is a recently developed effective sampling technique that interactively updates a set of particles.

POS

Paper
Add Code

Predicting Smoking Events with a Time-Varying Semi-Parametric Hawkes Process Model

no code implementations • 5 Sep 2018 • Matthew Engelhard, Hongteng Xu, Lawrence Carin, Jason A Oliver, Matthew Hallyburton, F Joseph McClernon

Health risks from cigarette smoking -- the leading cause of preventable death in the United States -- can be substantially reduced by quitting.

Paper
Add Code

Distilled Wasserstein Learning for Word Embedding and Topic Modeling

no code implementations • NeurIPS 2018 • Hongteng Xu, Wenlin Wang, Wei Liu, Lawrence Carin

When learning the topic model, we leverage a distilled underlying distance matrix to update the topic distributions and smoothly calculate the corresponding optimal transports.

Mortality Prediction Word Embeddings

Paper
Add Code

Sequence Generation with Guider Network

no code implementations • 2 Nov 2018 • Ruiyi Zhang, Changyou Chen, Zhe Gan, Wenlin Wang, Liqun Chen, Dinghan Shen, Guoyin Wang, Lawrence Carin

Sequence generation with reinforcement learning (RL) has received significant attention recently.

Reinforcement Learning (RL)

Paper
Add Code

Generative Adversarial Network Training is a Continual Learning Problem

no code implementations • ICLR 2019 • Kevin J Liang, Chunyuan Li, Guoyin Wang, Lawrence Carin

We hypothesize that this is at least in part due to the evolution of the generator distribution and the catastrophic forgetting tendency of neural networks, which leads to the discriminator losing the ability to remember synthesized samples from previous instantiations of the generator.

Continual Learning Generative Adversarial Network +1

Paper
Add Code

Scalable Model Selection for Belief Networks

no code implementations • NeurIPS 2017 • Zhao Song, Yusuke Muraoka, Ryohei Fujimaki, Lawrence Carin

We propose a scalable algorithm for model selection in sigmoid belief networks (SBNs), based on the factorized asymptotic Bayesian (FAB) framework.

Model Selection

Paper
Add Code

Targeting EEG/LFP Synchrony with Neural Nets

no code implementations • NeurIPS 2017 • Yitong Li, Michael Murias, Samantha Major, Geraldine Dawson, Kafui Dzirasa, Lawrence Carin, David E. Carlson

We consider the analysis of Electroencephalography (EEG) and Local Field Potential (LFP) datasets, which are “big” in terms of the size of recorded data but rarely have sufficient labels required to train complex models (e. g., conventional deep learning methods).

EEG

Paper
Add Code

Cross-Spectral Factor Analysis

no code implementations • NeurIPS 2017 • Neil Gallagher, Kyle R. Ulrich, Austin Talbot, Kafui Dzirasa, Lawrence Carin, David E. Carlson

To facilitate understanding of network-level synchronization between brain regions, we introduce a novel model of multisite low-frequency neural recordings, such as local field potentials (LFPs) and electroencephalograms (EEGs).

Paper
Add Code

Linear Feature Encoding for Reinforcement Learning

no code implementations • NeurIPS 2016 • Zhao Song, Ronald E. Parr, Xuejun Liao, Lawrence Carin

We then develop a supervised linear feature encoding method that is motivated by insights from linear value function approximation theory, as well as empirical successes from deep RL.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

GP Kernels for Cross-Spectrum Analysis

no code implementations • NeurIPS 2015 • Kyle R. Ulrich, David E. Carlson, Kafui Dzirasa, Lawrence Carin

An illustrative and motivating example of a multi-task problem is multi-region electrophysiological time-series data, where experimentalists are interested in both power and phase coherence between channels.

Gaussian Processes Time Series +1

Paper
Add Code

Deep Poisson Factor Modeling

no code implementations • NeurIPS 2015 • Ricardo Henao, Zhe Gan, James Lu, Lawrence Carin

We propose a new deep architecture for topic modeling, based on Poisson Factor Analysis (PFA) modules.

Topic Models

Paper
Add Code

Preconditioned Spectral Descent for Deep Learning

no code implementations • NeurIPS 2015 • David E. Carlson, Edo Collins, Ya-Ping Hsieh, Lawrence Carin, Volkan Cevher

These challenges include, but are not limited to, the non-convexity of learning objectives and estimating the quantities needed for optimization algorithms, such as gradients.

Paper
Add Code

Large-Scale Bayesian Multi-Label Learning via Topic-Based Label Embeddings

no code implementations • NeurIPS 2015 • Piyush Rai, Changwei Hu, Ricardo Henao, Lawrence Carin

We present a scalable Bayesian multi-label learning model based on learning low-dimensional label embeddings.

Data Augmentation Multi-Label Learning

Paper
Add Code

Bayesian Nonlinear Support Vector Machines and Discriminative Factor Modeling

no code implementations • NeurIPS 2014 • Ricardo Henao, Xin Yuan, Lawrence Carin

A new Bayesian formulation is developed for nonlinear support vector machines (SVMs), based on a Gaussian process and with the SVM hinge loss expressed as a scaled mixture of normals.

Paper
Add Code

On the relations of LFPs & Neural Spike Trains

no code implementations • NeurIPS 2014 • David E. Carlson, Jana Schaich Borg, Kafui Dzirasa, Lawrence Carin

One of the goals of neuroscience is to identify neural networks that correlate with important behaviors, environments, or genotypes.

Clustering Dictionary Learning

Paper
Add Code

Analysis of Brain States from Multi-Region LFP Time-Series

no code implementations • NeurIPS 2014 • Kyle R. Ulrich, David E. Carlson, Wenzhao Lian, Jana S. Borg, Kafui Dzirasa, Lawrence Carin

The LFPs are modeled as a mixture of GPs, with state- and region-dependent mixture weights, and with the spectral content of the data encoded in GP spectral mixture covariance kernels.

Gaussian Processes Time Series +1

Paper
Add Code

Dynamic Rank Factor Model for Text Streams

no code implementations • NeurIPS 2014 • Shaobo Han, Lin Du, Esther Salazar, Lawrence Carin

We propose a semi-parametric and dynamic rank factor model for topic modeling, capable of (1) discovering topic prevalence over time, and (2) learning contemporary multi-scale dependence structures, providing topic and word correlations as a byproduct.

Paper
Add Code

Compressive Sensing of Signals from a GMM with Sparse Precision Matrices

no code implementations • NeurIPS 2014 • Jianbo Yang, Xuejun Liao, Minhua Chen, Lawrence Carin

This paper is concerned with compressive sensing of signals drawn from a Gaussian mixture model (GMM) with sparse precision matrices.

Compressive Sensing

Paper
Add Code

Designed Measurements for Vector Count Data

no code implementations • NeurIPS 2013 • Liming Wang, David E. Carlson, Miguel Rodrigues, David Wilcox, Robert Calderbank, Lawrence Carin

We consider design of linear projection measurements for a vector Poisson signal model.

Compressive Sensing

Paper
Add Code

Integrated Non-Factorized Variational Inference

no code implementations • NeurIPS 2013 • Shaobo Han, Xuejun Liao, Lawrence Carin

We present a non-factorized variational method for full posterior inference in Bayesian hierarchical models, with the goal of capturing the posterior variable dependencies via efficient and possibly parallel computation.

Variational Inference

Paper
Add Code

Real-Time Inference for a Gamma Process Model of Neural Spiking

no code implementations • NeurIPS 2013 • David E. Carlson, Vinayak Rao, Joshua T. Vogelstein, Lawrence Carin

With simultaneous measurements from ever increasing populations of neurons, there is a growing need for sophisticated tools to recover signals from individual neurons.

Paper
Add Code

Joint Modeling of a Matrix with Associated Text via Latent Binary Features

no code implementations • NeurIPS 2012 • Xianxing Zhang, Lawrence Carin

The model is applied to roll-call data, with the associated documents defined by the legislation.

Paper
Add Code

Augment-and-Conquer Negative Binomial Processes

no code implementations • NeurIPS 2012 • Mingyuan Zhou, Lawrence Carin

By developing data augmentation methods unique to the negative binomial (NB) distribution, we unite seemingly disjoint count and mixture models under the NB process framework.

Data Augmentation

Paper
Add Code

On the Analysis of Multi-Channel Neural Spike Data

no code implementations • NeurIPS 2011 • Bo Chen, David E. Carlson, Lawrence Carin

Nonparametric Bayesian methods are developed for analysis of multi-channel spike-train data, with the feature learning and spike sorting performed jointly.

Dictionary Learning Spike Sorting

Paper
Add Code

The Kernel Beta Process

no code implementations • NeurIPS 2011 • Lu Ren, Yingjian Wang, Lawrence Carin, David B. Dunson

A new Le ́vy process prior is proposed for an uncountable collection of covariate- dependent feature-learning measures; the model is called the kernel beta process (KBP).

Paper
Add Code

Hierarchical Topic Modeling for Analysis of Time-Evolving Personal Choices

no code implementations • NeurIPS 2011 • Xianxing Zhang, Lawrence Carin, David B. Dunson

The nested Chinese restaurant process is extended to design a nonparametric topic-model tree for representation of human choices.

Paper
Add Code

Joint Analysis of Time-Evolving Binary Matrices and Associated Documents

no code implementations • NeurIPS 2010 • Eric Wang, Dehong Liu, Jorge Silva, Lawrence Carin, David B. Dunson

An objective of such analysis is to infer structure and inter-relationships underlying the matrices, here defined by latent features associated with each axis of the matrix.

Paper
Add Code

Learning to Explore and Exploit in POMDPs

no code implementations • NeurIPS 2009 • Chenghui Cai, Xuejun Liao, Lawrence Carin

In this paper we propose a dual-policy method for jointly learning the agent behavior and the balance between exploration exploitation, in partially observable environments.

Active Learning

Paper
Add Code

A Bayesian Model for Simultaneous Image Clustering, Annotation and Object Segmentation

no code implementations • NeurIPS 2009 • Lan Du, Lu Ren, Lawrence Carin, David B. Dunson

The model clusters the images into classes, and each image is segmented into a set of objects, also allowing the opportunity to assign a word to each object (localized labeling).

Clustering Image Clustering +2

Paper
Add Code

Non-Parametric Bayesian Dictionary Learning for Sparse Image Representations

no code implementations • NeurIPS 2009 • Mingyuan Zhou, Haojun Chen, Lu Ren, Guillermo Sapiro, Lawrence Carin, John W. Paisley

The beta process is employed as a prior for learning the dictionary, and this non-parametric method naturally infers an appropriate dictionary size.

Bayesian Inference Compressive Sensing +2

Paper
Add Code

Second-Order Adversarial Attack and Certifiable Robustness

no code implementations • ICLR 2019 • Bai Li, Changyou Chen, Wenlin Wang, Lawrence Carin

In this paper, we propose a powerful second-order attack method that reduces the accuracy of the defense model by Madry et al. (2017).

Adversarial Attack

Paper
Add Code

On the Use of Word Embeddings Alone to Represent Natural Language Sequences

no code implementations • ICLR 2018 • Dinghan Shen, Guoyin Wang, Wenlin Wang, Martin Renqiang Min, Qinliang Su, Yizhe Zhang, Ricardo Henao, Lawrence Carin

In this paper, we conduct an extensive comparative study between Simple Word Embeddings-based Models (SWEMs), with no compositional parameters, relative to employing word embeddings within RNN/CNN-based models.

Sentence Word Embeddings

Paper
Add Code

Improving Sequence-to-Sequence Learning via Optimal Transport

no code implementations • ICLR 2019 • Liqun Chen, Yizhe Zhang, Ruiyi Zhang, Chenyang Tao, Zhe Gan, Haichao Zhang, Bai Li, Dinghan Shen, Changyou Chen, Lawrence Carin

Sequence-to-sequence models are commonly trained via maximum likelihood estimation (MLE).

Abstractive Text Summarization Image Captioning +3

Paper
Add Code

Multi-Shot Imaging: Joint Alignment, Deblurring and Resolution-Enhancement

no code implementations • CVPR 2014 • Haichao Zhang, Lawrence Carin

Registering multiple blurry images is a challenging task due to the presence of blur while deblurring of multiple blurry images requires accurate alignment, leading to an intrinsically coupled problem.

Deblurring Image Restoration

Paper
Add Code

Learning Weight Uncertainty With Stochastic Gradient MCMC for Shape Classification

no code implementations • CVPR 2016 • Chunyuan Li, Andrew Stevens, Changyou Chen, Yunchen Pu, Zhe Gan, Lawrence Carin

Learning the representation of shape cues in 2D & 3D objects for recognition is a fundamental task in computer vision.

General Classification Stochastic Optimization

Paper
Add Code

Towards Generating Long and Coherent Text with Multi-Level Latent Variable Models

no code implementations • ACL 2019 • Dinghan Shen, Asli Celikyilmaz, Yizhe Zhang, Liqun Chen, Xin Wang, Jianfeng Gao, Lawrence Carin

Variational autoencoders (VAEs) have received much attention recently as an end-to-end architecture for text generation with latent variables.

Sentence Text Generation

Paper
Add Code

Scalable Thompson Sampling via Optimal Transport

no code implementations • 19 Feb 2019 • Ruiyi Zhang, Zheng Wen, Changyou Chen, Lawrence Carin

Thompson sampling (TS) is a class of algorithms for sequential decision-making, which requires maintaining a posterior distribution over a model.

Decision Making Thompson Sampling

Paper
Add Code

Topic-Guided Variational Autoencoders for Text Generation

no code implementations • 17 Mar 2019 • Wenlin Wang, Zhe Gan, Hongteng Xu, Ruiyi Zhang, Guoyin Wang, Dinghan Shen, Changyou Chen, Lawrence Carin

We propose a topic-guided variational autoencoder (TGVAE) model for text generation.

Conditional Text Generation

Paper
Add Code

Thyroid Cancer Malignancy Prediction From Whole Slide Cytopathology Images

no code implementations • 29 Mar 2019 • David Dov, Shahar Kovalsky, Jonathan Cohen, Danielle Range, Ricardo Henao, Lawrence Carin

We consider preoperative prediction of thyroid cancer based on ultra-high-resolution whole-slide cytopathology images.

Multiple Instance Learning

Paper
Add Code

Weakly Supervised Instance Learning for Thyroid Malignancy Prediction from Whole Slide Cytopathology Images

no code implementations • 26 Apr 2019 • David Dov, Shahar Ziv Kovalsky, Serge Assaad, Avani A. Pendse Jonathan Cohen, Danielle Elliott Range, Ricardo Henao, Lawrence Carin

The lower bound further allows us to extend the proposed algorithm to simultaneously predict multiple bag and instance-level labels from a single output of a neural network.

Informativeness Multiple Instance Learning +2

Paper
Add Code

Stochastic Blockmodels meet Graph Neural Networks

no code implementations • Proceedings of the 36th International Conference on Machine Learning 2019 • Nikhil Mehta, Lawrence Carin, Piyush Rai

Although we develop this framework for a particular type of SBM, namely the \emph{overlapping} stochastic blockmodel, the proposed framework can be adapted readily for other types of SBMs.

Link Prediction

Paper
Add Code

On Norm-Agnostic Robustness of Adversarial Training

no code implementations • 15 May 2019 • Bai Li, Changyou Chen, Wenlin Wang, Lawrence Carin

Adversarial examples are carefully perturbed in-puts for fooling machine learning models.

BIG-bench Machine Learning

Paper
Add Code

Improving Textual Network Embedding with Global Attention via Optimal Transport

no code implementations • ACL 2019 • Liqun Chen, Guoyin Wang, Chenyang Tao, Dinghan Shen, Pengyu Cheng, Xinyuan Zhang, Wenlin Wang, Yizhe Zhang, Lawrence Carin

Constituting highly informative network embeddings is an important tool for network analysis.

Network Embedding

Paper
Add Code

Syntax-Infused Variational Autoencoder for Text Generation

no code implementations • ACL 2019 • Xinyuan Zhang, Yi Yang, Siyang Yuan, Dinghan Shen, Lawrence Carin

We present a syntax-infused variational autoencoder (SIVAE), that integrates sentences with their syntactic trees to improve the grammar of generated sentences.

Sentence Text Generation

Paper
Add Code

Adaptation Across Extreme Variations using Unlabeled Domain Bridges

no code implementations • 5 Jun 2019 • Shuyang Dai, Kihyuk Sohn, Yi-Hsuan Tsai, Lawrence Carin, Manmohan Chandraker

We tackle an unsupervised domain adaptation problem for which the domain discrepancy between labeled source and unlabeled target domains is large, due to many factors of inter and intra-domain variation.

Object Recognition Semantic Segmentation +1

Paper
Add Code

Topic-Guided Variational Auto-Encoder for Text Generation

no code implementations • NAACL 2019 • Wenlin Wang, Zhe Gan, Hongteng Xu, Ruiyi Zhang, Guoyin Wang, Dinghan Shen, Changyou Chen, Lawrence Carin

We propose a topic-guided variational auto-encoder (TGVAE) model for text generation.

Conditional Text Generation

Paper
Add Code

Interpretable ICD Code Embeddings with Self- and Mutual-Attention Mechanisms

no code implementations • 13 Jun 2019 • Dixin Luo, Hongteng Xu, Lawrence Carin

The proposed method achieves clinically-interpretable embeddings of ICD codes, and outperforms state-of-the-art embedding methods in procedure recommendation.

Paper
Add Code

Adversarial Self-Paced Learning for Mixture Models of Hawkes Processes

no code implementations • 20 Jun 2019 • Dixin Luo, Hongteng Xu, Lawrence Carin

Instead of learning a mixture model directly from a set of event sequences drawn from different Hawkes processes, the proposed method learns the target model iteratively, which generates "easy" sequences and uses them in an adversarial and self-paced manner.

Data Augmentation

Paper
Add Code

LMVP: Video Predictor with Leaked Motion Information

no code implementations • 24 Jun 2019 • Dong Wang, Yitong Li, Wei Cao, Liqun Chen, Qi Wei, Lawrence Carin

We propose a Leaked Motion Video Predictor (LMVP) to predict future frames by capturing the spatial and temporal dependencies from given inputs.

Paper
Add Code

Contrastively Smoothed Class Alignment for Unsupervised Domain Adaptation

no code implementations • 11 Sep 2019 • Shuyang Dai, Yu Cheng, Yizhe Zhang, Zhe Gan, Jingjing Liu, Lawrence Carin

Recent unsupervised approaches to domain adaptation primarily focus on minimizing the gap between the source and the target domains through refining the feature generator, in order to learn a better alignment between the two domains.

domain classification Unsupervised Domain Adaptation

Paper
Add Code

Straight-Through Estimator as Projected Wasserstein Gradient Flow

no code implementations • 5 Oct 2019 • Pengyu Cheng, Chang Liu, Chunyuan Li, Dinghan Shen, Ricardo Henao, Lawrence Carin

The Straight-Through (ST) estimator is a widely used technique for back-propagating gradients through discrete random variables.

Paper
Add Code

Fused Gromov-Wasserstein Alignment for Hawkes Processes

no code implementations • 4 Oct 2019 • Dixin Luo, Hongteng Xu, Lawrence Carin

Accordingly, the learned optimal transport reflects the correspondence between the event types of these two Hawkes processes.

Paper
Add Code

Zero-Shot Recognition via Optimal Transport

no code implementations • 20 Oct 2019 • Wenlin Wang, Hongteng Xu, Guoyin Wang, Wenqi Wang, Lawrence Carin

{Specifically, we build a conditional generative model to generate features from seen-class attributes, and establish an optimal transport between the distribution of the generated features and that of the real features.}

Attribute Generalized Zero-Shot Learning

Paper
Add Code

Learning to Recommend from Sparse Data via Generative User Feedback

no code implementations • ICLR 2020 • Wenlin Wang, Hongteng Xu, Ruiyi Zhang, Wenqi Wang, Piyush Rai, Lawrence Carin

To address this, we propose a learning framework that improves collaborative filtering with a synthetic feedback loop (CF-SFL) to simulate the user feedback.

Collaborative Filtering Recommendation Systems

Paper
Add Code

An End-to-End Generative Architecture for Paraphrase Generation

no code implementations • IJCNLP 2019 • Qian Yang, Zhouyuan Huo, Dinghan Shen, Yong Cheng, Wenlin Wang, Guoyin Wang, Lawrence Carin

Generating high-quality paraphrases is a fundamental yet challenging natural language processing task.

Paraphrase Generation

Paper
Add Code

Syntax-Infused Transformer and BERT models for Machine Translation and Natural Language Understanding

no code implementations • 10 Nov 2019 • Dhanasekar Sundararaman, Vivek Subramanian, Guoyin Wang, Shijing Si, Dinghan Shen, Dong Wang, Lawrence Carin

Attention-based models have shown significant improvement over traditional algorithms in several NLP tasks.

Machine Translation Natural Language Understanding +2

Paper
Add Code

Graph-Driven Generative Models for Heterogeneous Multi-Task Learning

no code implementations • 20 Nov 2019 • Wenlin Wang, Hongteng Xu, Zhe Gan, Bai Li, Guoyin Wang, Liqun Chen, Qian Yang, Wenqi Wang, Lawrence Carin

We propose a novel graph-driven generative model, that unifies multiple heterogeneous learning tasks into the same framework.

Multi-Task Learning Type prediction

Paper
Add Code

Toward Automatic Threat Recognition for Airport X-ray Baggage Screening with Deep Convolutional Object Detection

no code implementations • 13 Dec 2019 • Kevin J Liang, John B. Sigman, Gregory P. Spell, Dan Strellis, William Chang, Felix Liu, Tejas Mehta, Lawrence Carin

We show performance of our models on held-out evaluation sets, analyze several design parameters, and demonstrate the potential of such systems for automated detection of threats that can be found in airports.

object-detection Object Detection

Paper
Add Code

Nested-Wasserstein Self-Imitation Learning for Sequence Generation

no code implementations • 20 Jan 2020 • Ruiyi Zhang, Changyou Chen, Zhe Gan, Zheng Wen, Wenlin Wang, Lawrence Carin

Reinforcement learning (RL) has been widely studied for improving sequence-generation models.

Imitation Learning reinforcement-learning +1

Paper
Add Code

Object Detection as a Positive-Unlabeled Problem

no code implementations • 11 Feb 2020 • Yuewei Yang, Kevin J Liang, Lawrence Carin

These missing annotations can be problematic, as the standard cross-entropy loss employed to train object detection models treats classification as a positive-negative (PN) problem: unlabeled regions are implicitly assumed to be background.

General Classification Object +2

Paper
Add Code

Continual Learning using a Bayesian Nonparametric Dictionary of Weight Factors

no code implementations • 21 Apr 2020 • Nikhil Mehta, Kevin J Liang, Vinay K Verma, Lawrence Carin

Naively trained neural networks tend to experience catastrophic forgetting in sequential task settings, where data from previous tasks are unavailable.

Continual Learning Transfer Learning

Paper
Add Code

Transferable Perturbations of Deep Feature Distributions

no code implementations • ICLR 2020 • Nathan Inkawhich, Kevin J Liang, Lawrence Carin, Yiran Chen

Almost all current adversarial attacks of CNN classifiers rely on information derived from the output layer of the network.

Adversarial Attack

Paper
Add Code

Perturbing Across the Feature Hierarchy to Improve Standard and Strict Blackbox Attack Transferability

no code implementations • NeurIPS 2020 • Nathan Inkawhich, Kevin J Liang, Binghui Wang, Matthew Inkawhich, Lawrence Carin, Yiran Chen

We consider the blackbox transfer-based targeted adversarial attack threat model in the realm of deep neural network (DNN) image classifiers.

Adversarial Attack

Paper
Add Code

APo-VAE: Text Generation in Hyperbolic Space

no code implementations • NAACL 2021 • Shuyang Dai, Zhe Gan, Yu Cheng, Chenyang Tao, Lawrence Carin, Jingjing Liu

In this paper, we investigate text generation in a hyperbolic latent space to learn continuous hierarchical representations.

Language Modelling Response Generation +1

Paper
Add Code

Reward Constrained Interactive Recommendation with Natural Language Feedback

no code implementations • 4 May 2020 • Ruiyi Zhang, Tong Yu, Yilin Shen, Hongxia Jin, Changyou Chen, Lawrence Carin

Text-based interactive recommendation provides richer user feedback and has demonstrated advantages over traditional interactive recommender systems.

Recommendation Systems reinforcement-learning +2

Paper
Add Code

Improving Adversarial Text Generation by Modeling the Distant Future

no code implementations • ACL 2020 • Ruiyi Zhang, Changyou Chen, Zhe Gan, Wenlin Wang, Dinghan Shen, Guoyin Wang, Zheng Wen, Lawrence Carin

Auto-regressive text generation models usually focus on local fluency, and may cause inconsistent semantic meaning in long text generation.

Adversarial Text Imitation Learning +1

Paper
Add Code

Y-Net for Chest X-Ray Preprocessing: Simultaneous Classification of Geometry and Segmentation of Annotations

no code implementations • 8 May 2020 • John McManigle, Raquel Bartz, Lawrence Carin

A modified Y-Net architecture based on the VGG11 encoder is used to simultaneously learn geometric orientation (similarity transform parameters) of the chest and segmentation of radiographic annotations.

Classification General Classification +3

Paper
Add Code

Improving Disentangled Text Representation Learning with Information-Theoretic Guidance

no code implementations • ACL 2020 • Pengyu Cheng, Martin Renqiang Min, Dinghan Shen, Christopher Malon, Yizhe Zhang, Yitong Li, Lawrence Carin

Learning disentangled representations of natural language is essential for many NLP tasks, e. g., conditional text generation, style transfer, personalized dialogue systems, etc.

Conditional Text Generation Representation Learning +2

Paper
Add Code

Hierarchical Optimal Transport for Robust Multi-View Learning

no code implementations • 4 Jun 2020 • Dixin Luo, Hongteng Xu, Lawrence Carin

Traditional multi-view learning methods often rely on two assumptions: ($i$) the samples in different views are well-aligned, and ($ii$) their representations in latent space obey the same distribution.

Clustering MULTI-VIEW LEARNING

Paper
Add Code

Towards Understanding Fast Adversarial Training

no code implementations • 4 Jun 2020 • Bai Li, Shiqi Wang, Suman Jana, Lawrence Carin

Current neural-network-based classifiers are susceptible to adversarial examples.

Paper
Add Code

Scalable Control Variates for Monte Carlo Methods via Stochastic Optimization

no code implementations • 12 Jun 2020 • Shijing Si, Chris. J. Oates, Andrew B. Duncan, Lawrence Carin, François-Xavier Briol

Control variates are a well-established tool to reduce the variance of Monte Carlo estimators.

Bayesian Inference Stochastic Optimization

Paper
Add Code

Bridging Maximum Likelihood and Adversarial Learning via $α$-Divergence

no code implementations • 13 Jul 2020 • Miaoyun Zhao, Yulai Cong, Shuyang Dai, Lawrence Carin

Maximum likelihood (ML) and adversarial learning are two popular approaches for training generative models, and from many perspectives these techniques are complementary.

Paper
Add Code

WAFFLe: Weight Anonymized Factorization for Federated Learning

no code implementations • 13 Aug 2020 • Weituo Hao, Nikhil Mehta, Kevin J Liang, Pengyu Cheng, Mostafa El-Khamy, Lawrence Carin

Experiments on MNIST, FashionMNIST, and CIFAR-10 demonstrate WAFFLe's significant improvement to local test performance and fairness while simultaneously providing an extra layer of security.

Fairness Federated Learning

Paper
Add Code

Weakly supervised cross-domain alignment with optimal transport

no code implementations • 14 Aug 2020 • Siyang Yuan, Ke Bai, Liqun Chen, Yizhe Zhang, Chenyang Tao, Chunyuan Li, Guoyin Wang, Ricardo Henao, Lawrence Carin

Cross-domain alignment between image objects and text sequences is key to many visual-language tasks, and it poses a fundamental challenge to both computer vision and natural language processing.

Paper
Add Code

Towards Robust and Efficient Contrastive Textual Representation Learning

no code implementations • 1 Jan 2021 • Liqun Chen, Yizhe Zhang, Dianqi Li, Chenyang Tao, Dong Wang, Lawrence Carin

There has been growing interest in representation learning for text data, based on theoretical arguments and empirical evidence.

Contrastive Learning Representation Learning

Paper
Add Code

Background Adaptive Faster R-CNN for Semi-Supervised Convolutional Object Detection of Threats in X-Ray Images

no code implementations • 2 Oct 2020 • John B. Sigman, Gregory P. Spell, Kevin J Liang, Lawrence Carin

The data sources described earlier make two "domains": a hand-collected data domain of images with threats, and a real-world domain of images assumed without threats.

Domain Adaptation Object +2

Paper
Add Code

Improving Text Generation with Student-Forcing Optimal Transport

no code implementations • EMNLP 2020 • Guoyin Wang, Chunyuan Li, Jianqiao Li, Hao Fu, Yuh-Chen Lin, Liqun Chen, Yizhe Zhang, Chenyang Tao, Ruiyi Zhang, Wenlin Wang, Dinghan Shen, Qian Yang, Lawrence Carin

An extension is further proposed to improve the OT learning, based on the structural and contextual information of the text sequences.

Machine Translation Text Generation +2

Paper
Add Code

RetiNerveNet: Using Recursive Deep Learning to Estimate Pointwise 24-2 Visual Field Data based on Retinal Structure

no code implementations • 15 Oct 2020 • Shounak Datta, Eduardo B. Mariottoni, David Dov, Alessandro A. Jammal, Lawrence Carin, Felipe A. Medeiros

Due to the SAP test's innate difficulty and its high test-retest variability, we propose the RetiNerveNet, a deep convolutional recursive neural network for obtaining estimates of the SAP visual field.

Paper
Add Code

Counterfactual Representation Learning with Balancing Weights

no code implementations • 23 Oct 2020 • Serge Assaad, Shuxi Zeng, Chenyang Tao, Shounak Datta, Nikhil Mehta, Ricardo Henao, Fan Li, Lawrence Carin

A key to causal inference with observational data is achieving balance in predictive features associated with each treatment type.

Causal Inference counterfactual +1

Paper
Add Code

MixKD: Towards Efficient Distillation of Large-scale Language Models

no code implementations • ICLR 2021 • Kevin J Liang, Weituo Hao, Dinghan Shen, Yufan Zhou, Weizhu Chen, Changyou Chen, Lawrence Carin

Large-scale language models have recently demonstrated impressive empirical performance.

Data Augmentation Knowledge Distillation

Paper
Add Code

AutoSync: Learning to Synchronize for Data-Parallel Distributed Deep Learning

no code implementations • NeurIPS 2020 • Hao Zhang, Yuan Li, Zhijie Deng, Xiaodan Liang, Lawrence Carin, Eric Xing

Synchronization is a key step in data-parallel distributed machine learning (ML).

Transfer Learning

Paper
Add Code

Calibrating CNNs for Lifelong Learning

no code implementations • NeurIPS 2020 • Pravendra Singh, Vinay Kumar Verma, Pratik Mazumder, Lawrence Carin, Piyush Rai

Further, our approach does not require storing data samples from the old tasks, which is done by many replay based methods.

Continual Learning

Paper
Add Code

An Embedding Model for Estimating Legislative Preferences from the Frequency and Sentiment of Tweets

no code implementations • EMNLP 2020 • Gregory Spell, Brian Guay, Sunshine Hillygus, Lawrence Carin

Legislator preferences are typically represented as measures of general ideology estimated from roll call votes on legislation, potentially masking important nuances in legislators{'} political attitudes.

Paper
Add Code

Methods for Numeracy-Preserving Word Embeddings

no code implementations • EMNLP 2020 • Dhanasekar Sundararaman, Shijing Si, Vivek Subramanian, Guoyin Wang, Devamanyu Hazarika, Lawrence Carin

We propose a new methodology to assign and learn embeddings for numbers.

Question Answering Word Embeddings

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.