Search Results for author: Rahul Singh

Found 62 papers, 4 papers with code

CiteQA@CLSciSumm 2020

no code implementations EMNLP (sdp) 2020 Anjana Umapathy, Karthik Radhakrishnan, Kinjal Jain, Rahul Singh

In academic publications, citations are used to build context for a concept by highlighting relevant aspects from reference papers.

Program Evaluation with Remotely Sensed Outcomes

no code implementations17 Nov 2024 Ashesh Rambachan, Rahul Singh, Davide Viviano

While traditional program evaluations typically rely on surveys to measure outcomes, certain economic outcomes such as living standards or environmental quality may be infeasible or costly to collect.

Causal Inference

Provably Adaptive Average Reward Reinforcement Learning for Metric Spaces

no code implementations25 Oct 2024 Avik Kar, Rahul Singh

We note that the existing notion of zooming dimension for average reward RL is defined in terms of policy coverings, and hence it can be huge when the policy class is rich even though the underlying MDP is simple, so that the regret upper bound is nearly $O(T)$.

reinforcement-learning Reinforcement Learning +1

Latent Representation Learning for Multimodal Brain Activity Translation

no code implementations27 Sep 2024 Arman Afrasiyabi, Dhananjay Bhaskar, Erica L. Busch, Laurent Caplette, Rahul Singh, Guillaume Lajoie, Nicholas B. Turk-Browne, Smita Krishnaswamy

Neuroscience employs diverse neuroimaging techniques, each offering distinct insights into brain activity, from electrophysiological recordings such as EEG, which have high temporal resolution, to hemodynamic modalities such as fMRI, which have increased spatial precision.

EEG Functional Connectivity +3

Downstream bias mitigation is all you need

no code implementations1 Aug 2024 Arkadeep Baksi, Rahul Singh, Tarun Joshi

The advent of transformer-based architectures and large language models (LLMs) have significantly advanced the performance of natural language processing (NLP) models.

Automatic Generation of Behavioral Test Cases For Natural Language Processing Using Clustering and Prompting

no code implementations31 Jul 2024 Ying Li, Rahul Singh, Tarun Joshi, Agus Sudjianto

Recent work in behavioral testing for natural language processing (NLP) models, such as Checklist, is inspired by related paradigms in software engineering testing.

Policy Zooming: Adaptive Discretization-based Infinite-Horizon Average-Reward Reinforcement Learning

no code implementations29 May 2024 Avik Kar, Rahul Singh

We study infinite-horizon average-reward reinforcement learning (RL) for Lipschitz MDPs and develop an algorithm PZRL that discretizes the state-action space adaptively and zooms in to promising regions of the "policy space" which seems to yield high average rewards.

reinforcement-learning Reinforcement Learning (RL)

On Learning for Ambiguous Chance Constrained Problems

no code implementations31 Dec 2023 A Ch Madhusudanarao, Rahul Singh

When the DM has access to a set of distributions $\mathcal{U}$ such that $P$ is contained in $\mathcal{U}$, then the problem is known as the ambiguous chance-constrained problem \cite{erdougan2006ambiguous}.

Related Rhythms: Recommendation System To Discover Music You May Like

no code implementations24 Sep 2023 Rahul Singh, Pranav Kanuparthi

Machine Learning models are being utilized extensively to drive recommender systems, which is a widely explored topic today.

Recommendation Systems

Kernel Ridge Regression Inference

no code implementations13 Feb 2023 Rahul Singh, Suhas Vijaykumar

We provide uniform inference and confidence bands for kernel ridge regression (KRR), a widely-used non-parametric regression estimator for general data types including rankings, images, and graphs.

regression valid

Signed Graph Neural Networks: A Frequency Perspective

no code implementations15 Aug 2022 Rahul Singh, Yongxin Chen

Graph convolutional networks (GCNs) and its variants are designed for unsigned graphs containing only positive links.

Link Sign Prediction Node Classification

Quantifying Inherent Randomness in Machine Learning Algorithms

no code implementations24 Jun 2022 Soham Raste, Rahul Singh, Joel Vaughan, Vijayan N. Nair

Among the different algorithms, randomness in model training causes larger variation for FFNNs compared to tree-based methods.

BIG-bench Machine Learning

Understanding Metrics for Paraphrasing

no code implementations26 May 2022 Omkar Patil, Rahul Singh, Tarun Joshi

This is not only because of the limitations in text generation capabilities but also due that to the lack of a proper definition of what qualifies as a paraphrase and corresponding metrics to measure how good it is.

Paraphrase Generation

Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation

no code implementations26 Feb 2022 Guojun Xiong, Shufan Wang, Jian Li, Rahul Singh

Using this structural result, we establish the indexability of our problem, and employ the Whittle index policy to minimize average latency.

Edge-computing Q-Learning +1

Augmented RBMLE-UCB Approach for Adaptive Control of Linear Quadratic Systems

no code implementations25 Jan 2022 Akshay Mete, Rahul Singh, P. R. Kumar

We consider the problem of controlling an unknown stochastic linear system with quadratic costs - called the adaptive LQ control problem.

Thompson Sampling

Nested Nonparametric Instrumental Variable Regression: Long Term, Mediated, and Time Varying Treatment Effects

no code implementations28 Dec 2021 Isaac Meza, Rahul Singh

We analyze adversarial estimators of nested NPIV, and provide sufficient conditions for efficient inference on the causal parameter.

BIG-bench Machine Learning Causal Inference +1

Generalized Kernel Ridge Regression for Causal Inference with Missing-at-Random Sample Selection

no code implementations9 Nov 2021 Rahul Singh

I propose kernel ridge regression estimators for nonparametric dose response curves and semiparametric treatment effects in the setting where an analyst has access to a selected sample rather than a random sample; only for select observations, the outcome is observed.

Causal Inference counterfactual +1

Sequential Kernel Embedding for Mediated and Time-Varying Dose Response Curves

no code implementations6 Nov 2021 Rahul Singh, Liyuan Xu, Arthur Gretton

We propose simple nonparametric estimators for mediated and time-varying dose response curves based on kernel ridge regression.

Causal Inference counterfactual

Pruning Attention Heads of Transformer Models Using A* Search: A Novel Approach to Compress Big NLP Architectures

no code implementations28 Oct 2021 Archit Parnami, Rahul Singh, Tarun Joshi

Our results indicate that the method could eliminate as much as 40% of the attention heads in the BERT transformer model with no loss in accuracy.

Inference of collective Gaussian hidden Markov models

no code implementations24 Jul 2021 Rahul Singh, Yongxin Chen

We consider inference problems for a class of continuous state collective hidden Markov models, where the data is recorded in aggregate (collective) form generated by a large population of individuals following the same dynamics.

A Simple and General Debiased Machine Learning Theorem with Finite Sample Guarantees

no code implementations31 May 2021 Victor Chernozhukov, Whitney K. Newey, Rahul Singh

Debiased machine learning is a meta algorithm based on bias correction and sample splitting to calculate confidence intervals for functionals, i. e. scalar summaries, of machine learning algorithms.

BIG-bench Machine Learning Learning Theory

Self-interpretable Convolutional Neural Networks for Text Classification

no code implementations18 May 2021 Wei Zhao, Rahul Singh, Tarun Joshi, Agus Sudjianto, Vijayan N. Nair

We also study the impact of the complexity of the convolutional layers and the classification layers on the model performance.

text-classification Text Classification

Robustness Tests of NLP Machine Learning Models: Search and Semantically Replace

no code implementations20 Apr 2021 Rahul Singh, Karan Jindal, Yufei Yu, Hanyu Yang, Tarun Joshi, Matthew A. Campbell, Wayne B. Shoumaker

This paper proposes a strategy to assess the robustness of different machine learning models that involve natural language processing (NLP).

BIG-bench Machine Learning

Some parametric tests based on sample spacings

no code implementations11 Feb 2021 Rahul Singh, Neeraj Misra

The proposed tests provide alternative to similar tests based on simple spacings (i. e., $m=1$), that were proposed earlier in the literature.

Statistics Theory Statistics Theory 62F03, 62F05

Straggler-Resilient Distributed Machine Learning with Dynamic Backup Workers

no code implementations11 Feb 2021 Guojun Xiong, Gang Yan, Rahul Singh, Jian Li

In this paradigm, each worker maintains a local estimate of the optimal parameter vector, and iteratively updates it by waiting and averaging all estimates obtained from its neighbors, and then corrects it on the basis of its local dataset.

BIG-bench Machine Learning Distributed Optimization

Low-Power Status Updates via Sleep-Wake Scheduling

no code implementations4 Feb 2021 Ahmed M. Bedewy, Yin Sun, Rahul Singh, Ness B. Shroff

We devise a low-complexity solution to solve this problem and prove that, for practical sensing times that are short, the solution is within a small gap from the optimum AoI performance.

Information Theory Information Theory

Learning Augmented Index Policy for Optimal Service Placement at the Network Edge

no code implementations10 Jan 2021 Guojun Xiong, Rahul Singh, Jian Li

We pose the problem as a Markov decision process (MDP) in which the system state is given by describing, for each service, the number of customers that are currently waiting at the edge to obtain the service.

Q-Learning

Adversarial Estimation of Riesz Representers

no code implementations30 Dec 2020 Victor Chernozhukov, Whitney Newey, Rahul Singh, Vasilis Syrgkanis

Many causal parameters are linear functionals of an underlying regression.

Kernel Methods for Unobserved Confounding: Negative Controls, Proxies, and Instruments

no code implementations18 Dec 2020 Rahul Singh

Negative control is a strategy for learning the causal relationship between treatment and outcome in the presence of unmeasured confounding.

Learning Hidden Markov Models from Aggregate Observations

no code implementations23 Nov 2020 Rahul Singh, Qinsheng Zhang, Yongxin Chen

This problem arises when only the population level counts of the number of individuals at each time step are available, from which one seeks to learn the individual hidden Markov model.

Reward Biased Maximum Likelihood Estimation for Reinforcement Learning

no code implementations16 Nov 2020 Akshay Mete, Rahul Singh, Xi Liu, P. R. Kumar

The Reward-Biased Maximum Likelihood Estimate (RBMLE) for adaptive control of Markov chains was proposed to overcome the central obstacle of what is variously called the fundamental "closed-identifiability problem" of adaptive control, the "dual control problem", or, contemporaneously, the "exploration vs. exploitation problem".

reinforcement-learning Reinforcement Learning +2

Unwrapping The Black Box of Deep ReLU Networks: Interpretability, Diagnostics, and Simplification

1 code implementation8 Nov 2020 Agus Sudjianto, William Knauth, Rahul Singh, Zebin Yang, Aijun Zhang

We propose the local linear profile plot and other visualization methods for interpretation and diagnostics, and an effective merging strategy for network simplification.

Filtering for Aggregate Hidden Markov Models with Continuous Observations

no code implementations4 Nov 2020 Qinsheng Zhang, Rahul Singh, Yongxin Chen

We consider a class of filtering problems for large populations where each individual is modeled by the same hidden Markov model (HMM).

Multi-Armed Bandits with Dependent Arms

no code implementations13 Oct 2020 Rahul Singh, Fang Liu, Yin Sun, Ness Shroff

We study a variant of the classical multi-armed bandit problem (MABP) which we call as Multi-Armed Bandits with dependent arms.

Multi-Armed Bandits

Kernel Methods for Causal Functions: Dose, Heterogeneous, and Incremental Response Curves

no code implementations10 Oct 2020 Rahul Singh, Liyuan Xu, Arthur Gretton

We propose estimators based on kernel ridge regression for nonparametric causal functions such as dose, heterogeneous, and incremental response curves.

counterfactual regression

A Partially Observable MDP Approach for Sequential Testing for Infectious Diseases such as COVID-19

no code implementations25 Jul 2020 Rahul Singh, Fang Liu, Ness B. Shroff

In this paper, we show that the testing problem can be cast as a sequential learning-based resource allocation problem with constraints, where the input to the problem is provided by a time-varying social contact graph obtained through various contact tracing tools.

Incremental inference of collective graphical models

no code implementations26 Jun 2020 Rahul Singh, Isabel Haasler, Qinsheng Zhang, Johan Karlsson, Yongxin Chen

We consider incremental inference problems from aggregate data for collective dynamics.

Multi-marginal optimal transport and probabilistic graphical models

3 code implementations25 Jun 2020 Isabel Haasler, Rahul Singh, Qinsheng Zhang, Johan Karlsson, Yongxin Chen

We study multi-marginal optimal transport problems from a probabilistic graphical model perspective.

Bayesian Inference

Mather classes and conormal spaces of Schubert varieties in cominuscule spaces

no code implementations8 Jun 2020 Leonardo C. Mihalcea, Rahul Singh

We prove a type independent formula for the torus equivariant Mather class of a Schubert variety in $G/P$, and for a Schubert variety pulled back via the natural projection $G/Q \to G/P$.

Algebraic Geometry Combinatorics Representation Theory (MSC 2020) Primary: 14C17, 14M15, Secondary 32S60

Contextual Bandits with Side-Observations

no code implementations6 Jun 2020 Rahul Singh, Fang Liu, Xin Liu, Ness Shroff

We show that this asymptotically optimal regret is upper-bounded as $O\left(|\chi(\mathcal{G})|\log T\right)$, where $|\chi(\mathcal{G})|$ is the domination number of $\mathcal{G}$.

Multi-Armed Bandits

Improving Robustness via Risk Averse Distributional Reinforcement Learning

no code implementations L4DC 2020 Rahul Singh, Qinsheng Zhang, Yongxin Chen

One major obstacle that precludes the success of reinforcement learning in real-world applications is the lack of robustness, either to model uncertainties or external disturbances, of the trained policies.

Distributional Reinforcement Learning reinforcement-learning +2

Inference with Aggregate Data: An Optimal Transport Approach

no code implementations31 Mar 2020 Rahul Singh, Isabel Haasler, Qinsheng Zhang, Johan Karlsson, Yongxin Chen

Consequently, the celebrated Sinkhorn/iterative scaling algorithm for multi-marginal optimal transport can be leveraged together with the standard belief propagation algorithm to establish an efficient inference scheme which we call Sinkhorn belief propagation (SBP).

Learning in Networked Control Systems

no code implementations21 Mar 2020 Rahul Singh, P. R. Kumar

We design adaptive controller (learning rule) for a networked control system (NCS) in which data packets containing control information are transmitted across a lossy wireless channel.

Learning in Markov Decision Processes under Constraints

no code implementations27 Feb 2020 Rahul Singh, Abhishek Gupta, Ness B. Shroff

In order to measure the performance of a reinforcement learning algorithm that satisfies the average cost constraints, we define an $M+1$ dimensional regret vector that is composed of its reward regret, and $M$ cost regrets.

reinforcement-learning Reinforcement Learning +1

Sample-based Distributional Policy Gradient

no code implementations8 Jan 2020 Rahul Singh, Keuntaek Lee, Yongxin Chen

It relies on the key idea of replacing the expected return with the return distribution, which captures the intrinsic randomness of the long term rewards.

Distributional Reinforcement Learning OpenAI Gym +3

Double Robustness for Complier Parameters and a Semiparametric Test for Complier Characteristics

1 code implementation10 Sep 2019 Rahul Singh, Liyang Sun

We propose a semiparametric test to evaluate (i) whether different instruments induce subpopulations of compliers with the same observable characteristics on average, and (ii) whether compliers have observable characteristics that are the same as the full population on average.

BIG-bench Machine Learning

Kernel Instrumental Variable Regression

1 code implementation NeurIPS 2019 Rahul Singh, Maneesh Sahani, Arthur Gretton

Instrumental variable (IV) regression is a strategy for learning causal relationships in observational data.

regression

Understand the dynamics of GANs via Primal-Dual Optimization

no code implementations ICLR 2019 Songtao Lu, Rahul Singh, Xiangyi Chen, Yongxin Chen, Mingyi Hong

By developing new primal-dual optimization tools, we show that, with a proper stepsize choice, the widely used first-order iterative algorithm in training GANs would in fact converge to a stationary solution with a sublinear rate.

Generative Adversarial Network Multi-Task Learning

3D Deep Learning with voxelized atomic configurations for modeling atomistic potentials in complex solid-solution alloys

no code implementations23 Nov 2018 Rahul Singh, Aayush Sharma, Onur Rauf Bingol, Aditya Balu, Ganesh Balasubramanian, Duane D. Johnson, Soumik Sarkar

In this paper, we explore a deep convolutional neural-network based approach to develop the atomistic potential for such complex alloys to investigate materials for insights into controlling properties.

Physics-aware Deep Generative Models for Creating Synthetic Microstructures

no code implementations21 Nov 2018 Rahul Singh, Viraj Shah, Balaji Pokuri, Soumik Sarkar, Baskar Ganapathysubramanian, Chinmay Hegde

The first model is a WGAN model that uses a finite number of training images to synthesize new microstructures that weakly satisfy the physical invariances respected by the original data.

Stochastic Optimization

De-Biased Machine Learning of Global and Local Parameters Using Regularized Riesz Representers

no code implementations23 Feb 2018 Victor Chernozhukov, Whitney Newey, Rahul Singh

To achieve this property, we include the Riesz representer for the functional as an additional nuisance parameter.

BIG-bench Machine Learning

Throughput Optimal Decentralized Scheduling of Multi-Hop Networks with End-to-End Deadline Constraints: II Wireless Networks with Interference

no code implementations6 Sep 2017 Rahul Singh, P. R. Kumar, Eytan Modiano

The key difference arises due to the fact that in our set-up the packets loose their utility once their "age" has crossed their deadline, thus making the task of optimizing timely throughput much more challenging than that of ensuring network stability.

Scheduling

Kiefer Wolfowitz Algorithm is Asymptotically Optimal for a Class of Non-Stationary Bandit Problems

no code implementations26 Feb 2017 Rahul Singh, Taposh Banerjee

We consider the problem of designing an allocation rule or an "online learning algorithm" for a class of bandit problems in which the set of control actions available at each time $s$ is a convex, compact subset of $\mathbb{R}^d$.

Cannot find the paper you are looking for? You can Submit a new open access paper.