no code implementations • EMNLP (sdp) 2020 • Anjana Umapathy, Karthik Radhakrishnan, Kinjal Jain, Rahul Singh
In academic publications, citations are used to build context for a concept by highlighting relevant aspects from reference papers.
no code implementations • WAT 2022 • Sahinur Rahman Laskar, Rahul Singh, Md Faizal Karim, Riyanka Manna, Partha Pakray, Sivaji Bandyopadhyay
Machine translation translates one natural language to another, a well-defined natural language processing task.
no code implementations • 17 Nov 2024 • Ashesh Rambachan, Rahul Singh, Davide Viviano
While traditional program evaluations typically rely on surveys to measure outcomes, certain economic outcomes such as living standards or environmental quality may be infeasible or costly to collect.
no code implementations • 25 Oct 2024 • Avik Kar, Rahul Singh
We note that the existing notion of zooming dimension for average reward RL is defined in terms of policy coverings, and hence it can be huge when the policy class is rich even though the underlying MDP is simple, so that the regret upper bound is nearly $O(T)$.
no code implementations • 27 Sep 2024 • Arman Afrasiyabi, Dhananjay Bhaskar, Erica L. Busch, Laurent Caplette, Rahul Singh, Guillaume Lajoie, Nicholas B. Turk-Browne, Smita Krishnaswamy
Neuroscience employs diverse neuroimaging techniques, each offering distinct insights into brain activity, from electrophysiological recordings such as EEG, which have high temporal resolution, to hemodynamic modalities such as fMRI, which have increased spatial precision.
no code implementations • 27 Sep 2024 • Arman Afrasiyabi, Erica Busch, Rahul Singh, Dhananjay Bhaskar, Laurent Caplette, Nicholas Turk-Browne, Smita Krishnaswamy
In this work, we explore the decoding of mental imagery from subjects using their fMRI measurements.
no code implementations • 1 Aug 2024 • Arkadeep Baksi, Rahul Singh, Tarun Joshi
The advent of transformer-based architectures and large language models (LLMs) have significantly advanced the performance of natural language processing (NLP) models.
no code implementations • 31 Jul 2024 • Ying Li, Rahul Singh, Tarun Joshi, Agus Sudjianto
Recent work in behavioral testing for natural language processing (NLP) models, such as Checklist, is inspired by related paradigms in software engineering testing.
no code implementations • 29 May 2024 • Avik Kar, Rahul Singh
We study infinite-horizon average-reward reinforcement learning (RL) for Lipschitz MDPs and develop an algorithm PZRL that discretizes the state-action space adaptively and zooms in to promising regions of the "policy space" which seems to yield high average rewards.
no code implementations • 31 Dec 2023 • A Ch Madhusudanarao, Rahul Singh
When the DM has access to a set of distributions $\mathcal{U}$ such that $P$ is contained in $\mathcal{U}$, then the problem is known as the ambiguous chance-constrained problem \cite{erdougan2006ambiguous}.
no code implementations • 24 Sep 2023 • Rahul Singh, Pranav Kanuparthi
Machine Learning models are being utilized extensively to drive recommender systems, which is a widely explored topic today.
no code implementations • 26 May 2023 • Rahul Singh, Akshay Mete, Avik Kar, P. R. Kumar
Minimum variance controllers have been employed in a wide-range of industrial applications.
no code implementations • 13 Feb 2023 • Rahul Singh, Suhas Vijaykumar
We provide uniform inference and confidence bands for kernel ridge regression (KRR), a widely-used non-parametric regression estimator for general data types including rankings, images, and graphs.
no code implementations • 15 Aug 2022 • Rahul Singh, Yongxin Chen
Graph convolutional networks (GCNs) and its variants are designed for unsigned graphs containing only positive links.
no code implementations • 24 Jun 2022 • Soham Raste, Rahul Singh, Joel Vaughan, Vijayan N. Nair
Among the different algorithms, randomness in model training causes larger variation for FFNNs compared to tree-based methods.
no code implementations • 26 May 2022 • Omkar Patil, Rahul Singh, Tarun Joshi
This is not only because of the limitations in text generation capabilities but also due that to the lack of a proper definition of what qualifies as a paraphrase and corresponding metrics to measure how good it is.
no code implementations • 25 Mar 2022 • Victor Chernozhukov, Whitney Newey, Rahul Singh, Vasilis Syrgkanis
We extend the idea of automated debiased machine learning to the dynamic treatment regime and more generally to nested functionals.
no code implementations • 26 Feb 2022 • Guojun Xiong, Shufan Wang, Jian Li, Rahul Singh
Using this structural result, we establish the indexability of our problem, and employ the Whittle index policy to minimize average latency.
no code implementations • 25 Jan 2022 • Akshay Mete, Rahul Singh, P. R. Kumar
We consider the problem of controlling an unknown stochastic linear system with quadratic costs - called the adaptive LQ control problem.
no code implementations • 13 Jan 2022 • Rahul Singh
For long term dose responses, I prove uniform consistency with finite sample rates.
no code implementations • 28 Dec 2021 • Isaac Meza, Rahul Singh
We analyze adversarial estimators of nested NPIV, and provide sufficient conditions for efficient inference on the causal parameter.
no code implementations • 9 Nov 2021 • Rahul Singh
I propose kernel ridge regression estimators for nonparametric dose response curves and semiparametric treatment effects in the setting where an analyst has access to a selected sample rather than a random sample; only for select observations, the outcome is observed.
no code implementations • 6 Nov 2021 • Rahul Singh, Liyuan Xu, Arthur Gretton
We propose simple nonparametric estimators for mediated and time-varying dose response curves based on kernel ridge regression.
no code implementations • 28 Oct 2021 • Archit Parnami, Rahul Singh, Tarun Joshi
Our results indicate that the method could eliminate as much as 40% of the attention heads in the BERT transformer model with no loss in accuracy.
no code implementations • 20 Sep 2021 • Guojun Xiong, Jian Li, Rahul Singh
We call it the R(MA)^2B-UCB algorithm.
no code implementations • 24 Jul 2021 • Rahul Singh, Yongxin Chen
We consider inference problems for a class of continuous state collective hidden Markov models, where the data is recorded in aggregate (collective) form generated by a large population of individuals following the same dynamics.
no code implementations • 6 Jul 2021 • Anish Agarwal, Rahul Singh
We propose a procedure for data cleaning, estimation, and inference with data cleaning-adjusted confidence intervals.
no code implementations • 31 May 2021 • Victor Chernozhukov, Whitney K. Newey, Rahul Singh
Debiased machine learning is a meta algorithm based on bias correction and sample splitting to calculate confidence intervals for functionals, i. e. scalar summaries, of machine learning algorithms.
no code implementations • 18 May 2021 • Wei Zhao, Rahul Singh, Tarun Joshi, Agus Sudjianto, Vijayan N. Nair
We also study the impact of the complexity of the convolutional layers and the classification layers on the model performance.
no code implementations • 20 Apr 2021 • Rahul Singh, Karan Jindal, Yufei Yu, Hanyu Yang, Tarun Joshi, Matthew A. Campbell, Wayne B. Shoumaker
This paper proposes a strategy to assess the robustness of different machine learning models that involve natural language processing (NLP).
no code implementations • 22 Feb 2021 • Rahul Singh
KRRR is an exact generalization of kernel ridge regression and kernel ridge balancing weights.
no code implementations • 11 Feb 2021 • Rahul Singh, Neeraj Misra
The proposed tests provide alternative to similar tests based on simple spacings (i. e., $m=1$), that were proposed earlier in the literature.
Statistics Theory Statistics Theory 62F03, 62F05
no code implementations • 11 Feb 2021 • Guojun Xiong, Gang Yan, Rahul Singh, Jian Li
In this paradigm, each worker maintains a local estimate of the optimal parameter vector, and iteratively updates it by waiting and averaging all estimates obtained from its neighbors, and then corrects it on the basis of its local dataset.
no code implementations • 4 Feb 2021 • Ahmed M. Bedewy, Yin Sun, Rahul Singh, Ness B. Shroff
We devise a low-complexity solution to solve this problem and prove that, for practical sensing times that are short, the solution is within a small gap from the optimum AoI performance.
Information Theory Information Theory
no code implementations • 10 Jan 2021 • Guojun Xiong, Rahul Singh, Jian Li
We pose the problem as a Markov decision process (MDP) in which the system state is given by describing, for each service, the number of customers that are currently waiting at the edge to obtain the service.
no code implementations • 30 Dec 2020 • Victor Chernozhukov, Whitney Newey, Rahul Singh, Vasilis Syrgkanis
Many causal parameters are linear functionals of an underlying regression.
no code implementations • 18 Dec 2020 • Rahul Singh
Negative control is a strategy for learning the causal relationship between treatment and outcome in the presence of unmeasured confounding.
no code implementations • 23 Nov 2020 • Rahul Singh, Qinsheng Zhang, Yongxin Chen
This problem arises when only the population level counts of the number of individuals at each time step are available, from which one seeks to learn the individual hidden Markov model.
no code implementations • 16 Nov 2020 • Akshay Mete, Rahul Singh, Xi Liu, P. R. Kumar
The Reward-Biased Maximum Likelihood Estimate (RBMLE) for adaptive control of Markov chains was proposed to overcome the central obstacle of what is variously called the fundamental "closed-identifiability problem" of adaptive control, the "dual control problem", or, contemporaneously, the "exploration vs. exploitation problem".
1 code implementation • 8 Nov 2020 • Agus Sudjianto, William Knauth, Rahul Singh, Zebin Yang, Aijun Zhang
We propose the local linear profile plot and other visualization methods for interpretation and diagnostics, and an effective merging strategy for network simplification.
no code implementations • 4 Nov 2020 • Qinsheng Zhang, Rahul Singh, Yongxin Chen
We consider a class of filtering problems for large populations where each individual is modeled by the same hidden Markov model (HMM).
no code implementations • 13 Oct 2020 • Rahul Singh, Fang Liu, Yin Sun, Ness Shroff
We study a variant of the classical multi-armed bandit problem (MABP) which we call as Multi-Armed Bandits with dependent arms.
no code implementations • 10 Oct 2020 • Rahul Singh, Liyuan Xu, Arthur Gretton
We propose estimators based on kernel ridge regression for nonparametric causal functions such as dose, heterogeneous, and incremental response curves.
no code implementations • 12 Aug 2020 • Rahul Singh, Tarun Joshi, Vijayan N. Nair, Agus Sudjianto
We propose algorithms to create adversarial attacks to assess model robustness in text classification problems.
no code implementations • 25 Jul 2020 • Rahul Singh, Fang Liu, Ness B. Shroff
In this paper, we show that the testing problem can be cast as a sequential learning-based resource allocation problem with constraints, where the input to the problem is provided by a time-varying social contact graph obtained through various contact tracing tools.
no code implementations • 26 Jun 2020 • Rahul Singh, Isabel Haasler, Qinsheng Zhang, Johan Karlsson, Yongxin Chen
We consider incremental inference problems from aggregate data for collective dynamics.
3 code implementations • 25 Jun 2020 • Isabel Haasler, Rahul Singh, Qinsheng Zhang, Johan Karlsson, Yongxin Chen
We study multi-marginal optimal transport problems from a probabilistic graphical model perspective.
no code implementations • 8 Jun 2020 • Leonardo C. Mihalcea, Rahul Singh
We prove a type independent formula for the torus equivariant Mather class of a Schubert variety in $G/P$, and for a Schubert variety pulled back via the natural projection $G/Q \to G/P$.
Algebraic Geometry Combinatorics Representation Theory (MSC 2020) Primary: 14C17, 14M15, Secondary 32S60
no code implementations • 6 Jun 2020 • Rahul Singh, Fang Liu, Xin Liu, Ness Shroff
We show that this asymptotically optimal regret is upper-bounded as $O\left(|\chi(\mathcal{G})|\log T\right)$, where $|\chi(\mathcal{G})|$ is the domination number of $\mathcal{G}$.
no code implementations • L4DC 2020 • Rahul Singh, Qinsheng Zhang, Yongxin Chen
One major obstacle that precludes the success of reinforcement learning in real-world applications is the lack of robustness, either to model uncertainties or external disturbances, of the trained policies.
Distributional Reinforcement Learning reinforcement-learning +2
no code implementations • 31 Mar 2020 • Rahul Singh, Isabel Haasler, Qinsheng Zhang, Johan Karlsson, Yongxin Chen
Consequently, the celebrated Sinkhorn/iterative scaling algorithm for multi-marginal optimal transport can be leveraged together with the standard belief propagation algorithm to establish an efficient inference scheme which we call Sinkhorn belief propagation (SBP).
no code implementations • 21 Mar 2020 • Rahul Singh, P. R. Kumar
We design adaptive controller (learning rule) for a networked control system (NCS) in which data packets containing control information are transmitted across a lossy wireless channel.
no code implementations • 27 Feb 2020 • Rahul Singh, Abhishek Gupta, Ness B. Shroff
In order to measure the performance of a reinforcement learning algorithm that satisfies the average cost constraints, we define an $M+1$ dimensional regret vector that is composed of its reward regret, and $M$ cost regrets.
no code implementations • 8 Jan 2020 • Rahul Singh, Keuntaek Lee, Yongxin Chen
It relies on the key idea of replacing the expected return with the return distribution, which captures the intrinsic randomness of the long term rewards.
1 code implementation • 10 Sep 2019 • Rahul Singh, Liyang Sun
We propose a semiparametric test to evaluate (i) whether different instruments induce subpopulations of compliers with the same observable characteristics on average, and (ii) whether compliers have observable characteristics that are the same as the full population on average.
1 code implementation • NeurIPS 2019 • Rahul Singh, Maneesh Sahani, Arthur Gretton
Instrumental variable (IV) regression is a strategy for learning causal relationships in observational data.
no code implementations • ICLR 2019 • Songtao Lu, Rahul Singh, Xiangyi Chen, Yongxin Chen, Mingyi Hong
By developing new primal-dual optimization tools, we show that, with a proper stepsize choice, the widely used first-order iterative algorithm in training GANs would in fact converge to a stationary solution with a sublinear rate.
no code implementations • 23 Nov 2018 • Rahul Singh, Aayush Sharma, Onur Rauf Bingol, Aditya Balu, Ganesh Balasubramanian, Duane D. Johnson, Soumik Sarkar
In this paper, we explore a deep convolutional neural-network based approach to develop the atomistic potential for such complex alloys to investigate materials for insights into controlling properties.
no code implementations • 21 Nov 2018 • Rahul Singh, Viraj Shah, Balaji Pokuri, Soumik Sarkar, Baskar Ganapathysubramanian, Chinmay Hegde
The first model is a WGAN model that uses a finite number of training images to synthesize new microstructures that weakly satisfy the physical invariances respected by the original data.
no code implementations • 23 Feb 2018 • Victor Chernozhukov, Whitney Newey, Rahul Singh
To achieve this property, we include the Riesz representer for the functional as an additional nuisance parameter.
no code implementations • 6 Sep 2017 • Rahul Singh, P. R. Kumar, Eytan Modiano
The key difference arises due to the fact that in our set-up the packets loose their utility once their "age" has crossed their deadline, thus making the task of optimizing timely throughput much more challenging than that of ensuring network stability.
no code implementations • 26 Feb 2017 • Rahul Singh, Taposh Banerjee
We consider the problem of designing an allocation rule or an "online learning algorithm" for a class of bandit problems in which the set of control actions available at each time $s$ is a convex, compact subset of $\mathbb{R}^d$.