no code implementations • 1 May 2023 • Ruida Zhou, Chao Tian, Tie Liu
We further show that although the conditional bounding and the reference distribution can make the bound exactly tight, removing them does not significantly degrade the bound, which leads to a mutual-information-based bound that is also asymptotically tight in this setting.
no code implementations • 13 Mar 2023 • Ali Menati, Yuting Cai, Rayan El Helou, Chao Tian, Le Xie
One of the most significant bottlenecks for the scalable deployment of such computation is its energy demand.
1 code implementation • 10 Jun 2022 • Ruida Zhou, Tao Liu, Dileep Kalathil, P. R. Kumar, Chao Tian
We study policy optimization for Markov decision processes (MDPs) with multiple reward value functions, which are to be jointly optimized according to given criteria such as proportional fairness (smooth concave scalarization), hard constraints (constrained MDP), and max-min trade-off.
no code implementations • 11 Apr 2022 • Wenjing Chen, Ruida Zhou, Chao Tian, Cong Shen
In the special case of $m=2$, i. e., pairwise comparison, the resultant bound is tighter than that given by Shah et al., leading to a reduced gap between the error probability upper and lower bounds.
no code implementations • 11 Apr 2022 • Ruida Zhou, Chao Tian
We study the effect of reward variance heterogeneity in the approximate top-$m$ arm identification setting.
no code implementations • 31 Oct 2021 • Tao Liu, Ruida Zhou, Dileep Kalathil, P. R. Kumar, Chao Tian
We propose a new algorithm called policy mirror descent-primal dual (PMD-PD) algorithm that can provably achieve a faster $\mathcal{O}(\log(T)/T)$ convergence rate for both the optimality gap and the constraint violation.
no code implementations • 10 Sep 2021 • Kai Zhang, Chao Tian, Kun Zhang, Todd Johnson, Xiaoqian Jiang
The PC algorithm is the state-of-the-art algorithm for causal structure discovery on observational data.
no code implementations • 30 Jul 2021 • Sennur Ulukus, Salman Avestimehr, Michael Gastpar, Syed Jafar, Ravi Tandon, Chao Tian
Most of our lives are conducted in the cyberspace.
no code implementations • NeurIPS 2021 • Tao Liu, Ruida Zhou, Dileep Kalathil, P. R. Kumar, Chao Tian
We show that when a strictly safe policy is known, then one can confine the system to zero constraint violation with arbitrarily high probability while keeping the reward regret of order $\tilde{\mathcal{O}}(\sqrt{K})$.
no code implementations • 17 Dec 2020 • Ruida Zhou, Chao Tian, Tie Liu
We propose a new information-theoretic bound on generalization error based on a combination of the error decomposition technique of Bu et al. and the conditional mutual information (CMI) construction of Steinke and Zakynthinou.
no code implementations • COLING 2020 • Chao Tian, Yifei Wang, Hao Cheng, Yijiang Lian, Zhihua Zhang
In this paper we propose a unified approach for supporting different generation manners of machine translation, including autoregressive, semi-autoregressive, and refinement-based non-autoregressive models.
no code implementations • 19 Nov 2019 • Chao Tian, Cong Li, Jianping Shi
Recently, FCNs based methods have made great progress in semantic segmentation.
no code implementations • 25 Sep 2019 • Tao Guo, Ruida Zhou, Chao Tian
We further characterize the optimal tradeoff between the minimum amount of common randomness and the total leakage.