no code implementations • 21 Jan 2025 • Wenyuan Zhao, Yu-Shin Huang, Chao Tian, Alex Sprintson
Unlike the previous L-PIR scheme proposed by Samy et al., which only adjusted the probability allocation to the clean (low-cost) retrieval pattern, we optimize the probabilities assigned to all the retrieval patterns jointly.
1 code implementation • 4 Dec 2024 • Mingzhou Fan, Ruida Zhou, Chao Tian, Xiaoning Qian
We propose a path-guided particle-based sampling~(PGPS) method based on a novel Log-weighted Shrinkage (LwS) density path linking an initial distribution to the target distribution.
no code implementations • 7 Oct 2024 • Ruida Zhou, Chao Tian, Suhas Diggavi
Large language models have demonstrated impressive in-context learning (ICL) capability.
no code implementations • 6 Oct 2024 • Yu-Shin Huang, Peter Just, Krishna Narayanan, Chao Tian
We consider coverless steganography where a Large Language Model (LLM) drives an arithmetic coding decoder to generate stego-texts.
1 code implementation • 7 Aug 2024 • Guoqing Zhu, Honghu Pan, Qiang Wang, Chao Tian, Chao Yang, Zhenyu He
This framework aims to produce meticulously aligned pseudo thermal images at the pixel level, leveraging edge information extracted from visible images.
no code implementations • 1 Jul 2024 • Mohamed Zeid, Subir Majumder, Hasan Ibrahim, Prasad Enjeti, Le Xie, Chao Tian
Foundational Large Language Models (LLMs) such as GPT-3. 5-turbo allow users to refine the model based on newer information, known as ``fine-tuning''.
no code implementations • 14 Mar 2024 • Subir Majumder, Lin Dong, Fatemeh Doudi, Yuting Cai, Chao Tian, Dileep Kalathi, Kevin Ding, Anupam A. Thatte, Na Li, Le Xie
Large Language Models (LLMs) as chatbots have drawn remarkable attention thanks to their versatile capability in natural language processing as well as in a wide range of tasks.
no code implementations • 9 Mar 2024 • Min Cheng, Ruida Zhou, P. R. Kumar, Chao Tian
We prove that both algorithms based on independent policy gradient and independent natural policy gradient converge globally to a Nash equilibrium for the average reward criterion.
no code implementations • 23 Aug 2023 • Chao Tian, Zikun Zhou, Yuqing Huang, Gaojun Li, Zhenyu He
RGB-Thermal (RGB-T) pedestrian detection aims to locate the pedestrians in RGB-T image pairs to exploit the complementation between the two modalities for improving detection robustness in extreme conditions.
1 code implementation • NeurIPS 2023 • Ruida Zhou, Tao Liu, Min Cheng, Dileep Kalathil, P. R. Kumar, Chao Tian
We study robust reinforcement learning (RL) with the goal of determining a well-performing policy that is robust against model mismatch between the training simulator and the testing environment.
no code implementations • 1 May 2023 • Ruida Zhou, Chao Tian, Tie Liu
We provide a new information-theoretic generalization error bound that is exactly tight (i. e., matching even the constant) for the canonical quadratic Gaussian (location) problem.
no code implementations • 13 Mar 2023 • Ali Menati, Yuting Cai, Rayan El Helou, Chao Tian, Le Xie
One of the most significant bottlenecks for the scalable deployment of such computation is its energy demand.
2 code implementations • 10 Jun 2022 • Ruida Zhou, Tao Liu, Dileep Kalathil, P. R. Kumar, Chao Tian
We study policy optimization for Markov decision processes (MDPs) with multiple reward value functions, which are to be jointly optimized according to given criteria such as proportional fairness (smooth concave scalarization), hard constraints (constrained MDP), and max-min trade-off.
no code implementations • 11 Apr 2022 • Ruida Zhou, Chao Tian
We study the effect of reward variance heterogeneity in the approximate top-$m$ arm identification setting.
no code implementations • 11 Apr 2022 • Wenjing Chen, Ruida Zhou, Chao Tian, Cong Shen
In the special case of $m=2$, i. e., pairwise comparison, the resultant bound is tighter than that given by Shah et al., leading to a reduced gap between the error probability upper and lower bounds.
no code implementations • 31 Oct 2021 • Tao Liu, Ruida Zhou, Dileep Kalathil, P. R. Kumar, Chao Tian
We propose a new algorithm called policy mirror descent-primal dual (PMD-PD) algorithm that can provably achieve a faster $\mathcal{O}(\log(T)/T)$ convergence rate for both the optimality gap and the constraint violation.
no code implementations • 10 Sep 2021 • Kai Zhang, Chao Tian, Kun Zhang, Todd Johnson, Xiaoqian Jiang
The PC algorithm is the state-of-the-art algorithm for causal structure discovery on observational data.
no code implementations • 30 Jul 2021 • Sennur Ulukus, Salman Avestimehr, Michael Gastpar, Syed Jafar, Ravi Tandon, Chao Tian
Most of our lives are conducted in the cyberspace.
no code implementations • NeurIPS 2021 • Tao Liu, Ruida Zhou, Dileep Kalathil, P. R. Kumar, Chao Tian
We show that when a strictly safe policy is known, then one can confine the system to zero constraint violation with arbitrarily high probability while keeping the reward regret of order $\tilde{\mathcal{O}}(\sqrt{K})$.
no code implementations • 17 Dec 2020 • Ruida Zhou, Chao Tian, Tie Liu
We propose a new information-theoretic bound on generalization error based on a combination of the error decomposition technique of Bu et al. and the conditional mutual information (CMI) construction of Steinke and Zakynthinou.
no code implementations • COLING 2020 • Chao Tian, Yifei Wang, Hao Cheng, Yijiang Lian, Zhihua Zhang
In this paper we propose a unified approach for supporting different generation manners of machine translation, including autoregressive, semi-autoregressive, and refinement-based non-autoregressive models.
no code implementations • 19 Nov 2019 • Chao Tian, Cong Li, Jianping Shi
Recently, FCNs based methods have made great progress in semantic segmentation.
no code implementations • 25 Sep 2019 • Tao Guo, Ruida Zhou, Chao Tian
We further characterize the optimal tradeoff between the minimum amount of common randomness and the total leakage.