no code implementations • 21 Nov 2024 • Shenglai Zeng, Jiankun Zhang, Bingheng Li, Yuping Lin, Tianqi Zheng, Dante Everaert, Hanqing Lu, Hui Liu, Yue Xing, Monica Xiao Cheng, Jiliang Tang
We conduct a comprehensive analysis of LLM representation behaviors and demonstrate the significance of using representations in knowledge checking.
no code implementations • 22 Oct 2024 • Dante Everaert, Rohit Patki, Tianqi Zheng, Christopher Potts
Query Autocomplete (QAC) is a critical feature in modern search engines, facilitating user interaction by predicting search queries based on input prefixes.
no code implementations • 14 Mar 2024 • Tianqi Zheng, Nicolas Loizou, Pengcheng You, Enrique Mallada
Gradient Descent Ascent (GDA) methods for min-max optimization problems typically produce oscillatory behavior that can lead to instability, e. g., in bilinear settings.
no code implementations • 19 Oct 2023 • Tianqi Zheng, John W. Simpson-Porco, Enrique Mallada
The standard approach to combine these methodologies comprises an offline/open-loop stage, planning, that designs a feasible and safe trajectory to follow, and an online/closed-loop stage, tracking, that corrects for unmodeled dynamics and disturbances.
no code implementations • 3 Dec 2022 • Tianqi Zheng, Pengcheng You, Enrique Mallada
In constrained reinforcement learning (C-RL), an agent seeks to learn from the environment a policy that maximizes the expected cumulative reward while satisfying minimum requirements in secondary cumulative reward constraints.