no code implementations • 27 Dec 2024 • Seong Jin Lee, Will Wei Sun, Yufeng Liu
Reinforcement learning from human feedback (RLHF) has become a cornerstone for aligning large language models with human preferences.
no code implementations • 2 May 2024 • Weibin Mo, Weijing Tang, Songkai Xue, Yufeng Liu, Ji Zhu
Given the observed groups of data, we develop a min-max-regret (MMR) learning framework for general supervised learning, which targets to minimize the worst-group regret.
no code implementations • 19 Apr 2024 • Seong Jin Lee, Will Wei Sun, Yufeng Liu
In this paper, we consider the dynamic assortment problem with dual contexts -- user and item features.
1 code implementation • 13 Dec 2023 • Yufeng Liu
This framework is constructed to learn a compact and fast nighttime tracker via knowledge transferring from the teacher and knowledge sharing among various students.
no code implementations • 1 Sep 2023 • Dhruv Patel, Hui Shen, Shankar Bhamidi, Yufeng Liu, Vladas Pipiras
In canonical settings with ground truth clusters, we derive bounds for algorithms such as $k$-means$++$ to find good initializations and thus leading to the correctness of clustering via the main result.
no code implementations • 4 Jul 2023 • Jiao Zhang, Hongtao Zhang, Xuelei Chen, Fengquan Wu, Yufeng Liu, Wenmei Zhang
Peak sidelobe level reduction (PSLR) is crucial in the application of large-scale array antenna, which directly determines the radiation performance of array antenna.
1 code implementation • 29 Jan 2023 • Daiqi Gao, Yufeng Liu, Donglin Zeng
Dynamic treatment regimes or policies are a sequence of decision functions over multiple stages that are tailored to individual features.
1 code implementation • CVPR 2022 • Xingyu Chen, Yufeng Liu, Yajiao Dong, Xiong Zhang, Chongyang Ma, Yanmin Xiong, Yuan Zhang, Xiaoyan Guo
In this work, we propose a framework for single-view hand mesh reconstruction, which can simultaneously achieve high reconstruction accuracy, fast inference speed, and temporal coherence.
Ranked #3 on 3D Hand Pose Estimation on FreiHAND
no code implementations • 17 Oct 2021 • Weibin Mo, Zhengling Qi, Yufeng Liu
However, when the growth of testing sample size available for training is in a slower order, efficient value function estimates may not perform well anymore.
1 code implementation • 6 Sep 2021 • Weibin Mo, Yufeng Liu
Other than potential misspecified nuisance models, most existing methods do not account for the potential problem when the variance of outcome is heterogeneous among covariates and treatment.
no code implementations • 8 May 2021 • Yumeng Zhang, Li Chen, Yufeng Liu, Xiaoyan Guo, Wen Zheng, Junhai Yong
Deep learning methods have achieved excellent performance in pose estimation, but the lack of robustness causes the keypoints to change drastically between similar images.
2 code implementations • CVPR 2021 • Junguang Jiang, Yifei Ji, Ximei Wang, Yufeng Liu, Jianmin Wang, Mingsheng Long
First, based on our observation that the probability density of the output space is sparse, we introduce a spatial probability distribution to describe this sparsity and then use it to guide the learning of the adversarial regressor.
1 code implementation • CVPR 2021 • Xingyu Chen, Yufeng Liu, Chongyang Ma, Jianlong Chang, Huayan Wang, Tian Chen, Xiaoyan Guo, Pengfei Wan, Wen Zheng
In the root-relative mesh recovery task, we exploit semantic relations among joints to generate a 3D mesh from the extracted 2D cues.
Ranked #20 on 3D Hand Pose Estimation on FreiHAND
no code implementations • 26 Jun 2020 • Weibin Mo, Zhengling Qi, Yufeng Liu
We propose a novel distributionally robust ITR (DR-ITR) framework that maximizes the worst-case value function across the values under a set of underlying distributions that are "close" to the training distribution.
no code implementations • 13 Dec 2019 • Naim U. Rashid, Daniel J. Luckett, Jingxiang Chen, Michael T. Lawson, Longshaokan Wang, Yunshu Zhang, Eric B. Laber, Yufeng Liu, Jen Jen Yeh, Donglin Zeng, Michael R. Kosorok
PDX data are characterized by correlated outcomes, a high-dimensional feature space, and a large number of treatments.
no code implementations • 6 Oct 2019 • Zhengling Qi, Ying Cui, Yufeng Liu, Jong-Shi Pang
This paper has two main goals: (a) establish several statistical properties---consistency, asymptotic distributions, and convergence rates---of stationary solutions and values of a class of coupled nonconvex and nonsmoothempirical risk minimization problems, and (b) validate these properties by a noisy amplitude-based phase retrieval problem, the latter being of much topical interest. Derived from available data via sampling, these empirical risk minimization problems are the computational workhorse of a population risk model which involves the minimization of an expected value of a random functional.
no code implementations • 11 Sep 2019 • Yumeng Zhang, Li Chen, Yufeng Liu, Junhai Yong, Wen Zheng
During training, based on the relation between these common characteristics and 3D pose learned from fully-annotated synthetic datasets, it is beneficial for the network to restore the 3D pose of weakly labeled real-world datasets with the aid of 2D annotations and depth images.
no code implementations • 27 Aug 2019 • Zhengling Qi, Ying Cui, Yufeng Liu, Jong-Shi Pang
Recent exploration of optimal individualized decision rules (IDRs) for patients in precision medicine has attracted a lot of attention due to the heterogeneous responses of patients to different treatments.
no code implementations • 20 Jan 2017 • Will Wei Sun, Guang Cheng, Yufeng Liu
Stability is an important aspect of a classification procedure because unstable predictions can potentially reduce users' trust in a classification system and also harm the reproducibility of scientific conclusions.
no code implementations • 28 Nov 2016 • Botao Hao, Will Wei Sun, Yufeng Liu, Guang Cheng
We consider joint estimation of multiple graphical models arising from heterogeneous and high-dimensional observations.
no code implementations • 30 Aug 2016 • Yuying Xie, Yufeng Liu, William Valdar
In this paper, we propose a novel estimator for data arising from a group of Gaussian graphical models that are themselves dependent.
no code implementations • 19 Nov 2014 • Patrick K. Kimes, D. Neil Hayes, J. S. Marron, Yufeng Liu
Binary classification is a common statistical learning problem in which a model is estimated on a set of covariates for some outcome indicating the membership of one of two classes.