no code implementations • 12 Apr 2024 • Subhojyoti Mukherjee, Ge Liu, Aniket Deshmukh, Anusha Lalitha, Yifei Ma, Branislav Kveton
We design the LLM prompt by adaptively choosing few-shot examples for a given inference query.
no code implementations • 22 Dec 2023 • Behnam Rahdari, Hao Ding, Ziwei Fan, Yifei Ma, Zhuotong Chen, Anoop Deoras, Branislav Kveton
The unique capabilities of Large Language Models (LLMs), such as the natural language text generation ability, position them as strong candidates for providing explanation for recommendations.
no code implementations • 13 Jun 2023 • Anusha Lalitha, Kousha Kalantari, Yifei Ma, Anoop Deoras, Branislav Kveton
Our algorithms rely on non-uniform budget allocations among the arms where the arms with higher reward variances are pulled more often than those with lower variances.
no code implementations • 1 Feb 2022 • Hao Wang, Yifei Ma, Hao Ding, Yuyang Wang
Recurrent neural networks have proven effective in modeling sequential user feedbacks for recommender systems.
no code implementations • ICLR 2022 • Yifei Ma, Ge Liu, Anoop Deoras
RIM allows us to rethink recommendation in a Matching (Mtch) scenario, where the benefits of the users (e. g., ItemRec relevance) and item providers (e. g., item-exposure guarantees) are considered at the same time.
no code implementations • NeurIPS Workshop ICBINB 2021 • Yuhui Zhang, Hao Ding, Zeren Shui, Yifei Ma, James Zou, Anoop Deoras, Hao Wang
Pre-trained language models (PLMs) such as BERT and GPT learn general text representations and encode extensive world knowledge; thus, they can be efficiently and accurately adapted to various downstream tasks.
no code implementations • 18 May 2021 • Hao Ding, Yifei Ma, Anoop Deoras, Yuyang Wang, Hao Wang
This poses a chicken-and-egg problem for early-stage products, whose amount of data, in turn, relies on the performance of their RS.
no code implementations • 1 Jan 2021 • Hao Wang, Yifei Ma, Hao Ding, Bernie Wang
Recurrent neural networks have proven effective in modeling sequential user feedbacks for recommender systems.
no code implementations • NeurIPS 2019 • Tengyang Xie, Yifei Ma, Yu-Xiang Wang
To solve this problem, we consider a marginalized importance sampling (MIS) estimator that recursively estimates the state marginal distribution for the target policy at every step.
2 code implementations • 26 Apr 2019 • Haibin Lin, Hang Zhang, Yifei Ma, Tong He, Zhi Zhang, Sheng Zha, Mu Li
One difficulty we observe is that the noise in the stochastic momentum estimation is accumulated over time and will have delayed effects when the batch size changes.
no code implementations • 15 Jan 2019 • Yifei Ma, Yu-Xiang Wang, Balakrishnan, Narayanaswamy
To solve both problems, we show how one can use policy improvement (PIL) objectives, regularized by policy imitation (IML).
no code implementations • 2 Dec 2016 • Yifei Ma, Roman Garnett, Jeff Schneider
Autonomous systems can be used to search for sparse signals in a large space; e. g., aerial robots can be deployed to localize threats, detect gas leaks, or respond to distress calls.
no code implementations • NeurIPS 2013 • Yifei Ma, Roman Garnett, Jeff Schneider
For active learning on GRFs, the commonly used V-optimality criterion queries nodes that reduce the L2 (regression) loss.