Search Results for author: Yifei Ma

Found 13 papers, 1 papers with code

Experimental Design for Active Transductive Inference in Large Language Models

no code implementations • 12 Apr 2024 • Subhojyoti Mukherjee, Ge Liu, Aniket Deshmukh, Anusha Lalitha, Yifei Ma, Branislav Kveton

We design the LLM prompt by adaptively choosing few-shot examples for a given inference query.

Experimental Design

Paper
Add Code

Logic-Scaffolding: Personalized Aspect-Instructed Recommendation Explanation Generation using LLMs

no code implementations • 22 Dec 2023 • Behnam Rahdari, Hao Ding, Ziwei Fan, Yifei Ma, Zhuotong Chen, Anoop Deoras, Branislav Kveton

The unique capabilities of Large Language Models (LLMs), such as the natural language text generation ability, position them as strong candidates for providing explanation for recommendations.

Explanation Generation Position +1

Paper
Add Code

Fixed-Budget Best-Arm Identification with Heterogeneous Reward Variances

no code implementations • 13 Jun 2023 • Anusha Lalitha, Kousha Kalantari, Yifei Ma, Anoop Deoras, Branislav Kveton

Our algorithms rely on non-uniform budget allocations among the arms where the arms with higher reward variances are pulled more often than those with lower variances.

Paper
Add Code

Context Uncertainty in Contextual Bandits with Applications to Recommender Systems

no code implementations • 1 Feb 2022 • Hao Wang, Yifei Ma, Hao Ding, Yuyang Wang

Recurrent neural networks have proven effective in modeling sequential user feedbacks for recommender systems.

Multi-Armed Bandits Recommendation Systems +1

Paper
Add Code

Bridging Recommendation and Marketing via Recurrent Intensity Modeling

no code implementations • ICLR 2022 • Yifei Ma, Ge Liu, Anoop Deoras

RIM allows us to rethink recommendation in a Matching (Mtch) scenario, where the benefits of the users (e. g., ItemRec relevance) and item providers (e. g., item-exposure guarantees) are considered at the same time.

Marketing

Paper
Add Code

Language Models as Recommender Systems: Evaluations and Limitations

no code implementations • NeurIPS Workshop ICBINB 2021 • Yuhui Zhang, Hao Ding, Zeren Shui, Yifei Ma, James Zou, Anoop Deoras, Hao Wang

Pre-trained language models (PLMs) such as BERT and GPT learn general text representations and encode extensive world knowledge; thus, they can be efficiently and accurately adapted to various downstream tasks.

Movie Recommendation Session-Based Recommendations +1

Paper
Add Code

Zero-Shot Recommender Systems

no code implementations • 18 May 2021 • Hao Ding, Yifei Ma, Anoop Deoras, Yuyang Wang, Hao Wang

This poses a chicken-and-egg problem for early-stage products, whose amount of data, in turn, relies on the performance of their RS.

Recommendation Systems Zero-Shot Learning

Paper
Add Code

Recurrent Exploration Networks for Recommender Systems

no code implementations • 1 Jan 2021 • Hao Wang, Yifei Ma, Hao Ding, Bernie Wang

Recurrent neural networks have proven effective in modeling sequential user feedbacks for recommender systems.

Recommendation Systems Representation Learning

Paper
Add Code

Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling

no code implementations • NeurIPS 2019 • Tengyang Xie, Yifei Ma, Yu-Xiang Wang

To solve this problem, we consider a marginalized importance sampling (MIS) estimator that recursively estimates the state marginal distribution for the target policy at every step.

Off-policy evaluation reinforcement-learning

Paper
Add Code

Dynamic Mini-batch SGD for Elastic Distributed Training: Learning in the Limbo of Resources

2 code implementations • 26 Apr 2019 • Haibin Lin, Hang Zhang, Yifei Ma, Tong He, Zhi Zhang, Sheng Zha, Mu Li

One difficulty we observe is that the noise in the stochastic momentum estimation is accumulated over time and will have delayed effects when the batch size changes.

Image Classification object-detection +3

Paper
Code

Imitation-Regularized Offline Learning

no code implementations • 15 Jan 2019 • Yifei Ma, Yu-Xiang Wang, Balakrishnan, Narayanaswamy

To solve both problems, we show how one can use policy improvement (PIL) objectives, regularized by policy imitation (IML).

counterfactual Multi-Armed Bandits

Paper
Add Code

Active Search for Sparse Signals with Region Sensing

no code implementations • 2 Dec 2016 • Yifei Ma, Roman Garnett, Jeff Schneider

Autonomous systems can be used to search for sparse signals in a large space; e. g., aerial robots can be deployed to localize threats, detect gas leaks, or respond to distress calls.

Bayesian Optimization Compressive Sensing +1

Paper
Add Code

Σ-Optimality for Active Learning on Gaussian Random Fields

no code implementations • NeurIPS 2013 • Yifei Ma, Roman Garnett, Jeff Schneider

For active learning on GRFs, the commonly used V-optimality criterion queries nodes that reduce the L2 (regression) loss.

Active Learning General Classification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.