Search Results for author: Lichan Hong

Found 31 papers, 8 papers with code

Aligning Large Language Models with Recommendation Knowledge

no code implementations • 30 Mar 2024 • Yuwei Cao, Nikhil Mehta, Xinyang Yi, Raghunandan Keshavan, Lukasz Heldt, Lichan Hong, Ed H. Chi, Maheswaran Sathiamoorthy

Operations such as Masked Item Modeling (MIM) and Bayesian Personalized Ranking (BPR) have found success in conventional recommender systems.

Attribute Recommendation Systems +1

Paper
Add Code

Wisdom of Committee: Distilling from Foundation Model to Specialized Application Model

no code implementations • 21 Feb 2024 • Zichang Liu, Qingyun Liu, Yuening Li, Liang Liu, Anshumali Shrivastava, Shuchao Bi, Lichan Hong, Ed H. Chi, Zhe Zhao

Further, to accommodate the dissimilarity among the teachers in the committee, we introduce DiverseDistill, which allows the student to understand the expertise of each teacher and extract task knowledge.

Knowledge Distillation Transfer Learning

Paper
Add Code

How to Train Data-Efficient LLMs

no code implementations • 15 Feb 2024 • Noveen Sachdeva, Benjamin Coleman, Wang-Cheng Kang, Jianmo Ni, Lichan Hong, Ed H. Chi, James Caverlee, Julian McAuley, Derek Zhiyuan Cheng

The training of large language models (LLMs) is expensive.

Paper
Add Code

LEVI: Generalizable Fine-tuning via Layer-wise Ensemble of Different Views

no code implementations • 7 Feb 2024 • Yuji Roh, Qingyun Liu, Huan Gui, Zhe Yuan, Yujin Tang, Steven Euijong Whang, Liang Liu, Shuchao Bi, Lichan Hong, Ed H. Chi, Zhe Zhao

By combining two complementing models, LEVI effectively suppresses problematic features in both the fine-tuning data and pre-trained model and preserves useful features for new tasks.

Paper
Add Code

Hiformer: Heterogeneous Feature Interactions Learning with Transformers for Recommender Systems

no code implementations • 10 Nov 2023 • Huan Gui, Ruoxi Wang, Ke Yin, Long Jin, Maciej Kula, Taibai Xu, Lichan Hong, Ed H. Chi

We identify two key challenges for applying the vanilla Transformer architecture to web-scale recommender systems: (1) Transformer architecture fails to capture the heterogeneous feature interactions in the self-attention layer; (2) The serving latency of Transformer architecture might be too high to be deployed in web-scale recommender systems.

Recommendation Systems

Paper
Add Code

Talking Models: Distill Pre-trained Knowledge to Downstream Models via Interactive Communication

no code implementations • 4 Oct 2023 • Zhe Zhao, Qingyun Liu, Huan Gui, Bang An, Lichan Hong, Ed H. Chi

In this paper, we extend KD with an interactive communication process to help students of downstream tasks learn effectively from pre-trained foundation models.

Knowledge Distillation Transfer Learning

Paper
Add Code

Density Weighting for Multi-Interest Personalized Recommendation

no code implementations • 3 Aug 2023 • Nikhil Mehta, Anima Singh, Xinyang Yi, Sagar Jain, Lichan Hong, Ed H. Chi

When the data distribution is highly skewed, the gains observed by learning multiple representations diminish since the model dominates on head items/interests, leading to poor performance on tail items.

Recommendation Systems

Paper
Add Code

Online Matching: A Real-time Bandit System for Large-scale Recommendations

no code implementations • 29 Jul 2023 • Xinyang Yi, Shao-Chuan Wang, Ruining He, Hariharan Chandrasekaran, Charles Wu, Lukasz Heldt, Lichan Hong, Minmin Chen, Ed H. Chi

In this paper, we introduce Online Matching: a scalable closed-loop bandit system learning from users' direct feedback on items in real time.

Multi-Armed Bandits Recommendation Systems

Paper
Add Code

Better Generalization with Semantic IDs: A case study in Ranking for Recommendations

no code implementations • 13 Jun 2023 • Anima Singh, Trung Vu, Raghunandan Keshavan, Nikhil Mehta, Xinyang Yi, Lichan Hong, Lukasz Heldt, Li Wei, Ed Chi, Maheswaran Sathiamoorthy

We showcase how we use them as a replacement of item IDs in a resource-constrained ranking model used in an industrial-scale video sharing platform.

Recommendation Systems

Paper
Add Code

HyperFormer: Learning Expressive Sparse Feature Representations via Hypergraph Transformer

no code implementations • 27 May 2023 • Kaize Ding, Albert Jiongqian Liang, Bryan Perrozi, Ting Chen, Ruoxi Wang, Lichan Hong, Ed H. Chi, Huan Liu, Derek Zhiyuan Cheng

Learning expressive representations for high-dimensional yet sparse features has been a longstanding problem in information retrieval.

Graph Learning Information Retrieval +2

Paper
Add Code

Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems

no code implementations • NeurIPS 2023 • Benjamin Coleman, Wang-Cheng Kang, Matthew Fahrbach, Ruoxi Wang, Lichan Hong, Ed H. Chi, Derek Zhiyuan Cheng

Learning high-quality feature embeddings efficiently and effectively is critical for the performance of web-scale machine learning systems.

Recommendation Systems

Paper
Add Code

Do LLMs Understand User Preferences? Evaluating LLMs On User Rating Prediction

no code implementations • 10 May 2023 • Wang-Cheng Kang, Jianmo Ni, Nikhil Mehta, Maheswaran Sathiamoorthy, Lichan Hong, Ed Chi, Derek Zhiyuan Cheng

In this paper, we conduct a thorough examination of both CF and LLMs within the classic task of user rating prediction, which involves predicting a user's rating for a candidate item based on their past ratings.

Collaborative Filtering World Knowledge

Paper
Add Code

Improving Training Stability for Multitask Ranking Models in Recommender Systems

2 code implementations • 17 Feb 2023 • Jiaxi Tang, Yoel Drori, Daryl Chang, Maheswaran Sathiamoorthy, Justin Gilmer, Li Wei, Xinyang Yi, Lichan Hong, Ed H. Chi

Recommender systems play an important role in many content platforms.

Recommendation Systems

1,753

Paper
Code

Empowering Long-tail Item Recommendation through Cross Decoupling Network (CDN)

no code implementations • 25 Oct 2022 • Yin Zhang, Ruoxi Wang, Tiansheng Yao, Xinyang Yi, Lichan Hong, James Caverlee, Ed H. Chi, Derek Zhiyuan Cheng

In this work, we aim to improve tail item recommendations while maintaining the overall performance with less training and serving cost.

Memorization Recommendation Systems +1

Paper
Add Code

Improving Multi-Task Generalization via Regularizing Spurious Correlation

no code implementations • 19 May 2022 • Ziniu Hu, Zhe Zhao, Xinyang Yi, Tiansheng Yao, Lichan Hong, Yizhou Sun, Ed H. Chi

First, the risk of having non-causal knowledge is higher, as the shared MTL model needs to encode all knowledge from different tasks, and causal knowledge for one task could be potentially spurious to the other.

Multi-Task Learning Representation Learning

Paper
Add Code

DSelect-k: Differentiable Selection in the Mixture of Experts with Applications to Multi-Task Learning

3 code implementations • NeurIPS 2021 • Hussein Hazimeh, Zhe Zhao, Aakanksha Chowdhery, Maheswaran Sathiamoorthy, Yihua Chen, Rahul Mazumder, Lichan Hong, Ed H. Chi

State-of-the-art MoE models use a trainable sparse gate to select a subset of the experts for each input example.

Multi-Task Learning Recommendation Systems

32,867

Paper
Code

A Model of Two Tales: Dual Transfer Learning Framework for Improved Long-tail Item Recommendation

no code implementations • 29 Oct 2020 • Yin Zhang, Derek Zhiyuan Cheng, Tiansheng Yao, Xinyang Yi, Lichan Hong, Ed H. Chi

It is also very encouraging that our framework further improves head items and overall performance on top of the gains on tail items.

Data Augmentation Recommendation Systems +2

Paper
Add Code

Learning to Embed Categorical Features without Embedding Tables for Recommendation

no code implementations • 21 Oct 2020 • Wang-Cheng Kang, Derek Zhiyuan Cheng, Tiansheng Yao, Xinyang Yi, Ting Chen, Lichan Hong, Ed H. Chi

Embedding learning of categorical features (e. g. user/item IDs) is at the core of various recommendation models including matrix factorization and neural collaborative filtering.

Collaborative Filtering Natural Language Understanding +2

Paper
Add Code

DCN V2: Improved Deep & Cross Network and Practical Lessons for Web-scale Learning to Rank Systems

11 code implementations • 19 Aug 2020 • Ruoxi Wang, Rakesh Shivanna, Derek Z. Cheng, Sagar Jain, Dong Lin, Lichan Hong, Ed H. Chi

Learning effective feature crosses is the key behind building recommender systems.

Ranked #12 on Click-Through Rate Prediction on Criteo

Click-Through Rate Prediction Learning-To-Rank +1

76,611

Paper
Code

Beyond Point Estimate: Inferring Ensemble Prediction Variation from Neuron Activation Strength in Recommender Systems

no code implementations • 17 Aug 2020 • Zhe Chen, Yuyan Wang, Dong Lin, Derek Zhiyuan Cheng, Lichan Hong, Ed H. Chi, Claire Cui

Despite deep neural network (DNN)'s impressive prediction performance in various domains, it is well known now that a set of DNN models trained with the same model specification and the same data can produce very different prediction results.

Model-based Reinforcement Learning Recommendation Systems

Paper
Add Code

Small Towers Make Big Differences

no code implementations • 13 Aug 2020 • Yuyan Wang, Zhe Zhao, Bo Dai, Christopher Fifty, Dong Lin, Lichan Hong, Ed H. Chi

A delicate balance between multi-task generalization and multi-objective optimization is therefore needed for finding a better trade-off between efficiency and generalization.

Multi-Task Learning

Paper
Add Code

Self-supervised Learning for Large-scale Item Recommendations

1 code implementation • 25 Jul 2020 • Tiansheng Yao, Xinyang Yi, Derek Zhiyuan Cheng, Felix Yu, Ting Chen, Aditya Menon, Lichan Hong, Ed H. Chi, Steve Tjoa, Jieqi Kang, Evan Ettinger

Our online results also verify our hypothesis that our framework indeed improves model performance even more on slices that lack supervision.

Data Augmentation Natural Language Understanding +3

314

Paper
Code

Learning-to-Rank with Partitioned Preference: Fast Estimation for the Plackett-Luce Model

no code implementations • 9 Jun 2020 • Jiaqi Ma, Xinyang Yi, Weijing Tang, Zhe Zhao, Lichan Hong, Ed H. Chi, Qiaozhu Mei

We investigate the Plackett-Luce (PL) model based listwise learning-to-rank (LTR) on data with partitioned preference, where a set of items are sliced into ordered and disjoint partitions, but the ranking of items within a partition is unknown.

Extreme Multi-Label Classification Learning-To-Rank +1

Paper
Add Code

Learning Multi-granular Quantized Embeddings for Large-Vocab Categorical Features in Recommender Systems

no code implementations • 20 Feb 2020 • Wang-Cheng Kang, Derek Zhiyuan Cheng, Ting Chen, Xinyang Yi, Dong Lin, Lichan Hong, Ed H. Chi

In this paper, we seek to learn highly compact embeddings for large-vocab sparse features in recommender systems (recsys).

Quantization Recommendation Systems

Paper
Add Code

Sampling-Bias-Corrected Neural Modeling for Large Corpus Item Recommendations

2 code implementations • ACM Conference on Recommender Systems 2019 • Xinyang Yi, Ji Yang, Lichan Hong, Derek Zhiyuan Cheng, Lukasz Heldt, Aditee Ajit Kumthekar, Zhe Zhao, Li Wei, Ed Chi

However, batch loss is subject to sampling bias which could severely restrict model performance, particularly in the case of power-law distribution.

Recommendation Systems Retrieval

314

Paper
Code

Recommending what video to watch next: a multitask ranking system

no code implementations • RecSys 2019 • Zhe Zhao, Lichan Hong, Li Wei, Jilin Chen, Aniruddh Nath, Shawn Andrews, Aditee Kumthekar, Maheswaran Sathiamoorthy, Xinyang Yi, Ed Chi

In this paper, we introduce a large scale multi-objective ranking system for recommending what video to watch next on an industrial video sharing platform.

Paper
Add Code

Fairness in Recommendation Ranking through Pairwise Comparisons

no code implementations • 2 Mar 2019 • Alex Beutel, Jilin Chen, Tulsee Doshi, Hai Qian, Li Wei, Yi Wu, Lukasz Heldt, Zhe Zhao, Lichan Hong, Ed H. Chi, Cristos Goodrow

Recommender systems are one of the most pervasive applications of machine learning in industry, with many services using them to match users to products or information.

Fairness Recommendation Systems

Paper
Add Code

Modeling Task Relationships in Multi-task Learning with Multi-gate Mixture-of-Experts

10 code implementations • 19 Jul 2018 • Jiaqi Ma, Zhe Zhao, Xinyang Yi, Jilin Chen, Lichan Hong, Ed Chi

In this work, we propose a novel multi-task learning approach, Multi-gate Mixture-of-Experts (MMoE), which explicitly learns to model task relationships from data.

Binary Classification Click-Through Rate Prediction +2

7,353

Paper
Code

Efficient Training on Very Large Corpora via Gramian Estimation

no code implementations • ICLR 2019 • Walid Krichene, Nicolas Mayoraz, Steffen Rendle, Li Zhang, Xinyang Yi, Lichan Hong, Ed Chi, John Anderson

We study the problem of learning similarity functions over very large corpora using neural network embedding models.

Network Embedding

Paper
Add Code

TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks

1 code implementation • 8 Aug 2017 • Heng-Tze Cheng, Zakaria Haque, Lichan Hong, Mustafa Ispir, Clemens Mewald, Illia Polosukhin, Georgios Roumpos, D. Sculley, Jamie Smith, David Soergel, Yuan Tang, Philipp Tucker, Martin Wicke, Cassandra Xia, Jianwei Xie

Our focus is on simplifying cutting edge machine learning for practitioners in order to bring such technologies into production.

BIG-bench Machine Learning

182,545

Paper
Code

Wide & Deep Learning for Recommender Systems

36 code implementations • 24 Jun 2016 • Heng-Tze Cheng, Levent Koc, Jeremiah Harmsen, Tal Shaked, Tushar Chandra, Hrishi Aradhye, Glen Anderson, Greg Corrado, Wei Chai, Mustafa Ispir, Rohan Anil, Zakaria Haque, Lichan Hong, Vihan Jain, Xiaobing Liu, Hemal Shah

Memorization of feature interactions through a wide set of cross-product feature transformations are effective and interpretable, while generalization requires more feature engineering effort.

Ranked #2 on Click-Through Rate Prediction on Bing News

Click-Through Rate Prediction Feature Engineering +3

17,994

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.