Transformers with multi-modal features and post-fusion context for e-commerce session-based recommendation

11 Jul 2021  ·  Gabriel de Souza P. Moreira, Sara Rabhi, Ronay Ak, Md Yasin Kabir, Even Oldridge ·

Session-based recommendation is an important task for e-commerce services, where a large number of users browse anonymously or may have very distinct interests for different sessions. In this paper we present one of the winning solutions for the Recommendation task of the SIGIR 2021 Workshop on E-commerce Data Challenge. Our solution was inspired by NLP techniques and consists of an ensemble of two Transformer architectures - Transformer-XL and XLNet - trained with autoregressive and autoencoding approaches. To leverage most of the rich dataset made available for the competition, we describe how we prepared multi-model features by combining tabular events with textual and image vectors. We also present a model prediction analysis to better understand the effectiveness of our architectures for the session-based recommendation.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Product Recommendation Coveo Data Challenge Dataset Ensemble (60 models) MRR 0.2784 # 1
F1 0.0748 # 1

Methods