Attribute-aware non-linear co-embeddings of graph features
In very sparse recommender data sets, attributes of users such as age, gender and home location and attributes of items such as, in the case of movies, genre, release year, and director can improve the recommendation accuracy, especially for users and items that have few ratings. While most recommendation models can be extended to take attributes of users and items into account, their architectures usually become more complicated. While attributes for items are often easy to be provided, attributes for users are often scarce for reasons of privacy or simply because they are not relevant to the operational process at hand. In this paper, we address these two problems for attribute-aware recommender systems by proposing a simple model that co-embeds users and items into a joint latent space in a similar way as a vanilla matrix factorization, but with non-linear latent features construction that seamlessly can ingest user or item attributes or both (GraphRec). To address the second problem, scarce attributes, the proposed model treats the user-item relation as a bipartite graph and constructs generic user and item attributes via the Laplacian of the user-item co-occurrence graph that requires no further external side information but the mere rating matrix. In experiments on three recommender datasets, we show that GraphRec significantly outperforms existing state-of-the-art attribute-aware and content-aware recommender systems even without using any side information.
PDFCode
Datasets
Results from the Paper
Ranked #4 on Recommendation Systems on MovieLens 100K (using extra training data)
Task | Dataset | Model | Metric Name | Metric Value | Global Rank | Uses Extra Training Data |
Benchmark |
---|---|---|---|---|---|---|---|
Recommendation Systems | MovieLens 100K | GraphRec + Feat | RMSE (u1 Splits) | 0.897 | # 4 | ||
RMSE (Random 90/10 Splits) | 0.883 | # 1 | |||||
Recommendation Systems | MovieLens 100K | GraphRec | RMSE (u1 Splits) | 0.904 | # 5 | ||
RMSE (Random 90/10 Splits) | 0.887 | # 2 |