Self-Attentive Sequential Recommendation

Wang-Cheng Kang; Julian McAuley

doi:10.1109/icdm.2018.00035

Abstract

Sequential dynamics are a key feature of many modern recommender systems, which seek to capture the 'context' of users' activities on the basis of actions they have performed recently. To capture such patterns, two approaches have proliferated: Markov Chains (MCs) and Recurrent Neural Networks (RNNs). Markov Chains assume that a user's next action can be predicted on the basis of just their last (or last few) actions, while RNNs in principle allow for longer-term semantics to be uncovered. Generally speaking, MC-based methods perform best in extremely sparse datasets, where model parsimony is critical, while RNNs perform better in denser datasets where higher model complexity is affordable. The goal of our work is to balance these two goals, by proposing a self-attention based sequential model (SASRec) that allows us to capture long-term semantics (like an RNN), but, using an attention mechanism, makes its predictions based on relatively few actions (like an MC). At each time step, SASRec seeks to identify which items are 'relevant' from a user's action history, and use them to predict the next item. Extensive empirical studies show that our method outperforms various state-of-the-art sequential models (including MC/CNN/RNN-based approaches) on both sparse and dense datasets. Moreover, the model is an order of magnitude more efficient than comparable CNN/RNN-based models. Visualizations on attention weights also show how our model adaptively handles datasets with various density, and uncovers meaningful patterns in activity sequences.

Keywords

Computer scienceRecurrent neural networkArtificial intelligenceRecommender systemSemantics (computer science)Context (archaeology)Machine learningMarkov chainHidden Markov modelFeature (linguistics)Markov processArtificial neural network

Affiliated Institutions

University of California, San Diego US

Related Publications

Scene Segmentation with DAG-Recurrent Neural Networks

Bing Shuai , Zhen Zuo , Bing Wang +1 more

In this paper, we address the challenging task of scene segmentation. In order to capture the rich contextual dependencies over image regions, we propose Directed Acyclic Graph-...

2017 IEEE Transactions on Pattern Analysis... 143 citations

Speech recognition with deep recurrent neural networks

Alex Graves , Abdelrahman Mohamed , Geoffrey E. Hinton

Recurrent neural networks (RNNs) are a powerful model for sequential data. End-to-end training methods such as Connectionist Temporal Classification make it possible to train RN...

2013 8613 citations

DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language Understanding

Tao Shen , Tianyi Zhou , Guodong Long +3 more

Recurrent neural nets (RNN) and convolutional neural nets (CNN) are widely used on NLP tasks to capture the long-term and local dependencies, respectively. Attention mechanisms ...

2018 Proceedings of the AAAI Conference on... 729 citations

Session-Based Recommendation with Graph Neural Networks

Shu Wu , Yuyuan TANG , Yanqiao Zhu +8 more

The problem of session-based recommendation aims to predict user actions based on anonymous sessions. Previous methods model a session as a sequence and estimate user representa...

2019 Proceedings of the AAAI Conference on... 1343 citations

Sequence Transduction with Recurrent Neural Networks

Alex Graves

Many machine learning tasks can be expressed as the transformation---or \emph{transduction}---of input sequences into output sequences: speech recognition, machine translation, ...

2012 arXiv (Cornell University) 1292 citations

Publication Info

Year: 2018
Type: article
Pages: 197-206
Citations: 2397
Access: Closed

External Links

Download PDF (Free) View on DOI.org arXiv Semantic Scholar

Social Impact

Altmetric

Self-Attentive Sequential Recommendation

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

2397

OpenAlex

901

Influential

2015

CrossRef

Cite This

APA Style

                            
                                    Wang-Cheng Kang, 
                                
                                    Julian McAuley
                                
                            (2018). 
                            Self-Attentive Sequential Recommendation. 
                            2018 IEEE International Conference on Data Mining (ICDM)
                            
                            , 197-206.
                            https://doi.org/10.1109/icdm.2018.00035

Identifiers

DOI: 10.1109/icdm.2018.00035
arXiv: 1808.09781

Data Quality

Data completeness: 84%