Collaborative Deep Learning for Recommender Systems

[1409.2944] Collaborative Deep Learning for Recommender Systems
by H Wang - ‎2014 - ‎Cited by 113 - ‎Related articles
https://arxiv.org/abs/1409.2944

CF의 단점때문에 보조정보(auxiliary information)을 이용하는 method 많이들씀
1. loosely coupled
  - 보조정보 처리 후 CF에 넣음
2. tightly coupled
  - the rating information can guide the learning of features
  - Collaborative topic regression (CTR) ^[1]
    = Latent Dirichlet allocation (LDA) ^[2](as a topic model) + probabilistic matrix factorization (PMF) ^[3](as a model-based CF).
  - auxiliary info도 sparse하면 잘 안된다.

We first present a Bayesian formulation of a deep learning model called stacked denoising autoencoder (SDAE) ^[4]. With this, we then present our CDL model which tightly couples deep representation learning for the content information and collaborative filtering for the ratings (feedback) matrix, allowing two-way interaction between the two.
- SDAE말고, RBM, CNN, RNN같은거 써도 된다.

contributions
- Unlike previous deep learning models which use simple target like classification ^[5]and reconstruction ^[6], we propose to use CF as a more complex target in a probabilistic framework.
- Besides the algorithm for attaining maximum a poste- riori (MAP) estimates, we also derive a sampling-based algorithm for the Bayesian treatment of CDL, which, interestingly, turns out to be a Bayesian generalized version of back-propagation.
- etc.

notation

\(\mathbf{X}_c\)는 \(J\times S\) matrix. J items가 있고, 각 item이 S dimension. c는 clean을 뜻함. SDAE의 input이 된다(noise corrupted matrix는 \(\mathbf{X}_0\))
\(\mathbf{R}\)은 \(I\times J\) matrix. User수가 \(I\).
Note that an L/2-layer SDAE corresponds to an L-layer network.

CDL

Stacked Denoising Autoencoders

AE인데 그냥 stack한것.

Generalized Bayesian SDAE

Note that while generation of the clean input \(\mathbf{X}_c\) from \(\mathbf{X}_L\) is part of the generative process of the Bayesian SDAE, generation of the noise-corrupted input \(\mathbf{X}_0\) from \(\mathbf{X}_c\) is an artificial noise injection process to help the SDAE learn a more robust feature representation.

Collaborative Deep Learning

Maximum A Posteriori Estimates

Prediction

References

↑ C. Wang and D. M. Blei. Collaborative Topic Modeling for Recommending Scientific Articles. In KDD, pages 448–456, 2011.
↑ D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent Dirichlet allocation. JMLR, 3:993–1022, 2003.
↑ R. Salakhutdinov and A. Mnih. Probabilistic matrix factorization. In NIPS, pages 1257–1264, 2007.
↑ P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, and P.-A. Manzagol. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. JMLR, 11:3371–3408, 2010.
↑ N. Kalchbrenner, E. Grefenstette, and P. Blunsom. A convolutional neural network for modelling sentences. ACL, pages 655–665, 2014.
↑ P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, and P.-A. Manzagol. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. JMLR, 11:3371–3408, 2010.

blog comments powered by Disqus

[1] C. Wang and D. M. Blei. Collaborative Topic Modeling for Recommending Scientific Articles. In KDD, pages 448–456, 2011.

[2] D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent Dirichlet allocation. JMLR, 3:993–1022, 2003.

[3] R. Salakhutdinov and A. Mnih. Probabilistic matrix factorization. In NIPS, pages 1257–1264, 2007.

[4] P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, and P.-A. Manzagol. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. JMLR, 11:3371–3408, 2010.

[5] N. Kalchbrenner, E. Grefenstette, and P. Blunsom. A convolutional neural network for modelling sentences. ACL, pages 655–665, 2014.

[6] P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, and P.-A. Manzagol. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. JMLR, 11:3371–3408, 2010.

[1]

[2]

[3]

[4]

[5]

[6]

둘러보기 메뉴