Comparative analysis of neighborhood-based approache and matrix factorization in recommender systems
DOI:
https://doi.org/10.15587/1729-4061.2015.43074Keywords:
collaborative filtering, neighborhood-based recommendations, matrix factorization-based recommendations, feature interpretationAbstract
Unlike other works, this paper aims at searching a connection between two most popular approaches in recommender systems domain: Neighborhood-based (NB) and Matrix Factorization-based (MF). Provided analysis helps better understand advantages and disadvantages of each approach as well as their compatibility.
While NB relies on the ratings of similar users to estimate the rating of a user on an item, MF relies on the identification of latent features that represent the underlying relation between users and items. However, as it was shown in this paper, if latent features of Non-negative Matrix Factorization are interpreted as users, the processes of rating estimation by two methods become similar. In addition, it was shown through experiments that in this case elements of NB and MF are highly correlated. Still there is a major difference between Matrix Factorization-based and Neighborhood-based approaches: the first one exploits the same set of base elements to estimate unknown ratings (the set of latent features), while the second forms different sets of base elements (in this case neighbors) for each user-item pair.
References
- Turner, V. The Digital Universe of Opportunities: Rich Data and the Increasing Value of the Internet of Things. IDC iView. Available at: http://www.emc.com/leadership/digital-universe/2014iview/digital-universe-of-opportunities-vernon-turner.htm
- Adomavicius, G., Tuzhilin, A. (2005). Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering, 17 (6), 734–749. doi: 10.1109/tkde.2005.99
- Pazzani, M. J., Billsus, D. (2007). Content-based Recommendation Systems. The adaptive web, 4321, 325–341. doi: 10.1007/978-3-540-72079-9_10
- Schafer, J. B., Frankowski, D., Herlocker, J., Sen, S. (2007). Collaborative Filtering Recommender Systems. Lecture Notes in Computer Science, 4321, 291–324. doi: 10.1007/978-3-540-72079-9_9
- Burke, R. (2002). Hybrid Recommender Systems: Survey and Experiments. User modeling and user-adapted interaction, 12 (4), 331–370.
- Breese, J., Heckerman, D., Kadie, C. (1998). Empirical Analysis of Predictive Algorithms for Collaborative Filtering. Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI'98). 43–52.
- Koren, Y., Bell, R., Volinsky, C. (2009). Matrix Factorization Techniques for Recommender Systems. Computer, 42 (8), 30–37. doi: 10.1109/mc.2009.263
- Takacs, I., Pilaszy, I., Nemeth, B., Tikk, D. (2008). Matrix Factorization and Neighbor Based Algorithms for the Netflix Prize Problem. Proceedings of the 2008 ACM Conference on Recommender systems, 267–274. doi: 10.1145/1454008.1454049
- Shani, G., Gunawardana, A. (2011). Evaluating Recommendation Systems. Recommender Systems Handbook, 257–297. doi: 10.1007/978-0-387-85820-3_8
- Brun, A., Aleksandrova, M., Boyer, A. (2014). Can Latent Features be Interpreted as Users in Matrix Factorization-based Recommender Systems? 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2, 226–233. doi: 10.1109/wi-iat.2014.102
- Aleksandrova, M., Brun, A., Boyer, A., Chertov, O. (2014). Search for User-related Features in Matrix Factorization-based Recommender Systems. European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD 2014), PhD Session Proceedings, 1, 1–10.
- Pessiot, J. F., Truong, V., Usunier, N., Amini, M., Gallinari, P. (2006). Factorisation en Matrices Non-negatives pour le Filtrage Collaboratif. Proceedings of 3rd Conference en Recherche d'Information et Applications, 12.
- Zhang, S., Wang, W., Ford, J., Makedon, F. (2006). Learning from Incomplete Ratings Using Non-negative Matrix Factorization. Proceedings of the 6th SIAM Conference on Data Mining, 6, 548–552. doi: 10.1137/1.9781611972764.58
- Sarwar, B., Karypis, G., Konstan, J., Riedl, J. (2000). Application of Dimensionality Reduction in Recommender System a Case Study. (No. TR-00-043). Minnesota University Minneapolis Department of Computer Science, 15.
- Zhou, Y., Wilkinson, D., Schreiber, R., Pan, R. (2008). Large-scale Parallel Collaborative Filtering for the Netflix Prize. Algorithmic Aspects in Information and Management, 337–348. doi: 0.1007/978-3-540-68880-8_32
- Koren, Y. (2008). Factorization Meets the Neighborhood: a Multifaceted Collaborative Filtering Model. Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 426–434. doi: 10.1145/1401890.1401944
- Lee, D. D., Seung, H. S. (2001). Algorithms for Non-negative Matrix Factorization. Advances in Neural Information Processing Systems, 556‑562.
- Goldberg, D., Nichols, D., Oki, B., Terry, D. (1992). Using Collaborative Filtering to Weave an Information Tapestry. Communications of the ACM, 35 (12), 61–70. doi: 10.1145/138859.138867
- Boumaza, A., Brun, A. (2012). Stochastic Search for Global Neighbors Selection in Collaborative Filtering. Proceedings of the 27th Annual ACM Symposium on Applied Computing, 232–237. doi: 10.1145/2245276.2245322
- MovieLens Dataset GroupLens. Available at: http://grouplens.org/datasets/movielens/
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2015 Oleg Chertov, Armelle Brun, Anne Boyer, Marharyta Aleksandrova
This work is licensed under a Creative Commons Attribution 4.0 International License.
The consolidation and conditions for the transfer of copyright (identification of authorship) is carried out in the License Agreement. In particular, the authors reserve the right to the authorship of their manuscript and transfer the first publication of this work to the journal under the terms of the Creative Commons CC BY license. At the same time, they have the right to conclude on their own additional agreements concerning the non-exclusive distribution of the work in the form in which it was published by this journal, but provided that the link to the first publication of the article in this journal is preserved.
A license agreement is a document in which the author warrants that he/she owns all copyright for the work (manuscript, article, etc.).
The authors, signing the License Agreement with TECHNOLOGY CENTER PC, have all rights to the further use of their work, provided that they link to our edition in which the work was published.
According to the terms of the License Agreement, the Publisher TECHNOLOGY CENTER PC does not take away your copyrights and receives permission from the authors to use and dissemination of the publication through the world's scientific resources (own electronic resources, scientometric databases, repositories, libraries, etc.).
In the absence of a signed License Agreement or in the absence of this agreement of identifiers allowing to identify the identity of the author, the editors have no right to work with the manuscript.
It is important to remember that there is another type of agreement between authors and publishers – when copyright is transferred from the authors to the publisher. In this case, the authors lose ownership of their work and may not use it in any way.