TY - JOUR
T1 - A machine learning-based recommendation model for bipartite networks
AU - Kart, Ozge
AU - Ulucay, Oguzhan
AU - Bingol, Berkay
AU - Isik, Zerrin
PY - 2020/9/1
Y1 - 2020/9/1
N2 - Online user reviews on a product, service or content has been widely used for recommender systems with the spread of the internet and online applications. Link prediction is one of the popular recommender system approaches. It can benefit the structure of a social network by mapping item reviews of users to a bipartite user–item graph structure. This study aims to investigate how topological information, namely neighbor-based, path-based and random walk-based network similarity metrics, improve the prediction capability of a recommendation model. This study proposes a supervised machine learning-based link prediction model for weighted and bipartite social networks. The input features of the machine learning model are extended versions of similarity metrics for weighted and bipartite networks. Our proposed model provides 0.93 and 0.9 AUC values for the Goodreads and MovieLens datasets, respectively. Random forest and extreme gradient boosting as the ensemble models achieved the highest performances for ItemRank metric in both datasets.
AB - Online user reviews on a product, service or content has been widely used for recommender systems with the spread of the internet and online applications. Link prediction is one of the popular recommender system approaches. It can benefit the structure of a social network by mapping item reviews of users to a bipartite user–item graph structure. This study aims to investigate how topological information, namely neighbor-based, path-based and random walk-based network similarity metrics, improve the prediction capability of a recommendation model. This study proposes a supervised machine learning-based link prediction model for weighted and bipartite social networks. The input features of the machine learning model are extended versions of similarity metrics for weighted and bipartite networks. Our proposed model provides 0.93 and 0.9 AUC values for the Goodreads and MovieLens datasets, respectively. Random forest and extreme gradient boosting as the ensemble models achieved the highest performances for ItemRank metric in both datasets.
UR - http://www.scopus.com/inward/record.url?scp=85079533598&partnerID=8YFLogxK
U2 - 10.1016/j.physa.2020.124287
DO - 10.1016/j.physa.2020.124287
M3 - Article
SN - 0378-4371
VL - 553
JO - Physica A: Statistical Mechanics and its Applications
JF - Physica A: Statistical Mechanics and its Applications
M1 - 124287
ER -