Data Perspective: Basic recommendation engine using R

Basic recommendation engine using R

In our day to day life, we come across a large number of Recommendation engines like Facebook Recommendation Engine for Friends’ suggestions, and suggestions of similar Like Pages, Youtube recommendation engine suggesting videos similar to our previous searches/preferences. In today’s blog post I will explain how to build a basic recommender System.

Types of Collaborative Filtering:

User based Collaborative Filtering
Item based Collaborative filtering

In this post will explain about User based Collaborative Filtering. This algorithm usually works by searching a large group of people and finding a smaller set with tastes similar to yours. It looks at other things they like and combines them to create a ranked list of suggestions.

Implementing User Based Collaborative Filtering:
This involves two steps:

Calculating Similarity Function
Recommend items to users based on user Similarity Score

Consider the below data sample of Movie critics and their movie rankings, the objective is to recommend the unrated movies based on similar users:

Step1- Calculate Similarity Score for CHAN:

Creating Similarity score for people helps us to identify similar people. We use Cosine based Similarity function to calculate the similarity between the users. Know more about cosine similarity here. In R we have a cosine function readily available:

user_sim = cosine(as.matrix(t(x)))

Step2- recommending Movies for CHAN:

For recommending movies for Chan using the above similarity matrix, we need to first fill the N/A where he has not rated. As first step, separate the non-rated movies by Chan and a weighted matrix is created by multiplying user similarity score (user_sim[,7]) with ratings given by other users.

Next step is to sum up all the columns of the weight matrix, then divide by the sum of all the similarities for critics that reviewed that movie. The result calculation gives what the user might rate this movie, the results as below:

The above explanation is written in the below R function:
rec_itm_for_user = function(userNo)
{ #calcualte column wise sum
col_sums= list()
rat_user = critics[userNo,2:7]
x=1
tot = list()
z=1
for(i in 1:ncol(rat_user)){
if(is.na(rat_user[1,i]))
{
col_sums[x] = sum(weight_mat[,i],na.rm=TRUE)
x=x+1
temp = as.data.frame(weight_mat[,i])
sum_temp=0
for(j in 1:nrow(temp))
{ if(!is.na(temp[j,1]))
{
sum_temp = sum_temp+user_sim[j,7]
}
}
tot[z] = sum_temp z=z+1
}
}
z=NULL
z=1
for(i in 1:ncol(rat_user)){
if(is.na(rat_user[1,i]))
{
rat_user[1,i] = col_sums[[z]]/tot[[z]] z=z+1
}
}
return(rat_user)
}
Calling the above function gives the below results:

rec_itm_for_user(7)

Titanic Batman Inception Superman.Returns spiderMan Matrix

2.811 4.5 2.355783 4 1 3.481427
Recommending movies for Chan will be in the order: Matrix (3.48), Titanic(2.81), Inception(2.35).

complete sourceCode is available on github

17 comments:

Windows 7 Update Error 8007000eMar 28, 2020, 6:31:00 PM
Thanks for sharing this useful information! Hope that you will continue with the kind of stuff you are doing. If you want you can also read this blog which talk about how to Fix Windows 7 Update Error 8007000e in a very easy way.
adam scottOct 16, 2020, 10:15:00 AM

What a useful information provided by this blog! It’s remarkable. Thanks for helping me out… I will definitely get in touch with your blogs,
Epson printer in error state

vé máy bay từ nga về việt namFeb 26, 2021, 8:17:00 AM
Aivivu chuyên vé máy bay, tham khảo

giá vé máy bay đi Mỹ khứ hồi

vé máy bay từ mỹ về việt nam giá rẻ

chuyến bay từ frankfurt đến hà nội

đặt vé về việt nam từ nhật

dat ve may bay tu han quoc ve viet nam
SidneyDalessioJun 1, 2021, 10:45:00 PM
Hey! Someone in my Facebook group shared this site with us so I came to look it over. I'm definitely enjoying the information. I'm bookmarking and will be tweeting this to my followers! Wonderful blog and amazing style and design. internet security
StuartFaheyJun 21, 2021, 7:16:00 AM
I've been surfing online more than three hours today, yet I never found any interesting article like yours. It is pretty worth enough for me. In my view, if all website owners and bloggers made good content as you did, the internet will be much more useful than ever before. energy shot supplement
SimmyjessureJul 15, 2021, 9:33:00 AM
This is so amazing article thanks for sharing this type useful wonderful information please share in future again and again .if you are looking accounting service you can get by this at

QuickBooks customer service
RubyeHamillJul 28, 2021, 7:42:00 PM
Most packaging boxes can look like an overwhelming mess to the uninitiated. You may be wondering, how a 2d drawing with a bunch of dotted lines is going to turn into a 3d box. Well, there are free options to get you the right box for your unique needs.
vivikhapnoiJul 29, 2021, 1:53:00 PM
Fantastic!! you are doing good job! I impressed. Many bodies are follow to you and try to some new.. After read your comments I feel; Its very interesting and every guys sahre with you own works. Great!!
ve may bay tu Phap ve Viet Nam

vé máy bay từ singapore về việt nam

vé máy bay từ úc về việt nam

ve may bay tu Han Quoc ve Viet Nam

ve may bay tu Nhat Ban ve Viet Nam

ve may bay tu My ve Viet Nam

ve may bay tu Duc ve Viet Nam
alexa joneSep 20, 2021, 6:52:00 PM
very informative article . keep sharing it .if you have need software service you can call at quickbooks customer service
FellixLarsonJan 20, 2022, 7:45:00 PM
What are your company values? What kind of message do you want to convey to your customers? If you’re a fun-loving company, your bath bomb boxes and other packaging should reflect that. Perhaps you want to try out bright, peppy colors, or inject some humor.
PMP Training in MalaysiaApr 12, 2022, 3:11:00 PM
360DigiTMG, the top-rated organisation among the most prestigious industries around the world, is an educational destination for those looking to pursue their dreams around the globe. The company is changing careers of many people through constant improvement, 360DigiTMG provides an outstanding learning experience and distinguishes itself from the pack. 360DigiTMG is a prominent global presence by offering world-class training. Its main office is in India and subsidiaries across Malaysia, USA, East Asia, Australia, Uk, Netherlands, and the Middle East.
AnonymousJun 16, 2022, 4:50:00 PM
Someone sometimes with visits your blog regularly and recommended it in my experience to read as well.
Software Testing Tools Training in Hyderabad
sajinfoworldSep 7, 2022, 7:29:00 AM
You have provided finicky information for a new blogger so it has turned out to be really obliging. Keep up the good work!
CCTV installation services in Hooghly
CCTV camera installation in Kolkata
DiveSEOSep 22, 2022, 5:18:00 PM
Admiring the time and effort you put into your site and in depth information you provide. It’s great to come across your website every once in a while that isn’t the same old rehashed material. Excellent read! I’ve bookmarked your site and I’m adding your RSS feeds to my Google account.
SEO Company in London
feligrat solutionsMar 27, 2023, 10:51:00 AM
It is helpful and in-depth understanding is provided. It is useful for myData Science Training and I will share it with my friends.
Các mẫu nhà đẹpMar 28, 2023, 3:39:00 PM
cửa nhựa lõi thép sparlee profile

thanh nhựa lõi thép

bản lề cửa window

tiêu chuẩn chiều sâu khoan cấy thép

tiêu chuẩn khoan cấy thép vào bê tông

tiêu chuẩn về khoan cấy thép
FrederickFeb 7, 2025, 9:32:00 AM
Looking for reliable, authorized appliance repair? Our certified technicians provide expert repair services for all major home appliances, ensuring quality and reliability. Whether it’s a refrigerator, washing machine, oven, or dryer, we have the skills and experience to fix it right the first time. authorized appliance repair We use only genuine parts to ensure your appliances perform at their best. Fast, affordable, and guaranteed service! Trust the professionals for all your appliance repair needs—contact us today and get your home running smoothly again!

Blog posts on Data Science, Machine Learning, Data Mining, Artificial Intelligence, Spark Machine Learning

Sunday, May 25, 2014

Basic recommendation engine using R

Types of Collaborative Filtering:

Step1- Calculate Similarity Score for CHAN:

Step2- recommending Movies for CHAN:

17 comments: