Collaborative Filtering Recommendation

KAIST GSDS 대학원 박찬영 교수님의 수업인 추천시스템 및 그래프 기계학습 수업을 필기입니다.

Approach
- Use the "wisdom of the crowd" to recommend items
Basic assumption and idea
- Users give ratings to items
- Customers who had similar tastes in the past, will have similar tastes in the future(입맛 안변해)

Main idea
- Similar users display similar patterns of rating behavior -> User-based CF
- Similar items receive similar ratings -> Item-based CF
Input
- Only a matrix of given user-item rating
Output types
- A numerical prediction indicating to what degree(유저의 선호도 점수로)
- A top-N list of recommended items
Representative methodology
기본적으로 item-based CF가 더 연관성 있는 추천을 해준다. 왜냐하면 user의 rating을 이용하기 때문이다. 또한, item-based CF는 추천의 이유를 제공한다는 특징이 있다.
- User-Based Collaborative Filtering: 특정 User를 기준으로 비슷한 다른 User의 선호도를 기반으로 Item을 추천
- Item-Based Collaborative Filtering: 특정 Item을 기준으로 비슷한 Item을 추천
Pros
- Requires miknimal knowledge (only rating/implicit feedback)
- Produces good-enough results in most cases
Challenges
- Sparsity / Cold-start problem: 충분한 users 확보와 새로운 user, item에 대해 취약(New user, item problem)
- Scalability: item, users 수에 따라서 계산량이 증가하고 한 user가 item을 구매하면 다른 user나 item의 순위 영향
- Diversity: 다양한 결과나 나오기 힘듬

Euclidean distance
problem: euclidean distance favors small sets, even if the have few elements in common

Jaccard similarity
problem: Jaccard similarity can only take care of binary relevance scores

Pearson correlation
Different users may provide ratings on different scales
-> Subtract the average(평균에서 얼마나 차이나는지를 확인함, 사용자 v와 u의 공통된 item이 많이 필요)

A common prediction function

참조: https://kmhana.tistory.com/31?category=882777

Content-Based Recommendation (0)	2022.09.01
Introduction to Recommender System (0)	2022.08.31

티스토리툴바