谷歌浏览器插件
订阅小程序
在清言上使用

Amazon Books Rating Prediction Recommendation Model

Hsiu-Ping Lin,Suman Chauhan, Yougender Chauhan,Nagender Chauhan,Jongwook Woo

arXiv (Cornell University)(2023)

引用 0|浏览6
暂无评分
摘要
This paper uses the dataset of Amazon to predict the books ratings listed on Amazon website. As part of this project, we predicted the ratings of the books, and also built a recommendation cluster. This recommendation cluster provides the recommended books based on the column's values from dataset, for instance, category, description, author, price, reviews etc. This paper provides a flow of handling big data files, data engineering, building models and providing predictions. The models predict book ratings column using various PySpark Machine Learning APIs. Additionally, we used hyper-parameters and parameters tuning. Also, Cross Validation and TrainValidationSplit were used for generalization. Finally, we performed a comparison between Binary Classification and Multiclass Classification in their accuracies. We converted our label from multiclass to binary to see if we could find any difference between the two classifications. As a result, we found out that we get higher accuracy in binary classification than in multiclass classification.
更多
查看译文
关键词
Online Shopping
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要