Web Page Classification based on Unsupervised Learning using MIME type Analysis

2021 International Conference on COMmunication Systems & NETworkS (COMSNETS)(2021)

引用 4|浏览2
暂无评分
摘要
The properties of a web page have a strong impact on the experience of web users. In this work, a classification method based on unsupervised clustering is proposed to group web pages into classes based on download content that may affect the Quality of Experience (QoE) perceived by the user. Groups are defined based on Multipurpose Internet Mail Extensions (MIME) content breakdown and external subdomain connections, obtained with a desktop personal computer (PC) running WebPageTest tool. The dataset is generated with a PC as a terminal, emulating the first access to 500 popular websites. The collected data is divided into groups with a classical unsupervised learning algorithm, namely K-means clustering. Results show how web pages are classified in six groups and their cluster characteristics.
更多
查看译文
关键词
Web page,classification,clustering,WebPageTest,unsupervised learning,Quality of Experience
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要