谷歌浏览器插件
订阅小程序
在清言上使用

Large Scale Web-Content Classification.

International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management(2015)

引用 1|浏览43
暂无评分
摘要
Web classification is used in many security devices for preventing users to access selected web sites that are not allowed by the current security policy, as well for improving web search and for implementing contextual advertising. There are many commercial web classification services available on the market and a few publicly available web directory services. Unfortunately they mostly focus on English-speaking web sites, making them unsuitable for other languages in terms of classification reliability and coverage. This paper covers the design and implementation of a web-based classification tool for TLDs (Top Level Domain). Each domain is classified by analysing the main domain web site, and classifying it in categories according to its content. The tool has been successfully validated by classifying all the registered it. Internet domains, whose results are presented in this paper.
更多
查看译文
关键词
Internet Domain,Web-Content Classification,HTTP Crawling,Web Mining,SVM
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要