Paper title

“Web Page Clustering by Combining Dense Units”

Authors: Morteza Haghir Chehreghani1, Hassan Abolhassani1 and Mostafa Haghir Chehreghani2
Affiliation
: 1. Department of CE, Sharif University of Technology, Tehran, IRAN

2. Department of ECE, University of Tehran, Tehran, Iran.

Abstract — One of the most important approaches of extracting knowledge from the web is to cluster the web data. In this paper a novel method for clustering the web pages is presented which at first finds the dense units using K-Means method and then joins these units for constructing final suitable clusters. The method also is extended for hierarchical clustering. The experimental results show the high quality of both flat and hierarchical clusters.