on back w3c Fifth International World Wide Web Conference

Bookmarks Clustering Algorithm

  1. For each URL in the collection, Access the source of the corresponding HTML document
  2. Index each document and generate an LA-based profile
  3. Compute the pairwise similarity of all the documents
  4. Cluster by applying the HAC method in order to produce a binary dendogram
  5. Collapse the dendogram into a non-binary cluster hierarchy by applying the slicing technique
  6. Enrich the cluster hierarchy with conceptual information


Automatically Organizing Bookmarks per Contents (Y.S.Maarek,I.Z.Ben-Shaul)