Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
On building graphs of documents with artificial ants
Azzag H., Lavergne J., Guinot C., Venturini G.  World Wide Web (Proceedings of the 16th International Conference on the World Wide Web, Banff, Alberta, Canada, May 8-12, 2007)1299-1300.2007.Type:Proceedings
Date Reviewed: Aug 10 2007

The task of clustering similar or related documents is important to information retrieval systems, like search engines. This is done by building graphs, where the given set of documents form the nodes and the edges represent the similarity between the documents and nodes.

The authors present a graph-building algorithm that closely follows the self-assembly behavior observed when ants build living structures by connecting their bodies together. The tabulated results show that the proposed algorithm outperforms standard methods, such as relative neighborhood graphs (RNG) methods, for building graphs, while finding more similarity, that is, creating more links between documents.

The key principle of the proposed algorithm is that the graph is built incrementally. When a new document is added, it follows the path of maximum similarity: it is connected to all neighboring nodes and documents whose similarity to the new document is higher than a given similarity threshold.

As this is a short poster session paper, details of the algorithm and evaluation are omitted. It may be worthwhile to investigate other related publications by the authors, as the high performance of this algorithm makes it a promising substitute for current clustering algorithms.

Reviewer:  Suma Adabala Review #: CR134635 (0807-0704)
Bookmark and Share
  Reviewer Selected
 
 
Clustering (H.3.3 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Clustering": Date
Concepts and effectiveness of the cover-coefficient-based clustering methodology for text databases
Can F. (ed), Ozkarahan E. ACM Transactions on Database Systems 15(3): 483-517, 1990. Type: Article
Dec 1 1992
A parallel algorithm for record clustering
Omiecinski E., Scheuermann P. ACM Transactions on Database Systems 15(3): 599-624, 1990. Type: Article
Nov 1 1992
Organization of clustered files for consecutive retrieval
Deogun J., Raghavan V., Tsou T. ACM Transactions on Database Systems 9(4): 646-671, 1984. Type: Article
Jun 1 1985
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy