Computing Reviews

Taxonomy induction based on a collaboratively built knowledge repository
Ponzetto S., Strube M. Artificial Intelligence175(9-10):1737-1756,2011.Type:Article
Date Reviewed: 08/28/12

Ontologies have always interested me. As a natural language processing (NLP) researcher, I have worried about linguistic resources for NLP tasks. Therefore, it was a pleasure to read this paper on a system the authors developed as an enhancement to Wikipedia. They describe the language acquisition bottleneck and provide a solution to the problem: a taxonomy derived from Wikipedia that, besides making Wikipedia more powerful and complete, allows for more accurate information retrieval. They also offer a large and comprehensive bibliography, containing references concerned with taxonomy and NLP applications that need such lexical resources.

The authors present research on taxonomy induction based on two lexical resources, Cyc and WordNet. At times, the text can be a bit confusing, due to the large amount of data presented. In addition, the runtime required for taxonomy generation is too long, according to the text. Furthermore, the authors fail to explain how they calculate accuracy. It also would have been very interesting to know the cost and time savings this method affords, compared with other approaches.

The employed methods are clear and well defined, and the evaluation is well presented. Besides its precision, Ponzetto and Strube also prove that their system is innovative. They mention the Kylin ontology generator (KOG) [1], which is similar to their system, and highlight some of their similarities and differences.

Overall, the paper is very good. I recommend it to anyone seeking a comprehensive presentation of many aspects of taxonomy induction based on collaboratively built sources.


1)

Wu, F.; Weld, D. S. Automatically refining the Wikipedia infobox ontology. In Proc. of the 17th International World Wide Web Conference. ACM, 2008, 635-644.

Reviewer:  João Luís G. Rosa Review #: CR140440 (1301-0039)

Reproduction in whole or in part without permission is prohibited.   Copyright 2024 ComputingReviews.com™
Terms of Use
| Privacy Policy