Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Health: related information structuring for the semantic Web
Eljinini M.  ISWSA 2011 (Proceedings of the 2011 International Conference on Intelligent Semantic Web Services and Applications, Amman, Jordan, Apr 18-20, 2011)1-7.2011.Type:Proceedings
Date Reviewed: Aug 9 2011

Eljinini sets out to provide a methodology and software implementation to harness the exponential growth of Web-based information, particularly in the medical field. Rather than suggesting a major revamp of existing Web sites to make the Web site optimally available to search agents, he proposes first analyzing a Web site based on its purpose, and then restructuring the information from the Web site to make it a part of a future semantic Web. His contention that “determining the purpose of a website makes extraction more efficient” is instrumental in processing the dynamic information (what is new and frequently changing) presented on a Web site (as opposed to meta-level information, such as the Internet protocol (IP) address and last update, and static information).

His methodology starts with defining a list of purposes of a given Web site (for example, does the Web site offer goods or services? How are these codified?). The list of purposes is then divided by static and dynamic content. The author takes 100 diabetes-related Web sites, extracts the dynamic content, classifies the dynamic content, and produces a processor that looks for the salient purpose(s) a Web site provides, leaving the static information unprocessed.

Having explicit meta-level information for a Web site has three advantages: it speeds up processing of relevant information; it allows existing Web sites to eventually become part of a more formalized semantic Web; and it takes the Web site’s content and restructures it into a machine-readable format.

While this is a seemingly intriguing approach, it is reminiscent of the bag-of-words idea that many natural language processing (NLP) systems favor, which works up to a point and only in well-defined scenarios requiring only minimal keyword retrieval. The basic criticism of such an approach harks back to the linguist Zellig Harris: “Language is not merely a bag of words, but a tool with particular properties” [1]. Interpreting these properties appropriately requires a lot more than surface-level lists.

Reviewer:  Klaus K. Obermeier Review #: CR139331 (1203-0301)
1) Harris, Z. Distributional structure. Word 10, (1954), 146–162.
Bookmark and Share
  Reviewer Selected
Featured Reviewer
 
 
General (H.0 )
 
 
Concept Learning (I.2.6 ... )
 
 
Text Analysis (I.2.7 ... )
 
 
Knowledge Representation Formalisms And Methods (I.2.4 )
 
 
Learning (I.2.6 )
 
 
Natural Language Processing (I.2.7 )
 
Would you recommend this review?
yes
no
Other reviews under "General": Date
Introduction to database and knowledge-base systems
Krishna S., World Scientific Publishing Co., Inc., River Edge, NJ, 1992. Type: Book (9789810206192)
Nov 1 1993
An introduction to information science
Flynn R., Marcel Dekker, Inc., New York, NY, 1987. Type: Book (9789780824775087)
Apr 1 1988
Acta Informatica 19, 4 (Sept. 1983)
  Acta Informatica 44:1983. Type: Journal
Mar 1 1986
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy