Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
An adaptive path index for XML data using the query workload
Min J., Chung C., Shim K. Information Systems30 (6):467-487,2005.Type:Article
Date Reviewed: Mar 28 2006

Extensible Markup Language (XML) path queries are fundamental to many XML query languages, like XQuery and XPath, which are often used for complex filtering of XML data. Most XML path query processing techniques use either navigation-based algorithms, which compute results by analyzing an input document one tag at a time, or index-based algorithms, which take advantage of pre-computed numbering schemes over the input XML document. Since XML data is highly irregular and complex, the use of indexes is often preferred.

There are many variations of indexing techniques and algorithms available today, but due to the nature of XML data, which is very rich in structure, the authors of this paper propose an advanced indexing technique that could further improve query processing. The proposed method is called APEX, short for “adaptive path index for XML.” APEX is different from other traditional methods because it only stores paths of length two, plus frequent query paths as measured by the query workload. Traditional methods store all paths from root to leaves, which is very inefficient for complex queries. APEX breaks down XML data into the subunits of element, attribute, name, value, and structure; this introduces automatic detection and updates of frequent query paths. Additionally, APEX supports dynamic data insertion, deletion, and structural changes.

In this paper, the authors compare APEX to several other techniques, such as DataGuides and Index Fabric. They discuss the scenarios under which the APEX technique is superior to the others. In particular, they show that APEX can be used to evaluate queries faster. They also present experimental results over real and synthetic XML documents that validate their claims.

Reviewer:  Norita Ahmad Review #: CR132597 (0701-0073)
Bookmark and Share
 
Trees (E.1 ... )
 
 
Domain-Specific Architectures (D.2.11 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Trees": Date
The hB-tree: a multiattribute indexing method with good guaranteed performance
Lomet D., Salzberg B. ACM Transactions on Database Systems 15(3): 625-658, 1990. Type: Article
Jun 1 1991
Multidimensional trees
Baldwin W., Strawn G. Theoretical Computer Science 84(2): 293-311, 1991. Type: Article
Oct 1 1992
Hash trees versus b-trees
Bell D., Deen S. The Computer Journal 27(3): 218-224, 1984. Type: Article
Feb 1 1985
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy