Computing Reviews

An adaptive path index for XML data using the query workload
Min J., Chung C., Shim K. Information Systems30(6):467-487,2005.Type:Article
Date Reviewed: 03/28/06

Extensible Markup Language (XML) path queries are fundamental to many XML query languages, like XQuery and XPath, which are often used for complex filtering of XML data. Most XML path query processing techniques use either navigation-based algorithms, which compute results by analyzing an input document one tag at a time, or index-based algorithms, which take advantage of pre-computed numbering schemes over the input XML document. Since XML data is highly irregular and complex, the use of indexes is often preferred.

There are many variations of indexing techniques and algorithms available today, but due to the nature of XML data, which is very rich in structure, the authors of this paper propose an advanced indexing technique that could further improve query processing. The proposed method is called APEX, short for “adaptive path index for XML.” APEX is different from other traditional methods because it only stores paths of length two, plus frequent query paths as measured by the query workload. Traditional methods store all paths from root to leaves, which is very inefficient for complex queries. APEX breaks down XML data into the subunits of element, attribute, name, value, and structure; this introduces automatic detection and updates of frequent query paths. Additionally, APEX supports dynamic data insertion, deletion, and structural changes.

In this paper, the authors compare APEX to several other techniques, such as DataGuides and Index Fabric. They discuss the scenarios under which the APEX technique is superior to the others. In particular, they show that APEX can be used to evaluate queries faster. They also present experimental results over real and synthetic XML documents that validate their claims.

Reviewer:  Norita Ahmad Review #: CR132597 (0701-0073)

Reproduction in whole or in part without permission is prohibited.   Copyright 2024 ComputingReviews.com™
Terms of Use
| Privacy Policy