Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Querying graphs with data
Libkin L., Martens W., Vrgoč D. Journal of the ACM63 (2):1-53,2016.Type:Article
Date Reviewed: Oct 25 2016

Various approaches to querying so-called graphs with data are described very comprehensively in this paper. In principle, this category of graphs is called (labeled) property graphs in the graph database community. Concerning queries in such databases, two categories are considered: one views graphs as sets of relational data, and the other concentrates on querying the topology of the graph. The authors focus on querying that combines both of these approaches.

In section 2, containing preliminaries, an important notion of path and various types of complexity of query evaluation are defined. In the forthcoming sections, the authors deal with two categories of languages: path languages and graph languages. Also, notions like regular path queries (RPQs), conjunctive RPQs (CRPQs), and nested regular expressions (NREs) are introduced. Section 3 discusses existing languages for paths, particularly register automata and related classes of expressions focused on data path queries. First, regular data path queries (RDPQs) are studied, followed by regular query expressions with memory (RQM), which use regular expressions with memory (REM). Another class of regular expressions for data paths permits testing for equality or inequality of data; this enables consideration of queries based on regular expressions with equality. The authors talk about regular queries with datasets (RQDs).

The core section 4 shows how the XPath language can be adapted to work over graphs. XPath has many desirable properties, such as the ability to define patterns that cannot be captured by paths, close connections to first-order logic, and efficient evaluation algorithms. The authors show how to extend the language to operate over graph databases while still retaining these properties. Such a language is called GPath here, and it is discussed with different data tests permitted in its formulas. Based on two categories of path formulas, the regular graph XPath (GXPathreg) and core graph XPath (GXPathcore) languages are introduced. Some extensions of both languages with counters are also discussed. The next two subsections of section 4 focus on query evaluation and expressive power. The authors investigate the complexity of querying graph databases using variants of GXPath and its many dialects when compared to first-order logic. They provide a detailed analysis of expressiveness for navigational features of these dialects and prove that GXPathcore is a proper subset of GXPathreg. Another result says that GXPathreg is equivalent to a three-variable fragment of the transitive-closer first-order logic. Expressiveness of languages using datasets is also studied. The rest of section 4 contains a comparison of GXPath to path languages introduced in section 3, as well as to traditional navigational languages such as RPQs, CRPQs, and NREs. A hierarchy of the GXPath fragments with regard to their relative expressive power illustrates their mutual relationships.

In section 5, the authors define conjunctive queries using languages from previous sections as their basic building blocks. Section 6 includes a summary of complexity results for classes of queries studied in the paper. Finally, section 6 offers a number of possible directions for future research. The consideration of incomplete information is very realistic, since it occurs very often in practice, particularly in querying graphs with data.

The notions defined in the paper are specified in a usual denotational way, which offers the needed clarity and preciseness. This increases the readability of the paper. Without a doubt, the paper offers interesting, valuable, and useful material for those interested in graph query languages and the associated theory.

Reviewer:  J. Pokorny Review #: CR144866 (1701-0064)
Bookmark and Share
 
Data Models (H.2.1 ... )
 
 
Query Languages (H.2.3 ... )
 
 
Mathematical Logic (F.4.1 )
 
Would you recommend this review?
yes
no
Other reviews under "Data Models": Date
A transient hypergraph-based model for data access
Watters C., Shepherd M. ACM Transactions on Information Systems 8(2): 77-102, 2001. Type: Article
Jun 1 1991
Toward a unified framework for version modeling in engineering databases
Katz R. ACM Computing Surveys 22(4): 375-409, 2001. Type: Article
Feb 1 1993
Graph data model and its data language
Kunii H., Springer-Verlag New York, Inc., New York, NY, 1990. Type: Book (9780387700588)
Dec 1 1991
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy