Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Locality analysis: a nonillion time window problem
Brock J., Luo H., Ding C. ACM SIGMETRICS Performance Evaluation Review41 (4):102-105,2014.Type:Article
Date Reviewed: Oct 1 2014

Big data analyses provide associations among large datasets for probing business trends, averting diseases, connecting legal citations, fighting crimes, and finding out instantaneous highway traffic flows. Unfortunately, most statistical and visualization packages have difficulty processing big datasets that require massively parallel software running on numerous servers. How should large and complex collections of datasets be captured, stored, searched, shared, analyzed, and visualized?

In this paper, the authors concisely present the current and future locality problems of big data, and recommend concepts for intensifying the theory of locality for applications in a variety of big data domains. A footprint is the quantity of discrete data retrieved in a window of execution by a central processing unit. The computation of a footprint is a big data problem, because it is time consuming to count the number of distinct data in each substring of large size window traces.

The authors recognize the open-ended precision of the approximate and sampling methods for solving the footprint problems in the literature. They present locality metrics of footprints, reuse distance, and miss rate. The reuse distance for every cache memory access is the total discrete data used at a time since it was previously accessed. The miss rate of a footprint is the overall portion of the reuse distances of a cache size beyond its size.

The authors graphically illuminate the average footprint function, miss rate curve, and the reuse distance profile. Hardware designers and programmers could take advantage of these graphs to measure and improve the use of cache by specific programs. The concepts of actively shared data and footprint sharing ratio discussed in this paper would be valuable for recognizing incorrect shared cache, designing shared cache, and multithreading.

Reviewer:  Amos Olagunju Review #: CR142784 (1501-0070)
Bookmark and Share
  Reviewer Selected
Featured Reviewer
 
 
Metrics (D.2.8 )
 
Would you recommend this review?
yes
no
Other reviews under "Metrics": Date
A comparison of time domains for software reliability models
Musa J., Okumoto K. Journal of Systems and Software 4(4): 277-287, 1984. Type: Article
May 1 1985
On software equations
Král J. Information Processing Letters 19(4): 191-196, 1984. Type: Article
Jun 1 1985
Software metrics: establishing a company-wide program
Grady R., Caswell D., Prentice-Hall, Inc., Upper Saddle River, NJ, 1987. Type: Book (9789780138218447)
Apr 1 1988
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy