Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Parallel-and-stream accelerator for computationally fast supervised learning
Hector E., Luo L., Song P. Computational Statistics & Data Analysis177 (1):1-12,2022.Type:Article
Date Reviewed: Feb 20 2023

Large enterprises with massive amounts of data require reliable and efficient algorithms for exploring and forecasting trends in available supplies, user purchases, and advertisements. The scalability of the MapReduce approach is advantageous for the parallel processing of such data. The statistical efficiency of an online streaming approach is also valuable for processing big data. But how should both computational strategies be combined to attain fast computing and the necessary statistics for huge data processing? Hector et al. present their parallel-and-stream accelerator (PASA) approach that integrates the algorithmic powers of parallel and sequential computing to cope with the computational chokepoint of supervised learning and reasoning with big data.

Initially in the hybrid PASA methodology, a huge dataset is indiscriminately split into parallel data blocks. Each data block is then processed in sequence using an online valuation and inference technique, to help retain low computing costs and staunch statistical efficacy. PASA contains algorithms for streaming and combining parameter estimates from data blocks, to provide fast learning and inferences in the efficient generalized linear model (GLM) used for regression analysis of big datasets.

The authors clearly present appropriate equations for the method of moments and the maximum likelihood functions for parameter estimations in PASA. They investigate the PASA methodology for its effectiveness in forecasting gains related to remodeling buildings. The experimental and simulation results are reliable enough to recommend PASA for computationally fast supervised learning.

Are there valuable application examples of PASA? Students and practitioners in data science and artificial intelligence (AI) will find the efficient linear and logistic regression examples in this paper valuable. Students in theoretical computer science courses will find this paper useful for defining algorithmic and computational efficiency and complexity of algorithms.

Reviewer:  Amos Olagunju Review #: CR147552
Bookmark and Share
  Featured Reviewer  
Statistical Computing (G.3 ... )
Algorithm Design And Analysis (G.4 ... )
Would you recommend this review?
Other reviews under "Statistical Computing": Date
Computer analysis of sequential medical trials
Duan-Zheng X., Ellis Horwood, Upper Saddle River, NJ, 1990. Type: Book (9780131618527)
Jul 1 1992
APL2 as a specification language for statistics
Thomson N. IBM Systems Journal 30(4): 539-542, 1991. Type: Article
Dec 1 1993
Multivariate statistical simulation
Johnson M., John Wiley & Sons, Inc., New York, NY, 1987. Type: Book (9789780471822905)
Dec 1 1987

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2023 ThinkLoud®
Terms of Use
| Privacy Policy