Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
A study of scalar compilation techniques for pipelined supercomputers
Weiss S., Smith J. ACM Transactions on Mathematical Software16 (3):223-245,1990.Type:Article
Date Reviewed: Jun 1 1991

This research paper reports on performance improvements for pipelined scientific computers provided by two scalar compilation techniques: loop unrolling and software pipelining. Architectural features to support the techniques are also discussed. The authors chose a CRAY-1S architecture with an expanded scalar register set as a baseline and used the first 14 Livermore Loops as a benchmark. TheY studied performance improvements by modifying the code generated by the CRAY FORTRAN compiler (CFT) with the vectorizer turned off. Results were obtained using a simulator described elsewhere.

From the figures, the paper concludes that loop unrolling can produce significant performance improvements. Although software pipelining achieves lower speedups, the authors point out that it demands less hardware than loop unrolling. The CRAY-1S architecture with additional scalar registers, a larger instruction buffer, and loop unrolling reaches a stage of performance comparable to that of the CRAY-1S with the vector unit and the CFT vectorizing compiler. The combination of loop unrolling and dynamic software pipelining (on a machine partitioned into address and execute processors) shows a speedup of 2.64 compared with the baseline.

The paper is meant for compiler people and architects. It is easy to read, and the experimental results provide some insight into the usefulness of the two compiler techniques. Some prior knowledge of these techniques would be helpful to the reader. A few related papers have appeared since this paper was written.

Reviewer:  U. Banerjee Review #: CR123893
Bookmark and Share
 
Pipeline Processors (C.1.1 ... )
 
 
Compilers (D.3.4 ... )
 
 
Performance Attributes (C.4 ... )
 
 
Large And Medium (“Mainframe”) Computers (C.5.1 )
 
Would you recommend this review?
yes
no
Other reviews under "Pipeline Processors": Date
Measurement and evaluation of the MIPS architecture and processor
Gross T., Hennessy J., Przybylski S., Rowen C. ACM Transactions on Computer Systems 6(3): 229-257, 1988. Type: Article
Apr 1 1989
Dynamic Instruction Scheduling and the Astronautics ZS-1
Smith J. Computer 22(7): 21-35, 1989. Type: Article
Oct 1 1990

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy