Computing Reviews, the leading online review service for computing literature.

Search

A study of scalar compilation techniques for pipelined supercomputers
Weiss S., Smith J. ACM Transactions on Mathematical Software16 (3):223-245,1990.Type:Article

Date Reviewed: Jun 1 1991

This research paper reports on performance improvements for pipelined scientific computers provided by two scalar compilation techniques: loop unrolling and software pipelining. Architectural features to support the techniques are also discussed. The authors chose a CRAY-1S architecture with an expanded scalar register set as a baseline and used the first 14 Livermore Loops as a benchmark. TheY studied performance improvements by modifying the code generated by the CRAY FORTRAN compiler (CFT) with the vectorizer turned off. Results were obtained using a simulator described elsewhere. From the figures, the paper concludes that loop unrolling can produce significant performance improvements. Although software pipelining achieves lower speedups, the authors point out that it demands less hardware than loop unrolling. The CRAY-1S architecture with additional scalar registers, a larger instruction buffer, and loop unrolling reaches a stage of performance comparable to that of the CRAY-1S with the vector unit and the CFT vectorizing compiler. The combination of loop unrolling and dynamic software pipelining (on a machine partitioned into address and execute processors) shows a speedup of 2.64 compared with the baseline. The paper is meant for compiler people and architects. It is easy to read, and the experimental results provide some insight into the usefulness of the two compiler techniques. Some prior knowledge of these techniques would be helpful to the reader. A few related papers have appeared since this paper was written.

Reviewer: U. Banerjee	Review #: CR123893

Pipeline Processors (C.1.1 ... )

Compilers (D.3.4 ... )

Performance Attributes (C.4 ... )

Large And Medium (“Mainframe”) Computers (C.5.1 )

Would you recommend this review?

yes

Other reviews under "Pipeline Processors":	Date

Measurement and evaluation of the MIPS architecture and processor Gross T., Hennessy J., Przybylski S., Rowen C. ACM Transactions on Computer Systems 6(3): 229-257, 1988. Type: Article	Apr 1 1989

Dynamic Instruction Scheduling and the Astronautics ZS-1 Smith J. Computer 22(7): 21-35, 1989. Type: Article	Oct 1 1990

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy