Next | Prev | Top | Contents | Index
Chapter 6: Performance Tuning for the R8000
This chapter outlines techniques for tuning the performance of your R8000 applications. It contains four sections:
- The first section presents the compiler optimization technique of software pipelining, which is crucial to getting optimal performance on the R8000. It shows you how to read your software pipelined code and how to understand what it does.
- The second section uses matrix multiplies as a case study on loop unrolling.
- The third section describes the phenomenon of bellows stalls on the R8000 architecture and gives tips on how to avoid them.
- The final section describes how the IVDEP directive can be used in Fortran to gain performance.
- Software Pipelining
-
- Matrix Multiply - A Tuning Case Study
-
- The Effect of Bellows Stalls
-
- Use of the IVDEP Directive
-
- Vector Intrinsic Functions
-
Next | Prev | Top | Contents | Index