<< Chapter < Page | Chapter >> Page > |
REAL X,Y
Y = X**2J = K*2
For the exponentiation operation on the first line, the compiler generally makes an embedded mathematical subroutine library call. In the library routine,
X
is converted to a logarithm, multiplied, then converted back. Overall, raising
X
to a power is expensive — taking perhaps hundreds of machine cycles. The key is to notice that
X
is being raised to a small integer power. A much cheaper alternative would be to express it as
X*X
, and pay only the cost of multiplication. The second statement shows integer multiplication of a variable
K
by 2. Adding
K+K
yields the same answer, but takes less time.
There are many opportunities for compiler-generated strength reductions; these are just a couple of them. We will see an important special case when we look at induction variable simplification. Another example of a strength reduction is replacing multiplications by integer powers of two by logical shifts.
In [link] , we talked about register renaming. Some processors can make runtime decisions to replace all references to register 1 with register 2, for instance, to eliminate bottlenecks. Register renaming keeps instructions that are recycling the same registers for different purposes from having to wait until previous instructions have finished with them.
The same situation can occur in programs — the same variable (i.e., memory location) can be recycled for two unrelated purposes. For example, see the variable
x
in the following fragment:
x = y * z;
q = r + x + x;x = a + b;
When the compiler recognizes that a variable is being recycled, and that its current and former uses are independent, it can substitute a new variable to keep the calculations separate:
x0 = y * z;
q = r + x0 + x0;x = a + b;
Variable renaming is an important technique because it clarifies that calculations are independent of each other, which increases the number of things that can be done in parallel.
Subexpressions are pieces of expressions. For instance,
A+B
is a subexpression of
C*(A+B)
. If
A+B
appears in several places, like it does below, we call it a
common subexpression :
D = C * (A + B)
E = (A + B)/2.
Rather than calculate
A + B
twice, the compiler can generate a temporary variable and use it wherever
A + B
is required:
temp = A + B
D = C * tempE = temp/2.
Different compilers go to different lengths to find common subexpressions. Most pairs, such as
A+B
, are recognized. Some can recognize reuse of intrinsics, such as
SIN(X)
. Don’t expect the compiler to go too far though. Subexpressions like
A+B+C
are not computationally equivalent to reassociated forms like
B+C+A
, even though they are algebraically the same. In order to provide predictable results on computations, FORTRAN must either perform operations in the order specified by the user or reorder them in a way to guarantee exactly the same result. Sometimes the user doesn’t care which way
A+B+C
associates, but the compiler cannot assume the user does not care.
Address calculations provide a particularly rich opportunity for common subexpression elimination. You don’t see the calculations in the source code; they’re generated by the compiler. For instance, a reference to an array element
A(I,J)
may translate into an intermediate language expression such as:
Notification Switch
Would you like to follow the 'High performance computing' conversation and receive update notifications?