Introduction

High performance computing Page 1 / 1

In nearly all high performance applications, loops are where the majority of the execution time is spent. In [link] we examined ways in which application developers introduced clutter into loops, possibly slowing those loops down. In this chapter we focus on techniques used to improve the performance of these “clutter-free” loops. Sometimes the compiler is clever enough to generate the faster versions of the loops, and other times we have to do some rewriting of the loops ourselves to help the compiler.

It’s important to remember that one compiler’s performance enhancing modifications are another compiler’s clutter. When you make modifications in the name of performance you must make sure you’re helping by testing the performance with and without the modifications. Also, when you move to another architecture you need to make sure that any modifications aren’t hindering performance. For this reason, you should choose your performance-related modifications wisely. You should also keep the original (simple) version of the code for testing on new architectures. Also if the benefit of the modification is small, you should probably keep the code in its most simple and clear form.

We look at a number of different loop optimization techniques, including:

Loop unrolling
Nested loop optimization
Loop interchange
Memory reference optimization
Blocking
Out-of-core solutions

Someday, it may be possible for a compiler to perform all these loop optimizations automatically. Typically loop unrolling is performed as part of the normal compiler optimizations. Other optimizations may have to be triggered using explicit compile-time options. As you contemplate making manual changes, look carefully at which of these optimizations can be done by the compiler. Also run some tests to determine if the compiler optimizations are as good as hand optimizations.

<< Chapter < Page Page > Chapter >>

Read also:

Get Jobilize Job Search Mobile App in your pocket Now!

100% Free Mobile Applications
Receive real-time job alerts and never miss the right job again

Source: OpenStax, High performance computing. OpenStax CNX. Aug 25, 2010 Download for free at http://cnx.org/content/col11136/1.5

Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'High performance computing' conversation and receive update notifications?

Ask

	NCE Ch 11 Counseling Families, Diagnosis... By Anh Dao Start Quiz
	Biology Final By Anonymous User Start Quiz
©flickr:	Vocabulary Practice Quiz! By Katie Montrose Start Quiz
	27 AP 27 Reproductive System MCQ By OpenStax Start Quiz
	15 AP 15 Autonomic Nervous System MCQ By OpenStax Start Quiz
	Principles of microeconomics for ap® courses By OpenStax Read Online Course
	Kira Kira Test By Briana Knowlton Start Quiz
	1 SCJP/OCJP Java Certification By JavaChamp Team Start Exam
	9 AP 09 Joints MCQ Quiz By OpenStax Start Quiz
	Measurement Experimentation Lab MCQ By Steve Gibbs Start Quiz