I recently discovered that some unexpected routines considered the major part of the computing time in a program I am developing. On closer examination this turned out to be due to the SPREAD intrinsic which uses a considerable amount of time. Replacing the intrinsic with a twoline conventional loop reduced the computing timeby a factor of 30.
My compiler is v. 6.1.I can provide a simple, short example if desired.