forums.silverfrost.com

KennyT · Joined: 02 Aug 2005 Posts: 317

the following code segment crashes (floating stack overflow) at the RMSN= line:

JohnCampbell · Joined: 16 Feb 2006 Posts: 2554 Location: Sydney

I think there is a problem with a possible temporary array for "SUM((ra)*(ra) )", but there are a lot of questions about this.

Are you using /64 or not ?
Did it work before ?
Would "SUM((ra)*(ra) )" require a temporary array, or should you accumulate the sum in a DO loop, without requiring a temporary array ?

I note that as of ver 8.20 : 64 bit FTN95 now stores large static arrays in the same way as automatic arrays. What would happen to a temporary array ?

Is SUM((ra)*(ra) ) different from SUM(ra*ra) ? Why not use the following. I would try to avoid a temporary array implied by (ra)*(ra)

KennyT · Joined: 02 Aug 2005 Posts: 317

Thanks John,

no, this is 32-bit code

i think the original version was coded for speed at runtime. the routine gets called a lot!

pretty sure it used to work in v8.01.

K

JohnCampbell · Joined: 16 Feb 2006 Posts: 2554 Location: Sydney

The use of " if ( abs(x) > 1.E15 ) cycle" may restrict optimisation, but avoiding all the temporary arrays would improve cache usage.
In general, do you find the use of SUM ( array*array ) an efficient approach ? It would not be my choice, but worth considering.
DOT_PRODUCT is an alternative, but is not typically optimised.

John

KennyT · Joined: 02 Aug 2005 Posts: 317

Is SUM((ra)*(ra) ) different from SUM(ra*ra) ?

YES. it seems that removing the extra brackets around the RA arrays fixes it!

just have to go and look for other occasions where we've used that syntax elsewhere...

K

KennyT · Joined: 02 Aug 2005 Posts: 317

Paul, if you need a test program...

JohnCampbell · Joined: 16 Feb 2006 Posts: 2554 Location: Sydney

Ken,

Try this example on either 32-bit or 64-bit.
(I have configured to use /fpp, see noteson64bitftn95.txt)

KennyT · Joined: 02 Aug 2005 Posts: 317

Thanks again John.

interestingly, I further modified the program to do a variable number of "kk" loops and found that, for 5 loops, "dot product" was slightly faster but for 100 loops, "sum" was faster (which probably means they are actually identical for all practical purposes!)

K

JohnCampbell · Joined: 16 Feb 2006 Posts: 2554 Location: Sydney

Could that imply that for /32, a temporary array is not being generated ?
I always try to remove possible stack overflow constructs.
/64 is different at this time.

mecej4 · Joined: 31 Oct 2006 Posts: 1886

JohnCampbell · Joined: 16 Feb 2006 Posts: 2554 Location: Sydney

mecej4,

That is correct, which means we should highlight the post above which exhibits this problem, for Paul to address:

PaulLaidler · Posted: Thu Nov 30, 2017 7:46 am Post subject:

Thanks. I have made a note of this.

PaulLaidler · Posted: Thu Nov 30, 2017 9:27 am Post subject:

This regression in v8.20 has now been fixed for the next release.