forums.silverfrost.com

wahorger · Joined: 13 Oct 2014 Posts: 1257 Location: Morrison, CO, USA

I'm getting a FP Stack Fault in my main code. I've isolated the routine and the fact that it occurs only when compiled with /CHECKMATE. The error is:

mecej4 · Joined: 31 Oct 2006 Posts: 1899

I built and ran your code with the 8.1 version 32-bit compiler, with /checkmate. Instead of an FPU stack overflow, the program ended after detection of the use of the undefined variable Q on line 149 (P is also undefined at the same line).

I certainly believe that X87 stack overflow occurs with FTN95-compiled 32-bit programs more often than it should occur. In fact, given the number of long expressions that I see in your program, I expect X87 stack overflow to happen. Fixing the problem can be expedited if you provide a test program that exhibits it without other errors (such as undefined variables) clouding up the issue.

JohnCampbell · Joined: 16 Feb 2006 Posts: 2615 Location: Sydney

The example code has many examples of old style calculations where a complex calculation spreads over many lines.
I may be wrong, but isn't the "floating point stack overflow" simply that the parser runs out of registers, rather than breaking up the expression into bits.
Did F77 do it this way ?

I got this earlier error on line 123.
my patch was:

wahorger · Joined: 13 Oct 2014 Posts: 1257 Location: Morrison, CO, USA

mecej4: This was run under V8.20.0. YMMV re: 8.1

wahorger · Joined: 13 Oct 2014 Posts: 1257 Location: Morrison, CO, USA

JohnCampbell, after looking at the code in some detail, I can see the problem you detected (UNDEFINED), and where it likely originates.

This code was obtained from the National Park Service under its Public Domain policy. So this issue dates back to the 1980's.

Thanks for pointing it out.

The problem is that P is used for the Z1 forward calculation, while R is used for the Z1 inverse calculation. Since the computation of both P and R is the same, this is the likely cause. I've searched my archives back to 1992 and this issue is there.

The ALASKA conversion is one that is rarely (if ever) used. Indeed the data set that caused the error was created in 1996 and was used by me while testing another section of code when this problem popped up.

I will be looking through the other sections of code to see if this was a "standard" coding problem in the other coordinate routines.

Again, thanks for the heads up.

The Stack Overflow issue is separate; I'll let Paul address this.

Bill

mecej4 · Joined: 31 Oct 2006 Posts: 1899

JohnCampbell · Joined: 16 Feb 2006 Posts: 2615 Location: Sydney

mecej4,

Thanks for your detailed explanation. I think I understand.
My (remove the problem) approach to that line was to first calculate zz, which I hoped would not overflow the registers.
zz = ( S+FF*R + G*DSIN(U/D) ) &
& / ( S-FF*R - G*DSIN(U/D) )
( should have used zz = FF*R + G*DSIN(U/D) first )
However, the error might have been with line 123, so I simplified that also with z1 and z2.
.
Bill,

I had the need to convert Lat-Long values from charts for Maputo in Mozambique into X Y for the projection (I don't recall).
I thought I could use a few sin cos conversions, until I googled the transformation equations, similar to what you have posted.
I can't believe the complexity, and was worried about register overload back then. I gave up, found a GIS expert and sent him the 30 points I needed.
I would always try to "tidy" the calculation and use a few temporary z's to make the equation readable / auditable.
EP**FACT is going to be a problem, if EP is out of range. It is also something I would not expect in a coordinate transformation.

I'll stick to FE calculations !!

John

PaulLaidler · Posted: Mon Jan 29, 2018 8:42 am Post subject:

Bill

Can you post a short and simple form of the issue that you want investigated.

wahorger · Joined: 13 Oct 2014 Posts: 1257 Location: Morrison, CO, USA

Paul, other than the example I've put in DropBox (link supplied in the original message), I don't know what else to do. It works when compiled as /RELEASE, and not as /CHECKMATE.

If I restructure the calculation that appears to be the culprit, the stack overflow does not occur. So my assumption it is a compiler code generation problem, but only when /CHECKMATE is used.

DanRRight · Posted: Mon Jan 29, 2018 6:56 pm Post subject:

Paul,
I also have several cases when the compiler in 64bit mode does not even compile the code if both /check /undef are used simultaneously but I can not provide small demo. In small code things very often go fine. So please look at any chance, like this one for example, to corner some last possible bugs and make compiler rock stable.

mecej4 · Joined: 31 Oct 2006 Posts: 1899

After looking at the error traceback (in the initial post) in more detail, I am quite puzzled. The FPU stack overflow occurred in Salflibc.dll, in the Fortran I/O routines, starting with a call to D8__WSF in the user's code. Since the X86 register convention requires that "The x87 floating point registers ST0 to ST7 must be empty (popped or freed) when calling a new function, and ST1 to ST7 must be empty on exiting a function" (see https://en.wikipedia.org/wiki/X86_calling_conventions), FPU stack overflow in the I/O routine can have nothing to do with the complexity of the arithmetic expression on Line-122 or elsewhere.

There is some inconsistency here, but I cannot check more precisely because I have the 8.1 compiler and I cannot reproduce the X87 stack overflow with Wahorger's source code.

On the other hand, once P and Q are initialised in the test code one should be able to compile and run the program without errors.

JohnCampbell · Joined: 16 Feb 2006 Posts: 2615 Location: Sydney

Could the error occurring with V8.20, actually be occurring on line 123 as reported:
print *,s
This would explain the I/O path.
Could when using /check, line 122 register usage is not finalised properly before doing I/O at line 123 ?

I have my own devilry with V8.20, which does not occur in V8.10 and I'm not getting close to the problem.

wahorger · Joined: 13 Oct 2014 Posts: 1257 Location: Morrison, CO, USA

John, Actually, no. This is the way the error is "trapped" even without the PRINT. It is always showing the next executable line.

It was because of this that it took me a while to find the offender......

PaulLaidler · Posted: Wed Jan 31, 2018 9:57 am Post subject:

This is a bug in 32 bit FTN95 that we are working on.

narayanamoorthy_k · Joined: 19 Jun 2014 Posts: 142 Location: Chennai, IN

Is the FTN95.exe release planned with this fix soon?
_________________
Thanks and Regards
Moorthy