forums.silverfrost.com

JohnCampbell · Joined: 16 Feb 2006 Posts: 2560 Location: Sydney

Paul,

Following on from the discussion of KIND, is it an option to provide REAL*6 or INTEGER*6.
There was a time when all reals were calculated in the co-processor, and I thought that real*4 ( and real*8 ) was just a truncated 80-bit real*10. Is this the case ? If so would REAL*6 be a simple extension of managing REAL*4. There is certainly a big gap between R*4 and R*8 in precision and R*6 would provide about 11 significant digits (precision).

I'm not sure of the basis of INTEGER*8 from INTEGER*4, but INTEGER*6 could be a useful alternative ?

Just a thought !

John

LitusSaxonicum · Posted: Mon Oct 26, 2009 3:11 pm Post subject:

John,

I'm a real believer (no pun intended) in REAL*6 and INTEGER*6. The problem is that they aren't native to (x87) coprocessors, and all the operations would need to be coded from scratch (i.e. done in software).

When I used MS Fortran, they had 2 libraries one could link with - one where the math was done largely in software, and one where it was done largely in hardware. They didn't always give the same result! In part, this was because REAL*8 match was done in 64 bits, whereas the coprocessor operations loaded things into 80-bit registers, so that the round-off was potentially different.

Eddie

PaulLaidler · Posted: Mon Oct 26, 2009 8:37 pm Post subject:

selected_integer_kind and selected_real_kind allow you to select the precision etc (within certain hardware limits) but these are mapped to those provided by the processor and co-processor. In other words if you asked for the equivalent of *6 then you would get *8 anyway. Providing *6 via software would be slower than the *8 provided by the hardware.

JohnCampbell · Joined: 16 Feb 2006 Posts: 2560 Location: Sydney

Paul,

I was under the impression that real*4 and real*8 were done in the 80-bit math co-processor. Results were stored in the word address, with truncation of the accuracy.
So my assumption for real*6 would be that the calcs would be in the coprocessor, but the truncation would be different.
This is not consistent with the statement "providing real*6 via software"
I have also seen past reference to a 64-bit rather than 80-bit arithmetic (SSE?) instructions, which would change this assumption.
Is the co-processor no longer used and are real*4 and real*8 calculations now done differently ?

John

Sebastian · Joined: 20 Feb 2008 Posts: 177

JohnCampbell · Joined: 16 Feb 2006 Posts: 2560 Location: Sydney

Is 80-bit extended precision the same as real*10 or is real*10 software implemented ?

PaulLaidler · Posted: Tue Oct 27, 2009 9:10 am Post subject:

Yes extended precision is the same as real*10.

Sebastian · Joined: 20 Feb 2008 Posts: 177

The *x usually specifies the amount of bytes required for the data type (this may be awfully wrong for non-x86/non-PC fortran implementations) so real*10 is the 10byte=80bit floating point type as Paul said.

JohnCampbell · Joined: 16 Feb 2006 Posts: 2560 Location: Sydney

Paul,

I am trying to understand how real*6 could be done and real*10 is done.
My question re real*10 is : Is it hardware implemented, with all calculations done in the 80-bit math co-processor, or is that an obsolete technology?

To test out this I wrote a program that repeated vector dot product on 1000 element arrays as real*8 or real*10, using dot_product intrinsic or simple function which has a loop:-

Sebastian · Joined: 20 Feb 2008 Posts: 177

JohnCampbell · Joined: 16 Feb 2006 Posts: 2560 Location: Sydney

Sebastian wrote "How do you come to that conclusion? " I also said that "Either this or the instructions to move 4, 8 or 10 bytes take a lot of time." I just find that the ratios of 130% and 68% are big spreads for just moving bytes, as compared to floating point calculation times. Is an 80-bit fpu always used for real calcualtions ?

Sebastion also wrote :

PaulLaidler · Posted: Wed Oct 28, 2009 9:20 am Post subject:

The answer to these questions can be researched by using /explist on the command line. This will show the assembly instructions generated by FTN95. You will then need to look up these instructions in an Intel manual.

There will be little or no software intervention except perhaps in the case of INTEGER*8. The native 32, 64 and 80 bit instructions will not be truncated unless your source code stipulates this. You will also be able to look up the timing of the native instructions.

Basically FTN95 will aim to give you the maximum precision that is available in any given situation, even to the point of sometimes using 80 bits internally when a 64 bit result is being generated.

With the speed of modern processors, the speed of a native 32 bit multiply (say) as against a 64 bit native multiply is rarely an issue.

LitusSaxonicum · Posted: Wed Oct 28, 2009 11:37 am Post subject:

Speed may not be an issue, but storage is, and if (say) REAL*6 was good enough for (again, say) FE calculations, then one would have 25% longer arrays to do the matrix operations in - while sticking with a 32-bit OS and the limitations of that. That puts off the evil moment when the solution has to use the hard disk .... which slows the process down hugely.

It's a very ong time since I knew my way round the 8087 fpu book (8087 applications and programming) and my understanding is that first MMX and later SSE provided alternate ways to do certain math operations. I got lost at that point. None of the standard methods countenance REAL*6.

Eddie

JohnCampbell · Joined: 16 Feb 2006 Posts: 2560 Location: Sydney

Thanks Eddie for providing the names of the more recent MMX and later SSE instructions.
I apologise, but I am not sufficiently familiar with assembler to understand what is happening in /explist.
Can't I get a clear answer to my question of is the real*x maths done in the co-processor or is it the more recent instructions ?
I am surprised by the difference in gross computation time between real*4, *8 and *10. Is the only explaination the different in moving the necessary bytes.
Any clear advice would be appreciated.
John

Sebastian · Joined: 20 Feb 2008 Posts: 177

As far as I know MMX/SSE/SSE2 do not support 80bit registers.