forums.silverfrost.com

JohnCampbell · Joined: 16 Feb 2006 Posts: 2629 Location: Sydney

Paul,

Is my previous statement basically correct ?

PaulLaidler · Posted: Tue Mar 03, 2015 8:40 am Post subject:

John

I see no reason why this should not be added and I will put it on the wish list.

This function is expanded inline by the compiler (see the /explist that it produces). I don't have any information about the reason for the warning.

mecej4 · Joined: 31 Oct 2006 Posts: 1943 Location: USA

PaulLaidler · Posted: Tue Mar 03, 2015 4:53 pm Post subject:

John

I have had a look at your request for CPU_CLOCK_TICK@. It turns out that there are two problems with this.

1) The new function would still need to create a temporary with the result that it would still require 5 machine instructions. There may be little or no improvement.

2) 32 bit FTN95 has used the floating point stack for this 64 bit integer so that it can hold the result in a single register. To do this in a different way would be a non-trivial task.

JohnCampbell · Joined: 16 Feb 2006 Posts: 2629 Location: Sydney

Paul,

Thanks for reviewing this. It looks like the existing cpu_clock@ provides a good interface to rdtsc. Your description of "the use of the floating point stack for this 64 bit integer" may be the basis of the FTN95 warning.

I was not able to compile the alternative assembler version of rdtsc.asm that mecej4 provided, using " ml /coff /c rdtsc.asm ".
Could a .obj version of that, compatible with FTN95 and SLINK, be available to see if it improves on the real*10 interface.
I am always worried about mixing .obj from other compilers, in case FTN95 does not have a compatible call interface.

John

PaulLaidler · Posted: Wed Mar 04, 2015 7:29 am Post subject:

John

I will have another look at this when I get a moment. Maybe you only need the lower DWORD.

mecej4 · Joined: 31 Oct 2006 Posts: 1943 Location: USA

John: FTN95 uses X87 instructions to process ALL variables of type INTEGER*8. This design complicates using RDTSC in the middle of X87 code which is also likely to cause X87 exceptions to occur. If you do not have/want to use the MASM assemblers ML and ML64, which come with Visual Studio/VC and SDKs from Microsoft, you can download and unpack the NASM assembler (just 500K w/o documentation) from www.nasm.us. Here is code for the INTEGER*8 RDTSC function:

JohnCampbell · Joined: 16 Feb 2006 Posts: 2629 Location: Sydney

Thanks Paul and Mecej4,

The following code appears to test alternatives I am considering. It looks like the code/edoc approach is giving good performance as an integer*8 function. I tested the following program for alternative ftn95 compile options of:
/check
/debug
(none)
/opt

I shall see where this goes for reliability.

I would expect that having code/edoc isolated to a function call should not inhibit optimisation of inner loops ?

John

mecej4 · Joined: 31 Oct 2006 Posts: 1943 Location: USA

John said:

jalih · Joined: 30 Jul 2012 Posts: 196

mecej4 · Joined: 31 Oct 2006 Posts: 1943 Location: USA

Jalih's guidelines will become important when precise timing is desired, but as of now we are still at the phase of looking for ways in which we can do some timing in a program compiled with /opt, from source code containing INTEGER*8 variables, whose runs cause underflows. No reliable way of avoiding run-time and compile-time crashes (other than not using /opt) has yet been found.

PaulLaidler · Posted: Wed Mar 04, 2015 3:50 pm Post subject:

I have added code to the compiler so that

mecej4 · Joined: 31 Oct 2006 Posts: 1943 Location: USA

Thanks, Paul. Here is a test program that you could use to ascertain whether using the new CPU_CLOCK@() function inhibits optimization.

PaulLaidler · Posted: Wed Mar 04, 2015 8:12 pm Post subject:

Thank but I am very pressed for time.
I am happy to provide the functionality so that users can decide to use it or not as they see fit.

mecej4 · Joined: 31 Oct 2006 Posts: 1943 Location: USA

Good, I shall run the tests when the new release becomes available. Thanks.