forums.silverfrost.com

mecej4 · Joined: 31 Oct 2006 Posts: 1897

One of the Fortran benchmark codes that DanRRight wrote about recently contains two code segments with the following pattern:

DanRRight · Posted: Thu Oct 20, 2022 9:16 am Post subject:

Devilry. This may be because the original author liked "GOTO 666" i counted more than 50 times in his code. Smile

ROTFL! Self-fulfilled prophecy.
Want to look at original code? I can send it. Or you can get it from his website. ZIP file contains EXE files though, be careful, do not run them

So the FTN95 executed and executed the DO loop while all other compilers just ignored it like it should be ? Or the opposite, FTN95 was doing everything right and all other compilers cheated ? Smile

Nice catch if this is indeed the case.

mecej4 · Joined: 31 Oct 2006 Posts: 1897

The mp_prop_design program is unfit to be used as a benchmark.

If you search through the history of the PB11 benchmarks, you may notice that the suite did not include mp_prop_design before 2011. Its original author (Anthony Falzone, an engineer) admits in his Bugzilla thread ( https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53957 ) that "I am not much of a programmer". He has also stated his opinion regarding the program being used as a benchmark at https://propdesign.jimdofree.com/notes/ ; scroll down to the "Benchmarking" section on that page.

The Fortran source code is quite inefficient, with many expressions being repeatedly evaluated inside long DO loops when portions or even whole expressions could be calculated and stored into temporary scalar variables before entering the loop. Here is an example:

JohnCampbell · Joined: 16 Feb 2006 Posts: 2593 Location: Sydney

Most of the PB11 programs are VERY poorly written, with little attention to using "efficient" coding approaches. Even Engineers who wrote Fortran can understand efficiency.

In these PB11 codes, there are many examples of replicated computation in DO loops and calculations that could be moved outside the DO for efficiency.

What do you do for someone that writes "((radius/rocr)**2.0)" ? In older implementations of FORTRAN, the performance hit would have been worse !
So why are these programs targeted at optimisation if the authors did not care ? Unfortunately ifort and other compilers have included optimisation that target these examples, probably at the expense of what other codes require !

The three main areas where FTN95 is/was less efficient are :
1) optimisation in DO loops to remove repeated or loop independent calculations.
2) use of temporary arrays for array sections.
3) use of AVX vector instructions, especially for inner loops.

When efficiency is being targeted, it is mostly the inner loops that are most important. For my FE calculations, AVX functions are easily included, but for more complex field theory calculations, use of AVX instructions can be more complex.

Fortunately, the area where FTN95 is more effective is in diagnostics, so it remains a very useful Fortran tool.

DanRRight · Posted: Fri Oct 21, 2022 10:25 pm Post subject:

Great, Mecej4 and John. Interesting findings. Decently, i was shocked, about what will tell you later. So GOTO 70 and GOTO 80 place is not the reason? Also I doubt print to the screen is slowing by more than a percent. Please keep investigating.

Motto came to my mind: "FTN95 : the compiler for real pros".
And another: "Real programmers do not use compiler optimization, they optimize codes by hand" Smile

Now i understand why FTN95 is the only compiler where you can mix fortran and assembler source text in the same subroutine and Fortran compiler will swallow that as its own.

PaulLaidler · Posted: Sat Oct 22, 2022 9:52 am Post subject:

mecej4

I have seen your comment

mecej4 · Joined: 31 Oct 2006 Posts: 1897

Here is what I think should suffice:

[Note: I am presuming that we rarely see instances of the old Fortran "extended DO range", where one could GOTO from inside the DO loop to the outside, execute some statements, followed by another GOTO back to a statement number within the DO loop.]

PaulLaidler · Posted: Thu Nov 17, 2022 3:46 pm Post subject:

mecej4

I have had a look at this and I don't think that it can be done within a reasonable time frame.