Based on recent posts, it may be worth setting up a review of the Polyhedron benchmark test suite and provide some indication of why FTN95 is so slow for each of these tests. I have (years ago) reviewed some of the tests by using /timing or /profile options to identify the area in the code where most of the time is being used. What I found for a few of the tests was that only a few lines were to blame for FTN95 poor performance, typically where FTN95 failed to optimise the inner loop. Often these could be corrected by 'better' code structures. Other tests exhibited poor indexing of large arrays or used large local arrays that exceeded the stack. Both these problems can be addressed by better array usage with subscript re-ordering or ALLOCATE. ( I have always found that if you need a larger stack, the solution is never to use a larger stack, but fix the problem) There were other examples where vector instructions or /Qparallel helped, which is beyond FTN95's capability.
If we can assemble this list of reasons, it may help FTN95 users to see what other compilers do automatically and what we, as FTN95 users, can do to improve our code to cope with some of these problems. It was my general impression from my past reviews that most of the inefficiency attributed to FTN95 can be avoided or at least mitigated.
As a first step I shall generate a list of the latest tests in the suite and proceed to provide a summary of their status. Hopefully as this becomes fairly complete it will document the problems to be avoided.
If anyone has done similar reviews, please let me know.
I'll keep you updated.
John
ps: We need links to the test suites to review