NONMEM on a SGI R8000

3 messages 3 people Latest: Dec 08, 1994

NONMEM on a SGI R8000

From: Ferdie Rombout Date: December 08, 1994 technical
We wanted to buy the R8000 challenge from SGI. Before buying we wanted to have a benchmark to see how the performance compared to our R4400 150 Mhz. It turned out that the R8000 did not deliver the performance we (and SGI) expected. So SGI had someone look at the sources and discovered that it was no floating point dependent source. It was more coupled to the integer performance. Then we had a look at the R4400 200 Mhz, and found our run speeds improved two-fold. This was explained by the bigger secondary cache (R4400 150 Mhz: 1 Mb, R4400 200 Mhz: 4 Mb) and by the higher specsint of this machine compared to the R8000 and R4400 150 Mhz Beside this, on SGI machines it can be important to adjust the size of the buffe When looking with a profiler NONMEM spends most time doing freads. This is done for a big part to NONMEM FILE10. This has increased our run times almost with a factor of two, just by adjusting NONMEM buffers. After adjusting the buffers freads decreased from more than 35 % to less than 1%. This is probably dependent on secondary cache since the performance got worse on a R3000 (no 1 Mb sec. cache). We have ordered the R4400 200 Mhz with two processors, our next steps will be optimizing the NONMEM source and parallel processing.

NONMEM on a SGI R8000

From: Ian Dillon Date: December 08, 1994 technical
I've CC'd everyone in your email, to expedite this request. > We would like to run NONMEM on a SGI Power Challenge machine with MIPS ... > this machine? Here are the most aggressive optimization options I've used to benchmark NONMEM on a single R8000 Power Challenge (from my makefile): BIN = nonmem FFLAG1 = -c -O3 -GCM:aggressive_speculation=ON:array_speculation=ON FFLAG2 = -Wk,-o=0,-so=0,-r=3 -OPT:roundoff=3:IEEE_arithmetic=3:fast_sqrt=on:fast_exp=on LFLAGS = -lfastm F77 = f77, You can play around with the KAP (-Wk,-o=0,-so=0,-r=3) options if need be. Also, if you want to turn on parallelization, tack on the -PFA switches.

NONMEM on a SGI R8000

From: Lutz Harnisch Date: Unknown date technical
(harnisch@pollux.zedat.fu-berlin.de) - 7 Dec 94 We would like to run NONMEM on a SGI Power Challenge machine with MIPS R8000 Processors and need some hints for compiling. What are the machine constants in BLKDAT? Is it advisable to run NONMEM in double or single precision mode, since this machine has a true 64 bit architecture? Are there any compiler flags to generate optimized code for this machine?