Baseline optimization flags:
C programs: -openmp -O3 -IPF_fp_relaxed -ipo -ansi_alias (ONESTEP)
Fortran programs: -openmp -O3 -IPF_fp_relaxed -ipo (ONESTEP)
OpenMP runtime library libguide.a statically linked
Extra Flags:
331.art_l: -DINTS_PER_CACHELINE=32 -DDBLS_PER_CACHELINE=16
User environment:
OMP_NUM_THREADS 64
limit stacksize 128000
KMP_STACKSIZE 124M
KMP_LIBRARY TURNAROUND
OMP_DYNAMIC FALSE
KMP_SCHEDULE static,balanced
For all benchmarks, threads were bound to cores using the following submit command:
dplace -x2 -e -cNTM1,0 $command,
where NTM1 is the number of threads minus 1.
This binds threads in order of creation, beginning with the master
thread on core NTM1, the first slave thread on core NTM1-1, and so on.
The -x2 flag instructs dplace to skip placement of the lightweight
OpenMP monitor thread, which is created prior to the slave threads.
For a description of SGI's compiler flags, portability flags, and
system parameters used to generate this result, please refer to the
SGI-20060801-Linux-Intel9.1-IPF.txt file in the flags directory.
|