MPI2007 Result Flag Description

Base Optimization Flags

C benchmarks

- -O3
- mpicc,mpiCC,mpif90
- COPTIMIZE
- Enables O2 optimizations plus more aggressive optimizations, such as prefetching, scalar replacement, and loop and memory access transformations. Enables optimizations for maximum speed, such as:
  - Loop unrolling, including instruction scheduling
  - Code replication to eliminate branches
  - Padding the size of certain power-of-two arrays to allow more efficient cache use.
  On Intel Itanium processors, the O3 option enables optimizations for technical computing applications (loop-intensive code):
  loop optimizations and data prefetch. The O3 optimizations may not cause higher performance unless loop and memory access transformations take place. The optimizations may slow down code in some cases compared to O2 optimizations.
  The O3 option is recommended for applications that have loops that heavily use floating-point calculations and process large data sets.
- Includes:
  - -O2
    - -O1
      
      -unrolln
      
      -builtin
      
      -mno-ieee-fp
      
      -fomit-frame-pointer
      
      -ffunction-sections
- -no-prec-div
- COPTIMIZE
- Enables optimizations that give slightly less precise results than full IEEE division. With some optimizations, such as -xN and -xB, the compiler may change floating-point division compu- tations into multiplication by the reciprocal of the denomina- tor. For example, A/B is computed as A * (1/B) to improve the speed of the computation. The default is -prec-div, which provides fully precise IEEE division. It improves precision of floating-point divides by disabling floating-point division-to-multiplication optimiza- tions, resulting in greater accuracy with some loss of perfor- mance.
- -ftz
- COPTIMIZE
- Flushes denormal floating point results to zero when the application is in gradual underflow mode.
- -fno-alias
- COPTIMIZE
- Tells the compiler not to assume aliasing in the program (DEFAULT = -falias).
- -xT
- COPTIMIZE
- Generates specialized code to run exclusively on processors with the extensions T. This option can generate SSSE3, SSE3, SSE2, and SSE instructions for Intel processors, and it can optimize for the Intel (R) Core (TM) 2 Duo processor family.

C++ benchmarks

126.lammps

- -O3
- mpicc,mpiCC,mpif90
- CXXOPTIMIZE
- Enables O2 optimizations plus more aggressive optimizations, such as prefetching, scalar replacement, and loop and memory access transformations. Enables optimizations for maximum speed, such as:
  - Loop unrolling, including instruction scheduling
  - Code replication to eliminate branches
  - Padding the size of certain power-of-two arrays to allow more efficient cache use.
  On Intel Itanium processors, the O3 option enables optimizations for technical computing applications (loop-intensive code):
  loop optimizations and data prefetch. The O3 optimizations may not cause higher performance unless loop and memory access transformations take place. The optimizations may slow down code in some cases compared to O2 optimizations.
  The O3 option is recommended for applications that have loops that heavily use floating-point calculations and process large data sets.
- Includes:
  - -O2
    - -O1
      
      -unrolln
      
      -builtin
      
      -mno-ieee-fp
      
      -fomit-frame-pointer
      
      -ffunction-sections
- -no-prec-div
- CXXOPTIMIZE
- Enables optimizations that give slightly less precise results than full IEEE division. With some optimizations, such as -xN and -xB, the compiler may change floating-point division compu- tations into multiplication by the reciprocal of the denomina- tor. For example, A/B is computed as A * (1/B) to improve the speed of the computation. The default is -prec-div, which provides fully precise IEEE division. It improves precision of floating-point divides by disabling floating-point division-to-multiplication optimiza- tions, resulting in greater accuracy with some loss of perfor- mance.
- -ftz
- CXXOPTIMIZE
- Flushes denormal floating point results to zero when the application is in gradual underflow mode.
- -fno-alias
- CXXOPTIMIZE
- Tells the compiler not to assume aliasing in the program (DEFAULT = -falias).
- -xT
- CXXOPTIMIZE
- Generates specialized code to run exclusively on processors with the extensions T. This option can generate SSSE3, SSE3, SSE2, and SSE instructions for Intel processors, and it can optimize for the Intel (R) Core (TM) 2 Duo processor family.

Fortran benchmarks

107.leslie3d

- -O3
- mpicc,mpiCC,mpif90
- FOPTIMIZE
- Enables O2 optimizations plus more aggressive optimizations, such as prefetching, scalar replacement, and loop and memory access transformations. Enables optimizations for maximum speed, such as:
  - Loop unrolling, including instruction scheduling
  - Code replication to eliminate branches
  - Padding the size of certain power-of-two arrays to allow more efficient cache use.
  On Intel Itanium processors, the O3 option enables optimizations for technical computing applications (loop-intensive code):
  loop optimizations and data prefetch. The O3 optimizations may not cause higher performance unless loop and memory access transformations take place. The optimizations may slow down code in some cases compared to O2 optimizations.
  The O3 option is recommended for applications that have loops that heavily use floating-point calculations and process large data sets.
- Includes:
  - -O2
    - -O1
      
      -unrolln
      
      -builtin
      
      -mno-ieee-fp
      
      -fomit-frame-pointer
      
      -ffunction-sections
- -no-prec-div
- FOPTIMIZE
- Enables optimizations that give slightly less precise results than full IEEE division. With some optimizations, such as -xN and -xB, the compiler may change floating-point division compu- tations into multiplication by the reciprocal of the denomina- tor. For example, A/B is computed as A * (1/B) to improve the speed of the computation. The default is -prec-div, which provides fully precise IEEE division. It improves precision of floating-point divides by disabling floating-point division-to-multiplication optimiza- tions, resulting in greater accuracy with some loss of perfor- mance.
- -ftz
- FOPTIMIZE
- Flushes denormal floating point results to zero when the application is in gradual underflow mode.
- -fno-alias
- FOPTIMIZE
- Tells the compiler not to assume aliasing in the program (DEFAULT = -falias).
- -xT
- FOPTIMIZE
- Generates specialized code to run exclusively on processors with the extensions T. This option can generate SSSE3, SSE3, SSE2, and SSE instructions for Intel processors, and it can optimize for the Intel (R) Core (TM) 2 Duo processor family.

113.GemsFDTD

- -O3
- mpicc,mpiCC,mpif90
- FOPTIMIZE
- Enables O2 optimizations plus more aggressive optimizations, such as prefetching, scalar replacement, and loop and memory access transformations. Enables optimizations for maximum speed, such as:
  - Loop unrolling, including instruction scheduling
  - Code replication to eliminate branches
  - Padding the size of certain power-of-two arrays to allow more efficient cache use.
  On Intel Itanium processors, the O3 option enables optimizations for technical computing applications (loop-intensive code):
  loop optimizations and data prefetch. The O3 optimizations may not cause higher performance unless loop and memory access transformations take place. The optimizations may slow down code in some cases compared to O2 optimizations.
  The O3 option is recommended for applications that have loops that heavily use floating-point calculations and process large data sets.
- Includes:
  - -O2
    - -O1
      
      -unrolln
      
      -builtin
      
      -mno-ieee-fp
      
      -fomit-frame-pointer
      
      -ffunction-sections
- -no-prec-div
- FOPTIMIZE
- Enables optimizations that give slightly less precise results than full IEEE division. With some optimizations, such as -xN and -xB, the compiler may change floating-point division compu- tations into multiplication by the reciprocal of the denomina- tor. For example, A/B is computed as A * (1/B) to improve the speed of the computation. The default is -prec-div, which provides fully precise IEEE division. It improves precision of floating-point divides by disabling floating-point division-to-multiplication optimiza- tions, resulting in greater accuracy with some loss of perfor- mance.
- -ftz
- FOPTIMIZE
- Flushes denormal floating point results to zero when the application is in gradual underflow mode.
- -fno-alias
- FOPTIMIZE
- Tells the compiler not to assume aliasing in the program (DEFAULT = -falias).
- -xT
- FOPTIMIZE
- Generates specialized code to run exclusively on processors with the extensions T. This option can generate SSSE3, SSE3, SSE2, and SSE instructions for Intel processors, and it can optimize for the Intel (R) Core (TM) 2 Duo processor family.

115.fds4

- -O3
- mpicc,mpiCC,mpif90
- FOPTIMIZE
- Enables O2 optimizations plus more aggressive optimizations, such as prefetching, scalar replacement, and loop and memory access transformations. Enables optimizations for maximum speed, such as:
  - Loop unrolling, including instruction scheduling
  - Code replication to eliminate branches
  - Padding the size of certain power-of-two arrays to allow more efficient cache use.
  On Intel Itanium processors, the O3 option enables optimizations for technical computing applications (loop-intensive code):
  loop optimizations and data prefetch. The O3 optimizations may not cause higher performance unless loop and memory access transformations take place. The optimizations may slow down code in some cases compared to O2 optimizations.
  The O3 option is recommended for applications that have loops that heavily use floating-point calculations and process large data sets.
- Includes:
  - -O2
    - -O1
      
      -unrolln
      
      -builtin
      
      -mno-ieee-fp
      
      -fomit-frame-pointer
      
      -ffunction-sections
- -no-prec-div
- FOPTIMIZE
- Enables optimizations that give slightly less precise results than full IEEE division. With some optimizations, such as -xN and -xB, the compiler may change floating-point division compu- tations into multiplication by the reciprocal of the denomina- tor. For example, A/B is computed as A * (1/B) to improve the speed of the computation. The default is -prec-div, which provides fully precise IEEE division. It improves precision of floating-point divides by disabling floating-point division-to-multiplication optimiza- tions, resulting in greater accuracy with some loss of perfor- mance.
- -ftz
- FOPTIMIZE
- Flushes denormal floating point results to zero when the application is in gradual underflow mode.
- -fno-alias
- FOPTIMIZE
- Tells the compiler not to assume aliasing in the program (DEFAULT = -falias).
- -xT
- FOPTIMIZE
- Generates specialized code to run exclusively on processors with the extensions T. This option can generate SSSE3, SSE3, SSE2, and SSE instructions for Intel processors, and it can optimize for the Intel (R) Core (TM) 2 Duo processor family.

129.tera_tf

- -O3
- mpicc,mpiCC,mpif90
- FOPTIMIZE
- Enables O2 optimizations plus more aggressive optimizations, such as prefetching, scalar replacement, and loop and memory access transformations. Enables optimizations for maximum speed, such as:
  - Loop unrolling, including instruction scheduling
  - Code replication to eliminate branches
  - Padding the size of certain power-of-two arrays to allow more efficient cache use.
  On Intel Itanium processors, the O3 option enables optimizations for technical computing applications (loop-intensive code):
  loop optimizations and data prefetch. The O3 optimizations may not cause higher performance unless loop and memory access transformations take place. The optimizations may slow down code in some cases compared to O2 optimizations.
  The O3 option is recommended for applications that have loops that heavily use floating-point calculations and process large data sets.
- Includes:
  - -O2
    - -O1
      
      -unrolln
      
      -builtin
      
      -mno-ieee-fp
      
      -fomit-frame-pointer
      
      -ffunction-sections
- -no-prec-div
- FOPTIMIZE
- Enables optimizations that give slightly less precise results than full IEEE division. With some optimizations, such as -xN and -xB, the compiler may change floating-point division compu- tations into multiplication by the reciprocal of the denomina- tor. For example, A/B is computed as A * (1/B) to improve the speed of the computation. The default is -prec-div, which provides fully precise IEEE division. It improves precision of floating-point divides by disabling floating-point division-to-multiplication optimiza- tions, resulting in greater accuracy with some loss of perfor- mance.
- -ftz
- FOPTIMIZE
- Flushes denormal floating point results to zero when the application is in gradual underflow mode.
- -fno-alias
- FOPTIMIZE
- Tells the compiler not to assume aliasing in the program (DEFAULT = -falias).
- -xT
- FOPTIMIZE
- Generates specialized code to run exclusively on processors with the extensions T. This option can generate SSSE3, SSE3, SSE2, and SSE instructions for Intel processors, and it can optimize for the Intel (R) Core (TM) 2 Duo processor family.

132.zeusmp2

- -O3
- mpicc,mpiCC,mpif90
- FOPTIMIZE
- Enables O2 optimizations plus more aggressive optimizations, such as prefetching, scalar replacement, and loop and memory access transformations. Enables optimizations for maximum speed, such as:
  - Loop unrolling, including instruction scheduling
  - Code replication to eliminate branches
  - Padding the size of certain power-of-two arrays to allow more efficient cache use.
  On Intel Itanium processors, the O3 option enables optimizations for technical computing applications (loop-intensive code):
  loop optimizations and data prefetch. The O3 optimizations may not cause higher performance unless loop and memory access transformations take place. The optimizations may slow down code in some cases compared to O2 optimizations.
  The O3 option is recommended for applications that have loops that heavily use floating-point calculations and process large data sets.
- Includes:
  - -O2
    - -O1
      
      -unrolln
      
      -builtin
      
      -mno-ieee-fp
      
      -fomit-frame-pointer
      
      -ffunction-sections
- -no-prec-div
- FOPTIMIZE
- Enables optimizations that give slightly less precise results than full IEEE division. With some optimizations, such as -xN and -xB, the compiler may change floating-point division compu- tations into multiplication by the reciprocal of the denomina- tor. For example, A/B is computed as A * (1/B) to improve the speed of the computation. The default is -prec-div, which provides fully precise IEEE division. It improves precision of floating-point divides by disabling floating-point division-to-multiplication optimiza- tions, resulting in greater accuracy with some loss of perfor- mance.
- -ftz
- FOPTIMIZE
- Flushes denormal floating point results to zero when the application is in gradual underflow mode.
- -fno-alias
- FOPTIMIZE
- Tells the compiler not to assume aliasing in the program (DEFAULT = -falias).
- -xT
- FOPTIMIZE
- Generates specialized code to run exclusively on processors with the extensions T. This option can generate SSSE3, SSE3, SSE2, and SSE instructions for Intel processors, and it can optimize for the Intel (R) Core (TM) 2 Duo processor family.

137.lu

- -O3
- mpicc,mpiCC,mpif90
- FOPTIMIZE
- Enables O2 optimizations plus more aggressive optimizations, such as prefetching, scalar replacement, and loop and memory access transformations. Enables optimizations for maximum speed, such as:
  - Loop unrolling, including instruction scheduling
  - Code replication to eliminate branches
  - Padding the size of certain power-of-two arrays to allow more efficient cache use.
  On Intel Itanium processors, the O3 option enables optimizations for technical computing applications (loop-intensive code):
  loop optimizations and data prefetch. The O3 optimizations may not cause higher performance unless loop and memory access transformations take place. The optimizations may slow down code in some cases compared to O2 optimizations.
  The O3 option is recommended for applications that have loops that heavily use floating-point calculations and process large data sets.
- Includes:
  - -O2
    - -O1
      
      -unrolln
      
      -builtin
      
      -mno-ieee-fp
      
      -fomit-frame-pointer
      
      -ffunction-sections
- -no-prec-div
- FOPTIMIZE
- Enables optimizations that give slightly less precise results than full IEEE division. With some optimizations, such as -xN and -xB, the compiler may change floating-point division compu- tations into multiplication by the reciprocal of the denomina- tor. For example, A/B is computed as A * (1/B) to improve the speed of the computation. The default is -prec-div, which provides fully precise IEEE division. It improves precision of floating-point divides by disabling floating-point division-to-multiplication optimiza- tions, resulting in greater accuracy with some loss of perfor- mance.
- -ftz
- FOPTIMIZE
- Flushes denormal floating point results to zero when the application is in gradual underflow mode.
- -fno-alias
- FOPTIMIZE
- Tells the compiler not to assume aliasing in the program (DEFAULT = -falias).
- -xT
- FOPTIMIZE
- Generates specialized code to run exclusively on processors with the extensions T. This option can generate SSSE3, SSE3, SSE2, and SSE instructions for Intel processors, and it can optimize for the Intel (R) Core (TM) 2 Duo processor family.

Benchmarks using both Fortran and C

121.pop2

- -O3
- mpicc,mpiCC,mpif90
- COPTIMIZE, FOPTIMIZE
- Enables O2 optimizations plus more aggressive optimizations, such as prefetching, scalar replacement, and loop and memory access transformations. Enables optimizations for maximum speed, such as:
  - Loop unrolling, including instruction scheduling
  - Code replication to eliminate branches
  - Padding the size of certain power-of-two arrays to allow more efficient cache use.
  On Intel Itanium processors, the O3 option enables optimizations for technical computing applications (loop-intensive code):
  loop optimizations and data prefetch. The O3 optimizations may not cause higher performance unless loop and memory access transformations take place. The optimizations may slow down code in some cases compared to O2 optimizations.
  The O3 option is recommended for applications that have loops that heavily use floating-point calculations and process large data sets.
- Includes:
  - -O2
    - -O1
      
      -unrolln
      
      -builtin
      
      -mno-ieee-fp
      
      -fomit-frame-pointer
      
      -ffunction-sections
- -no-prec-div
- COPTIMIZE, FOPTIMIZE
- Enables optimizations that give slightly less precise results than full IEEE division. With some optimizations, such as -xN and -xB, the compiler may change floating-point division compu- tations into multiplication by the reciprocal of the denomina- tor. For example, A/B is computed as A * (1/B) to improve the speed of the computation. The default is -prec-div, which provides fully precise IEEE division. It improves precision of floating-point divides by disabling floating-point division-to-multiplication optimiza- tions, resulting in greater accuracy with some loss of perfor- mance.
- -ftz
- COPTIMIZE, FOPTIMIZE
- Flushes denormal floating point results to zero when the application is in gradual underflow mode.
- -fno-alias
- COPTIMIZE, FOPTIMIZE
- Tells the compiler not to assume aliasing in the program (DEFAULT = -falias).
- -xT
- COPTIMIZE, FOPTIMIZE
- Generates specialized code to run exclusively on processors with the extensions T. This option can generate SSSE3, SSE3, SSE2, and SSE instructions for Intel processors, and it can optimize for the Intel (R) Core (TM) 2 Duo processor family.

130.socorro

- Same as 121.pop2

Implicitly Included Flags

This section contains descriptions of flags that were included implicitly by other flags, but which do not have a permanent home at SPEC.

For questions about the meanings of these flags, please contact the tester.
For other inquiries, please contact webmaster@spec.org
Copyright 2006-2010 Standard Performance Evaluation Corporation
Tested with SPEC MPI2007 v59.
Report generated on Tue Apr 13 15:43:49 2010 by SPEC MPI2007 flags formatter v1412.

MPI2007 Flag Description
Hewlett Packard Company HP Proliant BL460c blade Cluster Platform 3000BL

Test sponsored by Hewlett-Packard Company

Compilers:

Base Compiler Invocation

C benchmarks

C++ benchmarks

126.lammps

Fortran benchmarks

107.leslie3d

113.GemsFDTD

115.fds4

129.tera_tf

132.zeusmp2

137.lu

Benchmarks using both Fortran and C (except as noted below)

Base Portability Flags

121.pop2

127.wrf2

Base Optimization Flags

C benchmarks

C++ benchmarks

126.lammps

Fortran benchmarks

107.leslie3d

113.GemsFDTD

115.fds4

129.tera_tf

132.zeusmp2

137.lu

Benchmarks using both Fortran and C

121.pop2

127.wrf2

128.GAPgeofem

130.socorro

Implicitly Included Flags

	Indicates that the flag description came from the user flags file.
	Indicates that the flag description came from the suite-wide flags file.
	Indicates that the flag description came from a per-benchmark flags file.

MPI2007 Flag DescriptionHewlett Packard Company HP Proliant BL460c blade Cluster Platform 3000BL

Test sponsored by Hewlett-Packard Company

Compilers:

Base Compiler Invocation

Base Portability Flags

Base Optimization Flags

Implicitly Included Flags

MPI2007 Flag Description
Hewlett Packard Company HP Proliant BL460c blade Cluster Platform 3000BL