SPEC(R) MPIM2007 Summary
                              Intel Corporation
                  Endeavor (Intel Xeon E7-8890 v3, 2.50 GHz,
                      DDR4-2133 MHz, SMT on, Turbo off)
                           Thu May 21 05:06:14 2015

MPI2007 License: 13                                      Test date: May-2015
Test sponsor: Intel Corporation              Hardware availability: May-2015
Tested by:    Pavel Shelepugin               Software availability: Apr-2015

                Base     Base       Base        Peak     Peak       Peak
Benchmarks      Ranks  Run Time     Ratio       Ranks  Run Time     Ratio
-------------- ------  ---------  ---------    ------  ---------  ---------
104.milc          576       24.5       63.8 S                                  
104.milc          576       24.3       64.4 *                                  
104.milc          576       24.3       64.5 S                                  
107.leslie3d      576       66.8       78.1 *                                  
107.leslie3d      576       66.5       78.5 S                                  
107.leslie3d      576       67.4       77.4 S                                  
113.GemsFDTD      576      308         20.5 S                                  
113.GemsFDTD      576      308         20.4 *                                  
113.GemsFDTD      576      309         20.4 S                                  
115.fds4          576       20.5       95.0 *                                  
115.fds4          576       20.9       93.3 S                                  
115.fds4          576       20.3       95.9 S                                  
121.pop2          576      172         24.0 S                                  
121.pop2          576      178         23.2 S                                  
121.pop2          576      175         23.6 *                                  
122.tachyon       576       30.5       91.6 *                                  
122.tachyon       576       30.8       90.9 S                                  
122.tachyon       576       30.5       91.6 S                                  
126.lammps        576      137         21.3 *                                  
126.lammps        576      137         21.3 S                                  
126.lammps        576      137         21.3 S                                  
127.wrf2          576       51.2      152   *                                  
127.wrf2          576       50.8      153   S                                  
127.wrf2          576       51.5      151   S                                  
128.GAPgeofem     576       22.1       93.4 *                                  
128.GAPgeofem     576       22.2       93.2 S                                  
128.GAPgeofem     576       22.0       93.7 S                                  
129.tera_tf       576       41.0       67.5 S                                  
129.tera_tf       576       41.1       67.3 *                                  
129.tera_tf       576       41.2       67.2 S                                  
130.socorro       576      127         30.1 S                                  
130.socorro       576      127         30.1 *                                  
130.socorro       576      127         30.1 S                                  
132.zeusmp2       576       32.6       95.3 S                                  
132.zeusmp2       576       33.0       94.0 S                                  
132.zeusmp2       576       32.8       94.7 *                                  
137.lu            576       31.6      116   *                                  
137.lu            576       31.5      117   S                                  
137.lu            576       31.8      116   S                                  
==============================================================================
104.milc          576       24.3       64.4 *                                  
107.leslie3d      576       66.8       78.1 *                                  
113.GemsFDTD      576      308         20.4 *                                  
115.fds4          576       20.5       95.0 *                                  
121.pop2          576      175         23.6 *                                  
122.tachyon       576       30.5       91.6 *                                  
126.lammps        576      137         21.3 *                                  
127.wrf2          576       51.2      152   *                                  
128.GAPgeofem     576       22.1       93.4 *                                  
129.tera_tf       576       41.1       67.3 *                                  
130.socorro       576      127         30.1 *                                  
132.zeusmp2       576       32.8       94.7 *                                  
137.lu            576       31.6      116   *                                  
 SPECmpiM_base2007                     60.4
 SPECmpiM_peak2007                                                  Not Run


                              BENCHMARK DETAILS
                              -----------------
      Type of System: Homogeneous
 Total Compute Nodes: 8
         Total Chips: 32
         Total Cores: 576
       Total Threads: 1152
        Total Memory: 4 TB
      Base Ranks Run: 576
  Minimum Peak Ranks: --
  Maximum Peak Ranks: --
          C Compiler: Intel C++ Composer XE 2015 for Linux,
                      Version 15.0.3.187 Build 20150407
        C++ Compiler: Intel C++ Composer XE 2015 for Linux,
                      Version 15.0.3.187 Build 20150407
    Fortran Compiler: Intel Fortran Composer XE 2015 for Linux,
                      Version 15.0.3.187 Build 20150407
       Base Pointers: 64-bit
       Peak Pointers: 64-bit
         MPI Library: Intel MPI Library 5.0.3.048 for Linux
      Other MPI Info: None
      Pre-processors: No
      Other Software: None

                       Node Description: Endeavor Node
                       ===============================


                                   HARDWARE
                                   --------
     Number of nodes: 8
    Uses of the node: compute
              Vendor: Intel
               Model: S4TR2KU1Q
            CPU Name: Intel Xeon E7-8890 v3
    CPU(s) orderable: 1-4 chips
       Chips enabled: 4
       Cores enabled: 72
      Cores per chip: 18
    Threads per core: 2
 CPU Characteristics: Intel Turbo Boost Technology disabled,
                      9.6 GT/s QPI, Hyper-Threading enabled
             CPU MHz: 2500
       Primary Cache: 32 KB I + 32 KB D on chip per core
     Secondary Cache: 256 KB I+D on chip per core
            L3 Cache: 45 MB I+D on chip per chip, 45 MB
                      shared / 18 cores
         Other Cache: None
              Memory: 512 GB (32 x 16 GB 2Rx4 PC4-17000R-15, ECC)
      Disk Subsystem: ATA INTEL SSDSA2BZ20, SSDSC2BB80
      Other Hardware: None
             Adapter: Intel (ESB2) 82575EB Dual-Port Gigabit
                      Ethernet Controller
  Number of Adapters: 1
           Slot Type: PCI-Express x8
           Data Rate: 1Gbps Ethernet
          Ports Used: 2
   Interconnect Type: Ethernet
             Adapter: Mellanox MCX353A-FCAT ConnectX-3
  Number of Adapters: 1
           Slot Type: PCIe x8 Gen3
           Data Rate: InfiniBand 4x FDR
          Ports Used: 1
   Interconnect Type: InfiniBand


                                   SOFTWARE
                                   --------
             Adapter: Intel (ESB2) 82575EB Dual-Port Gigabit
                      Ethernet Controller
      Adapter Driver: e1000
    Adapter Firmware: None
             Adapter: Mellanox MCX353A-FCAT ConnectX-3
      Adapter Driver: OFED 3.5-2-MIC-rc1
    Adapter Firmware: 2.31.5050
    Operating System: Red Hat EL 6.5, kernel 2.6.32-358
   Local File System: Linux/xfs
  Shared File System: NFS
        System State: Multi-User
      Other Software: IBM Platform LSF Standard 9.1.1.1


                            Node Description: NFS
                            =====================


                                   HARDWARE
                                   --------
     Number of nodes: 1
    Uses of the node: fileserver
              Vendor: Intel
               Model: S7000FC4UR
            CPU Name: Intel Xeon CPU
    CPU(s) orderable: 1-4 chips
       Chips enabled: 4
       Cores enabled: 16
      Cores per chip: 4
    Threads per core: 2
 CPU Characteristics: --
             CPU MHz: 2926
       Primary Cache: 32 KB I + 32 KB D on chip per core
     Secondary Cache: 8 MB I+D on chip per chip, 4 MB shared / 2 cores
            L3 Cache: None
         Other Cache: None
              Memory: 64 GB
      Disk Subsystem: 8 disks, 500GB/disk, 2.7TB total
      Other Hardware: None
             Adapter: Intel 82563GB Dual-Port Gigabit
                      Ethernet Controller
  Number of Adapters: 1
           Slot Type: PCI-Express x8
           Data Rate: 1Gbps Ethernet
          Ports Used: 1
   Interconnect Type: Ethernet


                                   SOFTWARE
                                   --------
             Adapter: Intel 82563GB Dual-Port Gigabit
                      Ethernet Controller
      Adapter Driver: e1000e
    Adapter Firmware: N/A
    Operating System: RedHat EL 5 Update 4
   Local File System: None
  Shared File System: NFS
        System State: Multi-User
      Other Software: None


                     Interconnect Description: IB Switch
                     ===================================


                                   HARDWARE
                                   --------
              Vendor: Mellanox
               Model: Mellanox MSX6025F-1BFR
        Switch Model: Mellanox MSX6025F-1BFR
  Number of Switches: 46
     Number of Ports: 36
           Data Rate: InfiniBand 4x FDR
            Firmware: 9.2.8000
            Topology: Fat tree
         Primary Use: MPI traffic


                  Interconnect Description: Gigabit Ethernet
                  ==========================================


                                   HARDWARE
                                   --------
              Vendor: Force10 Networks, Cisco Systems
               Model: Force10 S50N, Force10 C300, Cisco WS-C4948E-F
        Switch Model: Force10 S50N, Force10 C300, Cisco WS-C4948E-F
  Number of Switches: 13
     Number of Ports: 48
           Data Rate: 1Gbps Ethernet, 10Gbps Ethernet
            Firmware: 8.3.2.0, 12.2(54)WO
            Topology: Star
         Primary Use: Cluster File System



                                 Submit Notes
                                 ------------
    The config file option 'submit' was used.

                                General Notes
                                -------------
    130.socorro (base): "nullify_ptrs" src.alt was used.
    
    129.tera_tf (base): "add_rank_support" src.alt was used.
    
     MPI startup command:
       mpiexec.hydra command was used to start MPI jobs.
    
     BIOS settings:
       Intel Hyper-Threading Technology (SMT): Enabled (default is Enabled)
       Intel Turbo Boost Technology (Turbo)  : Disabled (default is Enabled)
    
     RAM configuration:
       Compute nodes have 4x16-GB RDIMM on each memory channel.
    
     Network:
       Forty six 36-port switches: 18 core switches and 28 leaf switches.
       Each leaf has one link to each core. Remaining 18 ports on 25 of 28 leafs
       are used for compute nodes. On the remaining 3 leafs the ports are used
       for FS nodes and other peripherals.
    
     Job placement:
       Each MPI job was assigned to a topologically compact set of nodes, i.e.
       the minimal needed number of leaf switches was used for each job: 1 switch
       for 72/144/288/576 ranks.
    
     IBM Platform LSF was used for job submission. It has no impact on performance.
       Information can be found at: http://www.ibm.com

                           Base Compiler Invocation
                           ------------------------
C benchmarks: 
     mpiicc

C++ benchmarks:

    126.lammps: mpiicpc

Fortran benchmarks: 
     mpiifort

Benchmarks using both Fortran and C: 
     mpiicc mpiifort


                            Base Portability Flags
                            ----------------------
      121.pop2: -DSPEC_MPI_CASE_FLAG
    126.lammps: -DMPICH_IGNORE_CXX_SEEK
      127.wrf2: -DSPEC_MPI_CASE_FLAG -DSPEC_MPI_LINUX
   130.socorro: -assume nostd_intent_in


                           Base Optimization Flags
                           -----------------------
C benchmarks: 
     -O3 -xCORE-AVX2 -no-prec-div

C++ benchmarks:

    126.lammps: -O3 -xCORE-AVX2 -no-prec-div

Fortran benchmarks: 
     -O3 -xCORE-AVX2 -no-prec-div

Benchmarks using both Fortran and C: 
     -O3 -xCORE-AVX2 -no-prec-div


The flags file that was used to format this result can be browsed at
http://www.spec.org/mpi2007/flags/EM64T_Intel140_flags.20140908.html

You can also download the XML flags source by saving the following link:
http://www.spec.org/mpi2007/flags/EM64T_Intel140_flags.20140908.xml

    SPEC and SPEC MPI are registered trademarks of the Standard
    Performance Evaluation Corporation.  All other brand and product names
    appearing in this result are trademarks or registered trademarks of
    their respective holders.
-----------------------------------------------------------------------------
For questions about this result, please contact the tester.
For other inquiries, please contact webmaster@spec.org.
Copyright 2006-2010 Standard Performance Evaluation Corporation
Tested with SPEC MPI2007 v2.0.1.
Report generated on Wed Jun 10 11:39:54 2015 by MPI2007 ASCII formatter v1463.
Originally published on 10 June 2015.