SPEChpc(TM) 2021 Tiny Result
                                                   Intel
                                Hatch: Intel Server D50DNP1SB (Xeon Platinum
                                                   8480+)

                hpc2021 License: 13                                      Test date: Apr-2025
                Test sponsor: Intel                          Hardware availability: Jan-2023
                Tested by:    Intel                          Software availability: Mar-2025

               Base   Base    Thrds   Base       Base         Peak   Peak   Thrds    Peak       Peak
Benchmarks     Model  Ranks  pr Rnk   Run Time   Ratio        Model  Ranks  pr Rnk   Run Time   Ratio
-------------- ------ ------  ------  ---------  ---------    ------ ------  ------  ---------  ---------   
505.lbm_t         TGT      6       1       17.1      131    S    TGT      6       1       16.5      137    S 
505.lbm_t         TGT      6       1       16.7      135    *    TGT      6       1       16.0      141    * 
505.lbm_t         TGT      6       1       16.7      135    S    TGT      6       1       16.0      141    S 
513.soma_t        TGT      6       1       27.5      135    S    TGT      6       1       27.6      134    S 
513.soma_t        TGT      6       1       27.5      134    S    TGT      6       1       27.4      135    S 
513.soma_t        TGT      6       1       27.5      135    *    TGT      6       1       27.5      135    * 
518.tealeaf_t     TGT      6       1       51.7       31.9  S    TGT      6       1       51.7       31.9  S 
518.tealeaf_t     TGT      6       1       51.6       32.0  *    TGT      6       1       51.6       32.0  * 
518.tealeaf_t     TGT      6       1       51.3       32.1  S    TGT      6       1       51.3       32.1  S 
519.clvleaf_t     TGT      6       1       34.2       48.2  S    TGT      6       1       34.2       48.2  S 
519.clvleaf_t     TGT      6       1       34.2       48.3  *    TGT      6       1       34.2       48.3  * 
519.clvleaf_t     TGT      6       1       34.1       48.4  S    TGT      6       1       34.1       48.4  S 
521.miniswp_t     TGT      6       1       61.2       26.1  S    TGT      6       1       57.3       27.9  * 
521.miniswp_t     TGT      6       1       61.2       26.1  *    TGT      6       1       57.5       27.8  S 
521.miniswp_t     TGT      6       1       61.4       26.1  S    TGT      6       1       57.2       28.0  S 
528.pot3d_t       TGT      6       1       44.3       48.0  S    TGT      6       1       42.7       49.7  S 
528.pot3d_t       TGT      6       1       44.4       47.9  *    TGT      6       1       43.0       49.4  S 
528.pot3d_t       TGT      6       1       44.4       47.8  S    TGT      6       1       42.9       49.5  * 
532.sph_exa_t     TGT      6       1       77.4       25.2  S    TGT      6       1       74.8       26.1  S 
532.sph_exa_t     TGT      6       1       77.3       25.2  *    TGT      6       1       75.1       26.0  * 
532.sph_exa_t     TGT      6       1       76.9       25.3  S    TGT      6       1       75.1       26.0  S 
534.hpgmgfv_t     TGT      6       1       81.0       14.5  S    TGT      6       1       42.5       27.7  S 
534.hpgmgfv_t     TGT      6       1       79.7       14.7  S    TGT      6       1       42.6       27.6  * 
534.hpgmgfv_t     TGT      6       1       80.3       14.6  *    TGT      6       1       43.2       27.2  S 
535.weather_t     TGT      6       1       26.1      123    S    TGT      6       1       26.1      123    S 
535.weather_t     TGT      6       1       26.1      123    *    TGT      6       1       26.1      123    * 
535.weather_t     TGT      6       1       26.1      124    S    TGT      6       1       26.1      124    S 
============================================================================================================
505.lbm_t         TGT      6       1       16.7      135    *    TGT      6       1       16.0      141    * 
513.soma_t        TGT      6       1       27.5      135    *    TGT      6       1       27.5      135    * 
518.tealeaf_t     TGT      6       1       51.6       32.0  *    TGT      6       1       51.6       32.0  * 
519.clvleaf_t     TGT      6       1       34.2       48.3  *    TGT      6       1       34.2       48.3  * 
521.miniswp_t     TGT      6       1       61.2       26.1  *    TGT      6       1       57.3       27.9  * 
528.pot3d_t       TGT      6       1       44.4       47.9  *    TGT      6       1       42.9       49.5  * 
532.sph_exa_t     TGT      6       1       77.3       25.2  *    TGT      6       1       75.1       26.0  * 
534.hpgmgfv_t     TGT      6       1       80.3       14.6  *    TGT      6       1       42.6       27.6  * 
535.weather_t     TGT      6       1       26.1      123    *    TGT      6       1       26.1      123    * 
 SPEChpc 2021_tny_base                                48.9
 SPEChpc 2021_tny_peak                                                                               53.5


                                             BENCHMARK DETAILS
                                             -----------------
      Type of System: Homogenous Cluster
  Compute Nodes Used: 1
         Total Chips: 2
         Total Cores: 112
       Total Threads: 224
        Total Memory: 1 TB
  Total Accelerators: 3
   Max. Peak Threads: 1
            Compiler: Intel oneAPI Compiler 2025.1.0
         MPI Library: Intel MPI Library 2021.15 for Linux OS
      Other MPI Info: None
      Other Software: None
 Base Parallel Model: TGT
      Base Ranks Run: 6
    Base Threads Run: 1
Peak Parallel Models: TGT 
  Minimum Peak Ranks: 6
  Maximum Peak Ranks: 6
   Max. Peak Threads: 1
   Min. Peak Threads: 1

                       Node Description: Intel Server D50DNP1SB (Xeon Platinum 8480+)
                       ==============================================================


                                                  HARDWARE
                                                  --------
     Number of nodes: 1
    Uses of the node: Compute
              Vendor: Intel
               Model: Intel Server D50DNP1SB (2 x Intel Xeon
                      Platinum 8480+, 2.0GHz)
            CPU Name: Intel Xeon Platinum 8480+
    CPU(s) orderable: 1, 2 chips
       Chips enabled: 2
       Cores enabled: 112
      Cores per chip: 56
    Threads per core: 2
 CPU Characteristics: Turbo Boost Technology up to 3.8 GHz
             CPU MHz: 2000
       Primary Cache: 32 KB I + 48 KB D on chip per core
     Secondary Cache: 2 MB I+D on chip per core
            L3 Cache: 105 MB I+D on chip per chip
         Other Cache: None
              Memory: 1 TB (16x64 GB DDR5 2Rx4 PC5-4800B-R)
      Disk Subsystem: 1 x 1 1TB NVMe M.2 INTEL SSDPELKX010T8
      Other Hardware: None
         Accel Count: 3
         Accel Model: Intel Data Center GPU Max 1550
        Accel Vendor: Intel
          Accel Type: GPU
    Accel Connection: PCIe Gen5 x16
   Accel ECC enabled: yes
   Accel Description: Intel Data Center GPU Max 1550
             Adapter: Mellanox ConnectX-6 HDR
  Number of Adapters: 1
           Slot Type: PCI-Express 4.0 x16
           Data Rate: 200Gbit/s
          Ports Used: 1
   Interconnect Type: Mellanox HDR


                                                  SOFTWARE
                                                  --------
  Accelerator Driver: 25.05.32567
             Adapter: Mellanox ConnectX-6 HDR
    Adapter Firmware: 20.38.1900
    Operating System: SUSE Linux Enterprise Server 15 SP6
                      6.4.0-150600.23.42-default
   Local File System: lustre
  Shared File System: LUSTRE FS
        System State: Run level 5
      Other Software: None


                                   Interconnect Description: Mellanox HDR
                                   ======================================


                                                  HARDWARE
                                                  --------
              Vendor: Mellanox
               Model: Mellanox HDR
        Switch Model: Mellanox Technologies MT28908 Family
                      InfiniBand Switch
  Number of Switches: 12
     Number of Ports: 40
           Data Rate: 200 Gbit/s
            Firmware: 20.38.1900
            Topology: Fat-tree
         Primary Use: MPI Traffic, LustreFS traffic


                                                  SOFTWARE
                                                  --------



                                                Submit Notes
                                                ------------
    The config file option 'submit' was used.

                                               General Notes
                                               -------------
    Environment variables set by runhpc before the start of the run:
    LIBOMPTARGET_LEVEL_ZERO_USE_IMMEDIATE_COMMAND_LIST = "all"
    I_MPI_FABRICS=shm:ofi
    I_MPI_OFFLOAD=1
    I_MPI_OFFLOAD_CELL=tile
    I_MPI_OFFLOAD_TOPOLIB=level_zero
    I_MPI_OFFLOAD_CELL_LIST=0,1,2,3,4,5,6,7
    For the following tests src.alt was used in PEAK:
    513 521 528 532 534

                                               Platform Notes
                                               --------------
     Device Vendor                                   Intel
     Device Version                                  OpenCL 3.0 NEO
     Driver Version                                  25.05.32567
     Base clock                                      900MHz
     Max clock frequency                             1600MHz
     Tiles                                           2
     Slices per Tile                                 1
     Max compute units per Tile                      512
     Sub-slices per slice                            64
     EUs per sub-slice                               8
     Threads per EU                                  8
     Max work item dimensions                        3
     Max work item sizes                             1024x1024x1024
     Max work group size                             1024
     Preferred work group size multiple              32
     Max sub-groups per work group                   64
     Sub-group sizes                                 16, 32
     L1 Cache per EU                                 65536
     L2 cache size                                   427819008
     Global memory size                              137438953472
     Address bits                                    64, Little-Endian

                                          Base Compiler Invocation
                                          ------------------------
C benchmarks: 
     mpiicc -cc=icx

C++ benchmarks: 
     mpiicpc -cxx=icpx

Fortran benchmarks: 
     mpiifort -fc=ifx


                                           Base Portability Flags
                                           ----------------------
     505.lbm_t: -DUSE_MPI
    513.soma_t: -DUSE_MPI -DSPEC_NO_VAR_ARRAY_REDUCE
 518.tealeaf_t: -DUSE_MPI
 519.clvleaf_t: -DUSE_MPI
   528.pot3d_t: -DUSE_MPI
 535.weather_t: -DUSE_MPI


                                          Base Optimization Flags
                                          -----------------------
C benchmarks: 
     -O3 -xCORE-AVX512 -flto -mprefer-vector-width=512 -ffast-math -fiopenmp
     -fopenmp-targets=spir64_gen -ftarget-register-alloc-mode=pvc:auto
     -Xopenmp-target-backend '-device pvc -revision_id 0x2f' -DSPEC_COLLAPSE
     -fopenmp-optimistic-collapse

C++ benchmarks: 
     -O3 -xCORE-AVX512 -flto -mprefer-vector-width=512 -ffast-math -fiopenmp
     -fopenmp-targets=spir64_gen -ftarget-register-alloc-mode=pvc:auto
     -Xopenmp-target-backend '-device pvc -revision_id 0x2f' -DSPEC_COLLAPSE

Fortran benchmarks: 
     -DSPEC_COLLAPSE -O3 -xCORE-AVX512 -flto -mprefer-vector-width=512
     -ffast-math -fiopenmp -fopenmp-targets=spir64_gen
     -ftarget-register-alloc-mode=pvc:auto
     -Xopenmp-target-backend '-device pvc -revision_id 0x2f'
     -DSPEC_ACCEL_AWARE_MPI -fopenmp-target-loopopt


                                              Base Other Flags
                                              ----------------
Fortran benchmarks:

   528.pot3d_t: -Wno-incompatible-function-pointer-types


                                          Peak Compiler Invocation
                                          ------------------------
C benchmarks: 
     mpiicc -cc=icx

C++ benchmarks: 
     mpiicpc -cxx=icpx

Fortran benchmarks: 
     mpiifort -fc=ifx


                                           Peak Portability Flags
                                           ----------------------
     505.lbm_t: -DUSE_MPI
    513.soma_t: -DUSE_MPI -DSPEC_NO_VAR_ARRAY_REDUCE
 518.tealeaf_t: -DUSE_MPI
 519.clvleaf_t: -DUSE_MPI
   528.pot3d_t: -DUSE_MPI
 535.weather_t: -DUSE_MPI


                                          Peak Optimization Flags
                                          -----------------------
C benchmarks:

     505.lbm_t: -O3 -xCORE-AVX512 -flto -mprefer-vector-width=512 -ffast-math
                -fiopenmp -fopenmp-targets=spir64_gen
                -ftarget-register-alloc-mode=pvc:large
                -Xopenmp-target-backend '-device pvc -revision_id 0x2f'
                -DSPEC_COLLAPSE -fopenmp-optimistic-collapse

    513.soma_t: -O3 -xCORE-AVX512 -flto -mprefer-vector-width=512 -ffast-math
                -fiopenmp -fopenmp-targets=spir64_gen
                -ftarget-register-alloc-mode=pvc:auto
                -Xopenmp-target-backend '-device pvc -revision_id 0x2f'
                -DSPEC_COLLAPSE -fopenmp-optimistic-collapse

 518.tealeaf_t: basepeak = yes

 521.miniswp_t: -O3 -xCORE-AVX512 -flto -mprefer-vector-width=512 -ffast-math
                -fiopenmp -fopenmp-targets=spir64_gen
                -ftarget-register-alloc-mode=pvc:small
                -Xopenmp-target-backend '-device pvc -revision_id 0x2f'
                -DSPEC_COLLAPSE -fopenmp-optimistic-collapse

 534.hpgmgfv_t: -O3 -xCORE-AVX512 -flto -mprefer-vector-width=512 -ffast-math
                -fiopenmp -fopenmp-targets=spir64_gen
                -ftarget-register-alloc-mode=pvc:auto
                -Xopenmp-target-backend '-device pvc -revision_id 0x2f'
                -DSPEC_COLLAPSE -DSPEC_ACCEL_AWARE_MPI
                -fopenmp-optimistic-collapse

C++ benchmarks: 
     -O3 -xCORE-AVX512 -flto -mprefer-vector-width=512 -ffast-math -fiopenmp
     -fopenmp-targets=spir64_gen -ftarget-register-alloc-mode=pvc:auto
     -Xopenmp-target-backend '-device pvc -revision_id 0x2f' -DSPEC_COLLAPSE

Fortran benchmarks:

 519.clvleaf_t: basepeak = yes

   528.pot3d_t: -DSPEC_COLLAPSE -O3 -xCORE-AVX512 -flto
                -mprefer-vector-width=512 -ffast-math -fiopenmp
                -fopenmp-targets=spir64_gen
                -ftarget-register-alloc-mode=pvc:auto
                -Xopenmp-target-backend '-device pvc -revision_id 0x2f'
                -DSPEC_ACCEL_AWARE_MPI -fopenmp-target-loopopt

 535.weather_t: basepeak = yes


                                              Peak Other Flags
                                              ----------------
Fortran benchmarks:

   528.pot3d_t: -Wno-incompatible-function-pointer-types


The flags file that was used to format this result can be browsed at
http://www.spec.org/hpc2021/flags/Intel_compiler_flags.2025-05-27.html

You can also download the XML flags source by saving the following link:
http://www.spec.org/hpc2021/flags/Intel_compiler_flags.2025-05-27.xml

  SPEChpc is a trademark of the Standard Performance Evaluation
    Corporation.  All other brand and product names appearing in this
    result are trademarks or registered trademarks of their respective
    holders.
-------------------------------------------------------------------------------------------------------------
For questions about this result, please contact the tester.
For other inquiries, please contact info@spec.org.
Copyright 2021-2025 Standard Performance Evaluation Corporation
Tested with SPEChpc2021 v1.1.9 on 2025-04-14 07:46:48-0400.
Report generated on 2025-05-27 16:43:13 by hpc2021 ASCII formatter v112.
Originally published on 2025-05-21.