SPEChpc(TM) 2021 Small Result
                                                   Intel
                                Hatch: Intel Server D50DNP1SB (Xeon Platinum
                                                   8480+)

                hpc2021 License: 13                                      Test date: Apr-2025
                Test sponsor: Intel                          Hardware availability: Jan-2023
                Tested by:    Intel                          Software availability: Mar-2025

               Base   Base    Thrds   Base       Base         Peak   Peak   Thrds    Peak       Peak
Benchmarks     Model  Ranks  pr Rnk   Run Time   Ratio        Model  Ranks  pr Rnk   Run Time   Ratio
-------------- ------ ------  ------  ---------  ---------    ------ ------  ------  ---------  ---------   
605.lbm_s         TGT      8       1       82.8      18.7   S    TGT      8       1       78.2      19.8   S 
605.lbm_s         TGT      8       1       82.5      18.8   S    TGT      8       1       78.5      19.8   * 
605.lbm_s         TGT      8       1       82.5      18.8   *    TGT      8       1       79.0      19.6   S 
613.soma_s        TGT      8       1       76.4      20.9   S    TGT      8       1       75.4      21.2   S 
613.soma_s        TGT      8       1       76.9      20.8   *    TGT      8       1       75.1      21.3   S 
613.soma_s        TGT      8       1       77.1      20.8   S    TGT      8       1       75.2      21.3   * 
618.tealeaf_s     TGT      8       1      466         4.40  S    TGT      8       1      386         5.31  * 
618.tealeaf_s     TGT      8       1      471         4.36  S    TGT      8       1      384         5.34  S 
618.tealeaf_s     TGT      8       1      470         4.36  *    TGT      8       1      388         5.29  S 
619.clvleaf_s     TGT      8       1      223         7.42  S    TGT      8       1      222         7.42  S 
619.clvleaf_s     TGT      8       1      223         7.39  *    TGT      8       1      222         7.42  * 
619.clvleaf_s     TGT      8       1      223         7.39  S    TGT      8       1      222         7.45  S 
621.miniswp_s     TGT      8       1      123         8.94  S    TGT      8       1      117         9.39  S 
621.miniswp_s     TGT      8       1      124         8.86  *    TGT      8       1      118         9.35  * 
621.miniswp_s     TGT      8       1      124         8.85  S    TGT      8       1      119         9.28  S 
628.pot3d_s       TGT      8       1      236         7.09  S    TGT      8       1      235         7.12  * 
628.pot3d_s       TGT      8       1      238         7.05  S    TGT      8       1      241         6.96  S 
628.pot3d_s       TGT      8       1      236         7.09  *    TGT      8       1      235         7.14  S 
632.sph_exa_s     TGT      8       1      391         5.89  S    TGT      8       1      361         6.37  S 
632.sph_exa_s     TGT      8       1      390         5.89  *    TGT      8       1      361         6.37  * 
632.sph_exa_s     TGT      8       1      390         5.90  S    TGT      8       1      359         6.40  S 
634.hpgmgfv_s     TGT      8       1      229         4.26  *    TGT      8       1      151         6.47  S 
634.hpgmgfv_s     TGT      8       1      233         4.19  S    TGT      8       1      151         6.47  * 
634.hpgmgfv_s     TGT      8       1      225         4.33  S    TGT      8       1      153         6.37  S 
635.weather_s     TGT      8       1      153        17.0   S    TGT      8       1      153        17.0   S 
635.weather_s     TGT      8       1      151        17.2   S    TGT      8       1      151        17.2   S 
635.weather_s     TGT      8       1      152        17.2   *    TGT      8       1      152        17.2   * 
============================================================================================================
605.lbm_s         TGT      8       1       82.5      18.8   *    TGT      8       1       78.5      19.8   * 
613.soma_s        TGT      8       1       76.9      20.8   *    TGT      8       1       75.2      21.3   * 
618.tealeaf_s     TGT      8       1      470         4.36  *    TGT      8       1      386         5.31  * 
619.clvleaf_s     TGT      8       1      223         7.39  *    TGT      8       1      222         7.42  * 
621.miniswp_s     TGT      8       1      124         8.86  *    TGT      8       1      118         9.35  * 
628.pot3d_s       TGT      8       1      236         7.09  *    TGT      8       1      235         7.12  * 
632.sph_exa_s     TGT      8       1      390         5.89  *    TGT      8       1      361         6.37  * 
634.hpgmgfv_s     TGT      8       1      229         4.26  *    TGT      8       1      151         6.47  * 
635.weather_s     TGT      8       1      152        17.2   *    TGT      8       1      152        17.2   * 
 SPEChpc 2021_sml_base                                8.87
 SPEChpc 2021_sml_peak                                                                               9.73


                                             BENCHMARK DETAILS
                                             -----------------
      Type of System: Homogenous Cluster
  Compute Nodes Used: 1
         Total Chips: 2
         Total Cores: 112
       Total Threads: 224
        Total Memory: 1 TB
   Max. Peak Threads: 1
            Compiler: Intel oneAPI Compiler 2025.1.0
         MPI Library: Intel MPI Library 2021.15 for Linux OS
      Other MPI Info: None
      Other Software: None
 Base Parallel Model: TGT
      Base Ranks Run: 8
    Base Threads Run: 1
Peak Parallel Models: TGT 
  Minimum Peak Ranks: 8
  Maximum Peak Ranks: 8
   Max. Peak Threads: 1
   Min. Peak Threads: 1

                       Node Description: Intel Server D50DNP1SB (Xeon Platinum 8480+)
                       ==============================================================


                                                  HARDWARE
                                                  --------
     Number of nodes: 1
    Uses of the node: Compute
              Vendor: Intel
               Model: Intel Server D50DNP1SB (2 x Intel Xeon
                      Platinum 8480+, 2.0GHz)
            CPU Name: Intel Xeon Platinum 8480+
    CPU(s) orderable: 1, 2 chips
       Chips enabled: 2
       Cores enabled: 112
      Cores per chip: 56
    Threads per core: 2
 CPU Characteristics: Turbo Boost Technology up to 3.8 GHz
             CPU MHz: 2000
       Primary Cache: 32 KB I + 48 KB D on chip per core
     Secondary Cache: 2 MB I+D on chip per core
            L3 Cache: 105 MB I+D on chip per chip
         Other Cache: None
              Memory: 1 TB (16x64 GB DDR5 2Rx4 PC5-4800B-R)
      Disk Subsystem: 1 x 1 1TB NVMe M.2 INTEL SSDPELKX010T8
      Other Hardware: None
         Accel Count: 4
         Accel Model: Intel Data Center GPU Max 1550
        Accel Vendor: Intel
          Accel Type: GPU
    Accel Connection: PCIe Gen5 x16
   Accel ECC enabled: yes
   Accel Description: Intel Data Center GPU Max 1550
             Adapter: Mellanox ConnectX-6 HDR
  Number of Adapters: 1
           Slot Type: PCI-Express 4.0 x16
           Data Rate: 200Gbit/s
          Ports Used: 1
   Interconnect Type: Mellanox HDR


                                                  SOFTWARE
                                                  --------
  Accelerator Driver: 25.05.32567
             Adapter: Mellanox ConnectX-6 HDR
    Adapter Firmware: 20.38.1900
    Operating System: SUSE Linux Enterprise Server 15 SP6
                      6.4.0-150600.23.42-default
   Local File System: lustre
  Shared File System: LUSTRE FS
        System State: Run level 5
      Other Software: None


                                   Interconnect Description: Mellanox HDR
                                   ======================================


                                                  HARDWARE
                                                  --------
              Vendor: Mellanox
               Model: Mellanox HDR
        Switch Model: Mellanox Technologies MT28908 Family
                      InfiniBand Switch
  Number of Switches: 12
     Number of Ports: 40
           Data Rate: 200 Gbit/s
            Firmware: 20.38.1900
            Topology: Fat-tree
         Primary Use: MPI Traffic, LustreFS traffic


                                                  SOFTWARE
                                                  --------



                                                Submit Notes
                                                ------------
    The config file option 'submit' was used.

                                               General Notes
                                               -------------
    Environment variables set by runhpc before the start of the run:
    LIBOMPTARGET_LEVEL_ZERO_USE_IMMEDIATE_COMMAND_LIST = "all"
    I_MPI_FABRICS=shm:ofi
    I_MPI_OFFLOAD=1
    I_MPI_OFFLOAD_CELL=tile
    I_MPI_OFFLOAD_TOPOLIB=level_zero
    I_MPI_OFFLOAD_CELL_LIST=0,1,2,3,4,5,6,7
    For the following tests src.alt was used in PEAK:
    613 618 619 621 628 632 634

                                               Platform Notes
                                               --------------
     Device Vendor                                   Intel
     Device Version                                  OpenCL 3.0 NEO
     Driver Version                                  25.05.32567
     Base clock                                      900MHz
     Max clock frequency                             1600MHz
     Tiles                                           2
     Slices per Tile                                 1
     Max compute units per Tile                      512
     Sub-slices per slice                            64
     EUs per sub-slice                               8
     Threads per EU                                  8
     Max work item dimensions                        3
     Max work item sizes                             1024x1024x1024
     Max work group size                             1024
     Preferred work group size multiple              32
     Max sub-groups per work group                   64
     Sub-group sizes                                 16, 32
     L1 Cache per EU                                 65536
     L2 cache size                                   427819008
     Global memory size                              137438953472
     Address bits                                    64, Little-Endian

                                           Compiler Version Notes
                                           ----------------------
    ==============================================================================
     CXXC 632.sph_exa_s(base, peak)

    ------------------------------------------------------------------------------
    Intel(R) oneAPI DPC++/C++ Compiler 2025.1.0 (2025.1.0.20250317)
    Target: x86_64-unknown-linux-gnu
    Thread model: posix
    InstalledDir:
      /lfs/lfs17/mknyazev/intel/2025.1/oneapi/compiler/2025.1/bin/compiler
    Configuration file:
      /lfs/lfs17/mknyazev/intel/2025.1/oneapi/compiler/2025.1/bin/compiler/../icpx.cfg
    ------------------------------------------------------------------------------
    
    ==============================================================================
     CC  605.lbm_s(base, peak) 613.soma_s(base, peak) 618.tealeaf_s(base, peak)
          621.miniswp_s(base, peak) 634.hpgmgfv_s(base, peak)
    ------------------------------------------------------------------------------
    Intel(R) oneAPI DPC++/C++ Compiler 2025.1.0 (2025.1.0.20250317)
    Target: x86_64-unknown-linux-gnu
    Thread model: posix
    InstalledDir:
      /lfs/lfs17/mknyazev/intel/2025.1/oneapi/compiler/2025.1/bin/compiler
    Configuration file:
      /lfs/lfs17/mknyazev/intel/2025.1/oneapi/compiler/2025.1/bin/compiler/../icx.cfg
    ------------------------------------------------------------------------------
    
    ==============================================================================
     FC  619.clvleaf_s(base, peak) 635.weather_s(base, peak)

    ------------------------------------------------------------------------------
    ifx (IFX) 2025.1.0 20250317
    Copyright (C) 1985-2025 Intel Corporation. All rights reserved.
    ------------------------------------------------------------------------------
    
    ==============================================================================
     FC  628.pot3d_s(base, peak)

    ------------------------------------------------------------------------------
    ifx: command line warning #10157: ignoring option '-W'; argument is of wrong
      type
    ifx (IFX) 2025.1.0 20250317
    Copyright (C) 1985-2025 Intel Corporation. All rights reserved.
    ------------------------------------------------------------------------------

                                          Base Compiler Invocation
                                          ------------------------
C benchmarks: 
     mpiicc -cc=icx

C++ benchmarks: 
     mpiicpc -cxx=icpx

Fortran benchmarks: 
     mpiifort -fc=ifx


                                           Base Portability Flags
                                           ----------------------
     605.lbm_s: -DUSE_MPI
    613.soma_s: -DUSE_MPI -DSPEC_NO_VAR_ARRAY_REDUCE
 618.tealeaf_s: -DUSE_MPI
 619.clvleaf_s: -DUSE_MPI
   628.pot3d_s: -DUSE_MPI
 635.weather_s: -DUSE_MPI


                                          Base Optimization Flags
                                          -----------------------
C benchmarks: 
     -O3 -xCORE-AVX512 -flto -mprefer-vector-width=512 -ffast-math -fiopenmp
     -fopenmp-targets=spir64_gen -ftarget-register-alloc-mode=pvc:auto
     -Xopenmp-target-backend '-device pvc -revision_id 0x2f' -DSPEC_COLLAPSE
     -fopenmp-optimistic-collapse

C++ benchmarks: 
     -O3 -xCORE-AVX512 -flto -mprefer-vector-width=512 -ffast-math -fiopenmp
     -fopenmp-targets=spir64_gen -ftarget-register-alloc-mode=pvc:auto
     -Xopenmp-target-backend '-device pvc -revision_id 0x2f' -DSPEC_COLLAPSE

Fortran benchmarks: 
     -DSPEC_COLLAPSE -O3 -xCORE-AVX512 -flto -mprefer-vector-width=512
     -ffast-math -fiopenmp -fopenmp-targets=spir64_gen
     -ftarget-register-alloc-mode=pvc:auto
     -Xopenmp-target-backend '-device pvc -revision_id 0x2f'
     -DSPEC_ACCEL_AWARE_MPI -fopenmp-target-loopopt


                                              Base Other Flags
                                              ----------------
Fortran benchmarks:

   628.pot3d_s: -Wno-incompatible-function-pointer-types


                                          Peak Compiler Invocation
                                          ------------------------
C benchmarks: 
     mpiicc -cc=icx

C++ benchmarks: 
     mpiicpc -cxx=icpx

Fortran benchmarks: 
     mpiifort -fc=ifx


                                           Peak Portability Flags
                                           ----------------------
     605.lbm_s: -DUSE_MPI
    613.soma_s: -DUSE_MPI -DSPEC_NO_VAR_ARRAY_REDUCE
 618.tealeaf_s: -DUSE_MPI
 619.clvleaf_s: -DUSE_MPI
   628.pot3d_s: -DUSE_MPI
 635.weather_s: -DUSE_MPI


                                          Peak Optimization Flags
                                          -----------------------
C benchmarks:

     605.lbm_s: -O3 -xCORE-AVX512 -flto -mprefer-vector-width=512 -ffast-math
                -fiopenmp -fopenmp-targets=spir64_gen
                -ftarget-register-alloc-mode=pvc:large
                -Xopenmp-target-backend '-device pvc -revision_id 0x2f'
                -fopenmp-optimistic-collapse -DSPEC_COLLAPSE

    613.soma_s: -O3 -xCORE-AVX512 -flto -mprefer-vector-width=512 -ffast-math
                -fiopenmp -fopenmp-targets=spir64_gen
                -ftarget-register-alloc-mode=pvc:auto
                -Xopenmp-target-backend '-device pvc -revision_id 0x2f'
                -DSPEC_COLLAPSE -fopenmp-optimistic-collapse

 618.tealeaf_s: Same as 605.lbm_s

 621.miniswp_s: Same as 613.soma_s

 634.hpgmgfv_s: -O3 -xCORE-AVX512 -flto -mprefer-vector-width=512 -ffast-math
                -fiopenmp -fopenmp-targets=spir64_gen
                -ftarget-register-alloc-mode=pvc:auto
                -Xopenmp-target-backend '-device pvc -revision_id 0x2f'
                -DSPEC_COLLAPSE -DSPEC_ACCEL_AWARE_MPI
                -fopenmp-optimistic-collapse

C++ benchmarks: 
     -O3 -xCORE-AVX512 -flto -mprefer-vector-width=512 -ffast-math -fiopenmp
     -fopenmp-targets=spir64_gen -ftarget-register-alloc-mode=pvc:auto
     -Xopenmp-target-backend '-device pvc -revision_id 0x2f' -DSPEC_COLLAPSE

Fortran benchmarks:

 619.clvleaf_s: -DSPEC_COLLAPSE -O3 -xCORE-AVX512 -flto
                -mprefer-vector-width=512 -ffast-math -fiopenmp
                -fopenmp-targets=spir64_gen
                -ftarget-register-alloc-mode=pvc:auto
                -Xopenmp-target-backend '-device pvc -revision_id 0x2f'
                -DSPEC_ACCEL_AWARE_MPI -fopenmp-target-loopopt

   628.pot3d_s: Same as 619.clvleaf_s

 635.weather_s: basepeak = yes


                                              Peak Other Flags
                                              ----------------
Fortran benchmarks:

   628.pot3d_s: -Wno-incompatible-function-pointer-types


The flags file that was used to format this result can be browsed at
http://www.spec.org/hpc2021/flags/Intel_compiler_flags.2025-05-22.00.html

You can also download the XML flags source by saving the following link:
http://www.spec.org/hpc2021/flags/Intel_compiler_flags.2025-05-22.00.xml

  SPEChpc is a trademark of the Standard Performance Evaluation
    Corporation.  All other brand and product names appearing in this
    result are trademarks or registered trademarks of their respective
    holders.
-------------------------------------------------------------------------------------------------------------
For questions about this result, please contact the tester.
For other inquiries, please contact info@spec.org.
Copyright 2021-2025 Standard Performance Evaluation Corporation
Tested with SPEChpc2021 v1.1.9 on 2025-04-14 08:37:19-0400.
Report generated on 2025-05-22 11:01:20 by hpc2021 ASCII formatter v1.0.3.
Originally published on 2025-05-21.