# Invocation command line: # /home/cpu2017/bin/harness/runcpu --configfile amd_speed_aocc400_genoa_B1.cfg --tune all --reportable --iterations 3 --nopower --runmode speed --tune base:peak --size test:train:refspeed intspeed # output_root was not used for this run ############################################################################ ################################################################################ # AMD AOCC 400 SPEC CPU 2017 V1.1.8 Speed Configuration File for 64-bit Linux # # File name : amd_speed_aocc400_genoa_B1.cfg # Creation Date : October 6, 2022 # CPU 2017 Version : 1.1.8 # Supported benchmarks : All Speed benchmarks (intspeed, fpspeed) # Compiler name/version : AOCC 4.0.0 # Operating system version : RHEL 8.6 # Supported OS's : Ubuntu 22.04, RHEL 8.6/9, SLE 15 SP4 # Hardware : AMD Genoa (AMD64) # FP Base Pointer Size : 64-bit # FP Peak Pointer Size : 64-bit # INT Base Pointer Size : 64-bit # INT Peak Pointer Size : 64-bit # Auto Parallelization : No # # Note: DO NOT EDIT THIS FILE, the only edits required to properly run these # binaries are made in the ini Python file. Please consult Readme.amd_speed_aocc400_genoa_B1.txt # for a few uncommon exceptions which require edits to this file. # # Description: # # This binary package automates away many of the complexities necessary to set # up and run SPEC CPU 2017 under optimized conditions on AMD Genoa-based # server platforms within Linux (AMD64). # # The binary package was built specifically for AMD Genoa microprocessors and # is not intended to run on other products. # # Please install the binary package by following the instructions in # "Readme.amd_speed_aocc400_genoa_B1.txt" under the "How To Use the Binaries" section. # # The binary package is designed to work without alteration on one socket AMD # Genoa-based servers with 96 cores, SMT enabled and 768 (64x12) GB of DDR5 # memory distributed evenly among all 12 channels using 64 GiB DIMMs. # # To run the binary package on other Genoa configurations, please review # "Readme.amd_speed_aocc400_genoa_B1.txt". In general, Genoa CPUs # should be autodetected with no action required by the user. # # In most cases, it should be unnecessary to edit "amd_speed_aocc400_genoa_B1.cfg" or any # other file besides "ini_amd_speed_aocc400_genoa_B1.py" where reporting fields # and run conditions are set. # # The run script automatically sets the optimal number of speed copies and binds # them appropriately. # # The run script and accompanying binary package are designed to work on Ubuntu # 22.04, RHEL 8.6/9, and SLE 15 SP4. # # Important! If you write your own run script, please set the stack size to # "unlimited" when executing this binary package. Failure to do so may cause # some benchmarks to overflow the stack. For example, to set stack size within # the bash shell, include the following line somewhere at the top of your run # script before the runcpu invocation: # # ulimit -s unlimited # # Modification of this config file should only be necessary if you intend to # rebuild the binaries. General instructions for rebuilding the binaries are # found in-line below. # ################################################################################ # Modifiable macros: ################################################################################ # "allow_build"" switch: # Change the following line to true if you intend to REBUILD the binaries (AMD # does not support this). Valid values are "true" or "false" (no quotes). %define allow_build false # Only change these macros if you are rebuilding the binary package: %define compiler_name aocc400 %define binary_package_name amd_speed_%{compiler_name}_genoa_B %define binary_package_ext %{binary_package_name} %define binary_package_revision 1 %define build_path ${SPEC} %define flags_file_name %{compiler_name}-flags.xml # Do NOT change build_lib_dir after the build or it will trigger a # rebuild of the xalanc. It should also remain literal: %define build_lib_dir amd_speed_aocc400_genoa_B_lib # To enable the platform file, be sure to uncomment the flagsurl02 header line # below in the Header settings. %define platform_file_name INVALID_platform_%{binary_package_name}.xml ################################################################################ # You should never have to change binary_package_full_name: %define binary_package_full_name %{binary_package_name}%{binary_package_revision} ################################################################################ # Include file name ################################################################################ # The include file contains fields that are commonly changed. This file is auto- # generated based upon INI file settings and should not need user modification # for runs. %define inc_file_name %{binary_package_full_name}.inc %define flags_inc_file_name %{binary_package_full_name}_flags.inc # Binary label extension: # Only modify the binary label extension if you plan to rebuild the binaries. # If you plan to recompile these CPU 2017 binaries, please choose a new extension # name below to avoid confusion with the current binary set on your system # under test, and to avoid confusion for SPEC submission reviewers. You will # also need to set "allow_build" to true above. Finally, you must modify the # Paths section below to point to your library locations if the paths are not # already set up in your build environment. # Note that AMD calls an external script to set up the compiler and library # paths before initiating the build. %define ext %{binary_package_ext} ################################################################################ # Paths and Environment Variables # ** MODIFY AS NEEDED (modification should not be necessary for runs) ** ################################################################################ # Allow environment variables to be set before runs: preenv = 1 # retain:true is necessary to avoid gcc out-of-memory exceptions on certain SUTs: # oversize_threshold is required to support jemalloc 5.2.x+ preENV_MALLOC_CONF = oversize_threshold:0,retain:true preENV_LIBOMP_NUM_HIDDEN_HELPER_THREADS = 0 # OpenMP environment variables: preENV_OMP_SCHEDULE = static preENV_OMP_DYNAMIC = false preENV_OMP_STACKSIZE = 128M # Define the name of the directory that holds AMD library files: %define lib_dir %{binary_package_name}_lib # Set the shared object library path for runs and builds: preENV_LD_LIBRARY_PATH = $[top]/%{lib_dir}/lib:%{ENV_LD_LIBRARY_PATH} %if '%{allow_build}' eq 'false' # The include file is only needed for runs, but not for builds. # include: %{inc_file_name} # ----- Begin inclusion of 'amd_speed_aocc400_genoa_B1.inc' ############################################################################ ################################################################################ ################################################################################ # File name: amd_speed_aocc400_genoa_B1.inc # File generation code date: October 11, 2022 # File generation date/time: November 21, 2023 / 10:43:58 # # This file is automatically generated during a SPEC CPU2017 run. # # To modify inc file generation, please consult the readme file or the run # script. ################################################################################ ################################################################################ ################################################################################ ################################################################################ # The following macros are generated for use in the cfg file. ################################################################################ ################################################################################ %define logical_core_count 384 %define physical_core_count 192 %define physical_core_max 191 %define logical_core_max 383 ################################################################################ ################################################################################ # The following inc blocks set the speed thread counts and affinity settings. # # intspeed benchmarks: 600.perlbench_s,602.gcc_s,605.mcf_s,620.omnetpp_s, # 623.xalancbmk_s,625.x264_s,631.deepsjeng_s,641.leela_s,648.exchange2_s, # 657.xz_s # fpspeed benchmarks: 603.bwaves_s,607.cactuBSSN_s,619.lbm_s,621.wrf_s, # 627.cam4_s,628.pop2_s,638.imagick_s,644.nab_s,649.fotonik3d_s, # 654.roms_s # # Selected thread counts from '9654' section of CPU info ################################################################################ # default preENV thread settings: default: preENV_OMP_THREAD_LIMIT = 384 preENV_GOMP_CPU_AFFINITY = 0-383 ################################################################################ ################################################################################ # intspeed base thread counts: intspeed=base: threads = 192 ENV_GOMP_CPU_AFFINITY = 0-191 bind0 = numactl --physcpubind=0-191 submit = echo "$command" > run.sh ; $BIND bash run.sh ################################################################################ ################################################################################ # fpspeed base thread counts: fpspeed=base: threads = 192 ENV_GOMP_CPU_AFFINITY = 0-191 bind0 = numactl --physcpubind=0-191 submit = echo "$command" > run.sh ; $BIND bash run.sh ################################################################################ ################################################################################ # peak thread counts: 1 600.perlbench_s,602.gcc_s,605.mcf_s,620.omnetpp_s,623.xalancbmk_s,625.x264_s,631.deepsjeng_s,641.leela_s,648.exchange2_s=peak: threads = 1 ENV_GOMP_CPU_AFFINITY = 15 bind0 = numactl --physcpubind=15 submit = echo "$command" > run.sh ; $BIND bash run.sh ################################################################################ ################################################################################ # peak thread counts: 192 603.bwaves_s,619.lbm_s,621.wrf_s,628.pop2_s,649.fotonik3d_s=peak: threads = 192 ENV_GOMP_CPU_AFFINITY = 0-191 bind0 = numactl --physcpubind=0-191 submit = echo "$command" > run.sh ; $BIND bash run.sh ################################################################################ ################################################################################ # peak thread counts: 384 607.cactuBSSN_s,627.cam4_s,638.imagick_s,644.nab_s,657.xz_s=peak: threads = 384 ENV_GOMP_CPU_AFFINITY = 0-383 bind0 = numactl --physcpubind=0-383 submit = echo "$command" > run.sh ; $BIND bash run.sh ################################################################################ ################################################################################ # peak thread counts: 384 654.roms_s=peak: threads = 384 ENV_GOMP_CPU_AFFINITY = 0 192 1 193 2 194 3 195 4 196 5 197 6 198 7 199 8 200 9 201 10 202 11 203 12 204 13 205 14 206 15 207 16 208 17 209 18 210 19 211 20 212 21 213 22 214 23 215 24 216 25 217 26 218 27 219 28 220 29 221 30 222 31 223 32 224 33 225 34 226 35 227 36 228 37 229 38 230 39 231 40 232 41 233 42 234 43 235 44 236 45 237 46 238 47 239 48 240 49 241 50 242 51 243 52 244 53 245 54 246 55 247 56 248 57 249 58 250 59 251 60 252 61 253 62 254 63 255 64 256 65 257 66 258 67 259 68 260 69 261 70 262 71 263 72 264 73 265 74 266 75 267 76 268 77 269 78 270 79 271 80 272 81 273 82 274 83 275 84 276 85 277 86 278 87 279 88 280 89 281 90 282 91 283 92 284 93 285 94 286 95 287 96 288 97 289 98 290 99 291 100 292 101 293 102 294 103 295 104 296 105 297 106 298 107 299 108 300 109 301 110 302 111 303 112 304 113 305 114 306 115 307 116 308 117 309 118 310 119 311 120 312 121 313 122 314 123 315 124 316 125 317 126 318 127 319 128 320 129 321 130 322 131 323 132 324 133 325 134 326 135 327 136 328 137 329 138 330 139 331 140 332 141 333 142 334 143 335 144 336 145 337 146 338 147 339 148 340 149 341 150 342 151 343 152 344 153 345 154 346 155 347 156 348 157 349 158 350 159 351 160 352 161 353 162 354 163 355 164 356 165 357 166 358 167 359 168 360 169 361 170 362 171 363 172 364 173 365 174 366 175 367 176 368 177 369 178 370 179 371 180 372 181 373 182 374 183 375 184 376 185 377 186 378 187 379 188 380 189 381 190 382 191 383 bind0 = numactl --physcpubind=0-383 submit = echo "$command" > run.sh ; $BIND bash run.sh ################################################################################ ################################################################################ ################################################################################ # Switch back to default: default: ################################################################################ ################################################################################ ################################################################################ # The remainder of this file defines CPU2017 report parameters. ################################################################################ ################################################################################ ################################################################################ # SPEC CPU 2017 report header ################################################################################ license_num =3358 tester =IEIT Systems Co., Ltd. test_sponsor =IEIT Systems Co., Ltd. hw_vendor =IEIT Systems Co., Ltd. hw_model =NF5180A7 (AMD EPYC 9654) #--------- If you install new compilers, edit this section -------------------- sw_compiler =C/C++/Fortran: Version 4.0.0 of AOCC ################################################################################ ################################################################################ # Hardware, firmware and software information ################################################################################ hw_avail =Sep-2023 sw_avail =Nov-2022 hw_cpu_name =AMD EPYC 9654 hw_cpu_nominal_mhz =2400 hw_cpu_max_mhz =3700 hw_ncores =192 hw_nthreadspercore =2 hw_ncpuorder =1,2 chips hw_other =None # Other perf-relevant hw, or "None" fw_bios =Version 04.02.19 released Mar-2023 sw_base_ptrsize =64-bit hw_pcache =32 KB I + 32 KB D on chip per core hw_scache =1 MB I+D on chip per core hw_tcache000 =384 MB I+D on chip per chip, 32 MB shared / 8 hw_tcache001 = cores hw_ocache =None sw_other =None ################################################################################ # Notes ################################################################################ # Enter notes_000 through notes_100 here. notes_000 =Binaries were compiled on a system with 2x AMD EPYC 9174F CPU + 1.5TiB Memory using RHEL 8.6 notes_005 = notes_010 =NA: The test sponsor attests, as of date of publication, that CVE-2017-5754 (Meltdown) notes_015 =is mitigated in the system as tested and documented. notes_020 =Yes: The test sponsor attests, as of date of publication, that CVE-2017-5753 (Spectre variant 1) notes_025 =is mitigated in the system as tested and documented. notes_030 =Yes: The test sponsor attests, as of date of publication, that CVE-2017-5715 (Spectre variant 2) notes_035 =is mitigated in the system as tested and documented. notes_040 = notes_submit_000 ='numactl' was used to bind copies to the cores. notes_submit_005 =See the configuration file for details. notes_submit_010 = notes_os_000 ='ulimit -s unlimited' was used to set environment stack size limit notes_os_005 ='ulimit -l 2097152' was used to set environment locked pages in memory limit notes_os_010 = notes_os_015 =runcpu command invoked through numactl i.e.: notes_os_020 =numactl --interleave=all runcpu notes_os_025 = notes_os_030 =To limit dirty cache to 8% of memory, 'sysctl -w vm.dirty_ratio=8' run as root. notes_os_035 =To limit swap usage to minimum necessary, 'sysctl -w vm.swappiness=1' run as root. notes_os_040 =To free node-local memory and avoid remote memory usage, notes_os_045 ='sysctl -w vm.zone_reclaim_mode=1' run as root. notes_os_050 =To clear filesystem caches, 'sync; sysctl -w vm.drop_caches=3' run as root. notes_os_055 =To disable address space layout randomization (ASLR) to reduce run-to-run notes_os_060 =variability, 'sysctl -w kernel.randomize_va_space=0' run as root. notes_os_065 = notes_os_thp_000 =To enable Transparent Hugepages (THP) for all allocations, notes_os_thp_005 ='echo always > /sys/kernel/mm/transparent_hugepage/enabled' and notes_os_thp_010 ='echo always > /sys/kernel/mm/transparent_hugepage/defrag' run as root. notes_comp_000 =The AMD64 AOCC Compiler Suite is available at notes_comp_005 =http://developer.amd.com/amd-aocc/ notes_comp_010 = # notes_jemalloc_000 =jemalloc: configured and built with GCC v4.8.2 in RHEL 7.4 (No options specified) # notes_jemalloc_005 =jemalloc 5.1.0 is available here: # notes_jemalloc_010 =https://github.com/jemalloc/jemalloc/releases/download/5.1.0/jemalloc-5.1.0.tar.bz2 # notes_jemalloc_015 = # sw_other000 =jemalloc: jemalloc memory allocator library v5.1.0 ################################################################################ # The following note fields describe platorm settings. ################################################################################ # example: (edit and uncomment as necessary) # notes_plat_000 =BIOS settings: # notes_plat_002 = TDP: 400 # notes_plat_004 = Determinism Slider set to Power # notes_plat_006 = PPT: 400 # notes_plat_010 = NPS: 4 # notes_plat_011 = Workload Profile = CPU Intensive # notes_plat_012 = TSME = Disabled # notes_plat_014 = SEV Control = Disabled # notes_plat_015 = Fan Speed: Maximum ################################################################################ # The following are custom fields: ################################################################################ # Use custom_fields to enter lines that are not listed here. For example: # notes_plat_100 = Energy Bias set to Max Performance # new_field = Ambient temperature set to 10C ################################################################################ # The following fields must be set here for only Int benchmarks. ################################################################################ intspeed: sw_peak_ptrsize =64-bit notes_os_thp_015 = ################################################################################ # The following fields must be set here for FP benchmarks. ################################################################################ fpspeed: sw_peak_ptrsize =64-bit notes_os_thp_003 =To always enable THP for peak runs of: notes_os_thp_004 =603.bwaves_s, 607.cactuBSSN_s, 619.lbm_s, 627.cam4_s, 628.pop2_s, 638.imagick_s, 644.nab_s, 649.fotonik3d_s: notes_os_thp_005 ='echo madvise > /sys/kernel/mm/transparent_hugepage/enabled; echo always > /sys/kernel/mm/transparent_hugepage/defrag' notes_os_thp_006 =run as root. notes_os_thp_007 =To disable THP for peak runs of 621.wrf_s: notes_os_thp_008 ='echo never > /sys/kernel/mm/transparent_hugepage/enabled; echo always > /sys/kernel/mm/transparent_hugepage/defrag' notes_os_thp_009 =run as root. notes_os_thp_010 =To enable THP only on request for peak runs of 654.roms_s: notes_os_thp_011 ='echo madvise > /sys/kernel/mm/transparent_hugepage/enabled; echo madvise > /sys/kernel/mm/transparent_hugepage/defrag' notes_os_thp_012 =run as root. ################################################################################ # The following fields must be set here or they will be overwritten by sysinfo. ################################################################################ intspeed,fpspeed: hw_disk =1 x 1.92 TB NVME SSD hw_memory =1536 GB (24 x 64 GB 2Rx4 PC5-4800B-R) hw_nchips =2 prepared_by =IEIT Systems Co., Ltd. sw_file =xfs sw_os000 =Red Hat Enterprise Linux 9.0 (Plow) sw_os001 =5.14.0-70.22.1.el9_0.x86_64 sw_state =Run level 3 (multi-user) ################################################################################ # End of inc file ################################################################################ # Switch back to the default block after the include file: default: # ---- End inclusion of '/home/cpu2017/config/amd_speed_aocc400_genoa_B1.inc' # Switch back to default block after the include file: default: fail_build = 1 %elif '%{allow_build}' eq 'true' # If you intend to rebuild, be sure to set the library paths either in the # build script or here: preENV_LIBRARY_PATH = $[top]/%{build_lib_dir}/lib:%{ENV_LIBRARY_PATH} % define build_ncpus 16 # controls number of simultaneous compiles fail_build = 0 makeflags = --jobs=%{build_ncpus} --load-average=%{build_ncpus} %else % error The value of "allow_build" is %{allow_build}, but it can only be "true" or "false". This error was generated %endif ################################################################################ # Enable automated data collection per benchmark ################################################################################ # Data collection is not enabled for reportable runs. # teeout is necessary to get data collection stdout into the logs. Best # practices for the individual data collection items would be to have # them store important output in separate files. Filenames could be # constructed from $SPEC (environment), $lognum (result number from runcpu), # and benchmark name/number. teeout = yes # Run runcpu with '-v 35' (or greater) to log lists of variables which can # be used in substitutions as below. # For CPU2006, change $label to $ext %define data-collection-parameters benchname='$name' benchnum='$num' benchmark='$benchmark' iteration=$iter size='$size' tune='$tune' label='$label' log='$log' lognum='$lognum' from_runcpu='$from_runcpu' %define data-collection-start $[top]/data-collection/data-collection start %{data-collection-parameters} %define data-collection-stop $[top]/data-collection/data-collection stop %{data-collection-parameters} monitor_specrun_wrapper = %{data-collection-start} ; $command ; %{data-collection-stop} ################################################################################ # Header settings ################################################################################ backup_config = 0 # set to 0 if you do not want backup files bench_post_setup = sync # command_add_redirect: If set, the generated ${command} will include # redirection operators (stdout, stderr), which are passed along to the shell # that executes the command. If this variable is not set, specinvoke does the # redirection. command_add_redirect = yes env_vars = yes flagsurl000 = http://www.spec.org/cpu2017/flags/aocc400-flags.xml flagsurl001 = http://www.spec.org/cpu2017/flags/Inspur-Platform-Settings-amd-V3.1.xml #flagsurl02 = $[top]/%{platform_file_name} # label: User defined extension string that tags your binaries & directories: label = %{ext} line_width = 1020 log_line_width = 1020 mean_anyway = yes output_format = all reportable = yes size = test,train,ref teeout = yes teerunout = yes tune = base,peak use_submit_for_speed = yes ################################################################################ # Include the flags file: ################################################################################ #include: %{flags_inc_file_name} # ----- Begin inclusion of 'amd_speed_aocc400_genoa_B1_flags.inc' ############################################################################ ################################################################################ # AMD AOCC 4.0.0 SPEC CPU2017 V1.1.8 Speed Configuration Flags for AMD64 Linux ################################################################################ # Compilers ################################################################################ default: CC = clang -m64 CXX = clang++ -m64 FC = flang -m64 CLD = clang -m64 CXXLD = clang++ -m64 FLD = flang -m64 CC_VERSION_OPTION = --version CXX_VERSION_OPTION = --version FC_VERSION_OPTION = --version ################################################################################ # Portability Flags ################################################################################ default: # data model applies to all benchmarks EXTRA_PORTABILITY = -DSPEC_LP64 # *** Benchmark-specific portability *** # Anything other than the data model is only allowed where a need is proven. # (ordered by last 2 digits of benchmark number) 600.perlbench_s: #lang='C' PORTABILITY = -DSPEC_LINUX_X64 621.wrf_s: #lang='F,C' CPORTABILITY = -DSPEC_CASE_FLAG FPORTABILITY = -Mbyteswapio 623.xalancbmk_s: #lang='CXX' PORTABILITY = -DSPEC_LINUX 627.cam4_s: #lang='F,C' PORTABILITY = -DSPEC_CASE_FLAG 628.pop2_s: #lang='F,C' CPORTABILITY = -DSPEC_CASE_FLAG FPORTABILITY = -Mbyteswapio ################################################################################ # Default libraries and variables ################################################################################ default: # Libraries: EXTRA_LIBS = -fopenmp=libomp \ -lomp \ -lamdalloc \ -lamdlibm \ -lm MATHLIBOPT = #clearing this variable or else SPEC will set it to -lm VECMATHLIB = -fveclib=AMDLIBM # Variables: OPT_ROOT = -march=znver4 \ $(VECMATHLIB) \ -ffast-math \ -fopenmp OPT_ROOT_BASE = -O3 \ $(OPT_ROOT) OPT_ROOT_PEAK = -Ofast \ $(OPT_ROOT) \ -flto THP_ALWAYS = echo always > /sys/kernel/mm/transparent_hugepage/enabled; echo always > /sys/kernel/mm/transparent_hugepage/defrag THP_NEVER = echo never > /sys/kernel/mm/transparent_hugepage/enabled; echo never > /sys/kernel/mm/transparent_hugepage/defrag THP_MADVISE = echo madvise > /sys/kernel/mm/transparent_hugepage/enabled; echo madvise > /sys/kernel/mm/transparent_hugepage/defrag DEFAULT_SUBMIT = echo "$command" > run.sh ; $BIND bash run.sh ############################################################################### # AOCC 4.0.0 workarounds that do not count as PORTABILITY ################################################################################ # The workarounds in this section would not qualify under the SPEC CPU # PORTABILITY rule. # - In peak, they can be set as needed for individual benchmarks. # - In base, individual settings are not allowed; set for whole suite. # Use EXTRA_CFLAGS, EXTRA_CXXFLAGS, and EXTRA_FFLAGS for them. # # See: # https://www.spec.org/cpu2017/Docs/runrules.html#portability # https://www.spec.org/cpu2017/Docs/runrules.html#BaseFlags ####################### # Default workarounds # ####################### default: # Allow unused compile/link arguments without triggering warnings during build: EXTRA_CFLAGS = -Wno-unused-command-line-argument EXTRA_CXXFLAGS = -Wno-unused-command-line-argument EXTRA_FFLAGS = -Wno-unused-command-line-argument LDOPTIONS = -Wno-unused-command-line-argument #################### # Base workarounds # #################### # # *** NONE *** # ############################## # Integer workarounds - base # ############################## intrate=base: # The following is necessary for 602 gcc: EXTRA_LDFLAGS = -z muldefs ######################### # FP workarounds - base # ######################### # # *** NONE *** # #################### # Peak workarounds # #################### # # *** NONE *** # ############################## # Integer workarounds - peak # ############################## 602.gcc_s=peak: #lang='C' EXTRA_LDFLAGS = -z muldefs ##################################### # Floating Point workarounds - peak # ##################################### # # *** NONE *** # ################################################################################ # Tuning Flags ################################################################################ ##################### # Base tuning flags # ##################### default=base: COPTIMIZE = $(OPT_ROOT_BASE) \ -flto \ -fstruct-layout=7 \ -mllvm -unroll-threshold=50 \ -mllvm -inline-threshold=1000 \ -fremap-arrays \ -fstrip-mining \ -mllvm -reduce-array-computations=3 \ -DSPEC_OPENMP \ -Wno-return-type \ -zopt CXXOPTIMIZE = $(OPT_ROOT_BASE) \ -flto \ -mllvm -unroll-threshold=100 \ -finline-aggressive \ -mllvm -loop-unswitch-threshold=200000 \ -mllvm -reduce-array-computations=3 \ -DSPEC_OPENMP \ -zopt FOPTIMIZE = $(OPT_ROOT_BASE) \ -flto \ -Mrecursive \ -funroll-loops \ -mllvm -lsr-in-nested-loop \ -mllvm -reduce-array-computations=3 \ -DSPEC_OPENMP \ -zopt LDCXXFLAGS = -Wl,-mllvm -Wl,-x86-use-vzeroupper=false LDFLAGS = -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 \ -Wl,-mllvm -Wl,-reduce-array-computations=3 LDFFLAGS = -Wl,-mllvm -Wl,-enable-X86-prefetching #other libraries # Put OpenMP and math libraries here: # -lm needed at the end for some transcendental functions: EXTRA_LIBS = -fopenmp=libomp \ -lomp \ -lamdlibm \ -lamdalloc \ -lflang \ -lm EXTRA_FLIBS = # Don't put the AMD and mvec math libraries in MATHLIBOPT because it will trigger a reporting issue # because GCC won't use them. Forcefeed all benchmarks the math libraries in EXTRA_LIBS and clear # out MATHLIBOPT. MATHLIBOPT = ######################### # intspeed tuning flags # ######################### intspeed: FOPTIMIZE = $(OPT_ROOT_BASE) \ -flto \ -mllvm -optimize-strided-mem-cost EXTRA_FFLAGS = -mllvm -unroll-aggressive \ -mllvm -unroll-threshold=150 EXTRA_CXXFLAGS = -fvirtual-function-elimination \ -fvisibility=hidden LDFLAGS = -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 \ -Wl,-mllvm -Wl,-reduce-array-computations=3 LDCFLAGS = -Wl,-allow-multiple-definition LDCXXFLAGS = LDFFLAGS = -Wl,-mllvm -Wl,-inline-recursion=4 \ -Wl,-mllvm -Wl,-lsr-in-nested-loop \ -Wl,-mllvm -Wl,-enable-iv-split ############################## # intspeed base tuning flags # ############################## intspeed=base: EXTRA_LIBS = -fopenmp=libomp \ -lomp \ -lamdlibm \ -lflang \ -lm EXTRA_CLIBS = -lamdalloc EXTRA_CXXLIBS = -lamdalloc-ext EXTRA_FLIBS = -lamdalloc submit = ${THP_ALWAYS}; ${DEFAULT_SUBMIT} ############################## # intspeed peak tuning flags # ############################## intspeed=peak: submit = ${THP_ALWAYS}; ${DEFAULT_SUBMIT} ############################# # fpspeed base tuning flags # ############################# fpspeed=base: submit = ${THP_ALWAYS}; ${DEFAULT_SUBMIT} ############################# # fpspeed peak tuning flags # ############################# fpspeed=peak: submit = ${THP_ALWAYS}; ${DEFAULT_SUBMIT} ##################### # Peak tuning flags # ##################### default=peak: COPTIMIZE = $(OPT_ROOT_PEAK) -fstruct-layout=9 \ -mllvm -unroll-threshold=50 \ -fremap-arrays \ -fstrip-mining \ -mllvm -inline-threshold=1000 \ -mllvm -reduce-array-computations=3 \ -DSPEC_OPENMP -Wno-return-type \ -zopt CXXOPTIMIZE = $(OPT_ROOT_PEAK) -finline-aggressive \ -mllvm -unroll-threshold=100 \ -mllvm -reduce-array-computations=3 \ -DSPEC_OPENMP \ -zopt FOPTIMIZE = $(OPT_ROOT_PEAK) -Mrecursive \ -mllvm -reduce-array-computations=3 \ -DSPEC_OPENMP \ -zopt LDFLAGS = -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 \ -Wl,-mllvm -Wl,-reduce-array-computations=3 LDFFLAGS = -Wl,-mllvm -Wl,-enable-X86-prefetching LDCXXFLAGS = -Wl,-mllvm -Wl,-x86-use-vzeroupper=false EXTRA_LIBS = -fopenmp=libomp \ -lomp \ -lamdlibm \ -lamdalloc \ -lflang \ -lm feedback = 0 PASS1_CFLAGS = -fprofile-instr-generate PASS2_CFLAGS = -fprofile-instr-use PASS1_FFLAGS = -fprofile-generate PASS2_FFLAGS = -fprofile-use PASS1_CXXFLAGS = -fprofile-instr-generate PASS2_CXXFLAGS = -fprofile-instr-use PASS1_LDFLAGS = -fprofile-instr-generate PASS2_LDFLAGS = -fprofile-instr-use fdo_run1 = $command ; llvm-profdata merge --output=default.profdata *.profraw # Benchmark specific peak tuning flags: 603.bwaves_s=peak: #lang='F' FOPTIMIZE = -Ofast \ $(OPT_ROOT) \ -Mrecursive \ -mllvm -reduce-array-computations=3 \ -DSPEC_OPENMP \ -fvector-transform \ -fscalar-transform submit = ${THP_ALWAYS}; ${DEFAULT_SUBMIT} 607.cactuBSSN_s=peak: #lang='CXX,C,F' submit = ${THP_ALWAYS}; ${DEFAULT_SUBMIT} 619.lbm_s=peak: submit = ${THP_ALWAYS}; ${DEFAULT_SUBMIT} 620.omnetpp_s=peak: #lang='CXX' EXTRA_LIBS = -fopenmp=libomp \ -lomp \ -lamdlibm \ -lamdalloc-ext \ -lflang -lm 621.wrf_s=peak: #lang='F,C' FOPTIMIZE = $(OPT_ROOT_BASE) \ -Mrecursive \ -funroll-loops \ -mllvm -lsr-in-nested-loop \ -mllvm -reduce-array-computations=3 \ -DSPEC_OPENMP \ -zopt submit = ${THP_NEVER}; ${DEFAULT_SUBMIT} 623.xalancbmk_s=peak: #lang='CXX' EXTRA_CXXFLAGS = -mllvm -do-block-reorder=aggressive \ -fvirtual-function-elimination -fvisibility=hidden LDFLAGS = -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 \ -Wl,-mllvm -Wl,-reduce-array-computations=3 \ -Wl,-mllvm -Wl,-do-block-reorder=aggressive EXTRA_LIBS = -fopenmp=libomp \ -lomp \ -lamdlibm \ -lamdalloc-ext \ -lflang \ -lm 627.cam4_s=peak: #lang='F,C' LDFLAGS = -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 \ -Wl,-mllvm -Wl,-reduce-array-computations=3 submit = ${THP_ALWAYS}; ${DEFAULT_SUBMIT} 628.pop2_s=peak: #lang='F,C' FOPTIMIZE = $(OPT_ROOT) \ -Ofast \ -Mrecursive \ -mllvm -reduce-array-computations=3 \ -DSPEC_OPENMP \ -fvector-transform \ -fscalar-transform submit = ${THP_ALWAYS}; ${DEFAULT_SUBMIT} 638.imagick_s=peak: #lang='C' LDFLAGS = -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 \ -Wl,-mllvm -Wl,-reduce-array-computations=3 submit = ${THP_ALWAYS}; ${DEFAULT_SUBMIT} 644.nab_s=peak: #lang='C' LDFLAGS = -Wl,-mllvm -Wl,-region-vectorize submit = ${THP_ALWAYS}; ${DEFAULT_SUBMIT} 649.fotonik3d_s=peak: #lang='F' ENV_PGHPF_ZMEM = yes submit = ${THP_ALWAYS}; ${DEFAULT_SUBMIT} 654.roms_s=peak: #lang='F' FOPTIMIZE = -Ofast \ $(OPT_ROOT) \ -Mrecursive \ -mllvm -reduce-array-computations=3 \ -DSPEC_OPENMP \ -fvector-transform \ -fscalar-transform submit = ${THP_MADVISE}; ${DEFAULT_SUBMIT} 657.xz_s=peak: #lang='C' ENV_LIBOMP_NUM_HIDDEN_HELPER_THREADS = 8 # ---- End inclusion of '/home/cpu2017/config/amd_speed_aocc400_genoa_B1_flags.inc' # The following settings were obtained by running the sysinfo_program # 'specperl $[top]/bin/sysinfo' (sysinfo:SHA:2eb381fc1a58eb8122e4a1b875c1e38b3489dac84088192aa0ec6d157b084d06) default: notes_plat_sysinfo_000 = notes_plat_sysinfo_005 = Sysinfo program /home/cpu2017/bin/sysinfo notes_plat_sysinfo_010 = Rev: r6732 of 2022-11-07 fe91c89b7ed5c36ae2c92cc097bec197 notes_plat_sysinfo_015 = running on localhost Tue Nov 21 10:44:08 2023 notes_plat_sysinfo_020 = notes_plat_sysinfo_025 = SUT (System Under Test) info as seen by some common utilities. notes_plat_sysinfo_030 = notes_plat_sysinfo_035 = ------------------------------------------------------------ notes_plat_sysinfo_040 = Table of contents notes_plat_sysinfo_045 = ------------------------------------------------------------ notes_plat_sysinfo_050 = 1. uname -a notes_plat_sysinfo_055 = 2. w notes_plat_sysinfo_060 = 3. Username notes_plat_sysinfo_065 = 4. ulimit -a notes_plat_sysinfo_070 = 5. sysinfo process ancestry notes_plat_sysinfo_075 = 6. /proc/cpuinfo notes_plat_sysinfo_080 = 7. lscpu notes_plat_sysinfo_085 = 8. numactl --hardware notes_plat_sysinfo_090 = 9. /proc/meminfo notes_plat_sysinfo_095 = 10. who -r notes_plat_sysinfo_100 = 11. Systemd service manager version: systemd 250 (250-6.el9_0) notes_plat_sysinfo_105 = 12. Failed units, from systemctl list-units --state=failed notes_plat_sysinfo_110 = 13. Services, from systemctl list-unit-files notes_plat_sysinfo_115 = 14. Linux kernel boot-time arguments, from /proc/cmdline notes_plat_sysinfo_120 = 15. cpupower frequency-info notes_plat_sysinfo_125 = 16. tuned-adm active notes_plat_sysinfo_130 = 17. sysctl notes_plat_sysinfo_135 = 18. /sys/kernel/mm/transparent_hugepage notes_plat_sysinfo_140 = 19. /sys/kernel/mm/transparent_hugepage/khugepaged notes_plat_sysinfo_145 = 20. OS release notes_plat_sysinfo_150 = 21. Disk information notes_plat_sysinfo_155 = 22. /sys/devices/virtual/dmi/id notes_plat_sysinfo_160 = 23. dmidecode notes_plat_sysinfo_165 = 24. BIOS notes_plat_sysinfo_170 = ------------------------------------------------------------ notes_plat_sysinfo_175 = notes_plat_sysinfo_180 = ------------------------------------------------------------ notes_plat_sysinfo_185 = 1. uname -a notes_plat_sysinfo_190 = Linux localhost 5.14.0-70.22.1.el9_0.x86_64 #1 SMP PREEMPT Tue Aug 2 10:02:12 EDT 2022 x86_64 x86_64 x86_64 notes_plat_sysinfo_195 = GNU/Linux notes_plat_sysinfo_200 = notes_plat_sysinfo_205 = ------------------------------------------------------------ notes_plat_sysinfo_210 = 2. w notes_plat_sysinfo_215 = 10:44:08 up 1:28, 1 user, load average: 0.26, 0.09, 0.03 notes_plat_sysinfo_220 = USER TTY LOGIN@ IDLE JCPU PCPU WHAT notes_plat_sysinfo_225 = root tty1 10:41 16.00s 1.50s 0.27s /bin/bash ./amd_speed_aocc400_genoa_B1.sh notes_plat_sysinfo_230 = notes_plat_sysinfo_235 = ------------------------------------------------------------ notes_plat_sysinfo_240 = 3. Username notes_plat_sysinfo_245 = From environment variable $USER: root notes_plat_sysinfo_250 = notes_plat_sysinfo_255 = ------------------------------------------------------------ notes_plat_sysinfo_260 = 4. ulimit -a notes_plat_sysinfo_265 = real-time non-blocking time (microseconds, -R) unlimited notes_plat_sysinfo_270 = core file size (blocks, -c) 0 notes_plat_sysinfo_275 = data seg size (kbytes, -d) unlimited notes_plat_sysinfo_280 = scheduling priority (-e) 0 notes_plat_sysinfo_285 = file size (blocks, -f) unlimited notes_plat_sysinfo_290 = pending signals (-i) 6191046 notes_plat_sysinfo_295 = max locked memory (kbytes, -l) 2097152 notes_plat_sysinfo_300 = max memory size (kbytes, -m) unlimited notes_plat_sysinfo_305 = open files (-n) 1024 notes_plat_sysinfo_310 = pipe size (512 bytes, -p) 8 notes_plat_sysinfo_315 = POSIX message queues (bytes, -q) 819200 notes_plat_sysinfo_320 = real-time priority (-r) 0 notes_plat_sysinfo_325 = stack size (kbytes, -s) unlimited notes_plat_sysinfo_330 = cpu time (seconds, -t) unlimited notes_plat_sysinfo_335 = max user processes (-u) 6191046 notes_plat_sysinfo_340 = virtual memory (kbytes, -v) unlimited notes_plat_sysinfo_345 = file locks (-x) unlimited notes_plat_sysinfo_350 = notes_plat_sysinfo_355 = ------------------------------------------------------------ notes_plat_sysinfo_360 = 5. sysinfo process ancestry notes_plat_sysinfo_365 = /usr/lib/systemd/systemd --switched-root --system --deserialize 30 notes_plat_sysinfo_370 = login -- root notes_plat_sysinfo_375 = -bash notes_plat_sysinfo_380 = python3 ./run_amd_speed_aocc400_genoa_B1.py notes_plat_sysinfo_385 = /bin/bash ./amd_speed_aocc400_genoa_B1.sh notes_plat_sysinfo_390 = runcpu --config amd_speed_aocc400_genoa_B1.cfg --tune all --reportable --iterations 3 intspeed notes_plat_sysinfo_395 = runcpu --configfile amd_speed_aocc400_genoa_B1.cfg --tune all --reportable --iterations 3 --nopower notes_plat_sysinfo_400 = --runmode speed --tune base:peak --size test:train:refspeed intspeed --nopreenv --note-preenv --logfile notes_plat_sysinfo_405 = $SPEC/tmp/CPU2017.015/templogs/preenv.intspeed.015.0.log --lognum 015.0 --from_runcpu 2 notes_plat_sysinfo_410 = specperl $SPEC/bin/sysinfo notes_plat_sysinfo_415 = $SPEC = /home/cpu2017 notes_plat_sysinfo_420 = notes_plat_sysinfo_425 = ------------------------------------------------------------ notes_plat_sysinfo_430 = 6. /proc/cpuinfo notes_plat_sysinfo_435 = model name : AMD EPYC 9654 96-Core Processor notes_plat_sysinfo_440 = vendor_id : AuthenticAMD notes_plat_sysinfo_445 = cpu family : 25 notes_plat_sysinfo_450 = model : 17 notes_plat_sysinfo_455 = stepping : 1 notes_plat_sysinfo_460 = microcode : 0xa101116 notes_plat_sysinfo_465 = bugs : sysret_ss_attrs spectre_v1 spectre_v2 spec_store_bypass notes_plat_sysinfo_470 = TLB size : 3584 4K pages notes_plat_sysinfo_475 = cpu cores : 96 notes_plat_sysinfo_480 = siblings : 192 notes_plat_sysinfo_485 = 2 physical ids (chips) notes_plat_sysinfo_490 = 384 processors (hardware threads) notes_plat_sysinfo_495 = physical id 0: core ids 0-95 notes_plat_sysinfo_500 = physical id 1: core ids 0-95 notes_plat_sysinfo_505 = physical id 0: apicids 0-191 notes_plat_sysinfo_510 = physical id 1: apicids 256-447 notes_plat_sysinfo_515 = Caution: /proc/cpuinfo data regarding chips, cores, and threads is not necessarily reliable, especially for notes_plat_sysinfo_520 = virtualized systems. Use the above data carefully. notes_plat_sysinfo_525 = notes_plat_sysinfo_530 = ------------------------------------------------------------ notes_plat_sysinfo_535 = 7. lscpu notes_plat_sysinfo_540 = notes_plat_sysinfo_545 = From lscpu from util-linux 2.37.4: notes_plat_sysinfo_550 = Architecture: x86_64 notes_plat_sysinfo_555 = CPU op-mode(s): 32-bit, 64-bit notes_plat_sysinfo_560 = Address sizes: 52 bits physical, 57 bits virtual notes_plat_sysinfo_565 = Byte Order: Little Endian notes_plat_sysinfo_570 = CPU(s): 384 notes_plat_sysinfo_575 = On-line CPU(s) list: 0-383 notes_plat_sysinfo_580 = Vendor ID: AuthenticAMD notes_plat_sysinfo_585 = BIOS Vendor ID: Advanced Micro Devices, Inc. notes_plat_sysinfo_590 = Model name: AMD EPYC 9654 96-Core Processor notes_plat_sysinfo_595 = BIOS Model name: AMD EPYC 9654 96-Core Processor notes_plat_sysinfo_600 = CPU family: 25 notes_plat_sysinfo_605 = Model: 17 notes_plat_sysinfo_610 = Thread(s) per core: 2 notes_plat_sysinfo_615 = Core(s) per socket: 96 notes_plat_sysinfo_620 = Socket(s): 2 notes_plat_sysinfo_625 = Stepping: 1 notes_plat_sysinfo_630 = Frequency boost: enabled notes_plat_sysinfo_635 = CPU max MHz: 3707.8120 notes_plat_sysinfo_640 = CPU min MHz: 1500.0000 notes_plat_sysinfo_645 = BogoMIPS: 4800.11 notes_plat_sysinfo_650 = Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 notes_plat_sysinfo_655 = clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm notes_plat_sysinfo_660 = constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf rapl notes_plat_sysinfo_665 = pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe notes_plat_sysinfo_670 = popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy notes_plat_sysinfo_675 = abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext notes_plat_sysinfo_680 = perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 cdp_l3 notes_plat_sysinfo_685 = invpcid_single hw_pstate ssbd mba ibrs ibpb stibp vmmcall fsgsbase bmi1 notes_plat_sysinfo_690 = avx2 smep bmi2 erms invpcid cqm rdt_a avx512f avx512dq rdseed adx smap notes_plat_sysinfo_695 = avx512ifma clflushopt clwb avx512cd sha_ni avx512bw avx512vl xsaveopt notes_plat_sysinfo_700 = xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local notes_plat_sysinfo_705 = avx512_bf16 clzero irperf xsaveerptr rdpru wbnoinvd amd_ppin arat npt lbrv notes_plat_sysinfo_710 = svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists notes_plat_sysinfo_715 = pausefilter pfthreshold avic v_vmsave_vmload vgif v_spec_ctrl avx512vbmi notes_plat_sysinfo_720 = umip pku ospke avx512_vbmi2 gfni vaes vpclmulqdq avx512_vnni avx512_bitalg notes_plat_sysinfo_725 = avx512_vpopcntdq la57 rdpid overflow_recov succor smca fsrm flush_l1d notes_plat_sysinfo_730 = Virtualization: AMD-V notes_plat_sysinfo_735 = L1d cache: 6 MiB (192 instances) notes_plat_sysinfo_740 = L1i cache: 6 MiB (192 instances) notes_plat_sysinfo_745 = L2 cache: 192 MiB (192 instances) notes_plat_sysinfo_750 = L3 cache: 768 MiB (24 instances) notes_plat_sysinfo_755 = NUMA node(s): 24 notes_plat_sysinfo_760 = NUMA node0 CPU(s): 0-7,192-199 notes_plat_sysinfo_765 = NUMA node1 CPU(s): 8-15,200-207 notes_plat_sysinfo_770 = NUMA node2 CPU(s): 16-23,208-215 notes_plat_sysinfo_775 = NUMA node3 CPU(s): 24-31,216-223 notes_plat_sysinfo_780 = NUMA node4 CPU(s): 32-39,224-231 notes_plat_sysinfo_785 = NUMA node5 CPU(s): 40-47,232-239 notes_plat_sysinfo_790 = NUMA node6 CPU(s): 48-55,240-247 notes_plat_sysinfo_795 = NUMA node7 CPU(s): 56-63,248-255 notes_plat_sysinfo_800 = NUMA node8 CPU(s): 64-71,256-263 notes_plat_sysinfo_805 = NUMA node9 CPU(s): 72-79,264-271 notes_plat_sysinfo_810 = NUMA node10 CPU(s): 80-87,272-279 notes_plat_sysinfo_815 = NUMA node11 CPU(s): 88-95,280-287 notes_plat_sysinfo_820 = NUMA node12 CPU(s): 96-103,288-295 notes_plat_sysinfo_825 = NUMA node13 CPU(s): 104-111,296-303 notes_plat_sysinfo_830 = NUMA node14 CPU(s): 112-119,304-311 notes_plat_sysinfo_835 = NUMA node15 CPU(s): 120-127,312-319 notes_plat_sysinfo_840 = NUMA node16 CPU(s): 128-135,320-327 notes_plat_sysinfo_845 = NUMA node17 CPU(s): 136-143,328-335 notes_plat_sysinfo_850 = NUMA node18 CPU(s): 144-151,336-343 notes_plat_sysinfo_855 = NUMA node19 CPU(s): 152-159,344-351 notes_plat_sysinfo_860 = NUMA node20 CPU(s): 160-167,352-359 notes_plat_sysinfo_865 = NUMA node21 CPU(s): 168-175,360-367 notes_plat_sysinfo_870 = NUMA node22 CPU(s): 176-183,368-375 notes_plat_sysinfo_875 = NUMA node23 CPU(s): 184-191,376-383 notes_plat_sysinfo_880 = Vulnerability Itlb multihit: Not affected notes_plat_sysinfo_885 = Vulnerability L1tf: Not affected notes_plat_sysinfo_890 = Vulnerability Mds: Not affected notes_plat_sysinfo_895 = Vulnerability Meltdown: Not affected notes_plat_sysinfo_900 = Vulnerability Spec store bypass: Mitigation; Speculative Store Bypass disabled via prctl notes_plat_sysinfo_905 = Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization notes_plat_sysinfo_910 = Vulnerability Spectre v2: Mitigation; Retpolines, IBPB conditional, IBRS_FW, STIBP always-on, RSB notes_plat_sysinfo_915 = filling notes_plat_sysinfo_920 = Vulnerability Srbds: Not affected notes_plat_sysinfo_925 = Vulnerability Tsx async abort: Not affected notes_plat_sysinfo_930 = notes_plat_sysinfo_935 = From lscpu --cache: notes_plat_sysinfo_940 = NAME ONE-SIZE ALL-SIZE WAYS TYPE LEVEL SETS PHY-LINE COHERENCY-SIZE notes_plat_sysinfo_945 = L1d 32K 6M 8 Data 1 64 1 64 notes_plat_sysinfo_950 = L1i 32K 6M 8 Instruction 1 64 1 64 notes_plat_sysinfo_955 = L2 1M 192M 8 Unified 2 2048 1 64 notes_plat_sysinfo_960 = L3 32M 768M 16 Unified 3 32768 1 64 notes_plat_sysinfo_965 = notes_plat_sysinfo_970 = ------------------------------------------------------------ notes_plat_sysinfo_975 = 8. numactl --hardware notes_plat_sysinfo_980 = NOTE: a numactl 'node' might or might not correspond to a physical chip. notes_plat_sysinfo_985 = available: 24 nodes (0-23) notes_plat_sysinfo_990 = node 0 cpus: 0-7,192-199 notes_plat_sysinfo_995 = node 0 size: 64308 MB notes_plat_sysinfo_1000= node 0 free: 63472 MB notes_plat_sysinfo_1005= node 1 cpus: 8-15,200-207 notes_plat_sysinfo_1010= node 1 size: 64507 MB notes_plat_sysinfo_1015= node 1 free: 64088 MB notes_plat_sysinfo_1020= node 2 cpus: 16-23,208-215 notes_plat_sysinfo_1025= node 2 size: 64507 MB notes_plat_sysinfo_1030= node 2 free: 64082 MB notes_plat_sysinfo_1035= node 3 cpus: 24-31,216-223 notes_plat_sysinfo_1040= node 3 size: 64507 MB notes_plat_sysinfo_1045= node 3 free: 64039 MB notes_plat_sysinfo_1050= node 4 cpus: 32-39,224-231 notes_plat_sysinfo_1055= node 4 size: 64507 MB notes_plat_sysinfo_1060= node 4 free: 64122 MB notes_plat_sysinfo_1065= node 5 cpus: 40-47,232-239 notes_plat_sysinfo_1070= node 5 size: 64507 MB notes_plat_sysinfo_1075= node 5 free: 64282 MB notes_plat_sysinfo_1080= node 6 cpus: 48-55,240-247 notes_plat_sysinfo_1085= node 6 size: 64507 MB notes_plat_sysinfo_1090= node 6 free: 64304 MB notes_plat_sysinfo_1095= node 7 cpus: 56-63,248-255 notes_plat_sysinfo_1100= node 7 size: 64470 MB notes_plat_sysinfo_1105= node 7 free: 64254 MB notes_plat_sysinfo_1110= node 8 cpus: 64-71,256-263 notes_plat_sysinfo_1115= node 8 size: 64507 MB notes_plat_sysinfo_1120= node 8 free: 64286 MB notes_plat_sysinfo_1125= node 9 cpus: 72-79,264-271 notes_plat_sysinfo_1130= node 9 size: 64507 MB notes_plat_sysinfo_1135= node 9 free: 64047 MB notes_plat_sysinfo_1140= node 10 cpus: 80-87,272-279 notes_plat_sysinfo_1145= node 10 size: 64507 MB notes_plat_sysinfo_1150= node 10 free: 64293 MB notes_plat_sysinfo_1155= node 11 cpus: 88-95,280-287 notes_plat_sysinfo_1160= node 11 size: 64507 MB notes_plat_sysinfo_1165= node 11 free: 64302 MB notes_plat_sysinfo_1170= node 12 cpus: 96-103,288-295 notes_plat_sysinfo_1175= node 12 size: 64507 MB notes_plat_sysinfo_1180= node 12 free: 64267 MB notes_plat_sysinfo_1185= node 13 cpus: 104-111,296-303 notes_plat_sysinfo_1190= node 13 size: 64507 MB notes_plat_sysinfo_1195= node 13 free: 64303 MB notes_plat_sysinfo_1200= node 14 cpus: 112-119,304-311 notes_plat_sysinfo_1205= node 14 size: 64507 MB notes_plat_sysinfo_1210= node 14 free: 64255 MB notes_plat_sysinfo_1215= node 15 cpus: 120-127,312-319 notes_plat_sysinfo_1220= node 15 size: 64507 MB notes_plat_sysinfo_1225= node 15 free: 64205 MB notes_plat_sysinfo_1230= node 16 cpus: 128-135,320-327 notes_plat_sysinfo_1235= node 16 size: 64507 MB notes_plat_sysinfo_1240= node 16 free: 64242 MB notes_plat_sysinfo_1245= node 17 cpus: 136-143,328-335 notes_plat_sysinfo_1250= node 17 size: 64507 MB notes_plat_sysinfo_1255= node 17 free: 64229 MB notes_plat_sysinfo_1260= node 18 cpus: 144-151,336-343 notes_plat_sysinfo_1265= node 18 size: 64507 MB notes_plat_sysinfo_1270= node 18 free: 64220 MB notes_plat_sysinfo_1275= node 19 cpus: 152-159,344-351 notes_plat_sysinfo_1280= node 19 size: 64507 MB notes_plat_sysinfo_1285= node 19 free: 64223 MB notes_plat_sysinfo_1290= node 20 cpus: 160-167,352-359 notes_plat_sysinfo_1295= node 20 size: 64507 MB notes_plat_sysinfo_1300= node 20 free: 64205 MB notes_plat_sysinfo_1305= node 21 cpus: 168-175,360-367 notes_plat_sysinfo_1310= node 21 size: 64507 MB notes_plat_sysinfo_1315= node 21 free: 64170 MB notes_plat_sysinfo_1320= node 22 cpus: 176-183,368-375 notes_plat_sysinfo_1325= node 22 size: 64507 MB notes_plat_sysinfo_1330= node 22 free: 64164 MB notes_plat_sysinfo_1335= node 23 cpus: 184-191,376-383 notes_plat_sysinfo_1340= node 23 size: 64431 MB notes_plat_sysinfo_1345= node 23 free: 64094 MB notes_plat_sysinfo_1350= node distances: notes_plat_sysinfo_1355= node 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 notes_plat_sysinfo_1360= 0: 10 11 11 12 12 12 12 12 12 12 12 12 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1365= 1: 11 10 11 12 12 12 12 12 12 12 12 12 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1370= 2: 11 11 10 12 12 12 12 12 12 12 12 12 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1375= 3: 12 12 12 10 11 11 12 12 12 12 12 12 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1380= 4: 12 12 12 11 10 11 12 12 12 12 12 12 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1385= 5: 12 12 12 11 11 10 12 12 12 12 12 12 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1390= 6: 12 12 12 12 12 12 10 11 11 12 12 12 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1395= 7: 12 12 12 12 12 12 11 10 11 12 12 12 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1400= 8: 12 12 12 12 12 12 11 11 10 12 12 12 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1405= 9: 12 12 12 12 12 12 12 12 12 10 11 11 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1410= 10: 12 12 12 12 12 12 12 12 12 11 10 11 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1415= 11: 12 12 12 12 12 12 12 12 12 11 11 10 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1420= 12: 32 32 32 32 32 32 32 32 32 32 32 32 10 11 11 12 12 12 12 12 12 12 12 12 notes_plat_sysinfo_1425= 13: 32 32 32 32 32 32 32 32 32 32 32 32 11 10 11 12 12 12 12 12 12 12 12 12 notes_plat_sysinfo_1430= 14: 32 32 32 32 32 32 32 32 32 32 32 32 11 11 10 12 12 12 12 12 12 12 12 12 notes_plat_sysinfo_1435= 15: 32 32 32 32 32 32 32 32 32 32 32 32 12 12 12 10 11 11 12 12 12 12 12 12 notes_plat_sysinfo_1440= 16: 32 32 32 32 32 32 32 32 32 32 32 32 12 12 12 11 10 11 12 12 12 12 12 12 notes_plat_sysinfo_1445= 17: 32 32 32 32 32 32 32 32 32 32 32 32 12 12 12 11 11 10 12 12 12 12 12 12 notes_plat_sysinfo_1450= 18: 32 32 32 32 32 32 32 32 32 32 32 32 12 12 12 12 12 12 10 11 11 12 12 12 notes_plat_sysinfo_1455= 19: 32 32 32 32 32 32 32 32 32 32 32 32 12 12 12 12 12 12 11 10 11 12 12 12 notes_plat_sysinfo_1460= 20: 32 32 32 32 32 32 32 32 32 32 32 32 12 12 12 12 12 12 11 11 10 12 12 12 notes_plat_sysinfo_1465= 21: 32 32 32 32 32 32 32 32 32 32 32 32 12 12 12 12 12 12 12 12 12 10 11 11 notes_plat_sysinfo_1470= 22: 32 32 32 32 32 32 32 32 32 32 32 32 12 12 12 12 12 12 12 12 12 11 10 11 notes_plat_sysinfo_1475= 23: 32 32 32 32 32 32 32 32 32 32 32 32 12 12 12 12 12 12 12 12 12 11 11 10 notes_plat_sysinfo_1480= notes_plat_sysinfo_1485= ------------------------------------------------------------ notes_plat_sysinfo_1490= 9. /proc/meminfo notes_plat_sysinfo_1495= MemTotal: 1585014924 kB notes_plat_sysinfo_1500= notes_plat_sysinfo_1505= ------------------------------------------------------------ notes_plat_sysinfo_1510= 10. who -r notes_plat_sysinfo_1515= run-level 3 Nov 21 09:15 notes_plat_sysinfo_1520= notes_plat_sysinfo_1525= ------------------------------------------------------------ notes_plat_sysinfo_1530= 11. Systemd service manager version: systemd 250 (250-6.el9_0) notes_plat_sysinfo_1535= Default Target Status notes_plat_sysinfo_1540= multi-user degraded notes_plat_sysinfo_1545= notes_plat_sysinfo_1550= ------------------------------------------------------------ notes_plat_sysinfo_1555= 12. Failed units, from systemctl list-units --state=failed notes_plat_sysinfo_1560= UNIT LOAD ACTIVE SUB DESCRIPTION notes_plat_sysinfo_1565= * dnf-makecache.service loaded failed failed dnf makecache notes_plat_sysinfo_1570= notes_plat_sysinfo_1575= ------------------------------------------------------------ notes_plat_sysinfo_1580= 13. Services, from systemctl list-unit-files notes_plat_sysinfo_1585= STATE UNIT FILES notes_plat_sysinfo_1590= enabled dbus-broker getty@ lvm2-monitor mdmonitor microcode nis-domainname rhsmcertd tuned udisks2 notes_plat_sysinfo_1595= enabled-runtime systemd-remount-fs notes_plat_sysinfo_1600= disabled NetworkManager NetworkManager-dispatcher NetworkManager-wait-online auditd notes_plat_sysinfo_1605= blk-availability chrony-wait chronyd console-getty cpupower crond debug-shell firewalld notes_plat_sysinfo_1610= irqbalance kdump kvm_stat man-db-restart-cache-update nftables rdisc rhsm rhsm-facts notes_plat_sysinfo_1615= rpmdb-rebuild rsyslog selinux-autorelabel-mark serial-getty@ sshd sshd-keygen@ sssd notes_plat_sysinfo_1620= systemd-boot-check-no-failures systemd-network-generator systemd-pstore systemd-sysext notes_plat_sysinfo_1625= indirect sssd-autofs sssd-kcm sssd-nss sssd-pac sssd-pam sssd-ssh sssd-sudo notes_plat_sysinfo_1630= notes_plat_sysinfo_1635= ------------------------------------------------------------ notes_plat_sysinfo_1640= 14. Linux kernel boot-time arguments, from /proc/cmdline notes_plat_sysinfo_1645= BOOT_IMAGE=(hd0,gpt2)/vmlinuz-5.14.0-70.22.1.el9_0.x86_64 notes_plat_sysinfo_1650= root=/dev/mapper/rhel-root notes_plat_sysinfo_1655= ro notes_plat_sysinfo_1660= resume=/dev/mapper/rhel-swap notes_plat_sysinfo_1665= rd.lvm.lv=rhel/root notes_plat_sysinfo_1670= rd.lvm.lv=rhel/swap notes_plat_sysinfo_1675= notes_plat_sysinfo_1680= ------------------------------------------------------------ notes_plat_sysinfo_1685= 15. cpupower frequency-info notes_plat_sysinfo_1690= analyzing CPU 0: notes_plat_sysinfo_1695= current policy: frequency should be within 1.50 GHz and 2.40 GHz. notes_plat_sysinfo_1700= The governor "performance" may decide which speed to use notes_plat_sysinfo_1705= within this range. notes_plat_sysinfo_1710= boost state support: notes_plat_sysinfo_1715= Supported: yes notes_plat_sysinfo_1720= Active: yes notes_plat_sysinfo_1725= Boost States: 0 notes_plat_sysinfo_1730= Total States: 3 notes_plat_sysinfo_1735= Pstate-P0: 2400MHz notes_plat_sysinfo_1740= notes_plat_sysinfo_1745= ------------------------------------------------------------ notes_plat_sysinfo_1750= 16. tuned-adm active notes_plat_sysinfo_1755= Current active profile: throughput-performance notes_plat_sysinfo_1760= notes_plat_sysinfo_1765= ------------------------------------------------------------ notes_plat_sysinfo_1770= 17. sysctl notes_plat_sysinfo_1775= kernel.numa_balancing 1 notes_plat_sysinfo_1780= kernel.randomize_va_space 0 notes_plat_sysinfo_1785= vm.compaction_proactiveness 20 notes_plat_sysinfo_1790= vm.dirty_background_bytes 0 notes_plat_sysinfo_1795= vm.dirty_background_ratio 10 notes_plat_sysinfo_1800= vm.dirty_bytes 0 notes_plat_sysinfo_1805= vm.dirty_expire_centisecs 3000 notes_plat_sysinfo_1810= vm.dirty_ratio 8 notes_plat_sysinfo_1815= vm.dirty_writeback_centisecs 500 notes_plat_sysinfo_1820= vm.dirtytime_expire_seconds 43200 notes_plat_sysinfo_1825= vm.extfrag_threshold 500 notes_plat_sysinfo_1830= vm.min_unmapped_ratio 1 notes_plat_sysinfo_1835= vm.nr_hugepages 0 notes_plat_sysinfo_1840= vm.nr_hugepages_mempolicy 0 notes_plat_sysinfo_1845= vm.nr_overcommit_hugepages 0 notes_plat_sysinfo_1850= vm.swappiness 1 notes_plat_sysinfo_1855= vm.watermark_boost_factor 15000 notes_plat_sysinfo_1860= vm.watermark_scale_factor 10 notes_plat_sysinfo_1865= vm.zone_reclaim_mode 1 notes_plat_sysinfo_1870= notes_plat_sysinfo_1875= ------------------------------------------------------------ notes_plat_sysinfo_1880= 18. /sys/kernel/mm/transparent_hugepage notes_plat_sysinfo_1885= defrag [always] defer defer+madvise madvise never notes_plat_sysinfo_1890= enabled [always] madvise never notes_plat_sysinfo_1895= hpage_pmd_size 2097152 notes_plat_sysinfo_1900= shmem_enabled always within_size advise [never] deny force notes_plat_sysinfo_1905= notes_plat_sysinfo_1910= ------------------------------------------------------------ notes_plat_sysinfo_1915= 19. /sys/kernel/mm/transparent_hugepage/khugepaged notes_plat_sysinfo_1920= alloc_sleep_millisecs 60000 notes_plat_sysinfo_1925= defrag 1 notes_plat_sysinfo_1930= max_ptes_none 511 notes_plat_sysinfo_1935= max_ptes_shared 256 notes_plat_sysinfo_1940= max_ptes_swap 64 notes_plat_sysinfo_1945= pages_to_scan 4096 notes_plat_sysinfo_1950= scan_sleep_millisecs 10000 notes_plat_sysinfo_1955= notes_plat_sysinfo_1960= ------------------------------------------------------------ notes_plat_sysinfo_1965= 20. OS release notes_plat_sysinfo_1970= From /etc/*-release /etc/*-version notes_plat_sysinfo_1975= os-release Red Hat Enterprise Linux 9.0 (Plow) notes_plat_sysinfo_1980= redhat-release Red Hat Enterprise Linux release 9.0 (Plow) notes_plat_sysinfo_1985= system-release Red Hat Enterprise Linux release 9.0 (Plow) notes_plat_sysinfo_1990= notes_plat_sysinfo_1995= ------------------------------------------------------------ notes_plat_sysinfo_2000= 21. Disk information notes_plat_sysinfo_2005= SPEC is set to: /home/cpu2017 notes_plat_sysinfo_2010= Filesystem Type Size Used Avail Use% Mounted on notes_plat_sysinfo_2015= /dev/mapper/rhel-home xfs 1.7T 22G 1.7T 2% /home notes_plat_sysinfo_2020= notes_plat_sysinfo_2025= ------------------------------------------------------------ notes_plat_sysinfo_2030= 22. /sys/devices/virtual/dmi/id notes_plat_sysinfo_2035= Vendor: IEI notes_plat_sysinfo_2040= Product: NF5180-A7-A0-R0-00 notes_plat_sysinfo_2045= Product Family: Not specified notes_plat_sysinfo_2050= Serial: 000000000 notes_plat_sysinfo_2055= notes_plat_sysinfo_2060= ------------------------------------------------------------ notes_plat_sysinfo_2065= 23. dmidecode notes_plat_sysinfo_2070= Additional information from dmidecode 3.3 follows. WARNING: Use caution when you interpret this section. notes_plat_sysinfo_2075= The 'dmidecode' program reads system data which is "intended to allow hardware to be accurately notes_plat_sysinfo_2080= determined", but the intent may not be met, as there are frequent changes to hardware, firmware, and the notes_plat_sysinfo_2085= "DMTF SMBIOS" standard. notes_plat_sysinfo_2090= Memory: notes_plat_sysinfo_2095= 24x Samsung M321R8GA0BB0-CQKZJ 64 GB 2 rank 4800 notes_plat_sysinfo_2100= notes_plat_sysinfo_2105= notes_plat_sysinfo_2110= ------------------------------------------------------------ notes_plat_sysinfo_2115= 24. BIOS notes_plat_sysinfo_2120= (This section combines info from /sys/devices and dmidecode.) notes_plat_sysinfo_2125= BIOS Vendor: American Megatrends International, LLC. notes_plat_sysinfo_2130= BIOS Version: 04.02.19 notes_plat_sysinfo_2135= BIOS Date: 03/27/2023 hw_cpu_name = AMD EPYC 9654 hw_disk = 1.7 TB add more disk info here hw_memory001 = 1511.588 GB fixme: If using DDR4, the format is: hw_memory002 = 'N GB (N x N GB nRxn PC4-nnnnX-X)' hw_nchips = 2 hw_ncores = 192 hw_nthreadspercore = 2 prepared_by = root (is never output, only tags rawfile) sw_file = xfs sw_os001 = Red Hat Enterprise Linux 9.0 (Plow) sw_state = Run level 3 (add definition here) # End of settings added by sysinfo_program 648.exchange2_s: # The following setting was inserted automatically as a result of # post-run basepeak application. basepeak = 1 641.leela_s: # The following setting was inserted automatically as a result of # post-run basepeak application. basepeak = 1 631.deepsjeng_s: # The following setting was inserted automatically as a result of # post-run basepeak application. basepeak = 1 602.gcc_s: # The following setting was inserted automatically as a result of # post-run basepeak application. basepeak = 1 600.perlbench_s: # The following setting was inserted automatically as a result of # post-run basepeak application. basepeak = 1 # The following section was added automatically, and contains settings that # did not appear in the original configuration file, but were added to the # raw file after the run. default: power_management000 = BIOS and OS set to prefer performance at the cost power_management001 = of additional power usage. notes_plat_form_000 = BIOS configuration: notes_plat_form_005 = SVM Mode = disable notes_plat_form_010 = DRAM Scrub time = disable notes_plat_form_015 = NUMA nodes per socket = NPS1 notes_plat_form_020 = Determinism Slider = Power notes_plat_form_025 = cTDP = 400 notes_plat_form_030 = Package Power Limit = 400