geopm_report(7) – GEOPM summary report file

Description

The GEOPM report is a YAML file containing summarized information about runtimes, network usage, energy consumption, and other data about an application’s behavior on every compute node where it ran. The report is broken into sections with totals for each compute node, and further divided into sections detailing each application region and epoch. By default, the report will be saved into a file called "geopm.report", but this name can be customized with the GEOPM_REPORT environment variable; refer to the description for --geopm-report in geopmlaunch(1).

The application regions and epoch are defined by use of the geopm_prof(3) interface to mark up the user application, or through automatic inference of regions based on interposing on the MPI or OpenMP interfaces (interposing on OpenMP requires that OMPT is enabled at GEOPM compile time, and the --geopm-ompt-enable option is provided to geopmlaunch). Alternatively, epochs may be inserted automatically when --geopm-record-filter is used, as described in geopmlaunch(1).

Notes On Sampling

Most data in the report is derived from PlatformIO signals (described in geopm_pio(7)) sampled at a rate determined by the Agent::wait() implementation for the active agent (see geopm::Agent(3)). If a sample is longer than the rate specified by the agent, the subsequent sample may be shortened to correct the rate.

GEOPM reports summarize sampled signals across time and across sampling domains. A signal is summarized across time by displaying a single value in a report even though the many values may have been observed at different sample times. A signal is summarized across domains when the signal is sampled at a lower-level domain (e.g., CPU-level) than the displayed domain (e.g., board-level).

Aggregation Over Time

Signals are summarized over time following a method that depends on the signal’s behavior, as declared by signal_behavior() in geopm::IOGroup(3).

If a signal is declared as monotonically increasing, then the reported value is the amount of change measured in that signal.

If a signal is not monotonically increasing, then the report shows time-weighted mean of the sampled values.

Aggregation Across Domains

GEOPM supports reporting signals in the signal’s native domain or in a more coarse domains. If a more coarse domain is requested, the native signal is aggregated across domains as defined by the signal’s agg_function, documented in geopm::IOGroup(3). Common aggregation functions are documented in geopm::Agg(3).

Attributing Signals to Regions

Sample calculations (regardless of time-aggregation method) are attributed to the current region hash for the corresponding domain. By default, synchronous values in the report are aggregated to the board domain, so deltas will be attributed to a region only when all active CPUs on the compute node are in that region.

For example, synchronous time is aggregated into the sync-runtime (s) field as the sum of all delta(time) when all active cores on the node were in the same region. User extensions through GEOPM_REPORT_SIGNALS may be aggregated to smaller domains. If any CPUs within the domain are in different regions at the time of the sample, the delta value will be attributed to the unmarked region. Only active CPUs assigned to the application are considered when determining the region to attribute values to.

Values that are the result of sampling (most fields except runtime and count) may include small contributions from adjacent regions, especially if regions are very short. The amount of slop depends on the controller sample rate and the number of region entries. Sampled values should not be compared with the per-process average runtime runtime (s); use sync-runtime (s) instead.

Refer to the EXAMPLES section below.

Report Format

Header Fields

GEOPM Version

The version of the GEOPM library and tools used for the run. Note that if building from source, ./autogen.sh must be rerun to update the version to the latest git SHA.

Start Time

The start time of the job recorded by the GEOPM controller.

Profile

The profile string, obtained from the GEOPM_PROFILE environment variable.

Agent

The name of the agent, obtained from the GEOPM_AGENT environment variable.

Policy

A JSON string containing the policy values as set in GEOPM_POLICY, or "DYNAMIC" if the policy was received through the endpoint.

Per-host Region, Epoch, and Application Totals

Under the Hosts section of the report, each host in the job creates a section with its hostname as the key. Each host’s section contains:

  • Application Totals, the totals for each metric from the beginning to the end of the job

  • Epoch Totals, the totals from the first detected epoch to the end of the job

  • Regions, the summarized data for each region.

The fields in each of these sections are described below:

name

Name of the region. For user-defined regions, this is the string passed to geopm_region(); for automatically detected OpenMP and MPI regions this is the function name. See geopm_prof(3) for more information.

hash

The hash of the region name. This value is used by agents to distinguish regions using the REGION_HASH signal and also appears in the trace.

runtime (s)

The average across all processes of the total runtime spent in the region. For epoch totals, this is the time from the first detected epoch to the end of the application. For application totals, this is the time from the start to the end of the application, corresponding to the MPI_Init() and MPI_Finalize() calls. This runtime is more accurate than sync-runtime because it uses exact entry and exit timestamps rather than the sampled REGION_HASH. It should not be used for comparisons with sampled values below; use sync-runtime for comparisons instead.

count

The total number of times this region was entered and exited, averaged across all processes. Fractional counts are possible if some processes entered a region a different number of times. For epoch totals, this is the total number of detected epochs, averaged across all processes. The count has no meaning for the unmarked region and application totals.

sync-runtime (s)

Total time for which the sampled region hash matched this region on all CPUs on the compute node. For epoch and application totals, this value is the same as runtime (s). The sync-runtime for the unmarked region represents the total time for which the ranks on a compute node were not in the same region (i.e. unaligned). All subsequent default fields in the report are sampled in the same way as the sync-runtime and can be compared with it.

package-energy (J)

Total energy in joules consumed by all processor packages (sockets).

dram-energy (J)

Total energy in joules consumed by all DRAM on the board.

power (W)

Average power for the processor package, calculated as package-energy divided by sync-runtime.

frequency (%)

Achieved core frequency as a percentage of the sticker (base) frequency for the processor. This frequency is calculated using the ratio of CPU_CYCLES_THREAD to CPU_CYCLES_REFERENCE.

frequency (Hz)

Achieved core frequency for the processor in hertz. This frequency is calculated using the ratio of CPU_CYCLES_THREAD to CPU_CYCLES_REFERENCE times the sticker (base) frequency.

time-hint-network (s)

The portion of sync-runtime where the region hint was GEOPM_REGION_HINT_NETWORK. The region hint is determined by the hint passed to geopm_region() for the most nested region.

time-hint-ignore (s)

The portion of sync-runtime where the region hint was GEOPM_REGION_HINT_IGNORE. The region hint is determined by the hint passed to geopm_region() for the most nested region.

time-hint-compute (s)

The portion of sync-runtime where the region hint was GEOPM_REGION_HINT_COMPUTE. The region hint is determined by the hint passed to geopm_region() for the most nested region.

time-hint-memory (s)

The portion of sync-runtime where the region hint was GEOPM_REGION_HINT_MEMORY. The region hint is determined by the hint passed to geopm_region() for the most nested region.

time-hint-io (s)

The portion of sync-runtime where the region hint was GEOPM_REGION_HINT_IO. The region hint is determined by the hint passed to geopm_region() for the most nested region.

time-hint-serial (s)

The portion of sync-runtime where the region hint was GEOPM_REGION_HINT_SERIAL. The region hint is determined by the hint passed to geopm_region() for the most nested region.

time-hint-parallel (s)

The portion of sync-runtime where the region hint was GEOPM_REGION_HINT_PARALLEL. The region hint is determined by the hint passed to geopm_region() for the most nested region.

time-hint-unknown (s)

The portion of sync-runtime where the region hint was GEOPM_REGION_HINT_UNKNOWN. The region hint is determined by the hint passed to geopm_region() for the most nested region.

time-hint-unset (s)

The portion of sync-runtime where the region hint was GEOPM_REGION_HINT_UNSET. The region hint also becomes unset when exiting an unnested region (i.e. when a CPU is in the unmarked region).

time-hint-spin (s)

The portion of sync-runtime where the region hint was GEOPM_REGION_HINT_SPIN. The region hint is determined by the hint passed to geopm_region() for the most nested region.

gpu-energy (J)

Total energy in joules consumed by all GPUs.

gpu-power (W)

Average power for the GPUs in watts.

gpu-frequency (Hz)

Achieved frequency for the GPUs in hertz.

Report Extensions

The report can be extended by agents, or by through the --geopm-report-signals option to geopmlaunch which corresponds to the GEOPM_REPORT_SIGNALS environment variable. See the description for --geopm-report-signals in geopmlaunch(1) for more details. Signals in the GEOPM_REPORT_SIGNALS list will be added to individual regions, the epoch totals, and the application totals for each host.

Agents can add keys to the report header, host section, or individual region sections using the report_header(), report_host() or report_region() methods respectively. See geopm::Agent(3) for more information about the report extensions available to agents.

Examples

Suppose we have a system with 2 sockets per node and 2 CPUs per package (socket); assume 1 CPU per core for simplicity (i.e. hyperthreading is turned off if applicable). The application places one process (MPI rank) on each CPU, and each rank executes the same application code, containing regions called A and B. The processes are assigned to cores as follows:

      socket 0     |      socket 1
  CPU 0  |  CPU 1  |  CPU 2  |  CPU 3
---------+---------+---------+----------
    1    |    2    |    3    |    4

The following is an example of the region enter/exit and epoch events seen by the Controller. This stream of events will be used to calculate the average runtime and count over ranks for regions and epoch, and the current region hash for sampling.

The following is an example of the regions being entered and exited on each CPU. The sample rate is 0.002 s; the region hash for the REGION_HASH sample at larger domains is shown on the right side of the table; - indicates the unmarked region. Note that in a real report, the entry and exit times used to calculate the runtime may not line up exactly with the sample boundaries (e.g. 0.003 or 0.005), and samples may not be exactly the same length throughout the run.

 time    CPU0   CPU1   CPU2   CPU3   ||  board   pkg0   pkg1
-------------------------------------++----------------------
0.001     -      -      -      -     ||
0.002     A      -      -      A     ||   -       -      -
0.003     A      A      -      A     ||
0.004     A      A      A      A     ||   A       A      A
0.005     -      A      A      A     ||
0.006     B      A      A      A     ||   -       -      A
0.007     B      B      A      B     ||
0.008     B      B      B      B     ||   B       B      B
0.009     B      B      B      B     ||
0.010     B      B      B      B     ||   B       B      B
0.011     -      -      -      -     ||
0.012     finalize: report generated ||   -       -      -

A subset of the report is shown below. The runtime and count fields are averaged across the 4 CPUs. The user extensions for TIME@package correspond to the sync-runtime for each package.

Regions:

 "region": "A",
 "runtime": 0.00375,   // average of [0.002, 0.004, 0.004, 0.005]
 "count": 1,
 "sync-runtime": 0.002,    // 1 sample in A for board
 "TIME@package-0": 0.002,  // 1 sample in A for package 0
 "TIME@package-1": 0.004   // 2 samples in A for package 1
 -
 "region": "B",
 "runtime": 0.002875,  // average of [0.005, 0.004, 0.003, 0.004]
 "count": 1,
 "sync-runtime": 0.004,    // 2 samples in B for board
 "TIME@package-0": 0.004,  // 2 samples in B for package 0
 "TIME@package-1": 0.004,  // 2 samples in B for package 1

Unmarked Totals:
  "runtime": 0.003,     // average of [0.003, 0.003, 0.004, 0.002]
  "count": 0,
  "sync-runtime": 0.006,    // 3 samples in unmarked for board
  "TIME@package-0": 0.006,  // 3 samples in unmarked for package 0
  "TIME@package-1": 0.004,  // 2 samples in unmarked for package 1

See Also

geopm(7), geopm_prof(3), geopm::Agent(3), geopmlaunch(1)