geopm_report(7) -- GEOPM summary report file ============================================ Description ----------- The GEOPM report is a **YAML** file containing summarized information about runtimes, network usage, energy consumption, and other data about an application's behavior on every compute node where it ran. The report is broken into sections with totals for each compute node, and further divided into sections detailing each application region and epoch. By default, the report will be saved into a file called ``"geopm.report"``, but this name can be customized with the ``GEOPM_REPORT`` environment variable; refer to the description for ``--geopm-report`` in :doc:`geopmlaunch(1) <geopmlaunch.1>`. The application regions and epoch are defined by use of the :doc:`geopm_prof(3) <geopm_prof.3>` interface to mark up the user application, or through automatic inference of regions based on interposing on the MPI or OpenMP interfaces (interposing on OpenMP requires that OMPT is enabled at GEOPM compile time, and the ``--geopm-ompt-enable`` option is provided to ``geopmlaunch``). Alternatively, epochs may be inserted automatically when ``--geopm-record-filter`` is used, as described in :doc:`geopmlaunch(1) <geopmlaunch.1>`. Notes On Sampling ----------------- Most data in the report is derived from :doc:`PlatformIO <geopm::PlatformIO.3>` signals (described in :doc:`geopm_pio(7) <geopm_pio.7>`) sampled at a rate determined by the ``Agent::wait()`` implementation for the active agent (see :doc:`geopm::Agent(3) <geopm::Agent.3>`). If a sample is longer than the rate specified by the agent, the subsequent sample may be shortened to correct the rate. GEOPM reports summarize sampled signals across time and across sampling domains. A signal is summarized across time by displaying a single value in a report even though the many values may have been observed at different sample times. A signal is summarized across domains when the signal is sampled at a lower-level domain (e.g., CPU-level) than the displayed domain (e.g., board-level). Aggregation Over Time ^^^^^^^^^^^^^^^^^^^^^ Signals are summarized over time following a method that depends on the signal's *behavior*, as declared by ``signal_behavior()`` in :doc:`geopm::IOGroup(3) <geopm::IOGroup.3>`. If a signal is declared as monotonically increasing, then the reported value is the amount of change measured in that signal. If a signal is not monotonically increasing, then the report shows time-weighted mean of the sampled values. Aggregation Across Domains ^^^^^^^^^^^^^^^^^^^^^^^^^^ GEOPM supports reporting signals in the signal's native domain or in a more coarse domains. If a more coarse domain is requested, the native signal is aggregated across domains as defined by the signal's ``agg_function``, documented in :doc:`geopm::IOGroup(3) <geopm::IOGroup.3>`. Common aggregation functions are documented in :doc:`geopm::Agg(3) <geopm::Agg.3>`. Attributing Signals to Regions ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Sample calculations (regardless of time-aggregation method) are attributed to the current region hash for the corresponding domain. By default, synchronous values in the report are aggregated to the board domain, so deltas will be attributed to a region only when all active CPUs on the compute node are in that region. For example, synchronous time is aggregated into the ``sync-runtime (s)`` field as the sum of all ``delta(time)`` when all active cores on the node were in the same region. User extensions through ``GEOPM_REPORT_SIGNALS`` may be aggregated to smaller domains. If any CPUs within the domain are in different regions at the time of the sample, the delta value will be attributed to the unmarked region. Only active CPUs assigned to the application are considered when determining the region to attribute values to. Values that are the result of sampling (most fields except ``runtime`` and ``count``) may include small contributions from adjacent regions, especially if regions are very short. The amount of slop depends on the controller sample rate and the number of region entries. Sampled values should not be compared with the per-process average runtime ``runtime (s)``; use ``sync-runtime (s)`` instead. Refer to the :ref:`EXAMPLES <geopm_report.7:Examples>` section below. Report Format ------------- Header Fields ^^^^^^^^^^^^^ GEOPM Version The version of the GEOPM library and tools used for the run. Note that if building from source, ``./autogen.sh`` must be rerun to update the version to the latest git SHA. Start Time The start time of the job recorded by the GEOPM controller. Profile The profile string, obtained from the ``GEOPM_PROFILE`` environment variable. Agent The name of the agent, obtained from the ``GEOPM_AGENT`` environment variable. Policy A JSON string containing the policy values as set in ``GEOPM_POLICY``, or ``"DYNAMIC"`` if the policy was received through the endpoint. Per-host Region, Epoch, and Application Totals ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Under the ``Hosts`` section of the report, each host in the job creates a section with its hostname as the key. Each host's section contains: * ``Application Totals``, the totals for each metric from the beginning to the end of the job * ``Epoch Totals``, the totals from the first detected epoch to the end of the job * ``Regions``, the summarized data for each region. The fields in each of these sections are described below: ``name`` Name of the region. For user-defined regions, this is the string passed to ``geopm_region()``; for automatically detected OpenMP and MPI regions this is the function name. See :doc:`geopm_prof(3) <geopm_prof.3>` for more information. ``hash`` The hash of the region name. This value is used by agents to distinguish regions using the ``REGION_HASH`` signal and also appears in the trace. ``runtime (s)`` The average across all processes of the total runtime spent in the region. For epoch totals, this is the time from the first detected epoch to the end of the application. For application totals, this is the time from the start to the end of the application, corresponding to the ``MPI_Init()`` and ``MPI_Finalize()`` calls. This ``runtime`` is more accurate than ``sync-runtime`` because it uses exact entry and exit timestamps rather than the sampled ``REGION_HASH``. It should not be used for comparisons with sampled values below; use ``sync-runtime`` for comparisons instead. ``count`` The total number of times this region was entered and exited, averaged across all processes. Fractional counts are possible if some processes entered a region a different number of times. For epoch totals, this is the total number of detected epochs, averaged across all processes. The count has no meaning for the unmarked region and application totals. ``sync-runtime (s)`` Total time for which the sampled region hash matched this region on all CPUs on the compute node. For epoch and application totals, this value is the same as ``runtime (s)``. The ``sync-runtime`` for the unmarked region represents the total time for which the ranks on a compute node were not in the same region (i.e. unaligned). All subsequent default fields in the report are sampled in the same way as the ``sync-runtime`` and can be compared with it. ``package-energy (J)`` Total energy in *joules* consumed by all processor packages (sockets). ``dram-energy (J)`` Total energy in *joules* consumed by all DRAM on the board. ``power (W)`` Average power for the processor package, calculated as ``package-energy`` divided by ``sync-runtime``. ``frequency (%)`` Achieved core frequency as a *percentage* of the sticker (base) frequency for the processor. This frequency is calculated using the ratio of ``CPU_CYCLES_THREAD`` to ``CPU_CYCLES_REFERENCE``. ``frequency (Hz)`` Achieved core frequency for the processor in *hertz*. This frequency is calculated using the ratio of ``CPU_CYCLES_THREAD`` to ``CPU_CYCLES_REFERENCE`` times the sticker (base) frequency. ``time-hint-network (s)`` The portion of ``sync-runtime`` where the region hint was ``GEOPM_REGION_HINT_NETWORK``. The region hint is determined by the hint passed to ``geopm_region()`` for the most nested region. ``time-hint-ignore (s)`` The portion of ``sync-runtime`` where the region hint was ``GEOPM_REGION_HINT_IGNORE``. The region hint is determined by the hint passed to ``geopm_region()`` for the most nested region. ``time-hint-compute (s)`` The portion of ``sync-runtime`` where the region hint was ``GEOPM_REGION_HINT_COMPUTE``. The region hint is determined by the hint passed to ``geopm_region()`` for the most nested region. ``time-hint-memory (s)`` The portion of ``sync-runtime`` where the region hint was ``GEOPM_REGION_HINT_MEMORY``. The region hint is determined by the hint passed to ``geopm_region()`` for the most nested region. ``time-hint-io (s)`` The portion of ``sync-runtime`` where the region hint was ``GEOPM_REGION_HINT_IO``. The region hint is determined by the hint passed to ``geopm_region()`` for the most nested region. ``time-hint-serial (s)`` The portion of ``sync-runtime`` where the region hint was ``GEOPM_REGION_HINT_SERIAL``. The region hint is determined by the hint passed to ``geopm_region()`` for the most nested region. ``time-hint-parallel (s)`` The portion of ``sync-runtime`` where the region hint was ``GEOPM_REGION_HINT_PARALLEL``. The region hint is determined by the hint passed to ``geopm_region()`` for the most nested region. ``time-hint-unknown (s)`` The portion of ``sync-runtime`` where the region hint was ``GEOPM_REGION_HINT_UNKNOWN``. The region hint is determined by the hint passed to ``geopm_region()`` for the most nested region. ``time-hint-unset (s)`` The portion of ``sync-runtime`` where the region hint was ``GEOPM_REGION_HINT_UNSET``. The region hint also becomes unset when exiting an unnested region (i.e. when a CPU is in the unmarked region). ``time-hint-spin (s)`` The portion of ``sync-runtime`` where the region hint was ``GEOPM_REGION_HINT_SPIN``. The region hint is determined by the hint passed to ``geopm_region()`` for the most nested region. ``gpu-energy (J)`` Total energy in *joules* consumed by all GPUs. ``gpu-power (W)`` Average power for the GPUs in *watts*. ``gpu-frequency (Hz)`` Achieved frequency for the GPUs in *hertz*. **Report Extensions** The report can be extended by agents, or by through the ``--geopm-report-signals`` option to ``geopmlaunch`` which corresponds to the ``GEOPM_REPORT_SIGNALS`` environment variable. See the description for ``--geopm-report-signals`` in :doc:`geopmlaunch(1) <geopmlaunch.1>` for more details. Signals in the ``GEOPM_REPORT_SIGNALS`` list will be added to individual regions, the epoch totals, and the application totals for each host. Agents can add keys to the report header, host section, or individual region sections using the ``report_header()``, ``report_host()`` or ``report_region()`` methods respectively. See :doc:`geopm::Agent(3) <geopm::Agent.3>` for more information about the report extensions available to agents. Examples -------- Suppose we have a system with 2 sockets per node and 2 CPUs per package (socket); assume 1 CPU per core for simplicity (i.e. hyperthreading is turned off if applicable). The application places one process (MPI rank) on each CPU, and each rank executes the same application code, containing regions called ``A`` and ``B``. The processes are assigned to cores as follows: .. code-block:: socket 0 | socket 1 CPU 0 | CPU 1 | CPU 2 | CPU 3 ---------+---------+---------+---------- 1 | 2 | 3 | 4 The following is an example of the region enter/exit and epoch events seen by the ``Controller``. This stream of events will be used to calculate the average runtime and count over ranks for regions and epoch, and the current region hash for sampling. The following is an example of the regions being entered and exited on each CPU. The sample rate is ``0.002 s``; the region hash for the ``REGION_HASH`` sample at larger domains is shown on the right side of the table; ``-`` indicates the unmarked region. Note that in a real report, the entry and exit times used to calculate the ``runtime`` may not line up exactly with the sample boundaries (e.g. ``0.003`` or ``0.005``), and samples may not be exactly the same length throughout the run. .. code-block:: time CPU0 CPU1 CPU2 CPU3 || board pkg0 pkg1 -------------------------------------++---------------------- 0.001 - - - - || 0.002 A - - A || - - - 0.003 A A - A || 0.004 A A A A || A A A 0.005 - A A A || 0.006 B A A A || - - A 0.007 B B A B || 0.008 B B B B || B B B 0.009 B B B B || 0.010 B B B B || B B B 0.011 - - - - || 0.012 finalize: report generated || - - - A subset of the report is shown below. The ``runtime`` and ``count`` fields are averaged across the 4 CPUs. The user extensions for ``TIME@package`` correspond to the ``sync-runtime`` for each package. Regions: -------- .. code-block:: "region": "A", "runtime": 0.00375, // average of [0.002, 0.004, 0.004, 0.005] "count": 1, "sync-runtime": 0.002, // 1 sample in A for board "TIME@package-0": 0.002, // 1 sample in A for package 0 "TIME@package-1": 0.004 // 2 samples in A for package 1 - "region": "B", "runtime": 0.002875, // average of [0.005, 0.004, 0.003, 0.004] "count": 1, "sync-runtime": 0.004, // 2 samples in B for board "TIME@package-0": 0.004, // 2 samples in B for package 0 "TIME@package-1": 0.004, // 2 samples in B for package 1 Unmarked Totals: "runtime": 0.003, // average of [0.003, 0.003, 0.004, 0.002] "count": 0, "sync-runtime": 0.006, // 3 samples in unmarked for board "TIME@package-0": 0.006, // 3 samples in unmarked for package 0 "TIME@package-1": 0.004, // 2 samples in unmarked for package 1 See Also -------- :doc:`geopm(7) <geopm.7>`, :doc:`geopm_prof(3) <geopm_prof.3>`, :doc:`geopm::Agent(3) <geopm::Agent.3>`, :doc:`geopmlaunch(1) <geopmlaunch.1>`