geopm_agent_frequency_balancer(7) – Agent reduces imbalance across CPU cores
Description
The frequency_balancer
agent reduces imbalance across CPU cores.
The executing application’s main compute loop must be annotated with
geopm_prof_epoch()
from geopm_prof(3). The agent
assumes that core application time can be measured as time spent outside of
networking regions of code. CPU cores that spend less networking time per
epoch are allocated higher CPU frequency limits, while cores that spend more
networking time per epoch are allocated lower CPU frequency limits.
Allocating CPU Core Frequency Limits
The agent estimates a running application’s critical path in each epoch, estimates the best-case critical path time the agent can make the application achieve (sometimes with energy savings, sometimes with performance improvement), and applies frequency control settings that the agent estimates will let each core achieve the target best-case time.
The following subsections outline the agent’s responsibilities at a high level. See Guiding Hardware-Driven Turbo with Application Performance Awareness for more details.
Critical Path Time Estimates
The agent measures the non-networking time as time that each core spends with
the GEOPM_REGION_HINT_NETWORK
hint inactive in each epoch. By default,
GEOPM annotates all MPI function calls with that hint. The agent aims to set
frequency controls that reduce the variation of non-networking time across
cores.
Frequency Control Decisions
Frequency control decisions applied to one core may impact the achievable frequency on other cores. In general, if the platform throttles frequency lower than a user’s requested frequency due to resource constraints, then throttling other cores may lessen the need for platform-induced throttling. Furthermore some platforms support heterogeneous turbo frequency limits, where the number of cores configured with high turbo limits impacts the maximum achievable frequency. The agent performs estimates of the application performance tradeoffs in these cases.
Heterogeneous Turbo Frequency Limits
Turbo frequency boosting allows a CPU to temporarily achieve higher than its base frequency while still respecting hardware design constraints. Typically, higher turbo frequency is achievable when only one CPU core is active (i.e., single-core turbo) than when all CPU cores are active (i.e., all-core turbo). Some platforms support heterogeneous user-configurable turbo frequency limits through the Intel(R) Speed Select Technology - Turbo Frequency (SST-TF) feature.
This agent queries the platform to learn the frequency trade-offs of different possible SST-TF configurations. The agent estimates the potential critical path time for each possible configuration, and applies the configuration that results in the least estimated time.
Agent Configuration
Execute an application with this agent by specifying
--geopm-agent=frequency_balancer
in the geopmlaunch(1)
command line arguments.
The agent is capable of influencing CPU core performance through P-State settings and through heterogeneous turbo frequency limits via SST-TF. Each type of control can be used in isolation, or they can be used together. If SST-TF is requested on a platform that does not support that feature, then the request to use SST-TF is ignored.
Specify the desired types of performance influencers through a policy json file
and indicate the location of the file with --geopm-policy=<path-to-file>
.
The geopmagent(1) tool can help construct a policy file.
Agent Policy Definitions
This agent’s behavior is influenced by the following GEOPM policy fields. At
least one balancing option must be set to 1
.
USE_FREQUENCY_LIMITS
Set to
1
if you want the agent to use per-core P-States to help balance the application. Default:1
USE_SST_TF
Set to
1
if you want the agent to use SST-TF to help balance the application. Default:1
If both P-States and SST-TF are used, then the agent uses both per-core P-States and SST-TF controls to help balance the application.
Trace Column Extensions
This agent adds the following columns to GEOPM trace files:
NON_NET_TIME_PER_EPOCH-core-<core>
This collection of columns indicates the amount of non-networking time the agent measured for each core in the epoch prior to the in-progress epoch. This is the time metric that the agent attempts to balance across cores within each CPU package.
DESIRED_NON_NETWORK_TIME-package-<package>
This collection of columns indicates the agent’s estimate for the most time any core will spend in non-networking code for the current epoch in each package.
Report Extensions
This agent adds the following fields to the root level of GEOPM reports:
Agent uses frequency control
Indicates whether the agent used P-State frequency limits to manipulate application performance across CPU cores.
Agent uses SST-TF
Indicates whether the agent used SST-TF heterogeneous turbo limits to manipulate application performance across CPU cores. This may indicate
0
even ifUSE_SST_TF
was set to1
in the policy if the platform does not support SST-TF, or if the agent is otherwise not able to use that feature on the platform.
Control Loop Gate
The agent limits each step of the the GEOPM Controller’s main loop to execute no faster than once every 5 ms.
See Also
geopm(7), geopmagent(1), geopm_agent(3), geopmlaunch(1), geopm_prof(3)