Stay within a power cap but redistribute power to optimize performance. An average per compute node power maximum is maintained, but individual nodes will be allowed above or below this average. More...

#include <PowerBalancer.hpp>

Inheritance diagram for geopm::PowerBalancer:

[legend]

Public Member Functions
	PowerBalancer ()=default
	Construct a IPowerBalancer object.

virtual	~PowerBalancer ()=default
	Destroy a IPowerBalancer object.

virtual void	power_cap (double cap)=0
	Should be called at the start of application execution with the average power cap across compute nodes. Should be called at the end of the second phase of execution to note that the power savings made across all compute nodes has been evenly redistributed.

virtual double	power_cap (void) const =0
	The current power cap which cannot be exceeded without breaking contract that the average power budget across all compute nodes is maintained.

virtual double	power_limit (void) const =0
	Returns the current power limit prescribed for this node.

virtual void	power_limit_adjusted (double limit)=0
	Notify power balancer that a new limit has been set with the governor.

virtual bool	is_runtime_stable (double measured_runtime)=0
	Update the object with a runtime measured under the current power limit and test if the current runtime sample is reliable such that a call runtime_sample() can be made.

virtual double	runtime_sample (void) const =0
	Return the expected execution time of an application epoch under the current power limit.

virtual void	calculate_runtime_sample (void)=0
	Sample the measured runtimes under the current power cap in the first phase of execution. This measurement will be aggregated across all compute nodes to find the largest runtime measured.

virtual void	target_runtime (double largest_runtime)=0

virtual bool	is_target_met (double measured_runtime)=0
	During the second phase of execution the power limit is decreased until the epoch runtime on the compute node under management has increased to the runtime of the slowest compute node. This method is used to update the object with a new measurement and also test if the current power limit meets the requirements. If the method returns false, then the value returned by power_limit() may have been updated. The new power limit should be enforced for the next epoch execution.

virtual double	power_slack (void)=0
	Query the difference between the last power cap setting and the current power limit. If this method is called and it returns zero then the trial delta used to lower the power limit is reduced by a factor of two.

Static Public Member Functions
static std::unique_ptr< PowerBalancer >	make_unique (double ctl_latency)
	Returns a unique_ptr to a concrete object constructed using the underlying implementation.

static std::shared_ptr< PowerBalancer >	make_shared (double ctl_latency)
	Returns a shared_ptr to a concrete object constructed using the underlying implementation.

Detailed Description

Stay within a power cap but redistribute power to optimize performance. An average per compute node power maximum is maintained, but individual nodes will be allowed above or below this average.

Constructor & Destructor Documentation

◆ PowerBalancer()

geopm::PowerBalancer::PowerBalancer ( )

default

Construct a IPowerBalancer object.

◆ ~PowerBalancer()

virtual geopm::PowerBalancer::~PowerBalancer ( )

virtualdefault

Destroy a IPowerBalancer object.

Member Function Documentation

◆ calculate_runtime_sample()

virtual void geopm::PowerBalancer::calculate_runtime_sample ( void )

pure virtual

Sample the measured runtimes under the current power cap in the first phase of execution. This measurement will be aggregated across all compute nodes to find the largest runtime measured.

Implemented in geopm::PowerBalancerImp.

◆ is_runtime_stable()

virtual bool geopm::PowerBalancer::is_runtime_stable ( double measured_runtime )

pure virtual

Update the object with a runtime measured under the current power limit and test if the current runtime sample is reliable such that a call runtime_sample() can be made.

Parameters

measured_runtime Most recent measurement of the execution time for an epoch on the node being managed under the current power limit.

Returns: True if a stable measurement of expected runtime for an epoch can be made with the runtime_sample() method, and false otherwise.

Implemented in geopm::PowerBalancerImp.

◆ is_target_met()

virtual bool geopm::PowerBalancer::is_target_met ( double measured_runtime )

pure virtual

During the second phase of execution the power limit is decreased until the epoch runtime on the compute node under management has increased to the runtime of the slowest compute node. This method is used to update the object with a new measurement and also test if the current power limit meets the requirements. If the method returns false, then the value returned by power_limit() may have been updated. The new power limit should be enforced for the next epoch execution.

Parameters

measured_runtime Most recent measurement of the execution time for an epoch on the node being managed under the current power limit.

Returns: True if the current power limit is reliably close to the target runtime and excess power should be sent up to the root to be redistributed, and false if more trials are required.

Implemented in geopm::PowerBalancerImp.

◆ make_shared()

std::shared_ptr< PowerBalancer > geopm::PowerBalancer::make_shared ( double ctl_latency )

static

Returns a shared_ptr to a concrete object constructed using the underlying implementation.

◆ make_unique()

std::unique_ptr< PowerBalancer > geopm::PowerBalancer::make_unique ( double ctl_latency )

static

Returns a unique_ptr to a concrete object constructed using the underlying implementation.

◆ power_cap() [1/2]

virtual void geopm::PowerBalancer::power_cap ( double cap )

pure virtual

Should be called at the start of application execution with the average power cap across compute nodes. Should be called at the end of the second phase of execution to note that the power savings made across all compute nodes has been evenly redistributed.

Parameters

power_cap The new maximum power limit equal to the current power limit plus the amount of power saved that is being redistributed.

Implemented in geopm::PowerBalancerImp.

◆ power_cap() [2/2]

virtual double geopm::PowerBalancer::power_cap ( void ) const

pure virtual

The current power cap which cannot be exceeded without breaking contract that the average power budget across all compute nodes is maintained.

Returns: The current value of the power cap.

Implemented in geopm::PowerBalancerImp.

◆ power_limit()

virtual double geopm::PowerBalancer::power_limit ( void ) const

pure virtual

Returns the current power limit prescribed for this node.

Returns: The current power limit in units of Watts.

Implemented in geopm::PowerBalancerImp.

◆ power_limit_adjusted()

virtual void geopm::PowerBalancer::power_limit_adjusted ( double limit )

pure virtual

Notify power balancer that a new limit has been set with the governor.

Parameters

limit limit that was set.

Implemented in geopm::PowerBalancerImp.

◆ power_slack()

virtual double geopm::PowerBalancer::power_slack ( void )

pure virtual

Query the difference between the last power cap setting and the current power limit. If this method is called and it returns zero then the trial delta used to lower the power limit is reduced by a factor of two.

Returns: The difference between the last power cap and the current power limit in units of Watts.

Implemented in geopm::PowerBalancerImp.

◆ runtime_sample()

virtual double geopm::PowerBalancer::runtime_sample ( void ) const

pure virtual

Return the expected execution time of an application epoch under the current power limit.

Implemented in geopm::PowerBalancerImp.

◆ target_runtime()

virtual void geopm::PowerBalancer::target_runtime ( double largest_runtime )

pure virtual

Parameters

Set	the target runtime which is the largest epoch execution time measured by any compute node since the application began or the last global increase to the power budget.
largest_runtime	The largest expected runtime for one epoch across all compute nodes under the current power budget.

Implemented in geopm::PowerBalancerImp.

The documentation for this class was generated from the following files:

Public Member Functions

Static Public Member Functions

Detailed Description

Constructor & Destructor Documentation

◆ PowerBalancer()

◆ ~PowerBalancer()

Member Function Documentation

◆ calculate_runtime_sample()

◆ is_runtime_stable()

◆ is_target_met()

◆ make_shared()

◆ make_unique()

◆ power_cap() [1/2]

◆ power_cap() [2/2]

◆ power_limit()

◆ power_limit_adjusted()

◆ power_slack()

◆ runtime_sample()

◆ target_runtime()