c

akka.remote

PhiAccrualFailureDetector

class PhiAccrualFailureDetector extends FailureDetector

Implementation of 'The Phi Accrual Failure Detector' by Hayashibara et al. as defined in their paper: [https://oneofus.la/have-emacs-will-hack/files/HDY04.pdf]

The suspicion level of failure is given by a value called φ (phi). The basic idea of the φ failure detector is to express the value of φ on a scale that is dynamically adjusted to reflect current network conditions. A configurable threshold is used to decide if φ is considered to be a failure.

The value of φ is calculated as:

φ = -log10(1 - F(timeSinceLastHeartbeat)

where F is the cumulative distribution function of a normal distribution with mean and standard deviation estimated from historical heartbeat inter-arrival times.

Source
PhiAccrualFailureDetector.scala
Linear Supertypes
Type Hierarchy
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. PhiAccrualFailureDetector
  2. FailureDetector
  3. AnyRef
  4. Any
Implicitly
  1. by any2stringadd
  2. by StringFormat
  3. by Ensuring
  4. by ArrowAssoc
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. Protected

Instance Constructors

  1. new PhiAccrualFailureDetector(config: Config, ev: EventStream)

    Constructor that reads parameters from config.

    Constructor that reads parameters from config. Expecting config properties named threshold, max-sample-size, min-std-deviation, acceptable-heartbeat-pause and heartbeat-interval.

  2. new PhiAccrualFailureDetector(threshold: Double, maxSampleSize: Int, minStdDeviation: FiniteDuration, acceptableHeartbeatPause: FiniteDuration, firstHeartbeatEstimate: FiniteDuration)(implicit clock: Clock)

    Constructor without eventStream to support backwards compatibility

  3. new PhiAccrualFailureDetector(threshold: Double, maxSampleSize: Int, minStdDeviation: FiniteDuration, acceptableHeartbeatPause: FiniteDuration, firstHeartbeatEstimate: FiniteDuration, eventStream: Option[EventStream])(implicit clock: Clock)

    threshold

    A low threshold is prone to generate many wrong suspicions but ensures a quick detection in the event of a real crash. Conversely, a high threshold generates fewer mistakes but needs more time to detect actual crashes

    maxSampleSize

    Number of samples to use for calculation of mean and standard deviation of inter-arrival times.

    minStdDeviation

    Minimum standard deviation to use for the normal distribution used when calculating phi. Too low standard deviation might result in too much sensitivity for sudden, but normal, deviations in heartbeat inter arrival times.

    acceptableHeartbeatPause

    Duration corresponding to number of potentially lost/delayed heartbeats that will be accepted before considering it to be an anomaly. This margin is important to be able to survive sudden, occasional, pauses in heartbeat arrivals, due to for example garbage collect or network drop.

    firstHeartbeatEstimate

    Bootstrap the stats with heartbeats that corresponds to to this duration, with a with rather high standard deviation (since environment is unknown in the beginning)

    clock

    The clock, returning current time in milliseconds, but can be faked for testing purposes. It is only used for measuring intervals (duration).

Value Members

  1. val acceptableHeartbeatPause: FiniteDuration
  2. val firstHeartbeatEstimate: FiniteDuration
  3. final def heartbeat(): Unit

    Notifies the FailureDetector that a heartbeat arrived from the monitored resource.

    Notifies the FailureDetector that a heartbeat arrived from the monitored resource. This causes the FailureDetector to update its state.

    Definition Classes
    PhiAccrualFailureDetectorFailureDetector
    Annotations
    @tailrec()
  4. def isAvailable: Boolean

    Returns true if the resource is considered to be up and healthy and returns false otherwise.

    Returns true if the resource is considered to be up and healthy and returns false otherwise.

    Definition Classes
    PhiAccrualFailureDetectorFailureDetector
  5. def isMonitoring: Boolean

    Returns true if the failure detector has received any heartbeats and started monitoring of the resource.

    Returns true if the failure detector has received any heartbeats and started monitoring of the resource.

    Definition Classes
    PhiAccrualFailureDetectorFailureDetector
  6. val maxSampleSize: Int
  7. val minStdDeviation: FiniteDuration
  8. def phi: Double

    The suspicion level of the accrual failure detector.

    The suspicion level of the accrual failure detector.

    If a connection does not have any records in failure detector then it is considered healthy.

  9. val threshold: Double