Persistence
Loading

Persistence

Akka persistence enables stateful actors to persist their internal state so that it can be recovered when an actor is started, restarted after a JVM crash or by a supervisor, or migrated in a cluster. The key concept behind Akka persistence is that only changes to an actor's internal state are persisted but never its current state directly (except for optional snapshots). These changes are only ever appended to storage, nothing is ever mutated, which allows for very high transaction rates and efficient replication. Stateful actors are recovered by replaying stored changes to these actors from which they can rebuild internal state. This can be either the full history of changes or starting from a snapshot which can dramatically reduce recovery times. Akka persistence also provides point-to-point communication with at-least-once message delivery semantics.

Warning

This module is marked as “experimental” as of its introduction in Akka 2.3.0. We will continue to improve this API based on our users’ feedback, which implies that while we try to keep incompatible changes to a minimum the binary compatibility guarantee for maintenance releases does not apply to the contents of the akka.persistence package.

Akka persistence is inspired by and the official replacement of the eventsourced library. It follows the same concepts and architecture of eventsourced but significantly differs on API and implementation level. See also Migration Guide Eventsourced to Akka Persistence 2.3.x

§Changes in Akka 2.3.4

In Akka 2.3.4 several of the concepts of the earlier versions were collapsed and simplified. In essence; Processor and EventsourcedProcessor are replaced by PersistentActor. Channel and PersistentChannel are replaced by AtLeastOnceDelivery. View is replaced by PersistentView.

See full details of the changes in the Migration Guide Akka Persistence (experimental) 2.3.3 to 2.3.4 (and 2.4.x). The old classes are still included, and deprecated, for a while to make the transition smooth. In case you need the old documentation it is located here.

§Dependencies

Akka persistence is a separate jar file. Make sure that you have the following dependency in your project:

  1. "com.typesafe.akka" %% "akka-persistence-experimental" % "2.3.16"

§Architecture

  • PersistentActor: Is a persistent, stateful actor. It is able to persist events to a journal and can react to them in a thread-safe manner. It can be used to implement both command as well as event sourced actors. When a persistent actor is started or restarted, journaled messages are replayed to that actor, so that it can recover internal state from these messages.
  • PersistentView: A view is a persistent, stateful actor that receives journaled messages that have been written by another persistent actor. A view itself does not journal new messages, instead, it updates internal state only from a persistent actor's replicated message stream.
  • AtLeastOnceDelivery: To send messages with at-least-once delivery semantics to destinations, also in case of sender and receiver JVM crashes.
  • Journal: A journal stores the sequence of messages sent to a persistent actor. An application can control which messages are journaled and which are received by the persistent actor without being journaled. The storage backend of a journal is pluggable. The default journal storage plugin writes to the local filesystem, replicated journals are available as Community plugins.
  • Snapshot store: A snapshot store persists snapshots of a persistent actor's or a view's internal state. Snapshots are used for optimizing recovery times. The storage backend of a snapshot store is pluggable. The default snapshot storage plugin writes to the local filesystem.

§Event sourcing

The basic idea behind Event Sourcing is quite simple. A persistent actor receives a (non-persistent) command which is first validated if it can be applied to the current state. Here, validation can mean anything, from simple inspection of a command message's fields up to a conversation with several external services, for example. If validation succeeds, events are generated from the command, representing the effect of the command. These events are then persisted and, after successful persistence, used to change the actor's state. When the persistent actor needs to be recovered, only the persisted events are replayed of which we know that they can be successfully applied. In other words, events cannot fail when being replayed to a persistent actor, in contrast to commands. Event sourced actors may of course also process commands that do not change application state, such as query commands, for example.

Akka persistence supports event sourcing with the PersistentActor trait. An actor that extends this trait uses the persist method to persist and handle events. The behavior of a PersistentActor is defined by implementing receiveRecover and receiveCommand. This is demonstrated in the following example.

  1. import akka.actor._
  2. import akka.persistence._
  3.  
  4. case class Cmd(data: String)
  5. case class Evt(data: String)
  6.  
  7. case class ExampleState(events: List[String] = Nil) {
  8. def updated(evt: Evt): ExampleState = copy(evt.data :: events)
  9. def size: Int = events.length
  10. override def toString: String = events.reverse.toString
  11. }
  12.  
  13. class ExamplePersistentActor extends PersistentActor {
  14. override def persistenceId = "sample-id-1"
  15.  
  16. var state = ExampleState()
  17.  
  18. def updateState(event: Evt): Unit =
  19. state = state.updated(event)
  20.  
  21. def numEvents =
  22. state.size
  23.  
  24. val receiveRecover: Receive = {
  25. case evt: Evt => updateState(evt)
  26. case SnapshotOffer(_, snapshot: ExampleState) => state = snapshot
  27. }
  28.  
  29. val receiveCommand: Receive = {
  30. case Cmd(data) =>
  31. persist(Evt(s"${data}-${numEvents}"))(updateState)
  32. persist(Evt(s"${data}-${numEvents + 1}")) { event =>
  33. updateState(event)
  34. context.system.eventStream.publish(event)
  35. }
  36. case "snap" => saveSnapshot(state)
  37. case "print" => println(state)
  38. }
  39.  
  40. }

The example defines two data types, Cmd and Evt to represent commands and events, respectively. The state of the ExamplePersistentActor is a list of persisted event data contained in ExampleState.

The persistent actor's receiveRecover method defines how state is updated during recovery by handling Evt and SnapshotOffer messages. The persistent actor's receiveCommand method is a command handler. In this example, a command is handled by generating two events which are then persisted and handled. Events are persisted by calling persist with an event (or a sequence of events) as first argument and an event handler as second argument.

The persist method persists events asynchronously and the event handler is executed for successfully persisted events. Successfully persisted events are internally sent back to the persistent actor as individual messages that trigger event handler executions. An event handler may close over persistent actor state and mutate it. The sender of a persisted event is the sender of the corresponding command. This allows event handlers to reply to the sender of a command (not shown).

The main responsibility of an event handler is changing persistent actor state using event data and notifying others about successful state changes by publishing events.

When persisting events with persist it is guaranteed that the persistent actor will not receive further commands between the persist call and the execution(s) of the associated event handler. This also holds for multiple persist calls in context of a single command.

The easiest way to run this example yourself is to download Typesafe Activator and open the tutorial named Akka Persistence Samples with Scala. It contains instructions on how to run the PersistentActorExample.

Note

It's also possible to switch between different command handlers during normal processing and recovery with context.become() and context.unbecome(). To get the actor into the same state after recovery you need to take special care to perform the same state transitions with become and unbecome in the receiveRecover method as you would have done in the command handler.

§Identifiers

A persistent actor must have an identifier that doesn't change across different actor incarnations. The identifier must be defined with the persistenceId method.

  1. override def persistenceId = "my-stable-persistence-id"

§Recovery

By default, a persistent actor is automatically recovered on start and on restart by replaying journaled messages. New messages sent to a persistent actor during recovery do not interfere with replayed messages. New messages will only be received by a persistent actor after recovery completes.

§Recovery customization

Automated recovery on start can be disabled by overriding preStart with an empty implementation.

  1. override def preStart() = ()

In this case, a persistent actor must be recovered explicitly by sending it a Recover() message.

  1. processor ! Recover()

If not overridden, preStart sends a Recover() message to self. Applications may also override preStart to define further Recover() parameters such as an upper sequence number bound, for example.

  1. override def preStart() {
  2. self ! Recover(toSequenceNr = 457L)
  3. }

Upper sequence number bounds can be used to recover a persistent actor to past state instead of current state. Automated recovery on restart can be disabled by overriding preRestart with an empty implementation.

  1. override def preRestart(reason: Throwable, message: Option[Any]) = ()

§Recovery status

A persistent actor can query its own recovery status via the methods

  1. def recoveryRunning: Boolean
  2. def recoveryFinished: Boolean

Sometimes there is a need for performing additional initialization when the recovery has completed, before processing any other message sent to the persistent actor. The persistent actor will receive a special RecoveryCompleted message right after recovery and before any other received messages.

If there is a problem with recovering the state of the actor from the journal, the actor will be sent a RecoveryFailure message that it can choose to handle in receiveRecover. If the actor doesn't handle the RecoveryFailure message it will be stopped.

  1. def receiveRecover: Receive = {
  2. case RecoveryCompleted => recoveryCompleted()
  3. case evt => //...
  4. }
  5.  
  6. def receiveCommand: Receive = {
  7. case msg => //...
  8. }
  9.  
  10. def recoveryCompleted(): Unit = {
  11. // perform init after recovery, before any other messages
  12. // ...
  13. }

§Relaxed local consistency requirements and high throughput use-cases

If faced with relaxed local consistency requirements and high throughput demands sometimes PersistentActor and it's persist may not be enough in terms of consuming incoming Commands at a high rate, because it has to wait until all Events related to a given Command are processed in order to start processing the next Command. While this abstraction is very useful for most cases, sometimes you may be faced with relaxed requirements about consistency – for example you may want to process commands as fast as you can, assuming that Event will eventually be persisted and handled properly in the background and retroactively reacting to persistence failures if needed.

The persistAsync method provides a tool for implementing high-throughput persistent actors. It will not stash incoming Commands while the Journal is still working on persisting and/or user code is executing event callbacks.

In the below example, the event callbacks may be called "at any time", even after the next Command has been processed. The ordering between events is still guaranteed ("evt-b-1" will be sent after "evt-a-2", which will be sent after "evt-a-1" etc.).

  1. class MyPersistentActor extends PersistentActor {
  2.  
  3. override def persistenceId = "my-stable-persistence-id"
  4.  
  5. def receiveRecover: Receive = {
  6. case _ => // handle recovery here
  7. }
  8.  
  9. def receiveCommand: Receive = {
  10. case c: String => {
  11. sender() ! c
  12. persistAsync(s"evt-$c-1") { e => sender() ! e }
  13. persistAsync(s"evt-$c-2") { e => sender() ! e }
  14. }
  15. }
  16. }
  17.  
  18. // usage
  19. processor ! "a"
  20. processor ! "b"
  21.  
  22. // possible order of received messages:
  23. // a
  24. // b
  25. // evt-a-1
  26. // evt-a-2
  27. // evt-b-1
  28. // evt-b-2

Note

In order to implement the pattern known as "command sourcing" simply call persistAsync(cmd)(...) right away on all incomming messages right away, and handle them in the callback.

Warning

The callback will not be invoked if the actor is restarted (or stopped) in between the call to persistAsync and the journal has confirmed the write.

§Deferring actions until preceding persist handlers have executed

Sometimes when working with persistAsync you may find that it would be nice to define some actions in terms of ''happens-after the previous persistAsync handlers have been invoked''. PersistentActor provides an utility method called defer, which works similarily to persistAsync yet does not persist the passed in event. It is recommended to use it for read operations, and actions which do not have corresponding events in your domain model.

Using this method is very similar to the persist family of methods, yet it does not persist the passed in event. It will be kept in memory and used when invoking the handler.

  1. class MyPersistentActor extends PersistentActor {
  2.  
  3. override def persistenceId = "my-stable-persistence-id"
  4.  
  5. def receiveRecover: Receive = {
  6. case _ => // handle recovery here
  7. }
  8.  
  9. def receiveCommand: Receive = {
  10. case c: String => {
  11. sender() ! c
  12. persistAsync(s"evt-$c-1") { e => sender() ! e }
  13. persistAsync(s"evt-$c-2") { e => sender() ! e }
  14. defer(s"evt-$c-3") { e => sender() ! e }
  15. }
  16. }
  17. }

Notice that the sender() is safe to access in the handler callback, and will be pointing to the original sender of the command for which this defer handler was called.

The calling side will get the responses in this (guaranteed) order:

  1. processor ! "a"
  2. processor ! "b"
  3.  
  4. // order of received messages:
  5. // a
  6. // b
  7. // evt-a-1
  8. // evt-a-2
  9. // evt-a-3
  10. // evt-b-1
  11. // evt-b-2
  12. // evt-b-3

Warning

The callback will not be invoked if the actor is restarted (or stopped) in between the call to defer and the journal has processed and confirmed all preceding writes.

§Batch writes

To optimize throughput, a persistent actor internally batches events to be stored under high load before writing them to the journal (as a single batch). The batch size dynamically grows from 1 under low and moderate loads to a configurable maximum size (default is 200) under high load. When using persistAsync this increases the maximum throughput dramatically.

  1. akka.persistence.journal.max-message-batch-size = 200

A new batch write is triggered by a persistent actor as soon as a batch reaches the maximum size or if the journal completed writing the previous batch. Batch writes are never timer-based which keeps latencies at a minimum.

The batches are also used internally to ensure atomic writes of events. All events that are persisted in context of a single command are written as a single batch to the journal (even if persist is called multiple times per command). The recovery of a PersistentActor will therefore never be done partially (with only a subset of events persisted by a single command).

§Message deletion

To delete all messages (journaled by a single persistent actor) up to a specified sequence number, persistent actors may call the deleteMessages method.

An optional permanent parameter specifies whether the message shall be permanently deleted from the journal or only marked as deleted. In both cases, the message won't be replayed. Later extensions to Akka persistence will allow to replay messages that have been marked as deleted which can be useful for debugging purposes, for example.

§Persistent Views

Persistent views can be implemented by extending the PersistentView trait and implementing the receive and the persistenceId methods.

  1. class MyView extends PersistentView {
  2. override def persistenceId: String = "some-persistence-id"
  3. override def viewId: String = "some-persistence-id-view"
  4.  
  5. def receive: Actor.Receive = {
  6. case payload if isPersistent =>
  7. // handle message from journal...
  8. case payload =>
  9. // handle message from user-land...
  10. }
  11. }

The persistenceId identifies the persistent actor from which the view receives journaled messages. It is not necessary the referenced persistent actor is actually running. Views read messages from a persistent actor's journal directly. When a persistent actor is started later and begins to write new messages, the corresponding view is updated automatically, by default.

It is possible to determine if a message was sent from the Journal or from another actor in user-land by calling the isPersistent method. Having that said, very often you don't need this information at all and can simply apply the same logic to both cases (skip the if isPersistent check).

§Updates

The default update interval of all views of an actor system is configurable:

  1. akka.persistence.view.auto-update-interval = 5s

PersistentView implementation classes may also override the autoUpdateInterval method to return a custom update interval for a specific view class or view instance. Applications may also trigger additional updates at any time by sending a view an Update message.

  1. val view = system.actorOf(Props[MyView])
  2. view ! Update(await = true)

If the await parameter is set to true, messages that follow the Update request are processed when the incremental message replay, triggered by that update request, completed. If set to false (default), messages following the update request may interleave with the replayed message stream. Automated updates always run with await = false.

Automated updates of all persistent views of an actor system can be turned off by configuration:

  1. akka.persistence.view.auto-update = off

Implementation classes may override the configured default value by overriding the autoUpdate method. To limit the number of replayed messages per update request, applications can configure a custom akka.persistence.view.auto-update-replay-max value or override the autoUpdateReplayMax method. The number of replayed messages for manual updates can be limited with the replayMax parameter of the Update message.

§Recovery

Initial recovery of persistent views works in the very same way as for a persistent actor (i.e. by sending a Recover message to self). The maximum number of replayed messages during initial recovery is determined by autoUpdateReplayMax. Further possibilities to customize initial recovery are explained in section Recovery.

§Identifiers

A persistent view must have an identifier that doesn't change across different actor incarnations. The identifier must be defined with the viewId method.

The viewId must differ from the referenced persistenceId, unless Snapshots of a view and its persistent actor shall be shared (which is what applications usually do not want).

§Snapshots

Snapshots can dramatically reduce recovery times of persistent actors and views. The following discusses snapshots in context of persistent actors but this is also applicable to persistent views.

Persistent actors can save snapshots of internal state by calling the saveSnapshot method. If saving of a snapshot succeeds, the persistent actor receives a SaveSnapshotSuccess message, otherwise a SaveSnapshotFailure message

  1. class MyProcessor extends Processor {
  2. var state: Any = _
  3.  
  4. def receive = {
  5. case "snap" => saveSnapshot(state)
  6. case SaveSnapshotSuccess(metadata) => // ...
  7. case SaveSnapshotFailure(metadata, reason) => // ...
  8. }
  9. }

where metadata is of type SnapshotMetadata:

  1. case class SnapshotMetadata(@deprecatedName('processorId) persistenceId: String, sequenceNr: Long, timestamp: Long = 0L) {
  2. @deprecated("Use persistenceId instead.", since = "2.3.4")
  3. def processorId: String = persistenceId
  4. }

During recovery, the persistent actor is offered a previously saved snapshot via a SnapshotOffer message from which it can initialize internal state.

  1. class MyProcessor extends Processor {
  2. var state: Any = _
  3.  
  4. def receive = {
  5. case SnapshotOffer(metadata, offeredSnapshot) => state = offeredSnapshot
  6. case Persistent(payload, sequenceNr) => // ...
  7. }
  8. }

The replayed messages that follow the SnapshotOffer message, if any, are younger than the offered snapshot. They finally recover the persistent actor to its current (i.e. latest) state.

In general, a persistent actor is only offered a snapshot if that persistent actor has previously saved one or more snapshots and at least one of these snapshots matches the SnapshotSelectionCriteria that can be specified for recovery.

  1. processor ! Recover(fromSnapshot = SnapshotSelectionCriteria(
  2. maxSequenceNr = 457L,
  3. maxTimestamp = System.currentTimeMillis))

If not specified, they default to SnapshotSelectionCriteria.Latest which selects the latest (= youngest) snapshot. To disable snapshot-based recovery, applications should use SnapshotSelectionCriteria.None. A recovery where no saved snapshot matches the specified SnapshotSelectionCriteria will replay all journaled messages.

§Snapshot deletion

A persistent actor can delete individual snapshots by calling the deleteSnapshot method with the sequence number and the timestamp of a snapshot as argument. To bulk-delete snapshots matching SnapshotSelectionCriteria, persistent actors should use the deleteSnapshots method.

§At-Least-Once Delivery

To send messages with at-least-once delivery semantics to destinations you can mix-in AtLeastOnceDelivery trait to your PersistentActor on the sending side. It takes care of re-sending messages when they have not been confirmed within a configurable timeout.

Note

At-least-once delivery implies that original message send order is not always preserved and the destination may receive duplicate messages. That means that the semantics do not match those of a normal ActorRef send operation:

  • it is not at-most-once delivery
  • message order for the same sender–receiver pair is not preserved due to possible resends
  • after a crash and restart of the destination messages are still delivered—to the new actor incarnation

These semantics is similar to what an ActorPath represents (see Actor Lifecycle), therefore you need to supply a path and not a reference when delivering messages. The messages are sent to the path with an actor selection.

Use the deliver method to send a message to a destination. Call the confirmDelivery method when the destination has replied with a confirmation message.

  1. import akka.actor.{ Actor, ActorPath }
  2. import akka.persistence.AtLeastOnceDelivery
  3.  
  4. case class Msg(deliveryId: Long, s: String)
  5. case class Confirm(deliveryId: Long)
  6.  
  7. sealed trait Evt
  8. case class MsgSent(s: String) extends Evt
  9. case class MsgConfirmed(deliveryId: Long) extends Evt
  10.  
  11. class MyPersistentActor(destination: ActorPath)
  12. extends PersistentActor with AtLeastOnceDelivery {
  13.  
  14. def receiveCommand: Receive = {
  15. case s: String => persist(MsgSent(s))(updateState)
  16. case Confirm(deliveryId) => persist(MsgConfirmed(deliveryId))(updateState)
  17. }
  18.  
  19. def receiveRecover: Receive = {
  20. case evt: Evt => updateState(evt)
  21. }
  22.  
  23. def updateState(evt: Evt): Unit = evt match {
  24. case MsgSent(s) =>
  25. deliver(destination, deliveryId => Msg(deliveryId, s))
  26.  
  27. case MsgConfirmed(deliveryId) => confirmDelivery(deliveryId)
  28. }
  29. }
  30.  
  31. class MyDestination extends Actor {
  32. def receive = {
  33. case Msg(deliveryId, s) =>
  34. // ...
  35. sender() ! Confirm(deliveryId)
  36. }
  37. }

Correlation between deliver and confirmDelivery is performed with the deliveryId that is provided as parameter to the deliveryIdToMessage function. The deliveryId is typically passed in the message to the destination, which replies with a message containing the same deliveryId.

The deliveryId is a strictly monotonically increasing sequence number without gaps. The same sequence is used for all destinations of the actor, i.e. when sending to multiple destinations the destinations will see gaps in the sequence if no translation is performed.

The AtLeastOnceDelivery trait has a state consisting of unconfirmed messages and a sequence number. It does not store this state itself. You must persist events corresponding to the deliver and confirmDelivery invocations from your PersistentActor so that the state can be restored by calling the same methods during the recovery phase of the PersistentActor. Sometimes these events can be derived from other business level events, and sometimes you must create separate events. During recovery calls to deliver will not send out the message, but it will be sent later if no matching confirmDelivery was performed.

Support for snapshots is provided by getDeliverySnapshot and setDeliverySnapshot. The AtLeastOnceDeliverySnapshot contains the full delivery state, including unconfirmed messages. If you need a custom snapshot for other parts of the actor state you must also include the AtLeastOnceDeliverySnapshot. It is serialized using protobuf with the ordinary Akka serialization mechanism. It is easiest to include the bytes of the AtLeastOnceDeliverySnapshot as a blob in your custom snapshot.

The interval between redelivery attempts is defined by the redeliverInterval method. The default value can be configured with the akka.persistence.at-least-once-delivery.redeliver-interval configuration key. The method can be overridden by implementation classes to return non-default values.

After a number of delivery attempts a AtLeastOnceDelivery.UnconfirmedWarning message will be sent to self. The re-sending will still continue, but you can choose to call confirmDelivery to cancel the re-sending. The number of delivery attempts before emitting the warning is defined by the warnAfterNumberOfUnconfirmedAttempts method. The default value can be configured with the akka.persistence.at-least-once-delivery.warn-after-number-of-unconfirmed-attempts configuration key. The method can be overridden by implementation classes to return non-default values.

The AtLeastOnceDelivery trait holds messages in memory until their successful delivery has been confirmed. The limit of maximum number of unconfirmed messages that the actor is allowed to hold in memory is defined by the maxUnconfirmedMessages method. If this limit is exceed the deliver method will not accept more messages and it will throw AtLeastOnceDelivery.MaxUnconfirmedMessagesExceededException. The default value can be configured with the akka.persistence.at-least-once-delivery.max-unconfirmed-messages configuration key. The method can be overridden by implementation classes to return non-default values.

§Storage plugins

Storage backends for journals and snapshot stores are pluggable in Akka persistence. The default journal plugin writes messages to LevelDB (see Local LevelDB journal). The default snapshot store plugin writes snapshots as individual files to the local filesystem (see Local snapshot store). Applications can provide their own plugins by implementing a plugin API and activate them by configuration. Plugin development requires the following imports:

  1. import akka.actor.ActorSystem
  2. import akka.persistence._
  3. import akka.persistence.journal._
  4. import akka.persistence.snapshot._
  5. import akka.testkit.TestKit
  6. import com.typesafe.config._
  7. import org.scalatest.WordSpec
  8.  
  9. import scala.collection.immutable.Seq
  10. import scala.concurrent.Future
  11. import scala.concurrent.duration._

§Journal plugin API

A journal plugin either extends SyncWriteJournal or AsyncWriteJournal. SyncWriteJournal is an actor that should be extended when the storage backend API only supports synchronous, blocking writes. In this case, the methods to be implemented are:

  1. /**
  2. * Plugin API: synchronously writes a batch of persistent messages to the journal.
  3. * The batch write must be atomic i.e. either all persistent messages in the batch
  4. * are written or none.
  5. */
  6. def writeMessages(messages: immutable.Seq[PersistentRepr]): Unit
  7.  
  8. /**
  9. * Plugin API: synchronously writes a batch of delivery confirmations to the journal.
  10. */
  11. @deprecated("writeConfirmations will be removed, since Channels will be removed.", since = "2.3.4")
  12. def writeConfirmations(confirmations: immutable.Seq[PersistentConfirmation]): Unit
  13.  
  14. /**
  15. * Plugin API: synchronously deletes messages identified by `messageIds` from the
  16. * journal. If `permanent` is set to `false`, the persistent messages are marked as
  17. * deleted, otherwise they are permanently deleted.
  18. */
  19. @deprecated("deleteMessages will be removed.", since = "2.3.4")
  20. def deleteMessages(messageIds: immutable.Seq[PersistentId], permanent: Boolean): Unit
  21.  
  22. /**
  23. * Plugin API: synchronously deletes all persistent messages up to `toSequenceNr`
  24. * (inclusive). If `permanent` is set to `false`, the persistent messages are marked
  25. * as deleted, otherwise they are permanently deleted.
  26. */
  27. def deleteMessagesTo(persistenceId: String, toSequenceNr: Long, permanent: Boolean): Unit

AsyncWriteJournal is an actor that should be extended if the storage backend API supports asynchronous, non-blocking writes. In this case, the methods to be implemented are:

  1. /**
  2. * Plugin API: asynchronously writes a batch of persistent messages to the journal.
  3. * The batch write must be atomic i.e. either all persistent messages in the batch
  4. * are written or none.
  5. */
  6. def asyncWriteMessages(messages: immutable.Seq[PersistentRepr]): Future[Unit]
  7.  
  8. /**
  9. * Plugin API: asynchronously writes a batch of delivery confirmations to the journal.
  10. */
  11. @deprecated("writeConfirmations will be removed, since Channels will be removed.", since = "2.3.4")
  12. def asyncWriteConfirmations(confirmations: immutable.Seq[PersistentConfirmation]): Future[Unit]
  13.  
  14. /**
  15. * Plugin API: asynchronously deletes messages identified by `messageIds` from the
  16. * journal. If `permanent` is set to `false`, the persistent messages are marked as
  17. * deleted, otherwise they are permanently deleted.
  18. */
  19. @deprecated("asyncDeleteMessages will be removed.", since = "2.3.4")
  20. def asyncDeleteMessages(messageIds: immutable.Seq[PersistentId], permanent: Boolean): Future[Unit]
  21.  
  22. /**
  23. * Plugin API: asynchronously deletes all persistent messages up to `toSequenceNr`
  24. * (inclusive). If `permanent` is set to `false`, the persistent messages are marked
  25. * as deleted, otherwise they are permanently deleted.
  26. */
  27. def asyncDeleteMessagesTo(persistenceId: String, toSequenceNr: Long, permanent: Boolean): Future[Unit]

Message replays and sequence number recovery are always asynchronous, therefore, any journal plugin must implement:

  1. /**
  2. * Plugin API: asynchronously replays persistent messages. Implementations replay
  3. * a message by calling `replayCallback`. The returned future must be completed
  4. * when all messages (matching the sequence number bounds) have been replayed.
  5. * The future must be completed with a failure if any of the persistent messages
  6. * could not be replayed.
  7. *
  8. * The `replayCallback` must also be called with messages that have been marked
  9. * as deleted. In this case a replayed message's `deleted` method must return
  10. * `true`.
  11. *
  12. * The channel ids of delivery confirmations that are available for a replayed
  13. * message must be contained in that message's `confirms` sequence.
  14. *
  15. * @param persistenceId persistent actor id.
  16. * @param fromSequenceNr sequence number where replay should start (inclusive).
  17. * @param toSequenceNr sequence number where replay should end (inclusive).
  18. * @param max maximum number of messages to be replayed.
  19. * @param replayCallback called to replay a single message. Can be called from any
  20. * thread.
  21. *
  22. * @see [[AsyncWriteJournal]]
  23. * @see [[SyncWriteJournal]]
  24. */
  25. def asyncReplayMessages(persistenceId: String, fromSequenceNr: Long, toSequenceNr: Long, max: Long)(replayCallback: PersistentRepr Unit): Future[Unit]
  26.  
  27. /**
  28. * Plugin API: asynchronously reads the highest stored sequence number for the
  29. * given `persistenceId`.
  30. *
  31. * @param persistenceId persistent actor id.
  32. * @param fromSequenceNr hint where to start searching for the highest sequence
  33. * number.
  34. */
  35. def asyncReadHighestSequenceNr(persistenceId: String, fromSequenceNr: Long): Future[Long]

A journal plugin can be activated with the following minimal configuration:

  1. # Path to the journal plugin to be used
  2. akka.persistence.journal.plugin = "my-journal"
  3.  
  4. # My custom journal plugin
  5. my-journal {
  6. # Class name of the plugin.
  7. class = "docs.persistence.MyJournal"
  8. # Dispatcher for the plugin actor.
  9. plugin-dispatcher = "akka.actor.default-dispatcher"
  10. }

The specified plugin class must have a no-arg constructor. The plugin-dispatcher is the dispatcher used for the plugin actor. If not specified, it defaults to akka.persistence.dispatchers.default-plugin-dispatcher for SyncWriteJournal plugins and akka.actor.default-dispatcher for AsyncWriteJournal plugins.

§Snapshot store plugin API

A snapshot store plugin must extend the SnapshotStore actor and implement the following methods:

  1. /**
  2. * Plugin API: asynchronously loads a snapshot.
  3. *
  4. * @param persistenceId processor id.
  5. * @param criteria selection criteria for loading.
  6. */
  7. def loadAsync(persistenceId: String, criteria: SnapshotSelectionCriteria): Future[Option[SelectedSnapshot]]
  8.  
  9. /**
  10. * Plugin API: asynchronously saves a snapshot.
  11. *
  12. * @param metadata snapshot metadata.
  13. * @param snapshot snapshot.
  14. */
  15. def saveAsync(metadata: SnapshotMetadata, snapshot: Any): Future[Unit]
  16.  
  17. /**
  18. * Plugin API: called after successful saving of a snapshot.
  19. *
  20. * @param metadata snapshot metadata.
  21. */
  22. def saved(metadata: SnapshotMetadata)
  23.  
  24. /**
  25. * Plugin API: deletes the snapshot identified by `metadata`.
  26. *
  27. * @param metadata snapshot metadata.
  28. */
  29.  
  30. def delete(metadata: SnapshotMetadata)
  31.  
  32. /**
  33. * Plugin API: deletes all snapshots matching `criteria`.
  34. *
  35. * @param persistenceId processor id.
  36. * @param criteria selection criteria for deleting.
  37. */
  38. def delete(persistenceId: String, criteria: SnapshotSelectionCriteria)

A snapshot store plugin can be activated with the following minimal configuration:

  1. # Path to the snapshot store plugin to be used
  2. akka.persistence.snapshot-store.plugin = "my-snapshot-store"
  3.  
  4. # My custom snapshot store plugin
  5. my-snapshot-store {
  6. # Class name of the plugin.
  7. class = "docs.persistence.MySnapshotStore"
  8. # Dispatcher for the plugin actor.
  9. plugin-dispatcher = "akka.persistence.dispatchers.default-plugin-dispatcher"
  10. }

The specified plugin class must have a no-arg constructor. The plugin-dispatcher is the dispatcher used for the plugin actor. If not specified, it defaults to akka.persistence.dispatchers.default-plugin-dispatcher.

§Plugin TCK

In order to help developers build correct and high quality storage plugins, we provide an Technology Compatibility Kit (TCK for short).

The TCK is usable from Java as well as Scala projects, for Scala you need to include the akka-persistence-tck-experimental dependency:

  1. "com.typesafe.akka" %% "akka-persistence-tck-experimental" % "2.3.16" % "test"

To include the Journal TCK tests in your test suite simply extend the provided JournalSpec:

  1. class MyJournalSpec extends JournalSpec {
  2. override val config = ConfigFactory.parseString(
  3. """
  4. |akka.persistence.journal.plugin = "my.journal.plugin"
  5. """.stripMargin)
  6. }

We also provide a simple benchmarking class JournalPerfSpec which includes all the tests that JournalSpec has, and also performs some longer operations on the Journal while printing it's performance stats. While it is NOT aimed to provide a proper benchmarking environment it can be used to get a rough feel about your journals performance in the most typical scenarios.

In order to include the SnapshotStore TCK tests in your test suite simply extend the SnapshotStoreSpec:

  1. class MySnapshotStoreSpec extends SnapshotStoreSpec {
  2. override val config = ConfigFactory.parseString(
  3. """
  4. |akka.persistence.snapshot-store.plugin = "my.snapshot-store.plugin"
  5. """.stripMargin)
  6. }

In case your plugin requires some setting up (starting a mock database, removing temporary files etc.) you can override the beforeAll and afterAll methods to hook into the tests lifecycle:

  1. class MyJournalSpec extends JournalSpec {
  2. override val config = ConfigFactory.parseString(
  3. """
  4. |akka.persistence.journal.plugin = "my.journal.plugin"
  5. """.stripMargin)
  6.  
  7. val storageLocations = List(
  8. new File(system.settings.config.getString("akka.persistence.journal.leveldb.dir")),
  9. new File(config.getString("akka.persistence.snapshot-store.local.dir")))
  10.  
  11. override def beforeAll() {
  12. super.beforeAll()
  13. storageLocations foreach FileUtils.deleteRecursively
  14. }
  15.  
  16. override def afterAll() {
  17. storageLocations foreach FileUtils.deleteRecursively
  18. super.afterAll()
  19. }
  20.  
  21. }

We highly recommend including these specifications in your test suite, as they cover a broad range of cases you might have otherwise forgotten to test for when writing a plugin from scratch.

§Pre-packaged plugins

§Local LevelDB journal

The default journal plugin is akka.persistence.journal.leveldb which writes messages to a local LevelDB instance. The default location of the LevelDB files is a directory named journal in the current working directory. This location can be changed by configuration where the specified path can be relative or absolute:

  1. akka.persistence.journal.leveldb.dir = "target/journal"

With this plugin, each actor system runs its own private LevelDB instance.

§Shared LevelDB journal

A LevelDB instance can also be shared by multiple actor systems (on the same or on different nodes). This, for example, allows persistent actors to failover to a backup node and continue using the shared journal instance from the backup node.

Warning

A shared LevelDB instance is a single point of failure and should therefore only be used for testing purposes. Highly-available, replicated journal are available as Community plugins.

A shared LevelDB instance is started by instantiating the SharedLeveldbStore actor.

  1. }
  2. }
  3.  
  4. class MyJournal extends AsyncWriteJournal {
  5. def asyncWriteMessages(messages: Seq[PersistentRepr]): Future[Unit] = ???
  6. def asyncWriteConfirmations(confirmations: Seq[PersistentConfirmation]): Future[Unit] = ???
  7. def asyncDeleteMessages(messageIds: Seq[PersistentId], permanent: Boolean): Future[Unit] = ???
  8. def asyncDeleteMessagesTo(persistenceId: String, toSequenceNr: Long, permanent: Boolean): Future[Unit] = ???
  9. def asyncReplayMessages(persistenceId: String, fromSequenceNr: Long, toSequenceNr: Long, max: Long)(replayCallback: (PersistentRepr) => Unit): Future[Unit] = ???
  10. def asyncReadHighestSequenceNr(persistenceId: String, fromSequenceNr: Long): Future[Long] = ???
  11. }
  12.  
  13. class MySnapshotStore extends SnapshotStore {
  14. def loadAsync(persistenceId: String, criteria: SnapshotSelectionCriteria): Future[Option[SelectedSnapshot]] = ???
  15. def saveAsync(metadata: SnapshotMetadata, snapshot: Any): Future[Unit] = ???
  16. def saved(metadata: SnapshotMetadata): Unit = ???
  17. def delete(metadata: SnapshotMetadata): Unit = ???
  18. def delete(persistenceId: String, criteria: SnapshotSelectionCriteria): Unit = ???
  19. }
  20.  
  21. object PersistenceTCKDoc {
  22. new AnyRef {
  23. import akka.persistence.journal.JournalSpec
  24.  
  25. class MyJournalSpec extends JournalSpec {
  26. override val config = ConfigFactory.parseString(
  27. """
  28. |akka.persistence.journal.plugin = "my.journal.plugin"
  29. """.stripMargin)
  30. }
  31. }
  32. new AnyRef {
  33. import akka.persistence.snapshot.SnapshotStoreSpec
  34.  
  35. class MySnapshotStoreSpec extends SnapshotStoreSpec {
  36. override val config = ConfigFactory.parseString(
  37. """
  38. |akka.persistence.snapshot-store.plugin = "my.snapshot-store.plugin"
  39. """.stripMargin)
  40. }
  41. }
  42. new AnyRef {
  43. import java.io.File
  44.  
  45. import akka.persistence.journal.JournalSpec
  46. import org.iq80.leveldb.util.FileUtils
  47.  
  48. class MyJournalSpec extends JournalSpec {
  49. override val config = ConfigFactory.parseString(
  50. """
  51. |akka.persistence.journal.plugin = "my.journal.plugin"
  52. """.stripMargin)
  53.  
  54. val storageLocations = List(
  55. new File(system.settings.config.getString("akka.persistence.journal.leveldb.dir")),
  56. new File(config.getString("akka.persistence.snapshot-store.local.dir")))
  57.  
  58. override def beforeAll() {
  59. super.beforeAll()
  60. storageLocations foreach FileUtils.deleteRecursively
  61. }
  62.  
  63. override def afterAll() {
  64. storageLocations foreach FileUtils.deleteRecursively
  65. super.afterAll()
  66. }
  67.  
  68. }
  69. }
  70. }

By default, the shared instance writes journaled messages to a local directory named journal in the current working directory. The storage location can be changed by configuration:

  1. akka.persistence.journal.leveldb-shared.store.dir = "target/shared"

Actor systems that use a shared LevelDB store must activate the akka.persistence.journal.leveldb-shared plugin.

  1. akka.persistence.journal.plugin = "akka.persistence.journal.leveldb-shared"

This plugin must be initialized by injecting the (remote) SharedLeveldbStore actor reference. Injection is done by calling the SharedLeveldbJournal.setStore method with the actor reference as argument.

  1. trait SharedStoreUsage extends Actor {
  2. override def preStart(): Unit = {
  3. context.actorSelection("akka.tcp://example@127.0.0.1:2552/user/store") ! Identify(1)
  4. }
  5.  
  6. def receive = {
  7. case ActorIdentity(1, Some(store)) =>
  8. SharedLeveldbJournal.setStore(store, context.system)
  9. }
  10. }

Internal journal commands (sent by persistent actors) are buffered until injection completes. Injection is idempotent i.e. only the first injection is used.

§Local snapshot store

The default snapshot store plugin is akka.persistence.snapshot-store.local. It writes snapshot files to the local filesystem. The default storage location is a directory named snapshots in the current working directory. This can be changed by configuration where the specified path can be relative or absolute:

  1. akka.persistence.snapshot-store.local.dir = "target/snapshots"

§Custom serialization

Serialization of snapshots and payloads of Persistent messages is configurable with Akka's Serialization infrastructure. For example, if an application wants to serialize

  • payloads of type MyPayload with a custom MyPayloadSerializer and
  • snapshots of type MySnapshot with a custom MySnapshotSerializer

it must add

  1. akka.actor {
  2. serializers {
  3. my-payload = "docs.persistence.MyPayloadSerializer"
  4. my-snapshot = "docs.persistence.MySnapshotSerializer"
  5. }
  6. serialization-bindings {
  7. "docs.persistence.MyPayload" = my-payload
  8. "docs.persistence.MySnapshot" = my-snapshot
  9. }
  10. }

to the application configuration. If not specified, a default serializer is used.

§Testing

When running tests with LevelDB default settings in sbt, make sure to set fork := true in your sbt project otherwise, you'll see an UnsatisfiedLinkError. Alternatively, you can switch to a LevelDB Java port by setting

  1. akka.persistence.journal.leveldb.native = off

or

  1. akka.persistence.journal.leveldb-shared.store.native = off

in your Akka configuration. The LevelDB Java port is for testing purposes only.

§Miscellaneous

§State machines

State machines can be persisted by mixing in the FSM trait into persistent actors.

  1. import akka.actor.FSM
  2. import akka.persistence.{ Persistent, Processor }
  3.  
  4. class PersistentDoor extends Processor with FSM[String, Int] {
  5. startWith("closed", 0)
  6.  
  7. when("closed") {
  8. case Event(Persistent("open", _), counter) =>
  9. goto("open") using (counter + 1) replying (counter)
  10. }
  11.  
  12. when("open") {
  13. case Event(Persistent("close", _), counter) =>
  14. goto("closed") using (counter + 1) replying (counter)
  15. }
  16. }

§Configuration

There are several configuration properties for the persistence module, please refer to the reference configuration.