Lightweight deployments

Deployments at the edge of the cloud may need to minimize resource usage and be capable of running in resource-constrained environments. Akka Edge applications can be configured and built to run with low resource usage and adapt to changing resource needs.

Some approaches to running lightweight deployments for Akka Edge applications include:

These approaches are useful when running applications in edge environments that are on-premise or in 5G edge computing infrastructure, including cloud provider products such as AWS Wavelength, AWS Outposts, Google Distributed Cloud Edge, or Azure Stack Edge.

Lightweight Kubernetes

Kubernetes has become the standard orchestration tool for deploying containers and there are lightweight Kubernetes distributions that are specifically designed for edge computing environments. K3s and MicroK8s are lightweight Kubernetes distributions that are suitable for deploying containerized Akka Edge applications.

Cloud-optimized JVMs

The Java Virtual Machine (JVM) can be configured to run with lower resource usage. OpenJDK’s Project Leyden is not released yet, but is looking to improve the startup time, time to peak performance, and the footprint of Java programs.

OpenJ9

OpenJ9 is a JVM that is optimized for running in cloud environments and configured for lower resource usage. Options include tuning for virtualized environments, class data sharing, and ahead-of-time (AOT) compilation for faster starts and warmup.

GraalVM Native Image

GraalVM Native Image compiles Java or Scala code ahead-of-time to a native executable. A native image executable provides lower resource usage compared with the JVM, smaller deployments, faster starts, and immediate peak performance — making it ideal for Akka Edge deployments in resource-constrained environments and for responsiveness under autoscaling.

Native Image builds can be integrated into your application using build tool plugins:

Native Image builds need to be configured in various ways. See the build tool plugins and the Native Image build configuration reference documentation for more information on how to do this.

An important part of the configuration is the reachability metadata, which covers dynamic features used at runtime and which can’t be discovered statically at build time. Akka provides Native Image metadata for many of the modules, and can detect and register user messages when using Akka Serialization Jackson, so a client application may not need to define any additional reachability metadata.

For more details on what support Akka provides see the Akka documentation on Native Image.

GraalVM tracing agent

If an application uses special features or libraries not supported out of the box with Akka it is possible to automatically gather metadata and create configuration files using the GraalVM tracing agent. The tracing agent tracks usage of dynamic features during regular running of the application in a JVM, and outputs Native Image configuration based on the code paths that were exercised. The build tool plugins for Native Image provide ways to run locally with the tracing agent enabled. It can also be useful to deploy your application to a testing environment with the GraalVM tracing agent enabled, to capture usage in an actual deployment environment and exercising all the Akka Edge features that are being used.

Note

The GraalVM Native Image tracing agent can only generate configuration for code paths that were observed during the running of an application. When using this approach for generating configuration, make sure that tests exercise all possible code paths. In particular, check dynamic serialization of classes used for persistence or cross-node communication.

GraalVM Native Image build examples

The Local Drone Control service from the Akka Edge guide has been configured for GraalVM Native Image as an example:

Multidimensional autoscaling

An application using Akka Edge features, such as event sourcing and projections over gRPC, cannot scale to zero when idle. It’s possible, however, for the application to be scaled to and from “near zero” — scaling down to a state of minimal resource usage when idle, scaling up and out when load is increased. Multidimensional autoscaling is scaling both vertically (lower or higher resource allocation) and horizontally (fewer or more instances) and can be used to align resource usage with the actual demand given dynamic workloads.

In Kubernetes, the horizontal pod autoscaler (HPA) and the vertical pod autoscaler (VPA) can be combined, so that when the service is idle it is both scaled down with minimal resource requests, and scaled in to a minimal number of pods.

A multidimensional autoscaling configuration for an Akka Edge application in Kubernetes can be set up with:

  • Custom VPA recommender for vertical autoscaling configured to respond quickly, to “activate” the application. The default vertical pod autoscaler bases its recommendations for resource requests over long time frames (over days). A custom VPA recommender is needed to go from minimal resource allocation to higher requests more quickly.

  • HPA configured to horizontally autoscale based on custom metrics — such as the number of active event sourced entities in an Akka Cluster. Custom metrics need to be exposed by the application and configured for the Kubernetes custom metrics API with an “adapter”, such as the Prometheus adapter.

  • Application availability ensured by having a minimum of 2 replicas, and configuring a pod disruption budget (PDB) so that no more than one pod is unavailable at a time. When the vertical autoscaler makes changes, pods are evicted and restarted with updated resource requests. In-place changes are not currently supported by Kubernetes.

Note

The Kubernetes horizontal and vertical pod autoscalers should not be triggered using the same metrics. As the default vertical autoscaler is currently designed for resource metrics (CPU and memory), the horizontal autoscaler should be configured to use custom metrics.

Multidimensional autoscaling examples

The Local Drone Control service from the Akka Edge guide has been configured for multidimensional autoscaling. The example uses GraalVM Native Image builds for low resource usage, combines the vertical and horizontal pod autoscalers, and runs in k3s (lightweight Kubernetes).

Found an error in this documentation? The source code for this page can be found here. Please feel free to edit and contribute a pull request.