Stable Functions, Stable Objects, and Stable Classes #4789

luc-blaeser · 2024-11-30T12:59:21Z

Stable Functions, Stable Objects, and Stable Classes

This is a proposal to further strenghen orthogonal persistence in Motoko, by also supporting functions, objects, and classes to be automatically persisted, i.e. been used in Motoko's stable variables.

The feature is only supported with enhanced orthogonal persistence.

With this, the only non-persistent (non-stable) values would be lambdas, async handles (aka continuations, futures), and local async function references.

Example

With this, the following program can now be used for orthogonal persistence. For this example, you need a specific base library branch, which contains slight modifications for supporting stable functions.

import HashMap "base/HashMap";
import Hash "base/Hash";
import Nat "base/Nat";
import Iter "base/Iter";

actor {
    stable let hashMap = HashMap.HashMap<Nat, Text>(10, Nat.equal, Hash.hash); // finally persistent!

    public func populate(): async() {
        hashMap.put(1, "A");
        hashMap.put(2, "B");
        hashMap.put(3, "C");
    };

    public func list(): async [(Nat, Text)] {
        Iter.toArray<(Nat, Text)>(hashMap.entries());
    };
}

Stable Functions and Stable Scopes

A stable function is a named non-async local function in a stable scope, only closing over variables of a stable type.

A stable scope is:

the main actor,
an actor class,
a module imported with a unique identifier from a stable scope,
a named non-async function in a stable scope,
a class in a stable scope, or,
a named object in a stable scope.

Generic type parameters of stable functions and stable classes are bounded to stable types.

A stable function is also a stable type.

Syntactically, function types are prefixed by stable to denote a stable function, e.g.
stable X -> Y. Stable functions implicitly have a corresponding stable reference type.

A stable function type is a sub-type of a flexible function type with type-compatible signature, i.e. stable X' -> Y <: X -> Y' for X' <: X and Y' :< Y.

Upgrades of Stable Functions

Stable functions are upgraded as follows:

All stable functions that are reachable from stable variables are considered alive.
Each alive stable function must have a matching declaration in the new program version.
Stable functions match between program versions if they have an equal fully qualified name.
For matching functions, the function type of the new version must be compatible to the previous version (super-type).
For matching functions, the closure type in the new version must be compatible with the previous version, see below.

All other functions, such as lambdas, named functions in a lambda, async functions, or functions imported from a module without a unique import identifier, are flexible functions.

Stable Closures

On a stable function upgrade, the closure type of the stable function must remain compatible. The runtime system checks on upgrade:

The new version of the stable function does not capture more variables than the previous version.
The captured variable in the new version is a valid super-type of the previous version.

Specific aspects are to be considered for generic types used in the stable closure:

Generic types used for captured variables must match to previous declaration order (e.g. one cannot swap generic types).
The generic type bounds must remain compatible.
However, generic types do not need to be reified, see the reasoning in generics in stabe closures.

Runtime System Design

Function references are encoded by a function id in the following representation:

Stable function id, encoded as non-negative number:
A stable function reference that stays invariant across upgrades.
Flexible functiion id, encoded as negative number:
A flexible function reference that is invalidated on upgrade.

Each program version defines a set of named local functions that can be used as stable function references. Each such function obtains a stable function id on program initialization and upgrade. If the stable function was already declared in the previous version, its function id is reused on upgrade. Thereby, the compatibility of the function type and closure type are checked. Otherwise, if it is a new stable function, it obtains a new stable function id, or a recycled id.

The runtime system supports stable functions by two mechanisms:

Persistent virtual table for stable function calls:

The persistent virtual table maps stable function ids to Wasm table indices, for supporting dynamic calls of stable functions. Each entry also stores the hashed name of the stable function to match and rebind the stable function ids to the corresponding functions of the new Wasm binary on a program upgrade. Moreover, each entry also records the type of the closure, referring to the persistent type table. The table survives upgrades and is built and updated by the runtime system. To build and update the persistent virtual table, the compiler provides a stable function map, mapping the hashed name of a potentially stable function to the corresponding Wasm table index, plus its closure type pointing to the new type table. For performance, the stable function map is sorted by the hashed names.
Function literal table for materializing stable function literals:

As the compiler does not yet know the function ids of stable function literals/constants, this table maps a Wasm table index of the current program version to a stable function id. The function literal table is re-built on program initialization and upgrade. When a stable function literal is loaded, it serves for resolving the corresponding function id and thus the stable function reference. The table is discarded on upgrades and (re-)constructed by the runtime system, based on the information of the stable function map.

The runtime system distinguishes between flexible and stable function references by using a different encoding. This is to avoid complicated conversion logic been inserted by the compiler when a stable function reference is assigned to flexible reference, in particular in the presence of sharing (a function reference can be reached by both a stable and flexible function type) and composed types (function references can be deeply nested in a composed value that is assigned).

Compatibility Check

A stable function compatibility check is performed by the runtime system on upgrade.

It checks for a matching function in the new version.
The function type compatibility is implicitly covered by the upgrade memory compatibility check, since the stable function in use needs to be reachable by the stable actor type.
The closure compatibility is additionally checked for each mapped stable function. This covers all captured variables of the stable function. This check is supported by the information of the persistent virtual table and the stable function map.

Flexible function references are represented as negative function ids determining the Wasm table index, specifically -wasm_table_index - 1.

Garbage Collection

The runtime systems relies on a dedicated garbage collector of stable functions:

On pre-upgrade, the runtime systems determines which stable functions are still alive, i.e. transitively reachable from stable variables.
Only those alive stable functions need to exist in the new program version.
All other stable functions of the previous version are considered garbage and their slots in the virtual table can be recycled.
For efficiency, the GC is type-directed such that it only selectively traverses fields that may lead to stable function type. Note: Same objects may be revisited if appearing by a different static type.

Garbage collection is necessary to allow programs to use classes and stable functions in only flexible contexts or not even using imported classes or stable functions. Moreover, it allows programs to drop stable functions and classes, if they are no longer used for persistence.

Open Aspects

Make stable closure more portable, use a record representation for the captured variables.
The runtime system should report the name of missing stable functions in new version. Currently, only the name hash is displayed.
Supporing type-directed GC for stable functions reachable by generic type arguments. Currently, generic type arguments are fully traversed, similar to usual GC marking.
Recycle slots in the peristent virtual table.
Updating documentation, examples etc.
Lift the base library to support stable functions.

github-actions · 2024-12-02T10:05:23Z

Comparing from d87550d to 4fd9821:
In terms of gas, no changes are observed in 5 tests.
In terms of size, 5 tests improved and the mean change is -0.0%.

crusso · 2024-12-02T11:55:05Z

src/mo_types/type.ml

@@ -68,7 +69,7 @@ and field = {lab : lab; typ : typ; src : src}
 and con = kind Cons.t
 and kind =
  | Def of bind list * typ
-  | Abs of bind list * typ
+  | Abs of bind list * typ * int option


What's the int option for?

I need to document this. This is the index of the type parameter according to declaration order in its scope, i.e. starting with generic type parameter of outer scope, then continuing with inner scope. This is used for checking compatibility of stable closures if they refer to values of generic type parameters.

crusso · 2024-12-02T11:58:11Z

src/mo_types/type.ml

@@ -489,10 +494,9 @@ let close cs t =
  let sigma = List.fold_right2 ConEnv.add cs ts ConEnv.empty in
  subst sigma t

-let close_binds cs tbs =
+let close_binds cs tbs is_stable =


is_stable appears unused?

Thanks. A relict...

luc-blaeser added 30 commits October 18, 2024 18:21

Runtime system implementation of stable functions

7e6aed0

Continue implementation

d793a91

Continue RTS implementation

0056bf7

Continue compiler implementation

d509abf

Merge branch 'master' into luc/stable-functions

de353ce

Refine compiler support

5cdb086

Continue

6d6b701

Provisional support for flexible function references

0824a78

Add test case

884af72

Remove debugging code

043bb44

Extend type system for stable functions

71d25ab

Code refactoring

375d668

Adjust test

2e01e16

Adjust interpreter

49968e1

Adjust RTS tests

8ede27e

Add RTS unit test

a01ea76

Adjust test

2c3d872

Refine system functions type

427d34f

Adjust tests

8cb1763

Obtain qualified name for functions

c7a2235

Use qualified identifier for stable functions

05d6e8e

Distinguish modules

e911810

Adjust function type

fe42575

Refine type check for flexible functions

61b4da0

Remove debug functionality

1e5a400

Add test case

a9d9a6e

Support generic stable functions

9ea3dad

Refine generic stable functions

74e9b25

Add test case

06ddfae

Prepare stable closure compatibility check

204101a

luc-blaeser added 12 commits November 29, 2024 21:32

Refactor generic stable closure bounds

4472f86

Renumber error codes

18771bb

Temporarily disable ocamlformat

913acb8

Refine capture analysis

09f80e9

Temporarily disable base lib build

5e70b91

Merge branch 'master' into luc/stable-functions

9f5df3b

Manual merge conflict resolution

f792c1b

Adjusting comments

0a8fa9a

Adjust tests

ecfd254

Adjust tests

192d291

Adjust tests

fe50a61

Temporarily disable base lib building

8c244af

luc-blaeser self-assigned this Nov 30, 2024

luc-blaeser added enhancement feature New feature or request labels Nov 30, 2024

luc-blaeser added 2 commits November 30, 2024 14:38

Adjust test

1f3717a

Adjust tests

7106906

crusso reviewed Dec 2, 2024

View reviewed changes

luc-blaeser marked this pull request as ready for review December 2, 2024 16:55

luc-blaeser mentioned this pull request Dec 3, 2024

A sketch for stable functions #707

Open

luc-blaeser added 2 commits January 8, 2025 14:08

Merge branch 'master' into luc/stable-functions

0a4fbb3

Manual merge conflict resolution

282d4b0

luc-blaeser requested a review from a team as a code owner January 8, 2025 15:04

luc-blaeser added 5 commits January 8, 2025 19:04

Adjust tests for renumbered error codes

dadde1f

Generic closure: Add test case, design rationale

e0b61ca

Some documentation refactoring

db79cd9

Small adjustment for test

e37e17b

Adjust test

4fd9821

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stable Functions, Stable Objects, and Stable Classes #4789

Stable Functions, Stable Objects, and Stable Classes #4789

luc-blaeser commented Nov 30, 2024 •

edited

Loading

github-actions bot commented Dec 2, 2024 •

edited

Loading

crusso Dec 2, 2024

luc-blaeser Dec 2, 2024

crusso Dec 2, 2024

luc-blaeser Dec 2, 2024

Stable Functions, Stable Objects, and Stable Classes #4789

Are you sure you want to change the base?

Stable Functions, Stable Objects, and Stable Classes #4789

Conversation

luc-blaeser commented Nov 30, 2024 • edited Loading

Stable Functions, Stable Objects, and Stable Classes

Example

Stable Functions and Stable Scopes

Upgrades of Stable Functions

Stable Closures

Runtime System Design

Compatibility Check

Garbage Collection

Open Aspects

github-actions bot commented Dec 2, 2024 • edited Loading

crusso Dec 2, 2024

Choose a reason for hiding this comment

luc-blaeser Dec 2, 2024

Choose a reason for hiding this comment

crusso Dec 2, 2024

Choose a reason for hiding this comment

luc-blaeser Dec 2, 2024

Choose a reason for hiding this comment

luc-blaeser commented Nov 30, 2024 •

edited

Loading

github-actions bot commented Dec 2, 2024 •

edited

Loading