Substructural type system
Type systems |
---|
General concepts |
Major categories |
|
Minor categories |
Substructural type systems are a family of type systems analogous to substructural logics where one or more of the structural rules are absent or only allowed under controlled circumstances. Such systems can constrain access to system resources such as files, locks, and memory by keeping track of changes of state and prohibiting invalid states.[1]: 4
Different substructural type systems
[edit]Several type systems have emerged by discarding some of the structural rules of exchange, weakening, and contraction:
Exchange | Weakening | Contraction | Use | |
---|---|---|---|---|
Ordered | — | — | — | Exactly once in order |
Linear | Allowed | — | — | Exactly once |
Affine | Allowed | Allowed | — | At most once |
Relevant | Allowed | — | Allowed | At least once |
Normal | Allowed | Allowed | Allowed | Arbitrarily |
- Ordered type systems (discard exchange, weakening and contraction): Every variable is used exactly once in the order it was introduced.
- Linear type systems (allow exchange, but neither weakening nor contraction): Every variable is used exactly once.
- Affine type systems (allow exchange and weakening, but not contraction): Every variable is used at most once.
- Relevant type systems (allow exchange and contraction, but not weakening): Every variable is used at least once.
- Normal type systems (allow exchange, weakening and contraction): Every variable may be used arbitrarily.
The explanation for affine type systems is best understood if rephrased as “every occurrence of a variable is used at most once”.
Ordered type system
[edit]Ordered types correspond to noncommutative logic where exchange, contraction and weakening are discarded. This can be used to model stack-based memory allocation (contrast with linear types which can be used to model heap-based memory allocation).[1]: 30–31 Without the exchange property, an object may only be used when at the top of the modelled stack, after which it is popped off, resulting in every variable being used exactly once in the order it was introduced.
Linear type systems
[edit]Linear types correspond to linear logic and ensure that objects are used exactly once. This allows the system to safely deallocate an object after its use,[1]: 6 or to design software interfaces that guarantee a resource cannot be used once it has been closed or transitioned to a different state.[2]
The Clean programming language makes use of uniqueness types (a variant of linear types) to help support concurrency, input/output, and in-place update of arrays.[1]: 43
Linear type systems allow references but not aliases. To enforce this, a reference goes out of scope after appearing on the right-hand side of an assignment, thus ensuring that only one reference to any object exists at once. Note that passing a reference as an argument to a function is a form of assignment, as the function parameter will be assigned the value inside the function, and therefore such use of a reference also causes it to go out of scope.
The single-reference property makes linear type systems suitable as programming languages for quantum computing, as it reflects the no-cloning theorem of quantum states. From the category theory point of view, no-cloning is a statement that there is no diagonal functor which could duplicate states; similarly, from the combinatory logic point of view, there is no K-combinator which can destroy states. From the lambda calculus point of view, a variable x
can appear exactly once in a term.[3]
Linear type systems are the internal language of closed symmetric monoidal categories, much in the same way that simply typed lambda calculus is the language of Cartesian closed categories. More precisely, one may construct functors between the category of linear type systems and the category of closed symmetric monoidal categories.[4]
Affine type systems
[edit]Affine types are a version of linear types allowing to discard (i.e. not use) a resource, corresponding to affine logic. An affine resource can be used at most once, while a linear one must be used exactly once.
Relevant type system
[edit]Relevant types correspond to relevant logic which allows exchange and contraction, but not weakening, which translates to every variable being used at least once.
The resource interpretation
[edit]The nomenclature offered by substructural type systems is useful to characterize resource management aspects of a language. Resource management is the aspect of language safety concerned with ensuring that each allocated resource is deallocated exactly once. Thus, the resource interpretation is only concerned with uses that transfer ownership – moving, where ownership is the responsibility to free the resource.
Uses that don't transfer ownership – borrowing – are not in scope of this interpretation, but lifetime semantics further restrict these uses to be between allocation and deallocation.
Disowning move | Obligatory move | Move quantification | Enforcible function call state machine | |
---|---|---|---|---|
Normal type | No | No | Any number of times | Topological ordering |
Affine type | Yes | No | At most once | Ordering |
Linear type | Yes | Yes | Exactly once | Ordering and completion |
Resource-affine types
[edit]Under the resource interpretation, an affine type can not be spent more than once.
As an example, the same variant of Hoare's vending machine can be expressed in English, logic and in Rust:
English | Logic | Rust |
---|---|---|
A coin can buy you a piece of candy, a drink, or go out of scope. |
Coin ⊸ Candy Coin ⊸ Drink Coin ⊸ ⊤ |
fn buy_candy(_: Coin) -> Candy { Candy{} }
fn buy_drink(_: Coin) -> Drink { Drink{} }
|
What it means for Coin to be an affine type in this example (which it is unless it implements the Copy trait) is that trying to spend the same coin twice is an invalid program that the compiler is entitled to reject:
let coin = Coin{};
let candy = buy_candy(coin); // The lifetime of the coin variable ends here.
let drink = buy_drink(coin); // Compilation error: Use of moved variable that does not possess the Copy trait.
In other words, an affine type system can express the typestate pattern: Functions can consume and return an object wrapped in different types, acting like state transitions in a state machine that stores its state as a type in the caller's context – a typestate. An API can exploit this to statically enforce that its functions are called in a correct order.
What it doesn't mean, however, is that a variable can't be used without using it up:
// This function just borrows a coin: The ampersand means borrow.
fn validate(_: &Coin) -> Result<(), ()> { Ok(()) }
// The same coin variable can be used infinitely many times
// as long as it is not moved.
let coin = Coin{};
loop {
validate(&coin)?;
}
What Rust is not able to express is a coin type that cannot go out of scope – that would take a linear type.
Resource-linear types
[edit]Under the resource interpretation, a linear type not only can be moved, like an affine type, but must be moved – going out of scope is an invalid program.
{
// Must be passed on, not dropped.
let token = HotPotato{};
// Suppose not every branch does away with it:
if (!queue.is_full()) {
queue.push(token);
}
// Compilation error: Holding an undroppable object as the scope ends.
}
An attraction with linear types is that destructors become regular functions that can take arguments, can fail and so on.[5] This may for example avoid the need to keep state that is only used for destruction. A general advantage of passing function dependencies explicitly is that the order of function calls – destruction order – becomes statically verifiable in terms of the arguments' lifetimes. Compared to internal references, this does not require lifetime annotations as in Rust.
As with manual resource management, a practical problem is that any early return, as is typical of error handling, must achieve the same cleanup. This becomes pedantic in languages that have stack unwinding, where every function call is a potential early return. However, as a close analogy, the semantic of implicitly inserted destructor calls can be restored with deferred function calls.[6]
Resource-normal types
[edit]Under the resource interpretation, a normal type does not restrict how many times a variable can be moved from. C++ (specifically nondestructive move semantics) falls in this category.
auto coin = std::unique_ptr<Coin>();
auto candy = buy_candy(std::move(coin));
auto drink = buy_drink(std::move(coin)); // This is valid C++.
Programming languages
[edit]The following programming languages support linear or affine types[citation needed]:
See also
[edit]References
[edit]- ^ a b c d Walker, David (2002). "Substructural Type Systems". In Pierce, Benjamin C. (ed.). Advanced Topics in Types and Programming Languages (PDF). MIT Press. pp. 3–43. ISBN 0-262-16228-8.
- ^ Bernardy, Jean-Philippe; Boespflug, Mathieu; Newton, Ryan R; Peyton Jones, Simon; Spiwack, Arnaud (2017). "Linear Haskell: practical linearity in a higher-order polymorphic language". Proceedings of the ACM on Programming Languages. 2: 1–29. arXiv:1710.09756. doi:10.1145/3158093. S2CID 9019395.
- ^ Baez, John C.; Stay, Mike (2010). "Physics, Topology, Logic and Computation: A Rosetta Stone". In Springer (ed.). New Structures for Physics (PDF). pp. 95–174.
- ^ Ambler, S. (1991). First order logic in symmetric monoidal closed categories (PhD). U. of Edinburgh.
- ^ "Vale's Vision". Retrieved 6 December 2023.
Higher RAII, a form of linear typing that enables destructors with parameters and returns.
- ^ "Go by Example: Defer". Retrieved 5 December 2023.
Defer is used to ensure that a function call is performed later in a program's execution, usually for purposes of cleanup. defer is often used where e.g. ensure and finally would be used in other languages.
- ^ "6.4.19. Linear types — Glasgow Haskell Compiler 9.7.20230513 User's Guide". ghc.gitlab.haskell.org. Retrieved 2023-05-14.
- ^ [1]