Writing Simplifications
A comprehensive guide to writing simplifications for Kontrol
What are simplifications?
Symbolic execution in K in general and in Kontrol in particular is performed on a symbolic state consisting of:
a configuration, which is a collection of cells; and
a path condition, which is a set of Boolean constraints on the symbolic variables present in the configuration.
The purpose of simplifications, as their name suggests, is to simplify the job of the symbolic execution engine. They describe ways of getting the configuration and/or the path condition into a preferred format or state mathematical theorems that are either too complicated for the SMT solver to reason about both automatically and tractably or too simple for us to even want to call the SMT solver. Simplifications are K rewrite rules that have the [simplification]
attribute, and their general form is:
meaning that all occurrences of the term LHS
in the current symbolic state should be replaced with the term RHS
when the Boolean constraints C1
, ..., CN
are entailed by the current path condition PC
, that is, when
Importantly, a simplification is sound if and only if
meaning that the LHS and the RHS of the simplification have to be equivalent under assumptions C1
, ..., CN
, that is, that a simplification must not lose information. This is essential to keep in mind, especially since the K rewrite symbol =>
can easily be misunderstood as implication in this context, and implications both can lose information and are commonly used in program verification (for instance, as forward consequence in various program logics). This means that, for example:
would not be a sound simplification, whereas
on the other hand, would.
Simplifications by example
Let us now take a look at a number of illustrative examples of simplifications that Kontrol uses, starting from:
which is simple, yet fairly instructive. Let us examine its attributes, which tell us:
That the label/identifier of the simplification is
plus-group-conc
.That the simplification is of priority 40, meaning that it is of higher priority than the default (which is 50). Assigning higher priority to simplifications allows us, for example, to put terms into a canonical form before applying other simplifications (see (S2) below), whereas assigning lower priority allows us to have last-resort simplification, only to be used if all others fail.
That the simplification is to be applied only if
A
is symbolic andB
andC
are concrete. This also reveals the purpose of the simplification, which is to simplify the addition by grouping the concrete operands together in order for them to be added up, producing a two-term addition instead of the three-term addition we started from.
At this point, we need to take a closer look at how the above simplification will be processed by the backend, revealing important principles that a Writer of Simplifications must remember.
The LHS of a simplification is matched syntactically. For example, for simplification
plus-group-conc
to be applied, we have to syntactically have a term of the form(A +Int B) +Int C
, whereA
is symbolic andB
andC
are concrete.Functions with all concrete parameters are always evaluated before any other simplification is applied. This means, for example, that the term
1 +Int 2
will always be evaluated to3
without any other simplification applied to it beforehand. Interestingly, if we look at the simplificationplus-group-conc
above, this means that the attributesymbolic(A)
is redundant. In particular, given thatB
already has to be concrete, ifA
were also concrete the expressionA +Int B
would get fully evaluated first and the term would no longer syntactically match the LHS of the simplification. In the development, we choose to keep the attributesymbolic(A)
explicit for clarity. All of this tells us that when writing simplifications with a specific goal in mind, we have to ask ourselves: "What happens if some of the symbolic variables in my simplification were concrete?", and it turns out that sometimes further simplifications are required in such cases, as we will see later on.The direction of simplification is always bottom-up. This means that the backend will simplify a given term starting from the atoms, moving left-to-right and bottom-to-top: for example, given the term
A +Int ((B +Int C) +Int D)
, the order of simplification would be:A
,B
,C
,B +Int C
,D
,((B +Int C) +Int D)
,A +Int ((B +Int C) +Int D)
. The simplification passes are repeated for as long as one successful simplification or concrete evaluation can be performed. This means that we have to be careful not to introduce circular simplifications, an example of which would be:
Now, given all of the above, let us try to use plus-group-conc
to simplify the term
First, going bottom-up, we spot that the (LHS of the) simplification can be matched against the sub-term (Y +Int 5) +Int 3
, with the corresponding substitution equalling {A: Y, B: 5, C: 3}
, and since the binding for A
is symbolic and the bindings for B
and C
are concrete, we can apply the simplification to obtain
In the next pass (always bottom-up), we first concretely evaluate 5 +Int 3
, obtaining
and then, as part of the same pass, we match the sub-term (Y +Int 8) +Int Z
against plus-group-conc
with the substitution {A: Y, B: 8, C: Z}
, but cannot apply the simplification as the obtained binding for C
is not concrete.
As our next example, we consider the simplification
which normalizes multiplication by moving concrete operands to the left. First, note that this is not a circular simplification on its own. This is because, as discussed above, if B
is concrete then for the simplification to be applied A
must be symbolic. Once the simplification is applied, A
will end up on the RHS of the multiplication, which must be concrete if the simplification is to be applied again, producing a contradiction. Second, this normalization choice allows us to minimize duplication of simplifications, as illustrated by the following simplification:
which, similarly to plus-group-conc
, groups concrete operands for evaluation, but this time in the context of integer division, /Int
. Unsurprisingly, the simplification can be applied only if C
is different from zero, that is, if the division on the LHS is defined, and also if C
divides A
.
First, note that duplicating this simplification so that B
is concrete instead of A
:
is not necessary because of mul-conc-left
. In fact, such a simplification would never be applied. To understand why this is so, consider, for example, the term
to which we would expect div-group-conc-b
to apply. However, given that simplifications are performed bottom-up, we will first get to apply mul-conc-left
to X *Int 6
, obtaining
to which div-group-conc
would then apply, but not div-group-conc-b
.
Second, note that this simplification has the preserves-definedness
attribute. This attribute should be used when either the LHS or the RHS contains a partial function, but we are certain that both the LHS and RHS are defined for all use cases. In this case, the partial function is /Int
and both the LHS and the RHS are guaranteed to be defined given the notBool C ==Int 0
requirement.
More advanced simplifications: Handling the tension between byte arrays and integers
In EVM, data can be stored in two ways---as 32-byte (or 256-bit) integers (for example, in the word stack or in storage) or as byte arrays (for example, in the memory)---and the language semantics often needs to switch between these views and manipulate the associated data. The KEVM functions relevant for this part of the symbolic reasoning of Kontrol are:
#asWord(B:Bytes) : Bytes -> Int
, which denotes the unsigned integer represented by the lowest 32 bytes (or 256 bits) of byte arrayB
. This function is total.#buf(WIDTH:Int, VALUE:Int) : Int -> Int -> Bytes
, which denotes a byte array (or buffer)WIDTH
bytes long, the contents of which, when interpreted as an unsigned integer, equalVALUE
. For example,#buf(1, 9)
represents a single byte with contents00001001
. For#buf
to be well-defined, the following conditions need to be met:0 <=Int WIDTH
, unsurprisingly stating that buffers cannot have negative length;0 <=Int VALUE <Int 2 ^Int (8 *Int WIDTH)
, meaning that the value held in the buffer must fit into the buffer.
lengthBytes(B:Bytes): Bytes -> Int
denotes the length of the byte arrayB
. This function is total and it is guaranteed to be non-negative.+Bytes : Bytes -> Bytes -> Bytes
denotes byte array concatenation. This function is total and is the key function in the representation of symbolic byte arrays. It is right-associative, is always normalized to this form, and also all consecutive concrete byte arrays in a concatenation are joined into a larger concrete byte array using the simplification
#range(B:Bytes, START:Int, WIDTH:Int)
denotes a slice of the byte array starting from (0-indexed) byteSTART
andWIDTH
bytes long, with zero-padding if the slice goes beyond the length of the byte array. This function is total, returning the the empty byte array whenSTART <Int 0
orWIDTH <Int 0
.B:Bytes [ START:Int := B':Bytes ]
denotes an update of the byte arrayB
from (0-indexed) byteSTART
with byte arrayB'
. This function is total, returning the the empty byte array whenSTART <Int 0
.
Let us now take a look at a number of simplifications that capture how these functions operate and interact with each other in the context of symbolic reasoning, starting from #asWord
and #buf
.
Leading zeros in a byte array do not affect its integer representation:
Only the last 32 bytes of a byte array count towards its integer representation:
Slicing a byte array from the right does not affect its integer representation if all of the higher bytes are zeros, version 1:
#asWord
is the left-inverse of#buf
for 32-byte unsigned integers:
#buf
is the left-inverse of#asWord
up to mismatches in byte array length:
When the slice is limited to the first
+Bytes
-junct:
When the slice is limited to the second
+Bytes
-junct:
When the slice includes the entire first
+Bytes
-junct:
When the slice starts within the first
+Bytes
-junct:
Expressing a slice of a slice into a single slice:
Slicing a byte array from the right does not affect its integer representation if all of the higher bytes are zeros, version 2:
When a byte array update is limited to the first
+Bytes
-junct:
One byte array update subsumes another if it, intuitively speaking, starts earlier and ends later:
Expert simplifications: Automating storage slot updates
In EVM, contract fields are stored in storage slots, each of which holds 32 bytes, and each of which may contain multiple fields, depending on their size. A general slot update is of the form
where:
SLOT
is a byte array representing the slot to be updated;VALUE
is an integer representing the value to which a contract field in that slot should be updated;MASK
is a bit-mask that is used to zero the part of the slot corresponding to the contract field to be updated; andSHIFT
is a power of two that is used to shift the value so that it aligns with the part of the slot corresponding to the contract field to be updated.
For example, if we were updating a given 8-byte uint64
field stored in a given slot starting from byte 8
(note that byte indices are computed from right to left), this would be achieved as follows, taking:
MASK ==Int 115792089237316195423570985008687907852929702298719625576012656144555070980095
, that is,ffffffffffffffffffffffffffffffff0000000000000000ffffffffffffffff
in hex; andSHIFT ==Int 2 ^Int 32
Kontrol is able to automate this reasoning without resorting to SMT bit-level reasoning through its simplifications. Let us now understand how this is done, recalling the general form:
and noting that MASK
and SHIFT
are always concrete, as they are generated by the Solidity compiler and are part of the contract bytecode. Therefore, our simplifications have to account for the cases in which at least one of VALUE
and SLOT
is symbolic.
Identifying slot masks
We start by understanding how to capture slot masks correctly. A 32-byte (that is, 256-bit) integer MASK
is a slot mask if and only if it is of the binary form:
where:
all bits of
SHIFT
andREST
, if any, are ones;all bits of
WIDTH
are zeros, andWIDTH
is not of length zero; andSHIFT
andWIDTH
are aligned to bytes, that is, their length is divisible by8
.
Now, to identify the length of SHIFT
, we could use an auxiliary function that returns the index of the first 0
bit of a given 256-bit integer, and to identify the length of WIDTH
, we could similarly use a function that returns the index of the first 1
bit. We proceed to define these as follows:
leaving the correctness check of the success cases as an exercise to the reader, and noting that we return -1
to indicate failure when the argument is outside of the range [0, pow256)
and also treat trying to find the first one-bit of 0
and the first zero-bit of maxUInt256
as failures.
With these functions in place, we can now identify the length of SHIFT
of a mask, in bits and in bytes, straightforwardly as follows:
noting that if #getMaskShiftBits
fails, so will #getMaskShiftBytes
since -1 modInt 8 =/=Int 0
.
Computing the length of WIDTH
, on the other hand, is slightly more involved:
because we need to separately treat the case in which the REST
part of the mask is empty, that is, when #getFirstOneBit(X >>Int #getMaskShiftBits(X)) ==Int 0
. If we did not, then #getFirstOneBit(X >>Int #getMaskShiftBits(X))
would return -1
erroneously.
Now, we are able to state formally what it means for a 256-bit integer to be a valid slot mask:
Let us inspect the is-mask-true
rule in more detail, in the light of the three requirements given above. Given that 0 <=Int #getMaskShiftBytes(X)
and 0 <=Int #getMaskWidthBytes(X)
hold by the requires clause, then:
all bits of
SHIFT
, if any, are ones;all bits of
WIDTH
are zeros, andWIDTH
is not of length zero; andSHIFT
andWIDTH
are aligned to bytes meaning that what is left is to show that all bits ofREST
, if any, are ones. This is achieved by the check
where, since having ( 2 ^Int ( #getMaskShiftBits(X) +Int #getMaskWidthBits(X) ) -Int 1 )
will set all of the bits corresponding to SHIFT
and WIDTH
to one, the only way for the bitwise-or to equal maxUInt256
is if all the remaining bits, which correspond to REST
, are also ones.
Identifying value shifts
In addition to identifying masks, we also need to identify when SHIFT
(as per the general slot update) is a valid shift, and this is when it is a byte-aligned power of two:
Executing slot updates
Now we are in the position to write slot update simplifications. Before doing so, we recall their general form once more:
First, we consider the case in which SLOT
is symbolic, where we have to write a simplification that captures the effect of the mask, zeroing the appropriate part of the slot:
and we unpack it in full detail. As slot updates are morally performed on byte arrays, we move from integers to byte arrays by constructing #buf(32, SLOT)
, a 32-byte byte array that when interpreted as an unsigned integer yields SLOT
. Note that #buf(32, SLOT)
is well-defined due to the #rangeUInt(256, SLOT)
hypothesis, which tells us that SLOT
fits into 32 bytes. Then, we use the byte array update, B [ I := B' ]
, which pastes byte array B'
into byte array B
starting from index I
, to set WIDTH
bytes to 0
starting from 32 -Int (SHIFT +Int WIDTH)
, which achieves the desired effect. Finally, we convert the byte array back to an integer using #asWord
.
This, in general, results in a term of the form
Such a term, which we denote by MASKED_SLOT
, can be divided into three parts:
[P1]: the part of the slot before the part to be updated (possibly empty):
B1 +Bytes ... +Bytes BI
;[P2]: the part of the slot to be updated, now zeroed:
#buf(#getMaskWidthBytes(MASK), 0)
; and[P3]: the rest of the slot (possibly empty):
BI+1 +Bytes ... +Bytes BN
.
This form, however, could end up being structured in less appealing ways if, for example, BI
or BI+1
are concrete. In that case, the bytes-concat-left-assoc-conc
simplification from the previous part will fire and bring them together into a single concrete byte array, obfuscating the zeroed part. In the worst case, when both [P1] and [P3] are concrete, the masking will result in just a concrete number.
Continuing, we are now dealing with a term of the form
and write simplifications that pinpoint into which part of a byte array concatenation the bitwise-or should propagate, should MASKED_SLOT
have that form:
In particular, the bitwise-or should propagate to the left if A
(which here corresponds to SHIFT *Int VALUE
) has no bits set in its last LEN
bytes (where LEN
denotes the length of B2
), and to the right if A
fits into LEN
bytes.
Ideally, these simplifications would be sufficient for the general form of slot updates. However, recall that we also have in action the built-in simplifications for #asWord
and #buf
reasoning from the previous part. So, for example, if we had a slot with three pieces of information, X
, Y
, and Z
as follows:
and we wanted to update Y
to some value W
, after the masking, and noting that SHIFT ==Int 2 ^ 96
, we would have
and after applying bor-update-to-right
, we would have
but then asWord-trim-leading-zeros
would fire, creating
after which asWord-buf-inversion
would fire, resulting in
To account for these cases, we now need the following simplification:
which structures the (part of the) slot by putting X
into the bytes above the SHIFT
if Y
is unaffected by the bitwise-or. When this rule fires on our example, we get:
which will require some further massaging later. Right now, it is important to note that we also have to have another rule for this case, in which the SHIFT
is not explicit because the updating value is concrete:
and in which we require that the first one-bit of X
(which we can compute, as X
is concrete), is in a byte that is higher than the bytes occupied by Y
.
Returning to [E5], we note that we have this term:
where a buffer of size 22 contains the value described by a buffer of size 32. This is a result of the padding-to-32-bytes introduced by bor-update-with-shift
and bor-update-without-shift
, which is needed since we do not know a priori how wide VALUE
is. To reconcile these buffer sizes, we have the following simplification:
which isolates the part of the longer buffer that fits into the shorter one, assuming the value in it indeed fits the shorter one. When this simplification fires on [E5], we obtain
which then, through the range-inside-cHead-concat
simplification, becomes
then, using range-buf-value
, simplifes to
and which then, through a number of further built-in #range
-related simplifications, simplifies to the desired
We conclude with the final piece of the slot update puzzle, which is an additional simplification that splits the shift from the value in the case that the part of the slot being processed during the update is zero-valued (for example, if Z ==Int 0
in the running example above):
Last updated