Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sort PR #6

Open
wants to merge 21 commits into
base: master
Choose a base branch
from
Open
Changes from 9 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
278 changes: 278 additions & 0 deletions src/sort/sort_partial.gobra
Original file line number Diff line number Diff line change
@@ -0,0 +1,278 @@
// Copyright 2009 The Go Authors. All rights reserved.
// Use of this source code is governed by a BSD-style
// license that can be found in the LICENSE file.

//go:generate go run gen_sort_variants.go

// Package sort provides primitives for sorting slices and user-defined collections.
// +gobra
package sort

//import "math/bits" //# Had to comment this out, otherwise verification progress does not go above 3%
witemap marked this conversation as resolved.
Show resolved Hide resolved

ghost const ReadPerm perm = 1/2

// An implementation of Interface can be sorted by the routines in this package.
// The methods refer to elements of the underlying collection by integer index.
type Interface interface {
pred Mem(ghost s seq[any])

// Len is the number of elements in the collection.
requires acc(Mem(elems), ReadPerm)
ensures acc(Mem(elems), ReadPerm)
ensures res == len(elems)
ensures res >= 0
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

functions should also be annotated as terminating. Note that termination measures in interface methods' contracts have a special meaning - they define an upper bound on the termination measures of the implementations. If you are curious, I can explain why in the next meeting

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

you are absolutely right of course. the lack of these annotations might also be the reason why i can't use isPartiallySorted in the spec of other methods. an appropriate termination measure for isPartiallySorted would be end - start.

however, simply adding decreases to less_spec causes the error

Pure function might not terminate. Required tuple condition might not hold.

since i need less_spec to be terminating for isPartiallySorted to be terminating i also cannot add the decreases end - start clause to isPartiallySorted as of now.

i'd be glad to have a look at this in the meeting as you suggested

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

for which line is this error message reported?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

1: If i put decreases for both the interface version of less_spec and on the implementation of (x IntSlice) less_spec i get the above error pointing at the first line of the precondition of less_spec in the interface.

Additionally, I get Pure function might not terminate. Termination condition might not decrease 3 times.
2: Once pointing at the line right above the new decreases clause in (x IntSlice) less_spec.
3,4: And TWICE pointing at the line right above the new decreases clause.
5: Additionally i get

Error at: <.:0:0> Fold might fail. 
Assertion forall j int :: 0 <= j && j < len(x) ==> acc(&x[j]) &&
s == toSeq(x) &&
forall j int :: {x[j]} {s[j]} 0 <= j && j < len(s) ==> x[j] == s[j] might not hold.

which is just the last 3 lines of the x.Mem predicate.

It reports 5 errors, one of them twice but mentions Gobra has found 4 errors.

Putting decreases only on the implementation (x IntSlice) less_spec reports errors 2 and 5.
Putting decreases only on the interface version of less_spec reports errors 1, 3, 4 and 5 but says it found only 3 errors.

GobraIDE is a different story again. It verifies fine if we have decreases only on the implementation (x IntSlice) less_spec which is not very useful alone even if it were true. In any other scenario it reports error 1 and additionally Postcondition might not hold. Assertion "entire spec and declaration of less_spec" might not hold at the same line as error 1.

Len(ghost elems seq[any]) (res int)

// Less reports whether the element with index i
// must sort before the element with index j.
//
// If both Less(i, j) and Less(j, i) are false,
// then the elements at index i and j are considered equal. //# What does "considered equal mean"? We cannot use == in the spec since equality is relative to the defined order
witemap marked this conversation as resolved.
Show resolved Hide resolved
// Sort may place equal elements in any order in the final result,
// while Stable preserves the original input order of equal elements.
//
// Less must describe a transitive ordering:
// - if both Less(i, j) and Less(j, k) are true, then Less(i, k) must be true as well.
// - if both Less(i, j) and Less(j, k) are false, then Less(i, k) must be false as well.
requires Mem(elems)
requires 0 <= i && i < len(elems)
requires 0 <= j && j < len(elems)
ensures forall a, b, c int :: (0 <= a && a < len(elems) && 0 <= b && b < len(elems) && 0 <= c && c < len(elems) && less_spec(a,b,elems) && less_spec(b,c,elems)) ==> less_spec(a,c,elems)
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We talked in the meeting how this transitivity requirement should be replaced via the introduction of a method like lessTransitiveProof to improve performance.
While I understood the simple examples you presented in the meeting. I don't know how to go about this in this context. How would lessTransitiveProof have to look like on the implementation level, i.e. what would be its body if it even has a body?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There is no general answer to what the body would look like. Nonetheless, the body must always exist, otherwise this is not verified, it is just an assumption. In order to write the body of lessTransitiveProof, you will need to know the body of less_spec. Most often, if these implementations are trivial, the smt solver will be able to prove the postcondition of lessTransitiveProof. In some other cases you may need to introduce some additional lemmas or to do a proof by induction, which can be encoded as a recursive (ghost) method in Gobra. We can take a look at examples in our next meeting

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

no triggers have been specified here. I guess that { less_spec(a, b, elems), less_spec(b,c,elems) } would be a good fit

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

you are right. your proposed triggers also work fine, thank you!

i still struggle with some of the others as a result of the inconsistency between GobraIDE and the CLI version

pure less_spec(i, j int, ghost elems seq[any]) (res bool)
//
// Note that floating-point comparison (the < operator on float32 or float64 values)
// is not a transitive ordering when not-a-number (NaN) values are involved.
// See Float64Slice.Less for a correct implementation for floating-point values.
requires acc(Mem(elems), ReadPerm)
requires 0 <= i && i < len(elems)
requires 0 <= j && j < len(elems)
ensures acc(Mem(elems), ReadPerm)
ensures res == less_spec(i, j, elems)
Less(i, j int, ghost elems seq[any]) (res bool)

// Swap swaps the elements with indexes i and j.
requires Mem(elems)
requires 0 <= i && i < len(elems) //# possibly make Len() pure and use that
requires 0 <= j && j < len(elems) //# possibly make Len() pure and use that
ensures Mem(swapped_elems)
ensures len(elems) == len(swapped_elems) //# somehow this does not suffice to infer i and j are in range [0, len(swapped_elems)-1]
ensures 0 <= i && i < len(elems) && i < len(swapped_elems) //# possibly make Len() pure and use that
ensures 0 <= j && j < len(elems) && j < len(swapped_elems) //# possibly make Len() pure and use that
ensures elems[i] === swapped_elems[j]
ensures elems[j] === swapped_elems[i]
ensures forall x int :: (0 <= x && x < len(elems) && x != i && x != j) ==> (elems[x] === swapped_elems[x])
ensures haveSameElements(elems, swapped_elems)
ensures old(less_spec(i, j, elems)) ==> !less_spec(i, j, swapped_elems) //# We make the effect of the Swap operation on the Less relation explicit.
Swap(i, j int, ghost elems seq[any]) (ghost swapped_elems seq[any])
}


//# PLACEHOLDER: Here would be more code from the original sort.go file.

ghost
requires data != nil
requires data.Mem(elems)
requires 0 <= start && start < len(elems)
requires start <= end && end < len(elems)
ensures end - start == 0 ==> res == true
ensures (end - start > 0 && !data.less_spec(end, end - 1, elems)) ==> res == isPartiallySorted(data, start, end - 1, elems)
ensures (end - start > 0 && data.less_spec(end, end - 1, elems)) ==> !res
//ensures res ==> (forall idx int :: ((start < idx && idx <= end) ==> !data.less_spec(idx, idx - 1, elems)))
//# The below line verifies and should just be an unrolled version of the above line in cases were start+2 < end. However, the above line does not verify.
//ensures (res && start+2 < end) ==> !data.less_spec(end, end - 1, elems) && !data.less_spec(end-1, end - 2, elems) && !data.less_spec(end-2, end-3, elems)
pure
func isPartiallySorted(data Interface, start, end int, ghost elems seq[any]) (res bool) {
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The functionIsSorted below naturally yields the postcondition [1]:

forall idx int :: ((0 < idx && idx < len(elems)) ==> (!data.less_spec(idx, idx - 1, elems)))

if the elements are sorted.

This definition of "sortedness" makes sense but does not easily lend itself to checking for sortedness of data of any length (i.e. it cannot be directly applied when Len()<=1). I thought it would be nice to have a ghost pure helper function isPartiallySorted that can then also be used to easily check for sortedness of "subslices" of data as is required in insertionSort and even more useful in the invariants of insertionSort.

This implementation of isPartiallySorted follows the typical pattern of a recursive sortedness check. However, it does not manage to establish the same postcondition mentioned above in [1].
Conversely, I tried to use my implementation of isPartiallySorted in the loop invariant of IsSorted but it could not be preserved.

To be clear, I don't even need the postcondition [1] in line 81 here. It was just included as a sanity check to see why I could not use isPartiallySorted in IsSorted.
Nonetheless, I think it should still hold and cannot spot the issue.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This definition of "sortedness" makes sense but does not easily lend itself to checking for sortedness of data of any length (i.e. it cannot be directly applied when Len()<=1)

Why not? here, if len(elems) <= 1, then the property trivially holds - every sequence with 0 or 1 elements is sorted

return (end - start == 0) ? true : (!data.less_spec(end, end - 1, elems) ? isPartiallySorted(data, start, end - 1, elems) : false)
}

// IsSorted reports whether data is sorted.
preserves data.Mem(elems)
requires data != nil
ensures res ==> forall idx int :: ((0 < idx && idx < len(elems)) ==> (!data.less_spec(idx, idx - 1, elems)))
ensures !res ==> !(forall idx int :: ((0 < idx && idx < len(elems)) ==> (!data.less_spec(idx, idx - 1, elems))))
func IsSorted(data Interface, ghost elems seq[any]) (res bool) {
n := data.Len(elems)

res = true //# moved this up for use in the invariant

invariant res
invariant i < n
invariant len(elems) > 0 ==> 0 <= i //# if n==0 then i==-1 after initialization
invariant data.Mem(elems)
invariant len(elems) != 0 ==> (forall idx int :: ((i < idx && idx < len(elems)) ==> !data.less_spec(idx, idx - 1, elems)))
//invariant len(elems) != 0 ==> isPartiallySorted(data, i, n - 1, elems) //# Would like for this to hold here
decreases i
for i := n - 1; i > 0; i-- {
if data.Less(i, i - 1, elems) {
return false
}
//assert isPartiallySorted(data, i, n - 1, elems)
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

as commented above, this assertion does not hold

}
assert res
return
}

// Convenience types for common cases

// IntSlice attaches the methods of Interface to []int, sorting in increasing order.
type IntSlice []int

//# Adapted from Gobra tutorial
ghost
requires forall j int :: 0 <= j && j < len(s) ==> acc(&s[j], _)
ensures len(res) == len(s)
ensures forall j int :: {s[j]} {res[j]} 0 <= j && j < len(s) ==> s[j] == res[j]
pure func toSeq(s []int) (res seq[any]) {
return (len(s) == 0 ? seq[any]{} :
toSeq(s[:len(s) - 1]) ++ seq[any]{s[len(s) - 1]})
}

pred (x IntSlice) Mem(ghost s seq[any]){
len(x) == len(s) && //# If this is not the first line in the predicate, then we cannot verify e.g. the last postcondition in Len()
forall j int :: 0 <= j && j < len(x) ==> acc(&x[j]) &&
s == toSeq(x) &&
forall j int :: {x[j]} {s[j]} 0 <= j && j < len(s) ==> x[j] == s[j]
}

requires acc(x.Mem(elems), ReadPerm)
ensures acc(x.Mem(elems), ReadPerm)
ensures unfolding acc(x.Mem(elems), ReadPerm) in res == len(elems)
ensures res >= 0
func (x IntSlice) Len(ghost elems seq[any]) (res int) {
return len(x)
}

requires x.Mem(elems)
requires 0 <= i && i < len(elems)
requires 0 <= j && j < len(elems)
ensures forall a, b, c int :: (0 <= a && a < len(elems) && 0 <= b && b < len(elems) && 0 <= c && c < len(elems) && x.less_spec(a,b,elems) && x.less_spec(b,c,elems)) ==> x.less_spec(a,c,elems)
pure func (x IntSlice) less_spec(i, j int, ghost elems seq[any]) (res bool){
return unfolding x.Mem(elems) in x[i] < x[j]
}


requires acc(x.Mem(elems), ReadPerm)
requires 0 <= i && i < len(elems)
requires 0 <= j && j < len(elems)
ensures acc(x.Mem(elems), ReadPerm)
ensures res == x.less_spec(i, j, elems)
func (x IntSlice) Less(i, j int, ghost elems seq[any]) (res bool) {
return unfolding acc(x.Mem(elems), ReadPerm) in x[i] < x[j]
}

requires x.Mem(elems)
requires 0 <= i && i < len(elems)
requires 0 <= j && j < len(elems)
ensures x.Mem(swapped_elems)
ensures len(elems) == len(swapped_elems) //# Somehow this does not suffice to infer i and j are in range [0, len(swapped_elems)-1] and it needs to be stated explicitly below.
ensures 0 <= i && i < len(elems) && i < len(swapped_elems)
ensures 0 <= j && j < len(elems) && j < len(swapped_elems)
//# This line is required..
ensures forall idx int :: (0 <= idx && idx < len(elems) && idx != i && idx != j) ==> (old(unfolding x.Mem(elems) in x[idx]) == (unfolding x.Mem(swapped_elems) in x[idx]))
//# ..for these 3 corresponding lines to work.
ensures elems[i] === swapped_elems[j]
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This postcondition (and presumably the next one too) cannot get verified by the GobraCLI.
I run the most current versions of the GobraIDE and GobraCLI.
This postcondition only verifies for the GobraIDE stable build version but fails for the GobraIDE nightly build version and the GobraCLI.
Postcondition might not hold: Assertion elems[i] === swapped_elems[j] might not hold.

I can verify that old(unfolding x.Mem(elems) in x[i]) === (unfolding x.Mem(swapped_elems) in x[j]). Further, it should know from the predicate x.Mem(elems) from the precondition that in the old context we had a 1:1 mapping between the elements of x and elems. The same is given in the new context by x.Mem(swapped_elems) for a mapping between x and swapped_elems.

ensures elems[j] === swapped_elems[i]
ensures forall idx int :: (0 <= idx && idx < len(elems) && idx != i && idx != j) ==> (elems[idx] === swapped_elems[idx])
ensures haveSameElements(elems, swapped_elems)
ensures old(x.less_spec(i, j, elems)) ==> !x.less_spec(i, j, swapped_elems)
func (x IntSlice) Swap(i, j int, ghost elems seq[any]) (ghost swapped_elems seq[any]) {
unfold x.Mem(elems)
x[i], x[j] = x[j], x[i]
swapped_elems = toSeq(x)
fold x.Mem(swapped_elems)
assume haveSameElements(elems, swapped_elems) //# We believe this to be true. I was not able to establish this as a fact and I was not able to map directly from IntSlice to mset[any].
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I was not able to make haveSameElements work on the implementation level.
It cannot automatically be established after the elements in x have been swapped. I tried to implement a mapping of x to a multiset in an effort to try and prove this on the implementation level, however, this approach failed due to the error described in https://github.com/jcp19/verified_go_stdlib/pull/6/files#r1067661530.

Nonetheless, this assumption should still be safe. Swap is the only method described in the interface that actively changes the order of the elements and it is safe to assume here that really haveSameElements(elems, swapped_elems) holds.

}

//# implementation proof
(IntSlice) implements Interface{ }



// ######### BEGIN FROM zsortinterface.go ##############
// insertionSort sorts data[a:b] using insertion sort.
requires data.Mem(elems)
requires data != nil
requires 0 <= a && a < len(elems)
requires a <= b && b <= len(elems) //# the element at position b does not get sorted
ensures data.Mem(sorted_elems)
ensures haveSameElements(elems, sorted_elems)
ensures a == b ==> isPartiallySorted(data, a, a, sorted_elems)
//ensures isPartiallySorted(data, a, b - 1, sorted_elems)
func insertionSort(data Interface, a, b int, ghost elems seq[any]) (ghost sorted_elems seq[any]) {
sorted_elems = elems
assert data.Mem(sorted_elems)

invariant haveSameElements(elems, sorted_elems)
invariant a < i
invariant a < b ==> i <= b
invariant data.Mem(sorted_elems)
//# invariant prefix [a:i] is always sorted
//invariant forall idx int :: (a < idx && idx < i && i < b) ==> !data.less_spec(idx, idx - 1, sorted_elems)
//invariant isPartiallySorted(data, a, i - 1, sorted_elems)
decreases b-i
for i := a + 1; i < b; i++ { //# for any kind of sorting to happen b has to be at least a+2, does this mean if called with b=a+1 that this will not be sorted? Answer: YES, should we enforce that in the precondition?
invariant haveSameElements(elems, sorted_elems)
invariant i < b
invariant a <= j && j <= i
invariant a < j ==> 0 <= j - 1
invariant data.Mem(sorted_elems)
//# invariant prefix [a:j-2] is always sorted and [j:i] is always sorted
//invariant forall idx int :: (j < idx && idx < i) ==> !data.less_spec(idx, idx - 1, sorted_elems)
//invariant isPartiallySorted(data, a, XXX, sorted_elems)
//invariant isPartiallySorted(data, YYY, i, sorted_elems)
decreases j-a
for j := i; j > a /*&& data.Less(j, j - 1, elems)*/; j-- { //# The second conjunct in the loop condition was commented out. See comment below.
assert 0 <= j - 1
assert i < b
assert i < len(sorted_elems)
if data.Less(j, j - 1, sorted_elems) { //# The second conjunct in the loop condition was commented out and introduced here as an IF block. This is due to a bug in Gobra where short circuiting is not detected. See https://github.com/viperproject/gobra/issues/511
sorted_elems = data.Swap(j, j - 1, sorted_elems)
assert !data.less_spec(j, j - 1, sorted_elems)
} else {
break
}
}
}
}
// ######### END FROM zsortinterface.go ##############


//# helper:

ghost
ensures forall e any :: e in elemsSeq ==> e in elemsMultiSet
ensures len(elemsSeq) == len(elemsMultiSet)
pure func toMultiSet(ghost elemsSeq seq[any]) (ghost elemsMultiSet mset[any])



ghost
ensures toMultiSet(seq1) === toMultiSet(seq2) ==> res
ensures toMultiSet(seq1) !== toMultiSet(seq2) ==> !res
ensures res ==> len(seq1) == len(seq2)
pure func haveSameElements(ghost seq1 seq[any], ghost seq2 seq[any]) (res bool){
return toMultiSet(seq1) === toMultiSet(seq2)
}



/*
ensures forall e int :: e in s1 ==> e in s2 //# this causes gobra to crash, it works with mset[int] as the return type
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This postcondition causes Gobra to crash with the message:

An exception occurred during execution of Gobra: java.util.concurrent.ExecutionException: java.lang.AssertionError: assertion failed: Expected second operand e_V1@119@00 to be of sort Tuple2[Ref, Types], but found Int.

This does not happen if s2 is also of type mset[int].
It is very possible that this is not even a legal operation I'm trying to do here but I still think Gobra should not crash outright.
Moreover, looking at the postcondition in toSeq it is possible to compare elements in the collections of type []int and seq[any]. It is just the contains operator in that leads to the crash.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes, ideally Gobra should insert a type conversion automatically here or should return a type error. The reason here is that ints and values of interface types are encoded differently (with different types at the viper level), but we are treating them at the viper level as if they were encoded as the same type.

As for now, as soon as PR viperproject/gobra#600 is merged, we can side-step this issue by replacing e in s2 with (interface{})(e) in s2. Can you please open an issue on the issue tracker about this?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

viperproject/gobra#600 was merged. The approach with a cast should work now

pure func test(ghost s1 mset[int]) (ghost s2 mset[any])
*/


/*
ghost
requires forall j int :: 0 <= j && j < len(s) ==> acc(&s[j], _)
ensures len(res) == len(s)
ensures forall j int :: 0 <= j && j < len(s) ==> s[j] in res
pure func toMultiSet(s []int) (res mset[any]) {
return (len(s) == 0 ? mset[any]{} :
toSeqX(s[:len(s) - 1]) union mset[any]{s[len(s) - 1]})
}
*/