Proofs of programs

Previously: induction for recursive functions (math or Scheme notation).
Now: some induction/recursion, but also contrasting that with invariants/iteration.

Context for examples: Numbers vs. Numerals

Number is an abstract idea; Numeral is an encoding of a number as a string. (Roman numerals, base-10 numerals, …). People often confuse a number with a representation of that number. When necessary add brackets and base around a numeral. I.e., what based is intended for "471"?

[471]₁₀ = 4⋅10² + 7⋅10¹ + 1⋅10⁰ = 471

[471]₈ = 4⋅8² + 7⋅8¹ + 1⋅8⁰ = 313

(Another) Structural induction example

We'll assume the input list is reversed, just to simplify the code a bit.

; digs->int: (listof int) int → int
; Purpose: Returns the number represented in base b by the digit-list d,
; when reversed.
;
;   (digs->int (list 1 7 4) 10) = 471
;   (digs->int (list 1 7 4) 8)  = 313
;   
(define (digs->int d b)
  (cond [(empty? digs) 0]
        [(cons?  digs) (+ (first digs)
                          (* b (digs->int (rest d) b)))]))

Is this correct? What does "correct" mean? Can we make the "Purpose" statement more mathematically precise?

Define: d2i(d,b) = ∑_i=0…n-1 (d_i ⋅ bⁱ), where n=(length d) and d=(list d₀ … d_n).
Want (digs->int d b) = d2i(d,b), for all d∈{0,…,9}^*, b∈ℤ⁺.

How to prove?

By induction on d.
Show proof.

But what about iterative/imperative programs?

/*
 * digs->int: (listof int) int → int
 * Purpose: Returns the number represented in base b by the digit-list d,
 * when reversed.
 */
int digs->int( (listof int) d, int b ) {
  int sumSoFar ← 0;
  int i ← 0;
  while (i < (length d)) {
    sumSoFar ← sumSoFar + (list-ref d i) * pow(b,i);
    i ← i+1;
  }
  return sumSoFar;
  }

/* 
 *   List A ← new Cons( 1, new Cons( 7, (new Cons( 4, theEmptyList))));
 *
 *   unless (digs->int(A, 10) == 471) { println("Test case 1 failed."); };
 *   unless (digs->int(A, 8) == 313)  { println("Test case 2 failed."); };
 */

Again, we'reassuming the input list is reversed, for consistency with the first example.

Hoare Logic (informally)

Pre- and post-condition of each statement. Examples:

Sequence
Assignment
Conditional
Loop

Hoare Logic in practice

Detailed use of Hoare Logic is too tedious for human use.
Semi-automated use possible, but with limitations:
- Difficult for people or programs to come up with invariants.
- Difficult for program to simplify state knowledge to only what's necessary for pre-/post-conditions. So, pre-/post-conditions are large, and system doesn't scale well.
Hoare logic for sequences, assignment, and conditionals correspond to common sense and typical pre-COMP280 reasoning, anyway.
Most useful ideas:
- Pre-/post-conditions of functions/methods -- Part of your specifications. Somewhat stressed in programming courses.
- Loop invariants -- We'll concentrate on these.

Invariants — Hoare Logic for loops

Introduce by example, using the loop in digs->int:

The relevant definitions.

Goal G: sumSoFar = ∑_{k=0…(length d)-1} d_k⋅b^k
- Same as for recursive version.
- Should be clear from problem specification.
Loop condition C: i < (length d).
- Read it straight from code.
Loop invariant I: sumSoFar = ∑_k=0…i-1 d_k⋅b^k
- What is true (and useful) at the beginning of each iteration.
- Can be difficult to determine. Usually the hardest part of this process.
- Ideally, determining this is part of the code writing process.

To prove that G holds at the end, need to show four things:

Precondition: I holds initially
Loop invariant maintained: I∧C → I′
- Primed names are a notation to distinguish between the state before the loop (unprimed) and after the loop (primed). E.g., I′ = "the statement like I, but with all variables primed". Loop invariant I never mentions primed variables — only in the proof framework do we consider both I and I′.
Correctness: I∧¬C → G
Termination: ¬C will eventually hold

Prove it…

Lemma: d and b don't change. Thus,

d = d′
(length d) = (length d′);
d.i = d′.i for all i
b = b′

Show:

Precondition: Check.
Loop invariant maintained:
Suppose I ∧ C: That is, we know sumSoFar = ∑_k=0^i-1 d_k⋅b^k i ≤ (length d); Then
sumSoFar′
= sumSoFar + d_i⋅bⁱ
= ∑_k=0^i-1 d_k⋅b^k + d_i⋅bⁱ
= ∑_k=0ⁱ d_k⋅b^k;
= ∑_k=0^i′-1 d_k⋅b^k;
which is exactly I′, like we want. [Hmm, we didn't actually ever use C at all, oh well.]
Correctness: I∧¬C → G If ¬C, then ¬(i < (length d)), so i = (length d), and so I = G, voila.

Hmmm, wait a sec — how did we have i = (length d)? ¬C Only gives us i≥(length d) instead of equality, and we don't have any actual previous fact to cite to conclude i = (length d)!

We want to exclaim "but of course i=(length d); it starts at 0 and we stop as soon as we hit (length d)!" Well, if it's so clearly true, maybe we should deign to mention the fact.

Let's make our loop invariant stronger. Rename our original invariant to be I₁. Our new invariant is I = I₁∧I₂, where I₂="i≤(length d)".

A new invariant needs a new proof, so let's start over.

Precondition. We need to know that length() always returns a non-negative value, which presumably you'd verify separately.
Loop invariant maintained. Basically the same as before, but now we need to use the fact that C holds.
Correctness: I₂∧¬C gives us i=(length d); that and I give us G.
Termination: Initially i=0, and it increases by a constant at each iteration, thus it will eventually become greater than (length d). [This is a standard math theoremm; provable by induction.]

When writing proofs, it's not unusual to discover snags, and need to go back and patch them up; your first write-up is rarely your last. You're encouraged to write up your proofs in a word processor, to avoid having to re-copy your homework.

The Structure of Loop Proofs

Reflecting a moment, on the style of proof: Programs that respect the intrinsic structural definition of their data (the "natural recursion" of Comp210) are amenable to proofs by induction. Programs which need to change state are harder to reason about (partly because our mathematics doesn't naturally reaon about time and state (change-over-time). Especially when the state is spread out over different functions (and if a function doesn't document what state it's modifying, things are bleak indeed). Such code is not only harder to reason about, it tends to be far more bug-prone.

A good programmer is aware of when a functional (state-free) approach is good, and when keeping state is appropriate; this is an important distinction you probably didn't realize you were learning in Comp210/How to Design Programs.

Just as the purpose of loop constructs is to provide a high-level way of separating "do a task once" from "repeat the task as needed", our proof framework is separating "show that one iteration is correct" from "show that it's repeated as needed". Our proof structure is reflecting the structure of the underlying program.

More examples: The other direction

List int->digs( int n, int b ) {
  List digsSoFar = theEmptyList();

  while (n > 0) {
    digsSoFar ← snoc( remainder(n,b), digsSoFar );
    n ← quotient(n,b)
    }
  return digsSoFar;
  }

What is the statement of correctness?

Goal G: d2i((int->digs n b),b) = n

What is the loop invariant?

Trying to phrase this is difficult, since n changes. We'll tweak our program so that these things can be stated better.
```
List int->digs( int n, int b ) {
  List digsSoFar = theEmptyList();
  int m = n;

  while (m > 0) {
    digsSoFar ← snoc( remainder(m,b), digsSoFar );
    m ← quotient(m,b)
    }
  return digsSoFar;
  }
```
Proving correctness for something other than the original code is generally undesirable. However, this version is preferable since it's generally considered bad style to assign to an argument variable, plus maybe were proving correctness while writing the code.
Loop invariant: At each iteration, digsSoFar is more and more of the answer, but exactly how much "more and more"? Generalizing goal: d2i(digsSoFar,b) = …

Hmmmm.

Useful exercise: trace all variables for a sample input

It seems like we'd want to talk about the iteration#. But, we don't have an iteration counter. What can we use instead? Length of digsSoFar!

Two solutions:
- I1: After one time through the loop we have the least-significant digit; after two times through we have two of them, etc. d2i(digsSoFar,b) = remainder(n,b^|digsSoFar|).
- I2: At each point, the input n has been partitioned into two parts, digsSoFar and m. d2i(digsSoFar,b) = n - m×b^|digsSoFar|
Similar, but I2 is a somewhat easier to use.

Prove correctness:

First, we need the following lemmas:

d2i( snoc(x,D), b) = xb^|D|+d2i(D,b).
for any x≥0, d>1, x = quotient(x,d) ⋅ d + remainder(x,d)

Proof that invariant is maintained (other parts are fairly straightforward):

I2∧C → I2′

We'll abbreviate variable names.

Assume d2i(D,b) = n - mb^|D| and m>0.

Prove d2i(D′,b) = n - m′b^|D′|.

d2i(D′,b)
= d2i(snoc(rem(m,b),D), b) [From code]
= rem(m,b)b^|D| + d2i(D,b) [By lemma]
= n - mb^|D| + rem(m,b)b^|D| [By assumption of invariant]
= n - (m - rem(m,b))b^|D| [Algebra]
= n - (quo(m,b)⋅b)b^|D| [By lemma, since m>0]
= n - quo(m,b)b^|D+1| [Algebra]
= n - m′b^|D′| [From code]

On your own: Try proving correctness with I1.