Disclaimer 1: This crash introduction to Coq has stolen a lot of snippets of code from the excellent Software Foundation series of books: https://softwarefoundations.cis.upenn.edu/ If you are interested in learning more about Coq, you should definitely check it out!

Disclaimer 2: this file is not meant to constitute a self-contained quality tutorial for a reader that would stumble upon it, it was written as support material for a presentation touring informally the Coq proof assistant.

A gentle overview to the Coq proof assistant

Coq: reasoning about programming languages

Coq can be used to formalize most mathematics. But for us in particular, an interesting domain of application is the formalization of the semantics of programming languages.

Pure expressions

Let's start by defining the abstract syntax of a little language of arithmetic expressions.

Inductive aexp : Type :=
  | ANum (n : nat)
  | APlus (a1 a2 : aexp)
  | AMinus (a1 a2 : aexp)
  | AMult (a1 a2 : aexp).

This mini-language is particularly simple: it is pure, in particular it can neither fail nor diverge. We can therefore easily represent its semantics directly as Gallina computations by implementing an interpreter.

Fixpoint aeval (a : aexp) : nat :=
  match a with
  | ANum n => n
  | APlus  a1 a2 => (aeval a1) + (aeval a2)
  | AMinus a1 a2 => (aeval a1) - (aeval a2)
  | AMult  a1 a2 => (aeval a1) * (aeval a2)
  end.

Let's get ambitious and prove a quite complex optimization: we can substitute "0 + e" by "e" in arithmetic expressions!

First, we need to program our optimization as a Gallina function.

Fixpoint optimize_0plus (a:aexp) : aexp :=
  match a with
  | ANum n => ANum n
  | APlus (ANum 0) e2 => optimize_0plus e2
  | APlus  e1 e2 => APlus  (optimize_0plus e1) (optimize_0plus e2)
  | AMinus e1 e2 => AMinus (optimize_0plus e1) (optimize_0plus e2)
  | AMult  e1 e2 => AMult  (optimize_0plus e1) (optimize_0plus e2)
  end.

Now we can state our theorem: any arithmetic expression has the same semantics before and after optimization.

Things are looking good for us: both the semantics and the optimization are defined by recursion on the structure of our expressions, we have a good chance of success by induction on this structure!

Theorem optimize_0plus_sound: forall a,
  aeval (optimize_0plus a) = aeval a.forall a : aexp, aeval (optimize_0plus a) = aeval a
Proof.forall a : aexp, aeval (optimize_0plus a) = aeval a
  intros a.a:aexp
aeval (optimize_0plus a) = aeval a induction a.n:nat
aeval (optimize_0plus (ANum n)) = aeval (ANum n)
a1, a2:aexp
IHa1:aeval (optimize_0plus a1) = aeval a1
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus (APlus a1 a2)) =
aeval (APlus a1 a2)
a1, a2:aexp
IHa1:aeval (optimize_0plus a1) = aeval a1
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus (AMinus a1 a2)) =
aeval (AMinus a1 a2)
a1, a2:aexp
IHa1:aeval (optimize_0plus a1) = aeval a1
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus (AMult a1 a2)) =
aeval (AMult a1 a2)
  - (* ANum *)n:nat
aeval (optimize_0plus (ANum n)) = aeval (ANum n) reflexivity.
  - (* APlus *)a1, a2:aexp
IHa1:aeval (optimize_0plus a1) = aeval a1
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus (APlus a1 a2)) =
aeval (APlus a1 a2) destruct a1 eqn:Ea1.a1, a2:aexp
n:nat
Ea1:a1 = ANum n
IHa1:aeval (optimize_0plus (ANum n)) =
aeval (ANum n)
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus (APlus (ANum n) a2)) =
aeval (APlus (ANum n) a2)
a1, a2, a3, a4:aexp
Ea1:a1 = APlus a3 a4
IHa1:aeval (optimize_0plus (APlus a3 a4)) =
aeval (APlus a3 a4)
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus (APlus (APlus a3 a4) a2)) =
aeval (APlus (APlus a3 a4) a2)
a1, a2, a3, a4:aexp
Ea1:a1 = AMinus a3 a4
IHa1:aeval (optimize_0plus (AMinus a3 a4)) =
aeval (AMinus a3 a4)
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus (APlus (AMinus a3 a4) a2)) =
aeval (APlus (AMinus a3 a4) a2)
a1, a2, a3, a4:aexp
Ea1:a1 = AMult a3 a4
IHa1:aeval (optimize_0plus (AMult a3 a4)) =
aeval (AMult a3 a4)
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus (APlus (AMult a3 a4) a2)) =
aeval (APlus (AMult a3 a4) a2)
    + (* a1 = ANum n *)a1, a2:aexp
n:nat
Ea1:a1 = ANum n
IHa1:aeval (optimize_0plus (ANum n)) =
aeval (ANum n)
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus (APlus (ANum n) a2)) =
aeval (APlus (ANum n) a2) destruct n eqn:En.a1, a2:aexp
n:nat
En:n = 0
Ea1:a1 = ANum 0
IHa1:aeval (optimize_0plus (ANum 0)) =
aeval (ANum 0)
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus (APlus (ANum 0) a2)) =
aeval (APlus (ANum 0) a2)
a1, a2:aexp
n, n0:nat
En:n = S n0
Ea1:a1 = ANum (S n0)
IHa1:aeval (optimize_0plus (ANum (S n0))) =
aeval (ANum (S n0))
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus (APlus (ANum (S n0)) a2)) =
aeval (APlus (ANum (S n0)) a2)
      * (* n = 0 *)a1, a2:aexp
n:nat
En:n = 0
Ea1:a1 = ANum 0
IHa1:aeval (optimize_0plus (ANum 0)) =
aeval (ANum 0)
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus (APlus (ANum 0) a2)) =
aeval (APlus (ANum 0) a2)  simpl.a1, a2:aexp
n:nat
En:n = 0
Ea1:a1 = ANum 0
IHa1:aeval (optimize_0plus (ANum 0)) =
aeval (ANum 0)
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus a2) = aeval a2 apply IHa2.
      * (* n <> 0 *)a1, a2:aexp
n, n0:nat
En:n = S n0
Ea1:a1 = ANum (S n0)
IHa1:aeval (optimize_0plus (ANum (S n0))) =
aeval (ANum (S n0))
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus (APlus (ANum (S n0)) a2)) =
aeval (APlus (ANum (S n0)) a2) simpl.a1, a2:aexp
n, n0:nat
En:n = S n0
Ea1:a1 = ANum (S n0)
IHa1:aeval (optimize_0plus (ANum (S n0))) =
aeval (ANum (S n0))
IHa2:aeval (optimize_0plus a2) = aeval a2
S (n0 + aeval (optimize_0plus a2)) = S (n0 + aeval a2) rewrite IHa2.a1, a2:aexp
n, n0:nat
En:n = S n0
Ea1:a1 = ANum (S n0)
IHa1:aeval (optimize_0plus (ANum (S n0))) =
aeval (ANum (S n0))
IHa2:aeval (optimize_0plus a2) = aeval a2
S (n0 + aeval a2) = S (n0 + aeval a2) reflexivity.
    + (* a1 = APlus a1_1 a1_2 *)a1, a2, a3, a4:aexp
Ea1:a1 = APlus a3 a4
IHa1:aeval (optimize_0plus (APlus a3 a4)) =
aeval (APlus a3 a4)
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus (APlus (APlus a3 a4) a2)) =
aeval (APlus (APlus a3 a4) a2)
      simpl.a1, a2, a3, a4:aexp
Ea1:a1 = APlus a3 a4
IHa1:aeval (optimize_0plus (APlus a3 a4)) =
aeval (APlus a3 a4)
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval
  match a3 with
  | ANum 0 => optimize_0plus a4
  | _ => APlus (optimize_0plus a3) (optimize_0plus a4)
  end + aeval (optimize_0plus a2) =
aeval a3 + aeval a4 + aeval a2 simpl in IHa1.a1, a2, a3, a4:aexp
Ea1:a1 = APlus a3 a4
IHa1:aeval
  match a3 with
  | ANum 0 => optimize_0plus a4
  | _ =>
      APlus (optimize_0plus a3)
        (optimize_0plus a4)
  end = aeval a3 + aeval a4
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval
  match a3 with
  | ANum 0 => optimize_0plus a4
  | _ => APlus (optimize_0plus a3) (optimize_0plus a4)
  end + aeval (optimize_0plus a2) =
aeval a3 + aeval a4 + aeval a2 rewrite IHa1.a1, a2, a3, a4:aexp
Ea1:a1 = APlus a3 a4
IHa1:aeval
  match a3 with
  | ANum 0 => optimize_0plus a4
  | _ =>
      APlus (optimize_0plus a3)
        (optimize_0plus a4)
  end = aeval a3 + aeval a4
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval a3 + aeval a4 + aeval (optimize_0plus a2) =
aeval a3 + aeval a4 + aeval a2
      rewrite IHa2.a1, a2, a3, a4:aexp
Ea1:a1 = APlus a3 a4
IHa1:aeval
  match a3 with
  | ANum 0 => optimize_0plus a4
  | _ =>
      APlus (optimize_0plus a3)
        (optimize_0plus a4)
  end = aeval a3 + aeval a4
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval a3 + aeval a4 + aeval a2 =
aeval a3 + aeval a4 + aeval a2 reflexivity.
    + (* a1 = AMinus a1_1 a1_2 *)a1, a2, a3, a4:aexp
Ea1:a1 = AMinus a3 a4
IHa1:aeval (optimize_0plus (AMinus a3 a4)) =
aeval (AMinus a3 a4)
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus (APlus (AMinus a3 a4) a2)) =
aeval (APlus (AMinus a3 a4) a2)
      simpl.a1, a2, a3, a4:aexp
Ea1:a1 = AMinus a3 a4
IHa1:aeval (optimize_0plus (AMinus a3 a4)) =
aeval (AMinus a3 a4)
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus a3) - aeval (optimize_0plus a4) +
aeval (optimize_0plus a2) =
aeval a3 - aeval a4 + aeval a2 simpl in IHa1.a1, a2, a3, a4:aexp
Ea1:a1 = AMinus a3 a4
IHa1:aeval (optimize_0plus a3) -
aeval (optimize_0plus a4) = 
aeval a3 - aeval a4
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus a3) - aeval (optimize_0plus a4) +
aeval (optimize_0plus a2) =
aeval a3 - aeval a4 + aeval a2 rewrite IHa1.a1, a2, a3, a4:aexp
Ea1:a1 = AMinus a3 a4
IHa1:aeval (optimize_0plus a3) -
aeval (optimize_0plus a4) = 
aeval a3 - aeval a4
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval a3 - aeval a4 + aeval (optimize_0plus a2) =
aeval a3 - aeval a4 + aeval a2
      rewrite IHa2.a1, a2, a3, a4:aexp
Ea1:a1 = AMinus a3 a4
IHa1:aeval (optimize_0plus a3) -
aeval (optimize_0plus a4) = 
aeval a3 - aeval a4
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval a3 - aeval a4 + aeval a2 =
aeval a3 - aeval a4 + aeval a2 reflexivity.
    + (* a1 = AMult a1_1 a1_2 *)a1, a2, a3, a4:aexp
Ea1:a1 = AMult a3 a4
IHa1:aeval (optimize_0plus (AMult a3 a4)) =
aeval (AMult a3 a4)
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus (APlus (AMult a3 a4) a2)) =
aeval (APlus (AMult a3 a4) a2)
      simpl.a1, a2, a3, a4:aexp
Ea1:a1 = AMult a3 a4
IHa1:aeval (optimize_0plus (AMult a3 a4)) =
aeval (AMult a3 a4)
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus a3) * aeval (optimize_0plus a4) +
aeval (optimize_0plus a2) =
aeval a3 * aeval a4 + aeval a2 simpl in IHa1.a1, a2, a3, a4:aexp
Ea1:a1 = AMult a3 a4
IHa1:aeval (optimize_0plus a3) *
aeval (optimize_0plus a4) = 
aeval a3 * aeval a4
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus a3) * aeval (optimize_0plus a4) +
aeval (optimize_0plus a2) =
aeval a3 * aeval a4 + aeval a2 rewrite IHa1.a1, a2, a3, a4:aexp
Ea1:a1 = AMult a3 a4
IHa1:aeval (optimize_0plus a3) *
aeval (optimize_0plus a4) = 
aeval a3 * aeval a4
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval a3 * aeval a4 + aeval (optimize_0plus a2) =
aeval a3 * aeval a4 + aeval a2
      rewrite IHa2.a1, a2, a3, a4:aexp
Ea1:a1 = AMult a3 a4
IHa1:aeval (optimize_0plus a3) *
aeval (optimize_0plus a4) = 
aeval a3 * aeval a4
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval a3 * aeval a4 + aeval a2 =
aeval a3 * aeval a4 + aeval a2 reflexivity.
  - (* AMinus *)a1, a2:aexp
IHa1:aeval (optimize_0plus a1) = aeval a1
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus (AMinus a1 a2)) =
aeval (AMinus a1 a2)
    simpl.a1, a2:aexp
IHa1:aeval (optimize_0plus a1) = aeval a1
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus a1) - aeval (optimize_0plus a2) =
aeval a1 - aeval a2 rewrite IHa1.a1, a2:aexp
IHa1:aeval (optimize_0plus a1) = aeval a1
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval a1 - aeval (optimize_0plus a2) =
aeval a1 - aeval a2 rewrite IHa2.a1, a2:aexp
IHa1:aeval (optimize_0plus a1) = aeval a1
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval a1 - aeval a2 = aeval a1 - aeval a2 reflexivity.
  - (* AMult *)a1, a2:aexp
IHa1:aeval (optimize_0plus a1) = aeval a1
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus (AMult a1 a2)) =
aeval (AMult a1 a2)
    simpl.a1, a2:aexp
IHa1:aeval (optimize_0plus a1) = aeval a1
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval (optimize_0plus a1) * aeval (optimize_0plus a2) =
aeval a1 * aeval a2 rewrite IHa1.a1, a2:aexp
IHa1:aeval (optimize_0plus a1) = aeval a1
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval a1 * aeval (optimize_0plus a2) =
aeval a1 * aeval a2 rewrite IHa2.a1, a2:aexp
IHa1:aeval (optimize_0plus a1) = aeval a1
IHa2:aeval (optimize_0plus a2) = aeval a2
aeval a1 * aeval a2 = aeval a1 * aeval a2 reflexivity.
Qed.

Naturally, we don't have quite enough for a paper, this result is not too impressive. However it is already a good support to pause and think about what it means to prove such a result in Coq.

We have defined: - a syntax - a semantics - an optimization - a proof of soundness of the optimization.

In a way, what Coq provides us is the guarantee that all of these components are what they claim (in particular, that the proof _is_ a proof), but also that they _actually_ talk about one another. There is no ambiguity about what language a result talks about, things are properly specified. This raises majors questions about maintainability and evolution of formal language: if add a construction, or amend a semantic rule, then my proof will likely break. But it ensures that no bad interaction between two features that would have been checked sound separately on their own right could sneak in.

So what should you keep an eye for when wondering whether a claim of the flavor of "I've formalized my work in Coq"? - If the object they have formalized is the same as the object they claim to talk about. This is very much a process of modelization, it can be as crude and non-sensical as in any other context! - If the theorem they prove states something interesting. In particular if they don't constrain the scope of the result by adding assumptions. - Marginally if they don't use axioms or admit results, but this is a simpler problem.

Impure expressions

Let's add some states!

Inductive aexp : Type :=
  | ANum (n : nat)
  | AId (x : string)              (* <--- NEW *)
  | APlus (a1 a2 : aexp)
  | AMinus (a1 a2 : aexp)
  | AMult (a1 a2 : aexp).

Of course our semantics now need a notion of state to find the meaning of an identifier. Let's not be fancy Today, we'll use good old total maps.

Definition state : Type := string -> nat.

Fixpoint aeval (a : aexp) (s : state) : nat :=
  match a with
  | ANum n => n
  | AId x => s x
  | APlus  a1 a2 => (aeval a1 s) + (aeval a2 s)
  | AMinus a1 a2 => (aeval a1 s) - (aeval a2 s)
  | AMult  a1 a2 => (aeval a1 s) * (aeval a2 s)
  end.

Alright, let's now get serious: let's consider a Turing-complete language!

Inductive com : Type :=
  | CSkip
  | CAss (x : string) (a : aexp)
  | CSeq (c1 c2 : com)
  | CIf (b : aexp) (c1 c2 : com)
  | CWhile (b : aexp) (c : com).

And let's define a few fancy notations for convenience.

Bind Scope imp_scope with com.Declaring a scope implicitly is deprecated; use in
advance an explicit "Declare Scope imp_scope.".
[undeclared-scope,deprecated]
Notation "'SKIP'" :=
   CSkip : imp_scope.
Notation "x '::=' a" :=
  (CAss x a) (at level 60) : imp_scope.
Notation "c1 ;; c2" :=
  (CSeq c1 c2) (at level 80, right associativity) : imp_scope.
Notation "'WHILE' b 'DO' c 'END'" :=
  (CWhile b c) (at level 80, right associativity) : imp_scope.
Notation "'TEST' c1 'THEN' c2 'ELSE' c3 'FI'" :=
  (CIf c1 c2 c3) (at level 80, right associativity) : imp_scope.
Open Scope imp_scope.
Open Scope string_scope.
Open Scope nat_scope.

We are of course tempted to jump in and define a new Fixpoint for the semantics of our commands.

We just need an update function on our states, and we shoudl be good

Definition update (s : state) x v : state :=
  fun y => if eqb x y then v else s y.

Notation "x '!->' v ';' s" := (update s x v)
                              (at level 100, v at next level, right associativity).

Fixpoint ceval_fun_no_while (c : com) (st : state) 
                          : state :=
  match c with
    | SKIP =>
        st
    | x ::= a1 =>
        (x !-> (aeval a1 st) ; st)
    | c1 ;; c2 =>
        let st' := ceval_fun_no_while c1 st in
        ceval_fun_no_while c2 st'
    | TEST b THEN c1 ELSE c2 FI =>
        if (aeval b st) =? 1
          then ceval_fun_no_while c1 st
          else ceval_fun_no_while c2 st
    | WHILE b DO c END =>
        st  (* bogus *)
  end.

As long as we don't worry about the loop, things of course go smoothly...

What if we just give it a shot?

Fail Fixpoint ceval_fun_hopeful_while (c : com) (st : state) 
  : state :=
  match c with
  | SKIP =>
    st
  | x ::= a1 =>
          (x !-> (aeval a1 st) ; st)
        | c1 ;; c2 =>
          let st' := ceval_fun_hopeful_while c1 st in
          ceval_fun_hopeful_while c2 st'
        | TEST b THEN c1 ELSE c2 FI =>
          if (aeval b st) =? 1
          then ceval_fun_hopeful_while c1 st
          else ceval_fun_hopeful_while c2 st
        | WHILE b DO c END =>
          if (aeval b st) =? 1
          then
            ceval_fun_hopeful_while (WHILE b DO c END) st
          else st
  end.The command has indeed failed with message:
Cannot guess decreasing argument of fix.

This definition fails with the following message: Coq "Cannot guess decreasing argument of fix", and it seems to a concern.

Indeed while we are happy to live in this world where programming and proving conflates, it can only happen at the cost of programming in a constraint environment, where the logic corresponding to the type system is sound. As it turns out, diverging programs are sources of inconsistencies. Imagine the following imaginary recursive function:

[Fixpoint bad (u : unit) : P := bad u.]

OCaml will happily typecheck it parametrically in P. However if we read it under Curry Howard, we just found a proof for any proposition!!

Recursive functions in Coq are therefore severely restricted in Coq: they must all terminate. Coq has some structural criteria to prove it automatically by structural arguments, and provide facilities to manually provide a measure and prove the termination of our functions.

However here there is no hope: our interpreter indeed diverges for some inputs! We will hence switch gears and define a propositional semantics to Imp.

We will follow closely a style we can find in many semantic papers: a set of big-step reduction rules. It's not the most pleasant to parse, btu it really is not more complex.

For instance, the rule of sequence is exactly saying:

if [c1,st ->* st'] and [c2,st' ->* st'']

then [c1;c2,st ->* st'']

Inductive ceval : com -> state -> state -> Prop :=
  | E_Skip : forall st,
      st =[ SKIP ]=> st
  | E_Ass  : forall st a1 n x,
      aeval a1 st = n ->
      st =[ x ::= a1 ]=> (x !-> n ; st)
  | E_Seq : forall c1 c2 st st' st'',
      st  =[ c1 ]=> st'  ->
      st' =[ c2 ]=> st'' ->
      st  =[ c1 ;; c2 ]=> st''
  | E_IfTrue : forall st st' b c1 c2,
      aeval b st = 1 ->
      st =[ c1 ]=> st' ->
      st =[ TEST b THEN c1 ELSE c2 FI ]=> st'
  | E_IfFalse : forall st st' b c1 c2,
      aeval b st <> 1 ->
      st =[ c2 ]=> st' ->
      st =[ TEST b THEN c1 ELSE c2 FI ]=> st'
  | E_WhileFalse : forall b st c,
      aeval b st <> 1 ->
      st =[ WHILE b DO c END ]=> st
  | E_WhileTrue : forall st st' st'' b c,
      aeval b st = 1 ->
      st  =[ c ]=> st' ->
      st' =[ WHILE b DO c END ]=> st'' ->
      st  =[ WHILE b DO c END ]=> st''

  where "st =[ c ]=> st'" := (ceval c st st').

By moving from an interpreter to a propositional caracterisation of the semantics, we have gained a lot of freedom, at the cost of static guarantees.

We have not had to worry about any question of termination. But note that we have actually simply modeled _exclusively_ the finite executions: this is captured by the inductive interpretation of our rules. Only finite trees can be built, hence only finite computations are modeled.

But we actually got new perks. Is our semantics only partially defined? Easy, no one asked our rules to be total! (Actually divergence is precisely a source of partiality) Is our semantics non-deterministic? Easy, no one asked our rules to be deterministic!

But is our semantics deterministic btw? Exercise!

Theorem ceval_deterministic: forall c st st1 st2,
     st =[ c ]=> st1  ->
     st =[ c ]=> st2 ->
     st1 = st2.forall (c : com) (st st1 st2 : state),
st =[ c ]=> st1 -> st =[ c ]=> st2 -> st1 = st2
Proof.forall (c : com) (st st1 st2 : state),
st =[ c ]=> st1 -> st =[ c ]=> st2 -> st1 = st2
Admitted.

Of course we can prove properties of specific programs.

In particular, we can prove their functional correctness. Behold a colossal example.

Note: we could define lots of fancier notations and coercions to ease the pain of writing these programs.

Definition plus2 : com :=
  "X" ::= APlus (AId "X") (ANum 2).

Theorem plus2_spec : forall (st : state) (n : nat) (st' : state),
  st "X" = n ->
  st =[ plus2 ]=> st' ->
  st' "X" = n + 2.forall (st : state) (n : nat) (st' : state),
st "X" = n -> st =[ plus2 ]=> st' -> st' "X" = n + 2
Proof.forall (st : state) (n : nat) (st' : state),
st "X" = n -> st =[ plus2 ]=> st' -> st' "X" = n + 2
  intros st n st' HX Heval.st:state
n:nat
st':state
HX:st "X" = n
Heval:st =[ plus2 ]=> st'
st' "X" = n + 2
  inversion Heval.st:state
n:nat
st':state
HX:st "X" = n
Heval:st =[ plus2 ]=> st'
st0:state
a1:aexp
n0:nat
x:string
H3:aeval (APlus (AId "X") (ANum 2)) st = n0
H:x = "X"
H1:a1 = APlus (AId "X") (ANum 2)
H0:st0 = st
H2:("X" !-> n0; st) = st'
("X" !-> n0; st) "X" = n + 2 subst.st:state
Heval:st =[ plus2
]=> ("X"
     !-> aeval (APlus (AId "X") (ANum 2)) st;
     st)
("X" !-> aeval (APlus (AId "X") (ANum 2)) st; st) "X" =
st "X" + 2 clear Heval.st:state
("X" !-> aeval (APlus (AId "X") (ANum 2)) st; st) "X" =
st "X" + 2
  simpl.st:state
("X" !-> st "X" + 2; st) "X" = st "X" + 2
  (* We need a lemma to conclude!
     We need to reason about a lookup to a variable we have just written to. 
     Let's do something forbidden and nest it in our proof.
   *)

  Set Nested Proofs Allowed.st:state
("X" !-> st "X" + 2; st) "X" = st "X" + 2
  Lemma update_eq :
    forall st x v,
      (x !-> v; st) x = v.forall (st : state) (x : string) (v : nat),
(x !-> v; st) x = v
  Proof.forall (st : state) (x : string) (v : nat),
(x !-> v; st) x = v
    intros.st:state
x:string
v:nat
(x !-> v; st) x = v
    unfold update.st:state
x:string
v:nat
(if (x =? x)%string then v else st x) = v
    rewrite eqb_refl.st:state
x:string
v:nat
v = v
    reflexivity.
  Qed.st:state
("X" !-> st "X" + 2; st) "X" = st "X" + 2

  (* We can now invoke our freshly proved lemma *)
  apply update_eq.

Qed.

Or in a different style, we can prove that the following infinite loop indeed diverges (more specifically, we will state that it admits no finite derivation. It is strictly weaker, as the rules could simply be partial).

Definition loop : com :=
  WHILE ANum 1 DO
    SKIP
  END.

Theorem loop_never_stops : forall st st',
  ~(st =[ loop ]=> st').forall st st' : state, ~ st =[ loop ]=> st'
Proof.forall st st' : state, ~ st =[ loop ]=> st'
  intros st st' contra.st, st':state
contra:st =[ loop ]=> st'
False unfold loop in contra.st, st':state
contra:st =[ WHILE ANum 1 DO SKIP END ]=> st'
False
  remember (WHILE ANum 1 DO SKIP END) as loopdef
           eqn:Heqloopdef.st, st':state
loopdef:com
Heqloopdef:loopdef = (WHILE ANum 1 DO SKIP END)
contra:st =[ loopdef ]=> st'
False
  (* Minimal setup to proceed by induction on the hypothetical derivation *)
Admitted.

From there, if you feel so enclined, you could define an alternate semantics to Imp, this time as the transitive closure of a small step semantics, and prove the equivalence of your new semantics with the previous one.

But if you are sufficiently on board to be motivated to do so, you might want to rather get a proper introduction by going through Software Foundation!

https://softwarefoundations.cis.upenn.edu/

If I have gauged right, we will not have reached this point in under an hour.

If I happen to be wildly incorrect, here is a non exhaustive list of aspects we have not discussed at all, let's discuss! - Extraction to OCaml/Haskel/ML - Type classes - Modules - Advance uses of LTac - Working with non-structural recursive functions - Coinduction - Using libraries, searching for exhisting theormes

A gentle overview to the Coq proof assistant

Coq: a functional programming language

Gallina

Inductive types

Sum types

Polymorphism

Product types

Recursive types

Coq: expressing mathematical statements

Prop

Coq: reasoning about programming languages

Pure expressions

Impure expressions