negation, contradiction, absurdity
Left |
Adjust your browser window |
Right |
The idea behind this approach is as follows: let us define ~B (i.e. "B is false") as "B implies absurdity". So, let us add to our first order language a predicate constant f (meaning "false", or "absurdity"), and let us replace all negation expressions ~F by F->f. Then, the three negation axioms will take the following forms:
L9: (B->C)->((B->~C)->~B),
L9': (B->C)->((B->(C->f))->(B->f)),
L10: ~B->(B->C),
L10': (B->f)->(B->C),
L11: Bv~B,
L11': Bv(B->f).
After this, surprisingly, the axiom L9' becomes derivable from L1-L2! Indeed,
(1) | B->C | Hypothesis. |
(2) | B->(C->f) | Hypothesis. |
(3) | B | Hypothesis. |
(4) | C->f | By MP, from (2) and (3) |
(5) | C | By MP, from (1) and (3) |
(6) | f | By MP, from (4) and (5) |
Hence, by Deduction Theorem 1, [L1, L2, MP] |- (B->C)->((B->(C->f))->(B->f)).
Second observation: L10': (B->f)->(B->C) can be replaced simply by f->C. Indeed, if we assume f->C, then L10' becomes derivable:
(1) | B->f | Hypothesis. |
(2) | B | Hypothesis. |
(3) | f | By MP, from (1) and (2) |
(4) | |- f->C | f->C |
(5) | C | By MP, from (3) and (4) |
Hence, by Deduction Theorem 1, [L1, L2, f->C, MP] |- (B->f)->(B->C).
Third observation. As we know from Theorem 2.4.9: [L1, L2, L9, MP] |- ~B->(B->~C), in the minimal logic we can prove 50% of L10: "Contradiction implies that all is wrong". After our replacing negations by B->f the formula (B->f)->(B->(C->f) becomes derivable from L1-L2. Indeed,
(1) | B->f | Hypothesis. |
(2) | B | Hypothesis. |
(3) | f | By MP, from (1) and (2) |
(4) | |- f->(C->f) | Axiom L1 |
(5) | C->f | By MP, from (3) and (4) |
Hence, by Deduction Theorem 1, [L1, L2, MP] |- (B->f)->(B->(C->f)).
Thus, we see that L1 (and not L9!) is responsible for provability of the 50% "crazy" formula ~B->(B->~C). Is L1 50% as "crazy" as L10? Yes! Let us compare:
L10: ~B->(B->C) states that "Contradiction implies anything".
L1: B->(C->B) states that "If B is true, then B follows from anything".
Let us recall our "argument" for L10 from Section 1.3: "...we do not need to know, were C "true" or not, if ~B and B were "true" simultaneously. By assuming that "if ~B and B were true simultaneously, then anything were true" we greatly simplify our logical apparatus."
Now, similarly: if B is (unconditionally) true, then we do not need to know, follows B from C or not. By assuming that "if B is true, then B follows from anything" we greatly simplify our logical apparatus.
In a sense, the axiom L9 "defines" the negation of the minimal logic, the axioms L9 and L10 "define" the negation of the constructive logic, and L9-L11 "define" the negation of the classical logic. Is our definition of ~B as B->f equivalent to these "definitions"? Yes!
Theorem 7.1.1. For any formula F, let us denote by F' the formula obtained from F by replacing all sub-formulas ~G by G->f. Then, for any formulas B1, ..., Bn, C:
[L1-L9, MP]: B1, ..., Bn |- C, iff [L1-L8, MP]: B'1, ..., B'n |- C'.
Proof.
1) ->.
Let us consider a proof of [L1-L9, MP]: B1, ..., Bn |- C. In this proof:
- let us replace each formula G by its "translation" G',
- before each instance of L9, let us insert a proof of the corresponding instance of L'9 in [L1, L2, MP] (see above).
In this way we obtain a proof of [L1-L8, MP]: B'1, ..., B'n |- C'. Indeed,
a) If some formula B is an instance of L1-L8, then B' is an instance of the same axiom (verify!).
b) (B->D)' is B'->D', hence, if the initial proof contains a conclusion by MP from B and B->D to D, then, in the derived proof, it is converted into a conclusion by MP from B' and B'->D' to D'.
c) If the initial proof contains an instance of L9, then the derived proof contains the corresponding instance of L'9 preceded by its proof in [L1, L2, MP].
Q.E.D.
2) <-.
Let us recall the above translation operation: for any formula F, we denoted by F' the formula obtained from F by replacing all sub-formulas ~G by G->f. Now, let us introduce a kind of a converse operation - the re-translation operation: for any formula F, let us denote by F" the formula obtained from F: a) by replacing all sub-formulas G->f by ~G, and after this, b) by replacing all the remaining f's (f means "false"!) by ~(a->a), where a is some closed formula of the language considered.
Of course, for any formula F, (F')" is F (verify).
Note. Replacing f by a formula preceded by negation, is crucial - it will allow using Theorem 2.4.9: [L1-L9, MP]: ~B->(B->~C) instead of the Axiom L10: ~B->(B->C).
Now, let us consider a proof of [L1-L8, MP]: B'1, ..., B'n |- C'. In this proof, let us replace each formula G by its re-translation G". Then C' becomes C, and B'1, ..., B'n become B1, ..., Bn, but what about the remaining formulas contained in the proof?
a) Instances of the axioms L1-L8.
L1: B->(C->B)
If B is not f, then (B->(C->B))" is B"->(C"->B"), i.e. re-translation yields again an instance of L1.
If B is f, then (f->(C->f))" is ~(a->a)->~C". This formula is provable in [L1-L9, MP]. Indeed,
(1) | ~(a->a) | Hypothesis. |
(2) | |- ~(a->a)->((a->a)->~C") | Theorem 2.4.9, [L1-L9, MP]. |
(3) | |- a->a | Theorem 1.4.1 [L1-L2, MP]. |
(4) | ~C" | By MP, from (1), (2) and (3). |
Thus, re-translation of any instance of L1 is provable in [L1-L9, MP].
L2: (B->(C->D))->((B->C)->(B->D))
If C and D are not f, then re-translation yields again an instance of L2.
If C is f, and D is not, then re-translation yields (B"->(~(a->a)->D"))->(~B"->(B"->D")). This formula is provable in [L1-L9, MP]. Indeed,
(1) | B"->(~(a->a)->D") | Hypothesis. |
(2) | ~B" | Hypothesis. |
(3) | B" | Hypothesis. |
(4) | ~(a->a)->D" | By MP, from (1) and (3). |
(5) | |- ~B"->(B"->~(a->a)) | Theorem 2.4.9 [L1-L9, MP]. |
(6) | ~(a->a) | By MP, from (2), (3) and (5). |
(7) | D" | By MP, from (4) and (6). |
Hence, by Deduction Theorem 1, [L1-L9, MP] |- (B"->(~(a->a)->D"))->(~B"->(B"->D")).
If D is f, and C is not, then re-translation yields (B"->~C")->((B"->C")->~B"). This formula is provable in [L1-L9, MP]. Indeed,
(1) | B"->~C" | Hypothesis. |
(2) | B"->C" | Hypothesis. |
(3) | ~B" | By MP, from Axiom L9. |
Hence, by Deduction Theorem 1, [L1-L9, MP] |-(B"->~C")->((B"->C")->~B").
If C and D both are f, then re-translation yields (B"->~~(a->a))->(~B"->~B"). This formula is provable in [L1-L9, MP]. Indeed,
(1) | |- ~B"->~B" | Theorem 1.4.1 [L1-L2, MP]. |
(2) | |- (~B"->~B")->(X->(~B"->~B")) | Axiom L1, X is B"->~~(a->a). |
(3) | |- X->(~B"->~B") | By MP, X is B"->~~(a->a). |
Thus, re-translation of any instance of L2 is provable in [L1-L9, MP].
L3: B&C->B
If B is not f, then re-translation yields again an instance of L3.
If B is f, then re-translation yields via ~(f&C) the formula ~(~(a->a)&C). This formula is provable in [L1-L9, MP]. Indeed,
(1) | |- ~(a->a)&C -> ~(a->a) | Axiom L3. |
(2) | |- ~~(a->a) -> ~(~(a->a)&C) | From (1), by the Contraposition Law. |
(3) | |- (a->a)->~~(a->a) | Theorem 2.4.4: [L1, L2, L9, MP] |- A->~~A |
(4) | |- a->a | Theorem 1.4.1 [L1-L2, MP]. |
(5) | |- ~(~(a->a)&C) | By MP, from (3), (4) and (2). |
Thus, re-translation of any instance of L3 is provable in [L1-L9, MP].
L4: B&C->C
Similarly to L3 - re-translation of any instance of L4 is provable in [L1-L9, MP].
L5: B->(C->B&C)
Re-translation yields again an instance of L5.
L6: B->BvC
Re-translation yields again an instance of L6.
L7: C->BvC
Re-translation yields again an instance of L7.
L8: (B->D)->((C->D)->(BvC->D)
If D is not f, then re-translation yields again an instance of L8.
If D is f, then re-translation yields ~B->(~C->~(BvC)). By Theorem 2.4.10(b), this formula is provable in [L1-L9, MP] .
Thus, re-translation of any instance of L8 is provable in [L1-L9, MP].
Hence, re-translations of all (i.e. L1-L8) axiom instances are provable in [L1-L9, MP]. What about applications of MP in the initial proof? If the initial proof contains a conclusion by MP from B and B->D to D, then the following situations are possible:
a) If B and D are not f, then, in the derived proof, this conclusion is converted into a conclusion by MP from B" and B"->D" to D".
b) If B is f, and D is not, then, in the derived proof, this conclusion is converted into a conclusion by MP from ~(a->a) and ~(a->a)->D" to D".
c) If D is f, and B is not, then, in the derived proof, this conclusion is converted into three formulas: B", ~B", ~(a->a). To derive ~(a->a) from B" and ~B", we can use MP and Theorem 2.4.9: [L1-L9, MP] |- ~B"->(B"->~(a->a)).
d) If B and D are both f, then, in the derived proof, this conclusion is converted into three formulas: ~(a->a), ~~(a->a), ~(a->a). Simply drop the third formula from the proof.
Thus, the re-translation operation, when applied to all formulas of a proof of [L1-L8, MP]: B'1, ..., B'n |- C', yields a sequence of formulas that are provable in [L1-L9, MP] from hypotheses B1, ..., Bn. Hence, so is C.
Q.E.D.
This completes the proof of Theorem 7.1.1.
Corollary 7.1.2. a) A formula C is provable in the minimal propositional logic [L1-L9, MP], iff [L1-L8, MP] |- C'.
b) A formula C is provable in the constructive propositional logic [L1-L10, MP], iff [L1-L8, f->B, MP] |- C'.
c) A formula C is provable in the classical propositional logic [L1-L11, MP], iff [L1-L8, f->B, L'11, MP] |- C'.
Proof. a) Consider an empty set of hypotheses in Theorem 7.1.1.
b) If [L1-L10, MP] |- C, then [L1-L9, MP]: B1, ..., Bn |- C, where hypotheses are instances of the axiom L10. By Theorem 7.1.1, [L1-L8, MP]: B'1, ..., B'n |- C'. As established above, B'1, ..., B'n can be proved by using the axiom schema f->B, i.e. [L1-L8, f->B, MP] |- C'. Q.E.D.
Now, if [L1-L8, f->B, MP] |- C', then,
c) If [L1-L11, MP] |- C, then [L1-L9, MP]: B1, ..., Bn |- C, where hypotheses are instances of the axioms L10 and L11. Return to case (b). Q.E.D.
Corollary 7.1.3. a) A formula C is provable in the minimal predicate logic [L1-L9, L12-L15, MP, Gen], iff [L1-L8, L12-L15, MP, Gen] |- C'.
b) A formula C is provable in the constructive predicate logic [L1-L10, L12-L15, MP, Gen], iff [L1-L8, f->B, L12-L15, MP, Gen] |- C'.
c) A formula C is provable in the classical predicate logic [L1-L11, L12-L15, MP, Gen], iff [L1-L8, f->B, L11', L12-L15, MP, Gen] |- C'.
Exercise 7.1.1. Prove the Corollary 7.1.3.
Warning! Draft text follows.
S.Simpson
http://www.math.psu.edu/simpson/courses/math457/misc/trakh.pdf (with proofs)
B.A. Trakhtenbrot. The impossibility of an algorithm for the decision problem for finite models. Dokl. Akad. Nauk SSSR, 70:596--572, 1950. English translation in: AMS Transl. Ser. 2, vol.23 (1063), 1--6.
http://www.mcs.le.ac.uk/~istewart/moreIAS/BriefDCT.html by Iain Stewart:
Trakhtenbrot's theorem (1950): "The set of first-order sentences, over some signature including a relation symbol that is not unary, which are valid over finite structures is not r.e. but is co-r.e.". These early results appeared sporadically and tended to be "finite considerations" of analogous results in model theory. This is true of Trakhtenbrot's result where the analogous result in model theory is due to Goedel (1930): "The set of valid first-order sentences is r.e. but not co-r.e.".
Sergei Vorobyov. The "Hardest" Natural Decidable Theory. LICS: IEEE Symposium on Logic in Computer Science, 1997 (PDF):
In 1936 L. Kalmar proved that the first order theory of a binary relation is undecidable, which greatly simplified undecidability proofs, as compared to those based on straightforward encodings of Turing machines, see, e.g. M. Rabin [13] B. Trakhtenbrot [19] and later R. Vaught [20] proved even stronger Theorem 10 . Let L be the first order language with the unique binary relation symbol. The set of valid sentences of L and the set of sentences of L refutable by some finite model are recursively inseparable.
See at http://www.cs.nyu.edu/pipermail/fom/2000-July/004215.html
Allen Hazen a.hazen@philosophy.unimelb.edu.au
Thu, 06 Jul 2000 15:58:16 +0800
Recent posts from Simpson and Urquhart have mentioned the theorem that "there is no weakest axiom of infinity." I have been puzzled by this for a long time. This probably says more about me than about the intrinsic difficulty of the issue. I think I've de-puzzled myself; since I don't know of a self-contained textbook account, here is mine. (Thanks to Urquhart for suggestions.)
--
The Theorem about Axioms of Infinity
A: Trakhtenbrot on finite satisfiability
B: there is no weakest axiom of infinity
C: a surprise about the Axiom of Choice
D: Compactness and a request
--
-----A-----
The key here is the fact that "There is no weakest Axiom of
Infinity" is a corollary of Trakhtenbrot's theorem that the
set of first-order formulas valid in FINITE models (i.e. whose
negations are not FINITELY satisfiable) is undecidable. This
follows from Goedel 1931. There is no complete axiomatization of
the Pi-1-1 sentences of arithmetic (i.e., the set of Pi-1-1
truths is not r.e.). A proper initial segment of the natural
numbers, however, can be thought of as a finite model. So, (*),
if finite satisfiability was decidable, we'd have have a way of
proving arbitrary Pi-1-1 truths of arithmetic: just carry out the
decision procedure and note that no finite model knocks out the
candidate sentence! (The fiddly bit, (*), has to do with a bit of
change to the sentence: finite segments of omega are not closed
under addition and multiplication, so what we would have to test
in finite models was a variant: put a bound on the initial
universal quantifiers of the Pi-1-1 sentence (using a new
constant), and check that the bounded sentence holds in models
that go enough higher than the bound to include denotations for
all the terms appearing.)
-----B-----
An axiom of infinity is a sentence of n-th order logic, for some
n, which is true in all and only models with an infinite domain
of individuals. We can limit our attention to sentences
containing no non-logical vocabulary, since a sentence with
predicate constants can have them treated as variables bound by
initial existential quantifiers. One axiom of infinity is said to
be WEAKER than another if it is derivable from it in an
axiomatizable fragment of higher-order logic but not vice versa.
(Reference: section 57 of Church 1956, "Introduction to
Mathematical Logic.") So. Suppose there were a weakest Axiom
of Infinity, Q. It would have to be derivable from EVERY other
axiom of infinity. Now choose an arbitrary 1st order formula, P.
On the supposition that Q exists, we can test P for finite
validity as follows. First, form P* by treating the non-logical
constants of P as variables and existentially quantifying them.
Note that, if P is finitely valid, then P*, if satisfiable at
all, is an axiom of infinity. Now we start two mindless-search
algorithms side by side: (a) search for a finite model falsifying
P, (b) search for a formal proof of (P* -> Q). One of these is
bound to terminate; if (a) succeeds P is not finitely valid, and
if (b) does it is. Comments:
(i) Since P* is formed by existentially quantifying a 1st-order
sentence, it doesn't matter whether P* is unsatisfiable or an
axiom of infinity: either way, (P* -> Q) will be provable.
(ii) Church, op. cit., discusses only 2nd order axioms of
infinity, but the result obviously holds for higher orders as
well.
(iii) If a 1st order sentence (e.g. the negation of P, above) is
finitely satisfiable, an exhaustive search will IN PRINCIPLE find
a model. Note, however, that this is a thoroughly unbounded
algorithm. From Trakhtenbrot's theorem it follows that, for any
recursive function f, there is a natural number i such that for
some 1-st order sentence S of length i, S is finitely satisfiable
but its smallest model has size > f(i). We're talking
EFFECTIVE computability, not FEASIBLE!
-----C-----
The surprising thing about the result of part B is that it is
DIFFERENT from the other well-known fact about axioms of
infinity. Elementary set-theory textbooks give two definitions of
INFINITE SET: Dedekind-infinite and non-inductive. Famously,
these (or the Axioms of Infinity formed from them) can only be
shown equivalent by use of the Axiom of Choice. Leaving the naive
reader (me) with the impression "Yes there are
non-equivalent Axioms of Infinity, but the Axiom of Choice saves
the day and lets you prove them equivalent."
WRONG! This is a different phenomenon; even if we include the
Axiom of Choice in the axiomatized fragment of higher order logic
considered above, there is STILL an infinite sequence of ever
weaker Axioms of Infinity.
-----D-----
In a way this shouldn't surprise us. Consider the infinite
sequence of 1st order (with identity) sentences
"There is at least one thing"
"There are at least two things"
"There are at least three things"
and so on. They are jointly satisfiable only in infinite domains,
so they can be thought of as constituting, not an Axiom of
Infinity, but an "Axiomatization of Infinity." All of
them ought to be derivable from any sentence calling itself an
Axiom of Infinity: any Axiom of Infinity is at least as strong as
this "Axiomatization." By compactness, however, no
Axiom of Infinity is derivable from it! Thus, any Axiom of
Infinity is properly stronger than the 1st order
"Axiomatization." So if there were a weakest Axiom of
Infinity, there would be a **gap**. Which would be a surprising
and disturbing fact.
HOWEVER... I don't actually know of any Axiom of Infinity
weaker than
(*) There is a nonempty family of sets containing a proper
superset of each of its members.
I am fond of (*) for a number of reasons. It is close to
Whitehead and Russell's Inf Ax. It amounts to saying that the
universe is infinite in the non-Dedekind sense from introductory
set theory texts. It is in a language (Monadic Third Order logic:
closely related to the "Framework" of David Lewis's
"Parts of Classes") that I have spent time with. By
(B), above, there are weaker Axioms of Infinity. Can anyone give
me an example of a properly weaker one? (Reasonably short,
natural, non-pathological examples not encoding complex
statements about Turing machines preferred.)
Allen Hazen
Philosophy Department
University of Melbourne
See also
www.math.psu.edu/simpson/courses/math557/misc/trakhtenbrot.pdf
www.cs.indiana.edu/classes/foc-leiv/trakh.ps
negation, contradiction, absurdity