In propositional logic, we considered formulas made about atomic objects, which could only be either true or false. First-order logic, the topic of this chapter, builds upon propositional logic and allows you to look inside the objects discussed in formulas. We can provide this more refined level of granularity by discussing objects as elements of sets that can be larger than just the set
, and also include arbitrarily complex relationships with each other.
We begin first by defining the syntax of first-order (FO) logic, and then give these structures meaning.
The domain of discourse for first order logic is first-order structures or models. A first-order structure contains
- Relations,
- Functions, and
- Constants (functions of arity 0).
The vocabulary of first-order logic is
- a set of relation symbols with associated arities, and
- a set of function symbols with associated arities.
Here are some example first-order logic vocabularies:
- A graph
- Relation Set = {
: unary,
: binary }
- Function Set = {
: unary }
- Arithmetic
- Relation Set = {
: ternary,
: ternary,
: ternary,
: binary,
: binary}
- Function Set = {
: unary,
: const }
Here, a graph is a set of vertices
and a set of edges
. For the arithmetic set note that the use of
and
is purely syntactic and we could have used the symbols "plus" and "times" instead. We haven't provided the symbols with any meaning yet.
A term denotes an element in the first-order structure. A term is used to refer to the elements in our domain of discourse. Here are the rules that describe what a term is:
- every variable is a term, where a variable is simply another set of symbols
- every constant is a term
- If
are terms, and
is a
-ary function then
is a term.
For example, if
is a binary function and
is a ternary function then the following are all terms:
.
An atomic formula is
,
where
is a
-ary relation and
are terms. When viewing
as a set, then this is just another way of saying that the tuple
.
A special relation
means "equal" and cannot be interpreted otherwise. For example,
stands for
.
A first-order formula is an expression built using a given first-order vocabulary and variables and the symbols
. The set of first-order formulas is the minimum set satisfying the following:
- atomic formulas are first order formulas
- If
and
are formulas and
is a variable, then the following are also formulas:
data:image/s3,"s3://crabby-images/4eeed/4eeeda8b40c50fd9deed6632b87464f5a37d7c6a" alt="{\displaystyle (\phi \vee \psi )}"
data:image/s3,"s3://crabby-images/9e6c7/9e6c75d393f94a9eb8e736284e93aad8481d1f0c" alt="{\displaystyle (\phi \wedge \psi )}"
data:image/s3,"s3://crabby-images/696fb/696fb4c419457379398dd713d6ac1633cda558a3" alt="{\displaystyle (\neg \phi )}"
data:image/s3,"s3://crabby-images/7a7a0/7a7a052e74adb3fd0aeb19c13967a2e9560c8129" alt="{\displaystyle (\phi \rightarrow \psi )}"
data:image/s3,"s3://crabby-images/adf49/adf49c019e3a9003f0edc70e00704b247d9c353f" alt="{\displaystyle (\exists x\phi )}"
data:image/s3,"s3://crabby-images/3c2d5/3c2d5ab2137e6a15453cb8580e4fe879f7c9b1a0" alt="{\displaystyle (\forall x\phi )}"
A first-order structure over a given vocabulary consists of
- A domain, which is a set of elements (also known as a universe)
- A mapping associating to every
-ary relation symbol in the vocabulary a
-ary relation over the domain, and to every
-ary function symbol a
-ary function over the domain.
These components give meaning to the symbols.
Example:
- Relation Set = {
: ternary,
: ternary,
: ternary,
: binary,
: binary }
- Function Set = {
: unary,
: const }
A first-order structure over this vocabulary is:
- Domain: the set of integers
- Mapping :
addition,
multiplication,
exponentiation,
ordering, data:image/s3,"s3://crabby-images/5c490/5c49004b4ac233b93987373dd78261f5aa0f5cfa" alt="{\displaystyle {\mathit {next}}\rightarrow i+1}"
In this structure, the formula
![{\displaystyle \exists x\forall y\forall z[\times (y,z,x)\rightarrow ((y=1)\vee (z=1))]}](https://wikimedia.org/api/rest_v1/media/math/render/svg/377ac8513552e4a7fbf47f6b19d20d8e80132996)
expresses the statement "there exists a prime number" (the number 1 also satisfies this statement).
Note here that
is equivalent to
.
In the formula
or
,
is referred to as the scope of quantification of the variable
. An occurrence of a variable in a formula is bound if it is in the scope of quantification of that variable, otherwise the occurrence is said to be free. You can consider a quantifier use as a variable declaration.
A sentence is a formula with no free variables (i.e., all variables are bound). A sentence is either true or false.
A formula with free variable(s) can be considered as describing the properties of the free variable(s). For example,
denotes a formula with
occurring free and describes the properties of
.
If a sentence
evaluates to true over a structure
, we say
satisfies
and denote this by
.
iff
, where
is the interpretation of
by
.
iff
and
. (Similar for
.)
iff
.
if
for some
in the domain.
if
for every
in the domain.
Note:
denotes the result of substituting
for every free occurrence of
in
. Substitution is covered further later in this chapter.
The axioms are a set of sentences meant to distinguish "desired" models from others. But typically, also have "undesired" models, which are called non-standard models.
Examples: Consider the vocabulary
, where the symbols have the usual meaning (defined by FO sentences over this vocabulary). The standard interpretation for this set of axioms (see the end of this chapter) is the integers, but these are also satisfied by the set of integers modulo
. This possibility is ruled out by adding the following sentence to the axioms:
Question: Can all non-standard models be axiomatized away? The answer is no. Consider the model shown in [TODO: import figure] for the vocabulary containing only
(all elements in the upper line are larger than those on the lower line).
There is no set of first-order sentences that can distinguish this model from the natural numbers. Intuitively, the reason is that we cannot "backtrack" arbitrarily (towards
) in a first-order sentence. A similar non-standard model can be obtained for the full vocabulary above.
Given a first-order structure
and a FO sentence
, can we tell if
?
This problem is decidable for finite structures. For example, suppose
is a binary relation representing the edges of a graph, and the given sentence is
.
We can evaluate the truth of this sentence by trying all possible values for
and
. (Naive evaluation: "nested loop".) This is polynomial time (in domain size) but exponential in the size of the formula.
On infinite domains, we may or may not be able to evaluate the truth of a sentence. With the vocabulary
, where
is the set of natural numbers, it is decidable if a sentence is true. (This is known as Presburger arithmetic.) However if we include multiplication, the truth of a sentence becomes undecidable.
A set
of FO sentences is
- Satisfiable if there is some structure
such that data:image/s3,"s3://crabby-images/8736e/8736e77e1fe7e0703f28cd150b08cf9e7f06a5c1" alt="{\displaystyle I\models \Delta }"
- Unsatisfiable if it is not satisfiable
- Valid if
for all structure data:image/s3,"s3://crabby-images/78f89/78f89995590a40c2e28a3f2107833b7a9a38c044" alt="{\displaystyle I}"
Given a set of FO sentences
and a sentence
entails
(denoted
), if every structure that satisfies
also satisfies
.
iff
is unsatisfiable.
- If
is finite then,
iff
is valid.
- If a formula
has free variables
,
is valid iff
is valid.
The axioms for validity are from three categories:
- Axioms for boolean validity. These are inherited from propositional logic.
- Axioms for equality.
- Axioms for quantification.
Given a FO formula
, a boolean form of
a propositional formula
such that
is obtained from
by replacing each propositional variable in
by a subformula of
.
The set of Boolean forms of
is denoted
.
Examples:
- For the FO formula
, the boolean form is
.
- If
, then
where
.
Claim: If
and
is valid, then
is valid.
Let
be terms. The following are valid formulas.
data:image/s3,"s3://crabby-images/b305d/b305d92be56a632714f45e0206e560a5fab0bdf3" alt="{\displaystyle t=t}"
, where
is a
-ary function.
, where
is a
-ary relation.
Given a formula
in which variable
occurs free (denoted by
) and a term
, we define the substitution of
for
in
, denoted
, as the formula that results from replacing every
free occurrence of
in
by
, subject to the constraint that
contains no variable
quantified in
such that
occurs free within the scope of quantification of
. If
does not occur free
, then
is defined as
.
Example: Let
be
,
then
is a valid substitution
is not a valid substitution.
, where
is a term and
is a valid substitution
data:image/s3,"s3://crabby-images/b68cf/b68cff75fca45050058c09f1afe712faf6e2907b" alt="{\displaystyle (\forall x(\varphi \rightarrow \psi ))\rightarrow ((\forall x\varphi )\rightarrow (\forall x\psi ))}"
data:image/s3,"s3://crabby-images/3ad7f/3ad7f2131ed5493c296fb1ef0b28d871449a6812" alt="{\displaystyle \varphi \rightarrow \forall x\varphi }"
data:image/s3,"s3://crabby-images/054b1/054b1aa15f2c13ee799eccfe83856eb2cb6e5b36" alt="{\displaystyle \exists x\varphi \leftrightarrow \neg \forall x\neg \varphi }"
In summary, the axioms for validity are
:
- Axioms for boolean validity. These are inherited from propositional logic.
- Axioms for equality.
- Axioms for quantification.
Definition: A proof is a sequence
of FO sentences such that for each
either
or
such that
and
.
Notation: If there is a proof of
using
, we denote this fact by
.
Fact (Soundness): If
, then
is valid.
Remark: The set of formulas which can be proven valid is recursively enumerable.
Example: Proof of the formula
data:image/s3,"s3://crabby-images/bb4a6/bb4a6ef2489f714ba6ac1dc02f7406a7e7853d80" alt="{\displaystyle \forall x(\phi \land \psi )}"
data:image/s3,"s3://crabby-images/46f15/46f156d2ffb416526da4a454071cfbd560911e5c" alt="{\displaystyle \forall x(\phi \land \psi )\rightarrow (\phi \land \psi )(x\leftarrow x)}"
data:image/s3,"s3://crabby-images/4f830/4f830316238fd6e9344c1981234a752d46b67462" alt="{\displaystyle \phi \land \psi }"
data:image/s3,"s3://crabby-images/4d5f6/4d5f661c8f8797b52101eb8b9a510f5e87bdc76f" alt="{\displaystyle \phi }"
data:image/s3,"s3://crabby-images/89899/89899361ef2286d9ec095531a8780f8da55d5f3d" alt="{\displaystyle \forall x(\phi \land \psi )\rightarrow \phi }"
data:image/s3,"s3://crabby-images/6f8fe/6f8fe2006d34f688ef56917be3c99cf671ddf552" alt="{\displaystyle \forall x(\forall x(\phi \land \psi )\rightarrow \phi )}"
data:image/s3,"s3://crabby-images/2b219/2b219b4b0b33436ad769d1546c5d92c1daee330d" alt="{\displaystyle \forall x\forall x(\phi \land \psi )\rightarrow \forall x\phi }"
data:image/s3,"s3://crabby-images/5e8bc/5e8bc934d78f00e5814af9954bad5cabe6f0c23a" alt="{\displaystyle \forall x(\phi \land \psi )\rightarrow \forall x\phi }"
data:image/s3,"s3://crabby-images/c6118/c6118cb7bbe66ca2854621f4504de875f31f3c8f" alt="{\displaystyle \forall x\phi }"
The proof of
is symmetrical.
data:image/s3,"s3://crabby-images/8ba69/8ba69311d219804bcc6c5805d3cb3373baaf6cd0" alt="{\displaystyle \forall x\phi \land \forall x\psi }"
data:image/s3,"s3://crabby-images/42f44/42f44ec9f501a57494e6d243a4a75c261c1359e0" alt="{\displaystyle \forall x(\phi \land \psi )\rightarrow (\forall x\phi \land \forall x\psi )}"
data:image/s3,"s3://crabby-images/328f3/328f3ea995890f1831ed80ac1b1d7c1461147133" alt="{\displaystyle \forall x(\phi \land \psi )\leftrightarrow (\forall x\phi )\land (\forall x\psi )}"
e.g.
"x is even" and
"x is odd"
data:image/s3,"s3://crabby-images/6c4a3/6c4a3337d22876012dbbce42acba25c0fa863b75" alt="{\displaystyle \forall x(\phi (x)\lor \neg \phi (x))\not \leftrightarrow (\forall x\phi (x)\lor \forall x\neg \phi (x))}"
data:image/s3,"s3://crabby-images/797e9/797e9266ace4509071a5bd2b82a2c6afc4821270" alt="{\displaystyle \exists x(\phi \lor \psi )\leftrightarrow (\exists x\phi )\lor (\exists x\psi )}"
data:image/s3,"s3://crabby-images/eb549/eb549063e40d38ab9668cdc2df2ef057c6bfd164" alt="{\displaystyle \neg \exists x\phi \leftrightarrow \forall x\neg \phi }"
data:image/s3,"s3://crabby-images/974df/974df98eaf391707d876d19ea47cf69d3014b94c" alt="{\displaystyle \neg \forall x\phi \leftrightarrow \exists x\neg \phi }"
Claim: Every FO sentence is equivalent to a FO sentence of the form
, where
and
is quantifier free.
This is proved using the distributive properties of quantifiers. It may be necessary to rename variables, to avoid ambiguities. Although you can always move all quantifiers to the left, the resulting formula can actually be exponentially larger than the original one.
Example:
data:image/s3,"s3://crabby-images/9e0c0/9e0c0b433df3e548f55943899d324d77432546b5" alt="{\displaystyle \equiv (\forall x(G(x,x)\land (\forall yG(x,y)\lor \exists y\neg G(y,y))))\land G(w,0)}"
data:image/s3,"s3://crabby-images/0a58e/0a58e2de2d00e9e263a2f3cd9f389a11fc285a43" alt="{\displaystyle \equiv \forall x(G(x,x)\land (\forall yG(x,y)\lor \exists y\neg G(y,y))\land G(w,0))}"
data:image/s3,"s3://crabby-images/fe378/fe378e303441ebfb8a966549ef6f33202782d591" alt="{\displaystyle \equiv \forall x(G(x,x)\land (\forall yG(x,y)\lor \exists z\neg G(z,z))\land G(w,0))}"
data:image/s3,"s3://crabby-images/98811/98811306638318173fc22d68cd0410b07ebe123c" alt="{\displaystyle \equiv \forall x(G(x,x)\land \forall y(G(x,y)\lor \exists z\neg G(z,z))\land G(w,0))}"
data:image/s3,"s3://crabby-images/5e1c3/5e1c3ff2809285fe08b2d7a12814494250281d3c" alt="{\displaystyle \equiv \forall x\forall y(G(x,x)\land (G(x,y)\lor \exists z\neg G(z,z))\land G(w,0))}"
data:image/s3,"s3://crabby-images/a2b2c/a2b2cdeb9087fb7b48a0c38df402c862acbe4124" alt="{\displaystyle \equiv \forall x\forall y(G(x,x)\land \exists z(G(x,y)\lor \neg G(z,z))\land G(w,0))}"
data:image/s3,"s3://crabby-images/95012/95012ac9b6946f2b63ca4eddf6985d75ec9adab9" alt="{\displaystyle \equiv \forall x\forall y\exists z(G(x,x)\land (G(x,y)\lor \neg G(z,z))\land G(w,0))}"
- Describe the desired model as closely as possible using a (possibly infinite but recursive) set
of axioms (FO sentences).
- Prove things using deduction.
Notation: We say
is a valid consequence of
(denoted
), if every model satisfying
also satisfies
.
Fact:
iff
is unsatisfiable.
The following fourteen first-order axioms describe the properties of arithmetic and numbers, i.e. addition (
), multiplication (
), exponentiation (
, equality (
), ordering (
), successor function (
) and remainder (mod). This example shows the expressive power of first-order statements, which were originally hoped to provide a basis for the "one true mathematics."
Question: Why are they called "nonlogical" axioms?
- NT1:
data:image/s3,"s3://crabby-images/0dd95/0dd95a589c12a69968fd2a7488a18ab59dbb06e4" alt="{\displaystyle \forall x(\sigma (x)\neq 0)}"
- NT2:
data:image/s3,"s3://crabby-images/cf681/cf681513033ac9a87415464159425816b792940e" alt="{\displaystyle \forall x\forall y(\sigma (x)=\sigma (y)\rightarrow x=y)}"
- NT3:
data:image/s3,"s3://crabby-images/c824b/c824b299e776a952e1b90f6ddfbd2fc791852b72" alt="{\displaystyle \forall x(x=0\vee \exists y\sigma (y)=x)}"
- NT4:
data:image/s3,"s3://crabby-images/40597/40597a4a5217639326e70605a1ded626c1f4a0fe" alt="{\displaystyle \forall x(x+0=x)}"
- NT5:
data:image/s3,"s3://crabby-images/78ec0/78ec04cc6785a4f7ded422c955a739c780f07a57" alt="{\displaystyle \forall x\forall y(x+\sigma (y)=\sigma (x+y))}"
- NT6:
data:image/s3,"s3://crabby-images/0bc9a/0bc9ae903c2e8e9d218aa961cd529345d0c168bc" alt="{\displaystyle \forall x(x\times 0=0)}"
- NT7:
data:image/s3,"s3://crabby-images/83a13/83a13042c513d157ec888e81aa236d9f56d40b03" alt="{\displaystyle \forall x\forall y(x\times \sigma (y)=(x\times y)+x)}"
- NT8:
data:image/s3,"s3://crabby-images/8e06d/8e06de0dfc86b1787d034784582e0ca5a772723f" alt="{\displaystyle \forall x(x\uparrow 0=\sigma (0))}"
- NT9:
data:image/s3,"s3://crabby-images/4f0af/4f0af495be369c195f97e3f987e41f963577cc69" alt="{\displaystyle \forall x\forall y(x\uparrow \sigma (y)=(x\uparrow y)\times x)}"
- NT10:
data:image/s3,"s3://crabby-images/86cb8/86cb865951cfae66a719851454e971c50cad5554" alt="{\displaystyle \forall x(x<\sigma (x))}"
- NT11:
data:image/s3,"s3://crabby-images/4f8d4/4f8d460f59e23215675d58eac765f881933751ac" alt="{\displaystyle \forall x\forall y(x<y\rightarrow (\sigma (x)\leq y))}"
- NT12:
data:image/s3,"s3://crabby-images/87adb/87adb258db5ce0b69e7421bd842475f490e5f908" alt="{\displaystyle \forall x\forall y(\neg (x<y)\leftrightarrow y\leq x)}"
- NT13:
data:image/s3,"s3://crabby-images/b2bef/b2bef88bd5ad07f01a590af39fba6a67be9286a2" alt="{\displaystyle \forall x\forall y\forall z(((x<y)\wedge (y<z))\rightarrow x<z)}"
- NT14:
data:image/s3,"s3://crabby-images/254d5/254d56d1571e35118b329ef88c2646d3f2332eb8" alt="{\displaystyle \forall x\forall y\forall z\forall z'(mod(x,y,z)\wedge mod(x,y,z')\rightarrow z=z')}"