The Idea of a Set

The theory of sets is deep and has many subtleties. Strictly speaking, what exactly are sets and what behaviors one should expect from them are still matters of debate in mathematical philosophy and logic. However, at the informal level — the level at which this book uses set theory — sets are conceptually simple things. They are just unordered collections of arbitrary objects. Note that when I say "unordered collections" and "objects", I do not pretend to mean anything mathematically precise by them. In fact, I cannot do so without either circular reasoning or infinite regress. Rather, I am merely appealing to one's intuitive ideas of what an "unordered collection" and an "arbitrary object" should be and hope that everyone more or less agrees on those ideas.

Moving on, a set is denoted either in explicit form where all the elements of the set are written down between curly braces or in set builder form where the condition/predicate satisfied by the elements of the set is written down between curly braces. An example of the former is {0,1}\{0, 1\}, whereas as example of the latter is {x: xN and x<2}\{x:\ x\in\mathbb{N}\ \text{and}\ x<2\} (pretend that N\mathbb{N} is defined in the last example). Sometimes, in set builder notation, one of the conditions is snuck onto the left side of the '::'. For instance, I could have equally written the last example as {xN: x<2}\{x\in\mathbb{N}:\ x<2\}.

If one thinks about the two notations for a while, one will realize that the set builder form is more expressive than the explicit form. One can, for example, easily denote and manipulate infinite sets with the set builder form, while one cannot humanly write down the explicit form of an infinite set. However, the set builder form is too powerful. In general, collections of the form {x: ϕ(x)}\{x:\ \phi(x)\} cannot be sets for arbitrary conditions ϕ\phi, lest one wants to fall victim to Russell's Paradox. Some of those arbitrary collections must either not exist or be "larger" than sets, called classes. Discussing classes is outside the scope of this book. However, I mention them to motivate a special class worth keeping in mind — the universal class U\mathcal{U} that contains all sets. In most set theories, this also means that U\mathcal{U} contains all the mathematical objects being talked about because, in those theories, most objects of mathematical interest are sets!

Membership and Set Equality

As discussed above, sets are collections containing objects. To show that an object xx is inside a set AA, one writes xAx \in A. One says that xx is a member of AA. Two sets are equal if and only if they contain exactly the same members. More precisely, one says that sets AA and BB are equal if and only if given xAx \in A, one can show that xBx \in B and given xBx \in B, one can show that xAx \in A.

The Empty Set

One of the most important, if trivial, examples of sets is the empty set that contains no objects. It is either denoted by \emptyset or denoted in its obvious explicit form {}\{\}. Interestingly, it can have many set builder forms. Inside a set theory where all objects of interest are sets, one of those forms is {xU: xx}\{x\in\mathcal{U}:\ x \neq x\}. This says that the empty set contains all and only those objects that are not equal to themselves. Of course, in any reasonable mathematical theory, every object is in fact equal to itself. So, this denotation is basically a roundabout way of saying that the empty set has no members.

Subsets

One calls a set AA a subset of a set BB when every object inside AA is also inside BB i.e. if xAx \in A, then xBx \in B. This fact is denoted by ABA \subseteq B. If, in addition, AA is not the whole of BB, then one calls AA a proper subset of BB, denoting this fact by ABA \subset B.

It should not be hard to see that two sets are equal if and only if each is a subset of the other. Furthermore, note the following useful observation: if ABA \subset B, then one necessarily has an object in BB that is not in AA.

Operations on Sets

In this book, I primarily use three operations to make new sets from old ones (assuming a set theory where all objects being talked about are also sets):

  • Union: Given a set of many other sets S\mathcal{S}, S={xU: xS for some SS} \bigcup\mathcal{S}=\{x\in\mathcal{U}:\ x \in S\ \text{for some}\ S \in \mathcal{S}\} i.e. S\bigcup\mathcal{S} is the set of all objects that are in at least one of the sets inside S\mathcal{S}.
  • Intersection: Given a set of many other sets S\mathcal{S}, S={xU: xS for all SS} \bigcap\mathcal{S}=\{x\in\mathcal{U}:\ x \in S\ \text{for all}\ S \in \mathcal{S}\} i.e. S\bigcap\mathcal{S} is the set of all objects that are common to all the sets inside S\mathcal{S}. Technically, S\mathcal{S} should be non-empty for S\bigcap\mathcal{S} to be a set; otherwise, the intersection would be the whole of U\mathcal{U} which is not a set.
  • Power: Given a set SS PS={XU: XS} \mathcal{P}S=\{X\in\mathcal{U}:\ X \subseteq S\} i.e. PS\mathcal{P}S is the set of all subsets of SS.

The Cartesian Product of Sets

Even though sets are unordered, one can manually construct order out of them. For instance, the ordered pair (x,y)(x,y) can be defined as {{x},{x,y}}\{\{x\},\{x,y\}\}. Given this definition, it is not hard to show that one can tell the left element xx apart from the right element yy in (x,y)(x,y) i.e. one can show that if (a,b)=(x,y)(a,b)=(x,y), then a=xa=x and b=yb=y. Given two sets XX and YY, the set of all ordered pairs, where the left and right elements are taken from XX and YY respectively, is denoted by X×YX \times Y. More explicitly, X×Y={pU: p=(x,y) for some xX and some yY} X \times Y = \{p \in \mathcal{U}:\ p=(x,y)\ \text{for some}\ x \in X\ \text{and some}\ y \in Y\} This set is called the Cartesian Product of XX and YY.

Functions as Sets

A function ff with the domain set XX and the codomain set YY is just a subset of X×YX \times Y satisfying thus:

  • For every xXx \in X, there is exactly one yYy\in Y such that (x,y)f(x,y)\in f. One says that xx is mapped to yy and denotes this fact as f(x)f(x). This denotation is unambiguous since, as said before, there is just a single yy that f(x)f(x) could ever refer to.

Such a function ff is denoted as f:XYf:X \to Y. Next, I present the salient concepts about functions that the reader needs to keep in his/her mind while going through this book.

Given two functions f:XYf:X \to Y and f:YZf':Y \to Z with matching codomain and domain YY, their composition (ff):XZ(f' \circ f):X \to Z, defined by (ff)(x)=f(f(x))(f' \circ f)(x)=f'(f(x)), is also a function. The reader should translate this definition into the language of sets and verify that the claim is indeed true.

The set of all elements of YY that gets mapped to by an element of a subset UXU \subseteq X is denoted by f(U)f(U). This is called the image set of UU. More explicitly, f(U)={yY: y=f(x) for some xU} f(U)=\{y \in Y:\ y=f(x)\ \text{for some}\ x \in U\} Obviously f(U)f(U) is always a subset of YY.

For a given element yy of YY, the set of all elements of XX that map to that element is denoted by f1(y)f^{-1}(y). More explicitly, f1(y)={xX: f(x)=y} f^{-1}(y)=\{x \in X:\ f(x)=y\} Obviously f1(y)f^{-1}(y) is always a subset of XX. Generalizing, given UYU \subseteq Y, the set of all elements of XX mapping into UU is denoted by f1(U)f^{-1}(U) i.e. f1(U)={xX: f(x)U}f^{-1}(U)=\{x \in X:\ f(x) \in U\}. Note that f(f1(U))Uf(f^{-1}(U)) \subseteq U.

One says that ff is injective iff f(x)=f(x~)f(x)=f(\tilde{x}) implies x=x~x=\tilde{x}. Alternatively, by logical contraposition, xx~x \neq \tilde{x} implies f(x)f(x~)f(x) \neq f(\tilde{x}). Yet another way to phrase injectivity is this: given a yYy \in Y, there is at most one xXx \in X such that f(x)=yf(x)=y. As an example, for any two sets XYX \subset Y, there is an injection ι:XY\iota:X \to Y such that ι(x)=x\iota(x) = x for each xXx \in X. The reader can easily show that this is a well-defined injection. This injection ι\iota is called the inclusion map from XX to YY.

On the other hand, one says that ff is surjective iff for every yYy \in Y there is at least one xXx \in X such that f(x)=yf(x)=y. Using notation defined earlier, the reader can verify that surjectivity is equivalent to any of the following assertions:

  • f(X)=Yf(X)=Y
  • f1(y)f^{-1}(y)\neq\emptyset for any yYy \in Y.
  • Uf(f1(U))U \subseteq f(f^{-1}(U)) for every UYU \subseteq Y

Given two sets such that YXY \subseteq X and YY \neq \emptyset, there is a surjection σ:XY\sigma:X \to Y. To see this, note that YY \neq \emptyset; so there is some yYy \in Y. Now, define σ:XY\sigma:X \to Y thus: σ(x)={xif xYyif xY \sigma(x)=\begin{cases} x &\text{if } x \in Y \\ y &\text{if } x \notin Y \end{cases} The reader can easily show that this is a well-defined surjection.

If a function is both injective and surjective, it is called bijective. Hence, if ff is bijective and yYy \in Y, then there is exactly one xXx \in X such that f(x)=yf(x)=y. This observation allows one to define a function flinv:YXf^\text{linv}:Y \to X where flinv(y)f^\text{linv}(y) is equal to that one and only xx satisfying f(x)=yf(x)=y. This flinvf^\text{linv} is called a left inverse of ff because it cancels ff from the left: flinv(f(x))=xf^\text{linv}(f(x))=x.

I now present some elementary results about functions in general.

Lemma RSIS

Restriction to Subset and its Image is Surjection: For any function f:XYf:X \to Y and UXU \subseteq X, restricting the function's domain and codomain to UU and f(U)f(U), respectively, produces a surjection; furthermore, if ff is an injection, so is this restriction.

Proof: As per the theorem, let fU:UY~=f(U)f_U:U \to \tilde{Y}=f(U) where for all xUx \in U, fU(x)=f(x)f_U(x)=f(x). Now, suppose yY~y \in \tilde{Y}. Then, by definition of f(U)f(U), f(x)=yf(x)=y for some xUx \in U. But then, fU(x)=yf_U(x)=y. Hence, every yy in the codomain of fUf_U is the image of some xx in its domain i.e. fUf_U is surjective. Next, let fU(x)=fU(x~)f_U(x)=f_U(\tilde{x}) for x,x~Ux,\tilde{x} \in U. Then, f(x)=f(x~)f(x)=f(\tilde{x}) and, since UXU \subseteq X, x,x~Xx,\tilde{x} \in X. Thus, if ff is injective, then certainly x=x~x=\tilde{x}, meaning fUf_U is injective too.

Lemma ISSI

Image of a Subset is a Subset of the Image: For any function f:XYf:X \to Y and UXU \subseteq X, f(U)Yf(U) \subseteq Y. Furthermore, if ff is injective and UU is a strict subset of XX, then f(U)f(U) is a strict subset of YY.

Proof: This is left to the reader.

Lemma LInvBijThenBij

Left Inverse of a Bijection is a Bijection: If f:XYf:X \to Y is a left inverse of a bijection f:YXf':Y \to X, then ff is bijection.

Proof: Let me first show that ff is injective i.e. assuming f(x)=f(x~)f(x)=f(\tilde{x}), I will proceed to prove that x=xx=x'. Now, since ff' is bijective, and therefore surjective, there are y,y~Yy,\tilde{y} \in Y such that f(y)=xf'(y)=x and f(y~)=x~f'(\tilde{y})=\tilde{x}. Thus, y=f(f(y))=f(x)=f(x~)=f(f(y~))=y~ y=f(f'(y))=f(x)=f(\tilde{x})=f(f'(\tilde{y}))=\tilde{y} where the beginning and ending equalities come from the given hypothesis that ff is a left inverse of ff'. So ff' being a function, it maps equal values to exactly one value; hence, x=f(y)=f(y~)=x~x=f'(y)=f'(\tilde{y})=\tilde{x}, which is the sought after result.

Next, showing the surjectivity of ff is very easy: for any yYy \in Y, there is obviously an element f(y)f'(y) of XX such that ff maps that element to yy as f(f(y))=yf(f'(y))=y.

Lemma CBB

Composition of Bijections is a Bijection: Given bijections f:XYf:X \to Y and f:YZf':Y \to Z, the composition (ff):XZ(f' \circ f):X \to Z is also a bijection.

Proof: This is left to the reader.

results matching ""

    No results matching ""