mdw@git.distorted.org.uk Git - doc/ips/blame_incremental

... / ...

Commit	Line	Data
	1	\xcalways\section{Basics}\x
	2
	3	\xcalways\subsection{Introduction and motivation}\x
	4
	5	\begin{slide}
	6	\topic{joke}
	7	\head{Mandatory joke}
	8
	9	An elderly Frenchman rises every morning at 5, goes out to the street in
	10	front of his house, and sprinkles a white powder up and down the street.
	11
	12	One day, a neighbour, who has watched his routine for many years, confronts
	13	him. `What is this powder you sprinkle on the street every morning,
	14	Pierre?'
	15
	16	`It is elephant powder, \emph{mon ami},' the gentleman says. `It keeps the
	17	elephants away.'
	18
	19	`But Pierre,' says the neighbour. `Everybody knows that there are no
	20	elephants in France.'
	21
	22	Says the older man matter-of-factly, `I guess it must be working, then.'
	23	\end{slide}
	24
	25	The joke has its mandatory corresponding serious message. Cryptography is
	26	like elephant powder. If it seems to work, and keeps the attackers -- our
	27	elephants -- away, then we trust it. This isn't really very satisfactory.
	28	When the elephant powder fails, a 2-ton elephant turns up, with three of its
	29	chums in a Mini, and you notice it hiding upside-down in your bowl of
	30	custard. Let's face it: elephants aren't good at being surreptitious.
	31
	32	But if your cryptography is no good, you may never know.
	33
	34	\begin{slide}
	35	\topic{serious message}
	36	\head{Cryptography is elephant powder}
	37
	38	So, what can we do about the situation?
	39	\begin{itemize}
	40	\item Design simple cryptographic primitives, e.g., block ciphers, hash
	41	functions. Develop techniques for analysing them, and attempt to stay
	42	ahead of the bad guy.
	43	\item Build useful constructions from trusted primitives in a modular way.
	44	\emph{Prove that the constructions are secure.}
	45	\end{itemize}
	46	Here we look at this second part of the approach.
	47	\end{slide}
	48
	49	\xcalways\subsection{Approach}\x
	50
	51	\begin{slide}
	52	\head{The provable security approach}
	53
	54	\begin{itemize}
	55	\item Define notions of security. Now we know what to aim for, and what to
	56	expect of our components.
	57	\item Prove how the security of a construction relates to the security of
	58	its primitives. If it breaks, we know who to blame.
	59	\end{itemize}
	60	\end{slide}
	61
	62	\begin{slide}
	63	\topic{adversaries}
	64	\head{Adversaries}
	65
	66	We model our adversary as a \emph{probabilistic algorithm}; i.e., the
	67	algorithm is allowed to \emph{flip coins} to make decisions. The
	68	adversary's output (for some input) follows a probability distribution. We
	69	define what it means for an adversary to \emph{break} our construction, and
	70	examine the probability with which this happens.
	71
	72	We provide the adversary with \emph{oracles}:
	73	\begin{itemize}
	74	\item Oracles compute using secrets hidden from the adversary.
	75	\item We count the number of \emph{queries} made to an oracle.
	76	\item We can restrict the types of queries the adversary makes.
	77	\end{itemize}
	78
	79	Oracles are written as superscripts. For example, an adversary given a
	80	chosen-plaintext oracle might be written as $A^{E_K(\cdot)}$.
	81	\end{slide}
	82
	83	\begin{slide}
	84	\topic{the asymptotic approach}
	85	\resetseq
	86	\head{The asymptotic approach, \seq}
	87
	88	A function $\nu\colon \N \to \R$ is \emph{negligible} if, for any integer
	89	$c$, there exists an $n \in \N$ such that $0 \le \nu(k) < k^{-c}$ for all
	90	$k \ge n$. That is, $\nu(k)$ is `eventually' less than any polynomial
	91	function of $k$.
	92
	93	We examine families of constructions, with a \emph{security parameter} $k$.
	94	We say that a function is (asymptotically) secure in some sense if, for any
	95	polynomial $p(k)$ there is a negligible function $\nu(k)$ such that, for
	96	any construction in the family, parameterized by $k$, no adversary which
	97	runs for time $p(k)$ has success probability better than $\nu(k)$.
	98	\end{slide}
	99
	100	\begin{slide}
	101	\head{The asymptotic approach, \seq}
	102
	103	Suppose we build an encryption scheme from a one-way function. We'd like
	104	to prove that the encryption is good if the one-way function is secure. We
	105	do this by contradiction:
	106	\begin{enumerate}
	107	\item Suppose an adversary $A$ breaks the encryption scheme with
	108	better-than-negligible probability.
	109	\item Show a polynomial-time \emph{reduction}: an algorithm which uses $A$
	110	to break the one-way function, in polynomial time, and with
	111	better-than-negligible probability,
	112	\item Claim that this violates the assumption of a secure one-way function.
	113	\end{enumerate}
	114
	115	This doesn't work with real constructions. We don't know where the
	116	asymptotics set in, and they can conceal large constants. It's still
	117	better than nothing.
	118	\end{slide}
	119
	120	\begin{slide}
	121	\topic{the concrete (quantitative) approach}
	122	\head{The concrete (quantitative) approach}
	123
	124	We constrain the resources we allow the adversary:
	125	\begin{itemize}
	126	\item Running time (including program size).
	127	\item Number of oracle queries.
	128	\item Maximum size of oracle queries.
	129	\end{itemize}
	130	Write that something is \emph{$(t, q, \epsilon)$-secure} if no adversary
	131	which runs in time $t$ and makes $q$ oracle queries can break it with
	132	probability better than $\epsilon$.
	133
	134	We make statements like `foo is $(t, q, 2 q \epsilon)$-secure if bar is $(t
	135	+ O(q), 2 q, \epsilon)$-secure'.
	136
	137	This is a much more satisfactory approach. However, we still have to
	138	\emph{assume} the security of our primitive operations.
	139	\end{slide}
	140
	141	\xcalways\subsection{Notation}\x
	142
	143	\begin{slide}
	144	\topic{boolean operators}
	145	\resetseq
	146	\head{Notation, \seq: boolean operators}
	147
	148	If $P$ and $Q$ are \emph{predicates} -- i.e., either true or false -- then:
	149	\begin{itemize}
	150	\item $P \land Q$ is true if both $P$ \emph{and} $Q$ is true;
	151	\item $P \lor Q$ is true if either $P$ \emph{or} $Q$ (or both) is true;
	152	\item $\lnot P$ is true if $P$ is false; and
	153	\item $P \implies Q$ is true if $Q$ is true or $P$ is false.
	154	\end{itemize}
	155	\end{slide}
	156
	157	\begin{slide}
	158	\topic{sets}
	159	\head{Notation, \seq: sets}
	160
	161	For our purposes, we can think of sets as being collections of objects.
	162
	163	We use the usual notations for set membership ($x \in X$), intersection ($X
	164	\cap Y$), union ($X \cup Y$) and subset containment ($X \subseteq Y$). The
	165	\emph{empty set}, which contains no elements, is written $\emptyset$.
	166
	167	The notation $\{\, f(x) \mid P(x) \,\}$ describes the set containing those
	168	items $f(x)$ for those $x$ for which the predicate $P(x)$ is true.
	169
	170	The \emph{cardinality} $\|X\|$ of a (finite) set $X$ is the number of
	171	elements in the set.
	172
	173	The power set $\powerset(X)$ of a set $X$ is the set of all subsets of $X$.
	174
	175	The \emph{Cartesian product} of two sets $X \times Y$ is the set of all
	176	ordered pairs $\{\, (x, y) \mid x \in X \land y \in Y \,\}$. We use
	177	exponents to indicate the product of a set with itself: hence, $X^2 = X
	178	\times X$.
	179
	180	A \emph{relation} $R$ is a subset of a Cartesian product. We write $R(x,
	181	y)$ if $(x, y) \in R$. Relations between two sets are often written as
	182	infix symbols: e.g., $x \sim y$.
	183	\end{slide}
	184
	185	\begin{slide}
	186	\head{Notation, \seq: sets (cont.)}
	187
	188	In addition to strings, defined later, we use the following standard sets:
	189	\begin{itemize}
	190	\item the set $\Z$ of integers;
	191	\item the set $\N = \{\, x \in \Z \mid x \ge 0 \,\}$ of natural numbers;
	192	\item the set $\R$ of real numbers;
	193	\item closed intervals $[a, b] = \{\, x \in \R \mid a \le x \le b \,\}$;
	194	\item the finite field $\F_q$ of $q$ elements, and its multiplicative
	195	subgroup $\F_q^* = \F_q \setminus \{0\}$; and
	196	\item the ring $\Z/n\Z$ of residue classes modulo $n$ (i.e., if $x \in
	197	\Z/n\Z$ and $a, b \in x$ then $a \equiv b \pmod{n}$), and its
	198	multiplicative subgroup $(\Z/n\Z)^* = \Z/n\Z - \{\, x + n\Z \mid \gcd(x,
	199	n) > 1 \,\}$.
	200	\end{itemize}
	201	\end{slide}
	202
	203	\begin{slide}
	204	\topic{functions}
	205	\head{Notation, \seq: functions}
	206
	207	A \emph{function} $f\colon X \to Y$ is a mapping which assigns every
	208	element $x$ in the \emph{domain} $X$ a corresponding element $f(x)$ in the
	209	\emph{range} (or sometimes \emph{codomain}) $Y$. The notation $\dom f$
	210	describes the domain of an arbitrary function; $\ran f$ describes its
	211	range.
	212
	213	We sometimes apply the function notation to sets, indicating that the
	214	function should be applied pointwise; i.e., $f(Z) = \{ f(z) \mid z \in Z
	215	\}$. The \emph{image} of a function $f$ is the set $f(\dom f)$.
	216
	217	If $f\colon X \to Y$ preserves equality, i.e., $f(x) = f(x') \implies x =
	218	x'$ for all $x, x' \in X$, then we say $f$ is \emph{injective} (or
	219	\emph{1-to-1}). If $f(X) = Y$ then we say that it is \emph{surjective} (or
	220	\emph{onto}). If $f$ is both injective and surjective then we say that it
	221	is \emph{bijective}. In this case, there is a well-defined inverse
	222	$f^{-1}\colon Y \to X$ defined by $f(f^{-1}(y)) = y$ for all $y \in Y$.
	223
	224	If $f\colon X \to X$ (i.e., its domain and range are the same set) is
	225	bijective, then we say that $f$ is a \emph{permutation on $X$}.
	226	\end{slide}
	227
	228	\begin{slide}
	229	\head{Notation, \seq: functions (cont.)}
	230
	231	We can consider a function $f\colon X \to Y$ to be a particular sort of
	232	relation $f \subseteq X \times Y$, subject to the constraint that if $(x,
	233	y) \in f$ and $(x, y') \in f$ then $y = y'$.
	234
	235	We shall use this view in some of the algorithms we present. In addition,
	236	we shall use the \emph{maplet} notation $x \mapsto y$ rather than the
	237	ordered pair notation $(x, y)$ to reinforce the notion of a mapping.
	238
	239	We might write, for example,
	240	\begin{program}
	241	$f \gets f \cup \{ x \mapsto y \}$;
	242	\end{program}
	243	to augment a function by the addition of a new mapping. This is clearly
	244	only valid if $x \notin \dom f$ (or $f(x) = y$) initially.
	245	\end{slide}
	246
	247	\begin{slide}
	248	\topic{strings}
	249	\head{Notation, \seq: strings}
	250
	251	An \emph{alphabet} is a finite set of \emph{symbols}. The one we'll use
	252	most of the time is the set $\Sigma = \{0, 1\}$ of \emph{bits}.
	253
	254	Suppose $A$ is an alphabet. The set of sequences of exactly $n$ symbols
	255	from $A$ is written $A^n$. Hence, $\{0, 1\}^{64}$ is the set of all 64-bit
	256	sequences. The set of (finite) \emph{strings} over an alphabet $A$ is $A^*
	257	= \bigcup_{i \in \N} A^i$. The empty string is named $\emptystring$.
	258
	259	The \emph{length} of a string $a \in A^*$, written $\|a\|$, is the natural
	260	number $n$ where $a \in A^n$.
	261
	262	If $x, y \in A^*$ are strings then their \emph{concatenation}, written $x
	263	\cat y$, or sometimes just $x y$, is the result of writing the symbols in
	264	$x$ followed by the symbols in $y$, in order. We have $\|x y\| = \|x\| + \|y\|$.
	265	\end{slide}
	266
	267	\begin{slide}
	268	\head{Notation, \seq: strings (cont.)}
	269
	270	There are natural (injective) mappings between bit strings and natural
	271	numbers.
	272
	273	If $x = x_0 x_1 \ldots x_{n-1} \in \{0, 1\}^*$ then we can associate with
	274	it the natural number
	275	\[ \overrightarrow{x} = \sum_{0 \le i < n} 2^i x_i. \]
	276
	277	The other natural mapping is
	278	\[ \overleftarrow{x} = \sum_{0 \le i < n} 2^{n-i-1} x_i. \]
	279	It doesn't matter which you choose, as long as you're consistent.
	280
	281	For simplicity's sake, we shall tend to switch between strings and the
	282	numbers (and occasionally more exotic objects) they represent implicitly.
	283	\end{slide}
	284
	285	\begin{slide}
	286	\topic{parsing}
	287	\head{Notation, \seq: parsing}
	288
	289	We'll find it useful to be able to break up strings in the algorithms we
	290	present. We use the statement
	291	\begin{program}
	292	\PARSE $x$ \AS $n_0\colon x_0, n_1\colon x_1, \ldots, n_k\colon x_k$;
	293	\end{program}
	294	to mean that the string $x$ is broken up into individual strings $x_0$,
	295	$x_1$, \ldots, $x_k$, such that
	296	\begin{itemize}
	297	\item $x = x_0 \cat x_1 \cat \cdots \cat x_k$; and
	298	\item $\|x_i\| = n_i$ for all $0 \le i \le k$.
	299	\end{itemize}
	300	We may omit one of the $n_i$ length indicators, since it can be deduced
	301	from the length of the string $x$ and the other $n_j$.
	302	\end{slide}
	303
	304	\begin{slide}
	305	\topic{vectors}
	306	\head{Notation, \seq: vectors}
	307
	308	A \emph{vector} $\vect{x}$ is a finite ordered collection of elements from
	309	some set $X$. If $\vect{x}$ contains $n$ elements then we write $\vect{x}
	310	\in X^n$, and that $\|\vect{x}\| = n$. We write the individual elements as
	311	$\vect{x}[0], \vect{x}[1], \ldots, \vect{x}[n - 1]$.
	312
	313	We shall abuse set membership notation for vectors; i.e., we write $x \in
	314	\vect{x}$ if there is an $i$ ($0 \le i < \|\vect{x}\|$) such that
	315	$\vect{x}[i] = x$.
	316
	317	When we apply functions to vectors, we mean that they are applied
	318	pointwise, as for sets. Thus, if we write that $\vect{y} =
	319	f(\vect{x})$ then $\|\vect{y}\| = \|\vect{x}\|$ and $\vect{y}[i] =
	320	f(\vect{x}[i])$ for all $0 \le i < \|\vect{x}\|$.
	321	\end{slide}
	322
	323	\begin{slide}
	324	\topic{distributions and randomness}
	325	\head{Notation, \seq: distributions and randomness}
	326
	327	A \emph{probability distribution} over a (countable) set $X$ is a
	328	function $\mathcal{D}\colon X \to [0, 1]$ such that
	329	\[ \sum_{x \in X} \mathcal{D}(x) = 1. \]
	330
	331	The \emph{support} of $\mathcal{D}$, written $\supp \mathcal{D}$, is the
	332	set $\{ x \in X \mid \mathcal{D}(x) \ne 0 \}$; i.e., those elements of $X$
	333	which occur with nonzero probability.
	334
	335	We write $x \getsr \mathcal{D}$ in algorithms to indicate that $x$ is to be
	336	chosen independently at random, according to the distribution
	337	$\mathcal{D}$. The notation $x \inr \mathcal{D}$ indicates that $x$ has
	338	been chosen at independently at random according to $\mathcal{D}$.
	339
	340	The \emph{uniform distribution} over a (finite) set $X$ is
	341	$\mathcal{U}_X\colon X \to [0, 1]$ defined by $\mathcal{U}_X(x) = 1/\|X\|$
	342	for all $x \in X$. We shall write $x \getsr X$ and $x \inr X$ as
	343	convenient shorthands, meaning $x \getsr \mathcal{U}_X$ and $x \inr
	344	\mathcal{U}_X$ respectively.
	345	\end{slide}
	346
	347	\xcalways\subsection{Background}\x
	348
	349	\begin{slide}
	350	\topic{distinguishability}
	351	\resetseq
	352	\head{Distinguishability, \seq}
	353
	354	Suppose that $\mathcal{X}$ and $\mathcal{Y}$ are two probability
	355	distributions.
	356
	357	Let $X$ be a random variable distributed according to $\mathcal{X}$, and
	358	let $Y$ be a random variable distributed according to $\mathcal{Y}$. We
	359	say that $\mathcal{X}$ and $\mathcal{Y}$ are \emph{identically
	360	distributed}, and write that $\mathcal{X} \equiv \mathcal{Y}$, if, for
	361	all possible values $x$ of $X$, we have
	362	\[ \Pr[X = x] = \Pr[Y = x]. \]
	363
	364	Equivalently, we require that, for all $x \in \supp \mathcal{X}$ we have
	365	\[ x \in \supp \mathcal{Y} \land \mathcal{X}(x) = \mathcal{Y}(y). \]
	366	\end{slide}
	367
	368	\begin{slide}
	369	\head{Distinguishability, \seq: statistical closeness}
	370
	371	Now we generalize the setting slightly. Consider two \emph{families} of
	372	distributions, $\{\mathcal{X}_k\}_{k\in\N}$ and
	373	$\{\mathcal{Y}_k\}_{k\in\N}$, parameterized by a security parameter $k$,
	374	where $\dom\mathcal{X}_k = \dom\mathcal{Y}_k$. To make the asymptotics
	375	work, we require that $\|x\| \le p(k)$ for some polynomial $p(\cdot)$, for
	376	all $x \in \dom\mathcal{X}_k$.
	377
	378	Fix a value of $k$. Again, let $X$ be distributed according to
	379	$\mathcal{X}_k$, and let $Y$ be distributed according to $\mathcal{Y}_k$.
	380	We say that $\{\mathcal{X}_k\}_{k \in \N}$ and $\{\mathcal{Y}_k\}_{k\in\N}$
	381	are \emph{statistically close}, and write that $\{\mathcal{X}_k\}_{k\in\N}
	382	\statclose \{\mathcal{Y}_k\}_{k\in\N}$, if there is a negligible function
	383	$\nu(\cdot)$ such that, for any $k \in \N$,
	384	\[ \sum_{x\in\dom{\mathcal{X}_k}}
	385	\|{\Pr[X = x]} - \Pr[Y = x]\| \le \nu(k). \]%
	386	(Reminder: Saying that $\nu\colon \N \to \R$ is \emph{negligible} means
	387	that, for any $c \in \Z$ there is an $n \in N$ such that $\nu(k) <
	388	k^{-c}$.)
	389	\end{slide}
	390
	391	\begin{slide}
	392	\head{Distinguishability, \seq: computational indistinguishability}
	393
	394	We say that two families of distributions are computationally
	395	indistinguishable if no `efficient' algorithm can tell them apart with
	396	better than `negligible' probability.
	397
	398	So, we say that $\{\mathcal{X}_k\}_{k\in\N}$ and
	399	$\{\mathcal{Y}_k\}_{k\in\N}$ are \emph{computationally indistinguishable}
	400	and write that $\{\mathcal{X}_k\}_{k\in\N} \compind
	401	\{\mathcal{Y}_k\}_{k\in\N}$, if, for any probabilistic polynomial-time
	402	algorithm $A$, there is a negligible function $\nu(\cdot)$ such that, for
	403	any $k$:
	404	\[ \Pr[x \getsr \mathcal{X}_k; b \gets A(x) : b = 1] -
	405	\Pr[y \getsr \mathcal{Y}_k; b \gets A(y) : b = 1] \le \nu(k). \]%
	406	Statistical closeness implies computational indistinguishability.
	407	\end{slide}
	408
	409	\begin{proof}
	410	Let two statistically close distributions $\{\mathcal{X}_k\}_{k\in\N}
	411	\statclose \{\mathcal{Y}_k\}_{y\in\N}$ be given. Fix some $k$, and let $Z
	412	= \dom\mathcal{X}_k = \dom\mathcal{Y}_k$. Now note that the adversary's
	413	advantage is given by $\sum_{z\in Z} \Pr[b \gets A(z) : b = 1]
	414	\|\mathcal{X}_k(z) - \mathcal{Y}_k(z)\| \le \sum_{z\in Z} \|\mathcal{X}_k(z) -
	415	\mathcal{Y}_k(z)\| \le \nu(k)$. Hence the two distributions are
	416	computationally indistinguishable.
	417	\end{proof}
	418
	419	\begin{slide}
	420	\topic{collisions}
	421	\head{Collisions -- the Birthday `paradox'}
	422
	423	Suppose we throw $q$ balls into $n$ bins at random (with $q \le n$). Let
	424	$C_{q, n}$ be the event that, at the end of this, we have a bin containing
	425	more than one ball -- a \emph{collision}.
	426
	427	Let $B_{q, n}$ be the event that the $q$-th ball collides with a
	428	previous one. Obviously, the worst case for this is when none of the other
	429	balls have collided, so
	430	\[ \Pr[B_{q, n}] \le \frac{q - 1}{n}. \]
	431	Then
	432	\begin{eqnarray*}[rl]
	433	\Pr[C_{q, n}]
	434	&\le \Pr[B_{2, n}] + \Pr[B_{3, n}] + \cdots + \Pr[B_{q, n}] \\
	435	&\le \frac{1}{n} + \frac{2}{n} + \cdots + \frac{q - 1}{n} \\
	436	&= \frac{q(q - 1)}{2 n}.
	437	\end{eqnarray*}
	438	This is an extremely useful result, and we shall need it often.
	439	\end{slide}
	440
	441	\xcalways\subsection{Primitives}\x
	442
	443	\begin{slide}
	444	\topic{one-way functions}
	445	\resetseq
	446	\head{One-way functions, \seq: introduction}
	447
	448	Intuition: a one-way function is easy to compute but hard to invert.
	449
	450	Choose a function $f\colon X \to Y$. Let $I$ be a prospective inverter.
	451	Now we play a game:
	452	\begin{enumerate}
	453	\item Choose $x \in X$ at random. Compute $y = f(x)$.
	454	\item Let $x'$ be the output of $I$ when run on input $y$.
	455	\item We say that $I$ `wins' if $f(x') = y$; otherwise it loses.
	456	\end{enumerate}
	457	Note that we don't care whether $x = x'$.
	458
	459	Examples: SHA-1, or $x \mapsto g^x \bmod p$.
	460	\end{slide}
	461
	462	\begin{slide}
	463	\head{One-way functions, \seq: formalism}
	464
	465	The \emph{success probability} of an inverter $I$ against the function $f$
	466	is
	467	\[ \Succ{owf}{f}(I) =
	468	\Pr[x \getsr X;
	469	y \gets f(x);
	470	x' \gets I(y) :
	471	f(x') = y] \]%
	472	where the probability is taken over all choices of $x$ and all coin flips
	473	made by $I$.
	474
	475	We measure the \emph{insecurity} of a one-way function (OWF) by maximizing
	476	over possible inverters:
	477	\[ \InSec{owf}(f; t) = \max_I \Succ{owf}{f}(I) \]
	478	where the maximum is taken over all $I$ running in time $t$.
	479
	480	If $\InSec{owf}(f; t) \le \epsilon$ then we say that $f$ is a \emph{$(t,
	481	\epsilon)$-secure one-way function}.
	482	\end{slide}
	483
	484	\begin{slide}
	485	\head{One-way functions, \seq: trapdoors}
	486
	487	Intuition: a \emph{trapdoor} is secret information which makes inverting a
	488	one-way function easy. This is most useful when the one-way function is a
	489	permutation. A trapdoor one-way function generator $\mathcal{T} = (G, f,
	490	T)$ is a triple of algorithms:
	491
	492	\begin{itemize}
	493	\item The probabilistic algorithm $G$ is given some parameter $k$ and
	494	returns a pair $(P, K)$, containing the public parameters $P$ for
	495	computing an instance of the one-way function, and the secret trapdoor
	496	information $K$ for inverting it. We write $(P, K) \in G(k)$.
	497
	498	\item The algorithm $f$ implements the one-way function. That is, if $(P,
	499	K) \in G(k)$ then $f(P, \cdot)$ (usually written $f_P(\cdot)$) is a
	500	one-way function.
	501
	502	\item The algorithm $T$ inverts $f$ using the trapdoor information. That
	503	is, if $(P, K) \in G(k)$ and $y = f_P(x)$ for some $x$, then $y =
	504	f_P(T(K, y))$. We usually write $T_K(\cdot)$ instead of $T(K, \cdot)$.
	505	\end{itemize}
	506	\end{slide}
	507
	508	\begin{slide}
	509	\topic{pseudorandom generators (PRGs)}
	510	\head{Pseudorandom generators (PRGs)}
	511
	512	A pseudorandom generator (PRG) `stretches' an input seed into a longer
	513	string which `looks' random.
	514
	515	Let $G\colon \{0, 1\}^k \to \{0, 1\}^L$ be a function from $k$-bit strings
	516	to $L$-bit strings. The \emph{advantage} of a distinguisher $D$ against
	517	$G$ is:
	518	\begin{eqnarray*}[rl]
	519	\Adv{prg}{G}(D) = &
	520	\Pr[x \getsr \{0, 1\}^k; y \gets G(x) : D(y) = 1] - {}\\
	521	& \Pr[y \getsr \{0, 1\}^L : D(y) = 1].
	522	\end{eqnarray*}
	523	The \emph{insecurity} is simply the maximum advantage:
	524	\[ \InSec{prg}(G; t) = \max_D \Adv{prg}{G}(D) \]
	525	where the maximum is taken over all distinguishers $D$ running in time
	526	$t$. If $\InSec{prg}(G; t) \le \epsilon$ then we also say that $G$ is a
	527	$(t, \epsilon)$-secure PRG\@.
	528	\end{slide}
	529
	530	\begin{exercise}
	531	We say that a PRG $g\colon \{0, 1\}^k \to \{0, 1\}^L$ is \emph{trivial} if
	532	$k \ge L$.
	533	\begin{enumerate}
	534	\item Show that trivial PRGs exist.
	535	\item Show that if $g$ is nontrivial, then $g$ is also a one-way function,
	536	with
	537	\[ \InSec{owf}(g; t) \le \InSec{prg}(g; t) + 2^{k-L}. \]
	538	\end{enumerate}
	539	\answer%
	540	\begin{parenum}
	541	\item The identity function $I(x) = x$ is a trivial PRG, with
	542	$\InSec{prg}(I, t) = 0$, as is easily seen from the definition.
	543	\item Suppose $A$ inverts $g$. Then consider adversary $B(y)$: \{ $x \gets
	544	A(y)$; \IF $g(x) = y$ \THEN \RETURN $1$; \ELSE \RETURN $0$;~\}. If $y$
	545	is the output of $g$, then $A$ inverts $y$ with probability
	546	$\Succ{owf}{g}(A)$; if $y$ is random in $\{0, 1\}^L$ then there is a
	547	probability at least $1 - 2^{k-L}$ that $y$ has \emph{no} inverse,
	548	proving the result. Note that \cite{Wagner:2000:PSU} provides a better
	549	security bound than this simplistic analysis.
	550	\end{parenum}
	551	\end{exercise}
	552
	553	\begin{exercise}
	554	\label{ex:dbl-prg}
	555	Suppose that we have a \emph{length-doubling} PRG $g\colon \{0, 1\}^k \to
	556	\{0, 1\}^{2k}$. Let $g_0(\cdot)$ be the first $k$ bits of $g(x)$ and
	557	$g_1(\cdot)$ be the second $k$ bits. Define the sequence of generators
	558	$g^{(i)}$ (for $i >= 1$) by
	559	\[ g^{(1)}(x) = g(x); \qquad
	560	g^{(i+1)}(x) = g_0(x) \cat g^{(i)}(g_1(x)). \]%
	561	Relate the security of $g^{(i)}$ to that of $g$.
	562	\answer%
	563	The description of the function $g^{(i)}$ is deliberately terse and
	564	unhelpful. It probably helps understanding if you make a diagram.
	565
	566	Let $A$ be an adversary running in time $t$ and attacking $g^{(i+1)}$.
	567	Firstly, we attack $g$: consider adversary $B(y)$: \{ \PARSE $y$ \AS $y_0,
	568	k\colon y_1$; $z \gets g^{(i)}$; $b \gets A(y_0 \cat z)$; \RETURN $b$;~\}.
	569	Then $\Adv{prg}{g}(B) \ge \Adv{prg}{g^{(i+1)}}(A) + \delta$, so
	570	$\InSec{prg}(g^{(i+1)}; t) \le \InSec{prg}(g; t) + \delta$, where
	571	\begin{eqnarray*}[rl]
	572	\delta = &\Pr[x_0 \gets \{0, 1\}^k; x_1 \gets \{0, 1\}^k;
	573	y \gets g^{(i)}(x) : A(x_0 \cat y) = 1] - \\
	574	&\Pr[y \getsr \{0, 1\}^{(i+2)k} : A(y) = 1].
	575	\end{eqnarray*}
	576	We attack $g^{(i)}$ to bound $\delta$: consider adversary $C(y)$: \{ $x_0
	577	\getsr \{0, 1\}^k$; $b \gets A(x_0 \cat y)$; \RETURN $b$;~\}. Now $\delta
	578	\le \Adv{prg}{g^{(i)}}(C) \le \InSec{prg}(g^{(i)}; t)$. So by induction,
	579	\[ \InSec{prg}(g^{(i)}; t) \le i \cdot \InSec{prg}(g; t). \]
	580	\end{exercise}
	581
	582	\begin{slide}
	583	\topic{advantage}
	584	\head{Notes about advantage}
	585
	586	Advantage is a concept used in many definitions of security notions:
	587	\begin{eqnarray*}[rl]
	588	\Adv{}{}(A) = &
	589	\Pr[\text{$A$ returns 1 in setting $a$}] - {} \\
	590	& \Pr[\text{$A$ returns 1 in setting $b$}].
	591	\end{eqnarray*}
	592	\begin{enumerate}
	593	\item We have $-1 \le \Adv{}{}(A) \le +1$.
	594	\item Zero means that the adversary couldn't distinguish.
	595	\item Negative advantage means the adversary got them the wrong way
	596	around. There is another adversary which uses the same resources but has
	597	positive advantage.
	598	\item \label{item:adv-guess} If $A$ is attempting to guess some hidden bit
	599	$b^* \inr \{0, 1\}$, we have
	600	\[ \Pr[b \gets A : b = b^*] = \frac{\Adv{}{}(A)}{2} + \frac{1}{2}. \]
	601	\end{enumerate}
	602	\end{slide}
	603
	604	\begin{proof}
	605	Let $b$ be the bit that $A$ returns, and let $b^*$ be the experiment's
	606	hidden bit. Then
	607	\[ \Adv{}{}(A) = \Pr[b = 1 \mid b^* = 1] - \Pr[b = 1 \mid b^* = 0]. \]
	608	Addressing the above claims in order:
	609	\begin{enumerate}
	610	\item By definition of probability, $0 \le \Pr[b = 1 \mid b^* = 1] \le 1$
	611	and $0 \le \Pr[b = 1 \mid b^* = 0]$, so their absolute difference can be
	612	at most 1.
	613	\item This is a corollary of \ref{item:adv-guess}.
	614	\item Consider the adversary $\bar{A}$ which runs $A$ and returns the
	615	complement bit $\bar{b} = b \xor 1$. Then
	616	\begin{eqnarray*}[rl]
	617	\Adv{}{}(\bar{A})
	618	&= \Pr[\bar{b} = 1 \mid b^* = 1] - \Pr[\bar{b} = 1 \mid b^* = 0] \\
	619	&= \Pr[b = 0 \mid b^* = 1] - \Pr[b = 0 \mid b^* = 0] \\
	620	&= (1 - \Pr[b = 1 \mid b^* = 1]) - (1 - \Pr[b = 1 \mid b^* = 0]) \\
	621	&= \Pr[b = 1 \mid b^* = 0] - \Pr[b = 1 \mid b^* = 1] \\
	622	&= -\Adv{}{}(A).
	623	\end{eqnarray*}
	624	\item Note that $\Pr[b^* = 1] = \Pr[b^* = 0] = \frac{1}{2}$. Then
	625	\begin{eqnarray*}[rl]
	626	\Pr[b = b^*]
	627	&= \Pr[b = 1 \land b^* = 1] + \Pr[b = 0 \land b^* = 0] \\
	628	&= \frac{1}{2}(\Pr[b = 1 \mid b^* = 1] + \Pr[b = 1 \mid b^* = 0]) \\
	629	&= \frac{1}{2}(\Pr[b = 1 \mid b^* = 1] +
	630	(1 - \Pr[b = 0 \mid b^* = 0])) \\
	631	&= \frac{1}{2}(1 + \Pr[b = 1 \mid b^* = 1] -
	632	\Pr[b = 0 \mid b^* = 0]) \\
	633	&= \frac{\Adv{}{}(A)}{2} + \frac{1}{2}.
	634	\end{eqnarray*}
	635	\end{enumerate}
	636	All present and correct.
	637	\end{proof}
	638
	639	\begin{slide}
	640	\topic{pseudorandom functions (PRFs)}
	641	\head{Pseudorandom functions (PRFs)}
	642
	643	A \emph{pseudorandom function family} (PRF) is a collection of functions
	644	$F_K\colon \{0, 1\}^l \to \{0, 1\}^L$, where $K$ is some index, typically
	645	from a set of fixed-size strings $\{0, 1\}^k$. We shall often consider a
	646	PRF to be a single function $F\colon \{0, 1\}^k \times \{0, 1\}^l \to \{0,
	647	1\}^L$.
	648
	649	We want to say that $F$ is a strong PRF if adversaries find it hard to
	650	distinguish an instance $F_K$ from a function chosen completely at random
	651	with the same `shape'.
	652
	653	We provide the adversary with an \emph{oracle}, either for a randomly
	654	selected $F_K$, or for completely random function, and ask it to try and
	655	say which it is given.
	656
	657	We write $\Func{l}{L}$ as the set of \emph{all} functions from $\{0, 1\}^l$
	658	to $\{0, 1\}^L$.
	659	\end{slide}
	660
	661	\begin{slide}
	662	\head{Pseudorandom functions (cont.)}
	663
	664	We define the advantage of a distinguisher $D$ against the PRF $F$ as
	665	follows:
	666	\begin{eqnarray*}[rl]
	667	\Adv{prf}{F}(D) = &
	668	\Pr[K \getsr \{0, 1\}^k : D^{F_K(\cdot)} = 1] - {}\\
	669	& \Pr[R \getsr \Func{l}{L} : D^{R(\cdot)} = 1].
	670	\end{eqnarray*}
	671	The insecurity of the PRF is then measured as
	672	\[ \InSec{prf}(F; t, q) = \max_D \Adv{prf}{F}(D) \]
	673	where the maximum is taken over all distinguishers $D$ which run for time
	674	$t$ and make $q$ oracle queries. As is usual, if $\InSec{prf}(F; t, q)
	675	\le \epsilon$ then we say that $F$ is a $(t, q, \epsilon)$-secure PRF.
	676	\end{slide}
	677
	678	\begin{slide}
	679	\topic{pseudorandom permutations (PRPs)}
	680	\head{Pseudorandom permutations (PRPs)}
	681
	682	We define a \emph{pseudorandom permutation family} (PRP) in a similar way
	683	to the PRFs we've already seen. A PRP is a family $F_K\colon \{0, 1\}^L
	684	\to \{0, 1\}^L$ is a of permutations, indexed by elements of some finite
	685	set, e.g., $\{0, 1\}^k$. We shall often consider a PRP to be a single
	686	function $F\colon \{0, 1\}^k \times \{0, 1\}^l \to \{0, 1\}^L$.
	687
	688	Let $\Perm{L}$ be the set of \emph{all} permutations over the set of
	689	$L$-bit strings $\{0, 1\}^L$.
	690
	691	The advantage of a distinguisher $D$ against the PRP $F$ is
	692	\begin{eqnarray*}[rl]
	693	\Adv{prp}{F}(D) = &
	694	\Pr[K \getsr \{0, 1\}^k : D^{F_K(\cdot)} = 1] - {}\\
	695	& \Pr[R \getsr \Perm{L} : D^{R(\cdot)} = 1].
	696	\end{eqnarray*}
	697
	698	We define $\InSec{prp}(F; t, q) = \max_D \Adv{prp}{F}(D)$ exactly as for
	699	PRFs, and the notion of $(t, q, \epsilon)$-security is the same.
	700	\end{slide}
	701
	702	\begin{slide}
	703	\head{Super pseudorandom permutations}
	704
	705	PRPs are bijective. A \emph{super PRP} is a PRP which remains secure when
	706	the distinguisher is allowed to make inverse queries:
	707	\begin{eqnarray*}[rl]
	708	\Adv{sprp}{F}(D) = &
	709	\Pr[K \getsr \{0, 1\}^k : D^{F_K(\cdot), F_K^{-1}(\cdot)} = 1] - {} \\
	710	& \Pr[R \getsr \Perm{L} : D^{R(\cdot), R^{-1}(\cdot)} = 1].
	711	\end{eqnarray*}
	712	Since there are two oracles, we count queries to both when evaluating the
	713	insecurity:
	714	\[ \InSec{sprp}(F; t, q, q') = \max_D \Adv{sprp}{F}(D) \]
	715	where the maximum is taken over all distinguishers $D$ which run for time
	716	$t$, make $q$ queries to the standard oracle, and $q'$ queries to the
	717	inverse oracle. If $\InSec{sprp}(F; t, q, q') \le \epsilon$ then we say
	718	$F$ is a $(t, q, q', \epsilon)$-secure super PRP\@.
	719	\end{slide}
	720
	721	\begin{exercise}
	722	Note that the key length hasn't appeared anywhere in the definition of
	723	insecurity for a PRP. Derive lower bounds for the insecurity of a PRP with
	724	a $k$-bit key.
	725	\answer%
	726	Let $E\colon \{0, 1\}^k \times \{0, 1\}^L \to \{0, 1\}^L$ be a PRP. Fix
	727	$n$ and $c$. Then consider adversary $S^{E(\cdot)}$: \{ \FOR $i = 0$ \TO
	728	$c - 1$ \DO $y[i] \gets E(i)$; \FOR $K = 0$ \TO $n - 1$ \DO \{ $i \gets 0$;
	729	$\id{good} \gets 1$; \WHILE $i < c \land \id{good} = 1$ \DO \{ \IF $E_K(i)
	730	\ne y[i]$ \THEN $\id{good} \gets 0$;~\} \IF $\id{good} = 1$ \THEN \RETURN
	731	$1$;~\}~\}. Then $\Adv{prp}{E}(S) \ge n(2^{-k} - 2^{-Lc})$.
	732	\end{exercise}
	733
	734	\begin{slide}
	735	\resetseq
	736	\head{PRPs are PRFs, \seq}
	737
	738	We model block ciphers as families of PRPs (not super PRPs). Most of the
	739	analysis works best on PRFs, though. We show that a PRP makes a `pretty
	740	good' PRF, as long as it's not over-used.
	741
	742	Let $F$ be any PRP family. Then
	743	\[ \InSec{prf}(F; t, q) \le
	744	\InSec{prp}(F; t, q) + \frac{q(q - 1)}{2^{L+1}}. \]%
	745	This is a useful result. As long as $q^2$ is small compared to $2^L$ --
	746	the block size -- then a PRP makes a good PRF.
	747
	748	The value $2^{L/2}$ is often called the \emph{Birthday bound}. We shall
	749	meet it often when we examine modes of operation. We shall examine the
	750	proof, because it illustrates some useful techniques.
	751	\end{slide}
	752
	753	\begin{slide}
	754	\head{Shoup's lemma}
	755
	756	This handy lemma states that the difference in the probability of some
	757	outcome between the two games is bounded above by the probability that the
	758	games differ.
	759
	760	\begin{lemma}[Shoup \cite{Shoup:2001:OAEPR}]
	761	\label{lem:shoup}
	762	If $X$, $Y$ and $F$ are events, and $\Pr[X \land \lnot F] = \Pr[Y \land
	763	\lnot F]$ then $\|{\Pr[X]} - \Pr[Y]\| \le \Pr[F]$.
	764	\end{lemma}
	765	\begin{proof}
	766	We have:
	767	\begin{eqnarray*}[rll]
	768	\Pr[X] &= \Pr[X \land F] &+ \Pr[X \land \lnot F] \\
	769	\Pr[Y] &= \Pr[Y \land F] &+ \Pr[Y \land \lnot F]
	770	\end{eqnarray*}
	771	Subtracting gives
	772	\[ \|{\Pr[X]} - \Pr[Y]\| = \|{\Pr[X \land F]} -
	773	\Pr[Y \land F]\| \le \Pr[F] \]%
	774	as required.
	775	\end{proof}
	776	\end{slide}
	777
	778	\begin{slide}
	779	\head{PRPs are PRFs, \seq: proof}
	780
	781	Let $F\colon \{0, 1\}^k \times \{0, 1\}^L \to \{0, 1\}^L$ be a pseudorandom
	782	permutation. We aim to show that $F$ is also a pseudorandom function.
	783
	784	Let $A$ be an adversary which distinguishes~$F$ from a pseudorandom
	785	function in time~$t$, after making $q$ oracle queries. We consider a
	786	sequence of games $\G{i}$ played with the adversary. In each game $\G{i}$,
	787	let $S_i$ be the event that the adversary returns~$1$ at the end of the
	788	game.
	789
	790	Game~$\G0$ is the `random function' game. We given $A$ an oracle
	791	containing a random function $R \inr \Func{L}{L}$.
	792
	793	Game~$\G1$ is the `PRF' game. We give $A$ an oracle which computes
	794	$F_K(\cdot)$ for some randomly chosen $K \inr \{0, 1\}^k$.
	795
	796	By definition, then,
	797	\[ \Adv{prf}{F}(A) = \Pr[S_1] - \Pr[S_0]. \]
	798	\end{slide}
	799
	800	\begin{slide}
	801	\head{PRPs are PRFs, \seq: proof (cont.)}
	802
	803	Let $x_0, x_1, \ldots, x_{q-1}$ be the oracle queries made by $A$, and let
	804	$y_0, y_1, \ldots, y_{q-1}$ be the corresponding responses.
	805
	806	Game~$\G2$ works in the same way as $\G0$, except that if there is a
	807	\emph{collision} in the query replies (i.e., $y_i = y_j$ but $x_i \ne x_j$
	808	for any $0 \le i, j < q$) then we stop the game immediately. Let $F_2$ be
	809	the event that this occurs.
	810
	811	Because $\G2$ behaves exactly the same as $\G0$ unless $F_2$ occurs, we
	812	must have
	813	\[ \Pr[S_2 \land \lnot F_2] = \Pr[S_0 \land \lnot F_2] \]
	814	so we invoke Lemma~\ref{lem:shoup} and discover that
	815	\[ \|{\Pr[S_2]} - \Pr[S_0]\| \le \Pr[F_2]. \]
	816	Using the earlier result on collisions, it's easy to see that
	817	\[ \Pr[F_2] \le \frac{q(q - 1)}{2^{L+1}}. \]
	818	\end{slide}
	819
	820	\begin{slide}
	821	\head{PRPs are PRFs, \seq: proof (cont.)}
	822
	823	Game~$\G3$ works in the same way as $\G2$, except that we use a random
	824	permutation $P \inr \Perm{L}$ instead of a random function $R \inr
	825	\Func{L}{L}$. Firstly, note that $F_2$ can't occur in $\G3$. But, if
	826	$F_2$ doesn't occur in $\G2$ (i.e., there is no collision), then the random
	827	function is indistinguishable from a random permutation. So
	828	\[ \Pr[S_3] = \Pr[S_2]. \]
	829
	830	By definition, we have
	831	\[ \Adv{prp}{F}(A) = \Pr[S_1] - \Pr[S_3]. \]
	832	We can now tie all of this together.
	833	\end{slide}
	834
	835	\begin{slide}
	836	\head{PRPs are PRFs, \seq: proof (cont.)}
	837
	838	A simple calculation shows that
	839	\begin{eqnarray*}[rl]
	840	\Adv{prf}{F}(A) &= \Pr[S_1] - \Pr[S_0] \\
	841	&\le \Pr[S_1] - \Pr[S_2] + \Pr[F_2] \\
	842	&= \Pr[S_1] - \Pr[S_3] + \Pr[F_2] \\
	843	&= \Adv{prp}{F}(A) + \Pr[F_2] \\
	844	&\le \InSec{prp}(F; t, q) + \frac{q(q - 1)}{2^{L+1}}.
	845	\end{eqnarray*}
	846	In the second line, we used the bound we computed on the absolute
	847	difference between $\Pr[S_2]$ and $\Pr[S_0]$; in the third, we noted that
	848	$\Pr[S_2] = \Pr[S_3]$; in the fourth, we used the definition of advantage
	849	against a PRP; and in the fifth we used the definition of insecurity for a
	850	PRP.
	851	\end{slide}
	852
	853	\begin{slide}
	854	\head{PRPs are PRFs, \seq: proof (cont.)}
	855
	856	Finally, we imposed no restrictions on $A$, except that it run in time $t$
	857	and make $q$ oracle queries. So our bound
	858	\[ \Adv{prf}{F}(A) \le \InSec{prp}(F; t, q) + \frac{q(q - 1)}{2^{L+1}} \]%
	859	is true for \emph{any} such adversary $A$, and in particular, it's true for
	860	the most successful adversary running with those resource bounds.
	861
	862	Hence, we can maximize, showing that
	863	\[ \InSec{prf}(F; t, q) \le
	864	\InSec{prp}(F; t, q) + \frac{q(q - 1)}{2^{L+1}} \]%
	865	as required.
	866	\end{slide}
	867
	868	\begin{slide}
	869	\topic{hash functions}
	870	\resetseq
	871	\head{Hash functions, \seq: properties}
	872
	873	Hash functions like MD5 and SHA-1 are extremely useful primitives. What
	874	properties do we expect of them? This out to be an extremely difficult
	875	question to answer.
	876	\begin{itemize}
	877	\item One-wayness. We've seen a definition for this already. But it's not
	878	enough.
	879	\item Collision-resistance. This is the property usually claimed as the
	880	requirement for use in digital signature systems. We'll look at this
	881	later.
	882	\item Randomness. What does this mean, when the function is completely
	883	public? A distinguishability criterion is clearly hopeless.
	884	\end{itemize}
	885	\end{slide}
	886
	887	\begin{slide}
	888	\head{Hash functions, \seq: Merkle-Damg\aa{}rd iterated hashing
	889	\cite{Damgaard:1990:DPH, Merkle:1991:FSE}}
	890
	891	Let $F\colon \{0, 1\}^{k+L} \to \{0, 1\}^k$ be a \emph{compression}
	892	function. Now consider the function $H\colon \{0, 1\}^* \to \{0, 1\}^k$
	893	which transforms an input string $x$ as follows:
	894	\begin{enumerate}
	895	\item Pad $x$ to a multiple of $L$ bits in some injective way. Divide the
	896	padded message into $L$-bit blocks $x_0$, $x_1$, \ldots, $x_{n-1}$.
	897	\item Fix some $k$-bit constant $I$. Let $I_0 = I$. Define $I_{i+1} =
	898	F(I_i \cat x_i)$ for $0 \le i < n$.
	899	\item The result $H(x) = I_n$.
	900	\end{enumerate}
	901
	902	Suppose we have two strings $x \ne y$, such that $H(x) = H(y)$; i.e., a
	903	\emph{collision}. Then \emph{either} we can find a collision for $F$
	904	\emph{or} a string $z$ for which $F(z) = I$. (This is why initialization
	905	vectors for hash functions have such obviously regular forms.)
	906	\end{slide}
	907
	908	\begin{proof}
	909	Let $x_0, x_1, \ldots, x_{n-1}$ and $x'_0, x'_1, \ldots, x'_{n'-1}$ be
	910	the $l$-bit blocks of two distinct (padded) messages, and without loss
	911	of generality suppose that $n \ge n'$. Let $I_0 = I'_0 = I$, let
	912	$I_{i+1} = F(I_i \cat x_i)$, and $I'_{i+1} = F(I'_i \cat x'_i)$. We
	913	have $I_n = I'_{n'}$.
	914
	915	We prove the result by induction on $n$. The case $n = 0$ is trivially
	916	true, since there is only one zero-block message. Suppose, then, that the
	917	result is true for $n$-block messages. There are three cases to consider.
	918	Firstly, if $n' = 0$ then $F(I_n \cat x_n) = I$. Secondly, if $I_n \ne
	919	I'_{n'}$ or $x_n \ne x'_{n'}$, then we have a collision, for $F(I_n \cat
	920	x_n) = I_n = I'_{n'} = F(I'_{n'} \cat x'_{n'})$. Finally, if $I_n =
	921	I'_{n'}$ and $x_n = x'_{n'}$ then we remove the final block from both
	922	messages, and because the remaining messages must differ in at least one
	923	block, we can apply the inductive hypothesis on these shorter messages to
	924	complete the proof.
	925	\end{proof}
	926
	927	\begin{slide}
	928	\head{Hash functions, \seq: Merkle-Damg\aa{}rd iterated hashing (cont.)}
	929
	930	\vfil
	931	\[ \begin{graph}
	932	[]!{0; <2cm, 0cm>: <0cm, 0.9cm>::}
	933	+=(1, 0)+[F]{\mathstrut I_0 = I} :[d] +[F]{F}="f"
	934	[urrr] *+=(3, 0)+[F]{\mathstrut x_0} :`d"f" "f" :[d]
	935	+=(1, 0)+[F]{\mathstrut I_1} :[d] +[F]{F}="f"
	936	[urrr] *+=(3, 0)+[F]{\mathstrut x_1} :`d"f" "f" :@{-->}[dd]
	937	+=(1, 0)+[F]{\mathstrut I_{n-1}} :[d] +[F]{F}="f"
	938	[urrr] *+=(3, 0)+[F]{\mathstrut x_{n-1}} :`d"f" "f" :[d]
	939	*+=(1, 0)+[F:thicker]{\mathstrut H(x) = I_n}
	940	\end{graph} \]
	941	\vfil
	942	\end{slide}
	943
	944	\begin{slide}
	945	\head{Hash functions, \seq: any-collision resistance}
	946
	947	The statement usually made about a `good' hash function $h$ is that it
	948	should be `difficult' to find a collision: i.e., two preimages $x \ne y$
	949	where $H(x) = H(y)$. How do we formalize this? Here's one attempt:
	950	\begin{eqlines*}
	951	\Succ{acr}{H}(A) = \Pr[(x, y) \gets A : x \ne y \land H(x) = H(y)]; \\
	952	\InSec{acr}(H; t) = \max_A \Succ{acr}{H}(A).
	953	\end{eqlines*}
	954	But this doesn't work. There clearly \emph{exists} an adversary which
	955	already `knows' the a collision for $H$ and just outputs the right answer.
	956	It succeeds very quickly, and with probability 1. So this definition is
	957	impossible to satisfy.
	958	\end{slide}
	959
	960	\begin{slide}
	961	\head{Hash functions, \seq: targetted collision resistance}
	962
	963	The concept of targetted collision resistance is relatively new, but quite
	964	promising. It replaces a single hash function with a \emph{family} of hash
	965	functions. They're not really keyed, because the indices aren't kept
	966	secret.
	967
	968	When making a signature, an index $i$ is chosen at random, and the
	969	signature for message $m$ is formed over the pair $(i, H_i(M))$.
	970
	971	TCR-hash functions are the subject of ongoing research. No practical
	972	designs exist at the moment.
	973	\end{slide}
	974
	975	\begin{slide}
	976	\head{Hash functions, \seq: targetted collision resistance (cont.)}
	977
	978	Consider the following experiment:
	979	\begin{program}
	980	$\Expt{tcr}{H}(A)$: \+ \\
	981	$(x, s) \gets A(\cookie{find})$; \\
	982	$i \getsr \keys H$; \\
	983	$y \gets A(\cookie{collide}, i, s)$; \\
	984	\IF $x \ne y \land H_i(x) = H_i(y)$
	985	\THEN \RETURN $1$; \\
	986	\ELSE \RETURN $0$;
	987	\end{program}
	988	The security of a TCR-hash function is measured as:
	989	\[ \InSec{tcr}(H; t) = \max_A \Pr[\Expt{tcr}{H}(A) = 1] \]
	990	where the maximum is taken over all adversaries running in time $t$. We
	991	define $(t, \epsilon)$-security as usual.
	992	\end{slide}
	993
	994	\begin{slide}
	995	\head{Hash functions, \seq: random oracles \cite{Bellare:1993:ROP}}
	996
	997	In practice, we expect much more than just collision resistance from hash
	998	functions: we expect a certain amount of `random' behaviour. But this is
	999	hard to quantify.
	1000
	1001	One approach is to model a hash function as a `random oracle', i.e., an
	1002	oracle containing a function chosen at random, used by the construction
	1003	under consideration, and made available to the adversary. The idea is that
	1004	reductions can `capture' the knowledge of an adversary by examining the
	1005	queries it makes to its random oracle.
	1006
	1007	Hash functions \emph{aren't} random oracles. But a random oracle proof is
	1008	better than nothing.
	1009	\end{slide}
	1010
	1011	\xcalways\subsection{Standard assumptions}\x
	1012
	1013	\begin{slide}
	1014	\head{Standard assumptions}
	1015
	1016	There are a number of `standard' assumptions that are made about the
	1017	difficulty of various problems:
	1018	\begin{itemize}
	1019	\item IFP, the Integer Factorization Problem;
	1020	\item QRP, the Quadratic Residuosity Problem;
	1021	\item DLP, the Discrete Logarithm Problem;
	1022	\item RSAP, the RSA Problem; and
	1023	\item CDH, the Computational Diffie-Hellman problem and its variants
	1024	\end{itemize}
	1025	\cite{Menezes:1997:HAC} has excellent material on the above.
	1026	\end{slide}
	1027
	1028	\begin{slide}
	1029	\topic{integer factorization}
	1030	\resetseq
	1031	\head{The Integer Factorization Problem, \seq}
	1032
	1033	We often assume that large integers of the form $n = p q$, where $p$ and
	1034	$q$ are primes of roughly the same size, are `hard' to factor.
	1035	Specifically, there is no algorithm which will factor such an $n$ in time
	1036	bounded by a polynomial function of $\log n$.
	1037
	1038	The difficulty of various other problems, e.g., Quadratic Residuosity, or
	1039	RSA, depend on the difficulty of factoring; however, it is not yet known
	1040	whether the ability to solve QRP or RSAP can be used to factor.
	1041	\end{slide}
	1042
	1043	\begin{slide}
	1044	\head{The Integer Factorization Problem, \seq: square roots}
	1045
	1046	The problem of extracting square roots modulo $n = p q$ is provably as hard
	1047	as factoring. This is the basis of Rabin's public key encryption and
	1048	digital signature schemes. We shall analyse these later.
	1049
	1050	Suppose $Q(n, y)$ is an algorithm which returns an $x$ such that $x^2
	1051	\equiv y \pmod{n}$, provided such an $x$ exists. Then we can find a
	1052	nontrivial factor of $n$ as follows:
	1053	\begin{program}
	1054	Algorithm $\id{find-factor}(n)$: \+ \\
	1055	\REPEAT \\ \quad\=\+\kill
	1056	$x \getsr \{1, 2, \ldots, n - 1\}$; \\
	1057	$y \gets x^2 \bmod n$; \\
	1058	$x' \gets Q(n, y)$; \\
	1059	$p \gets \gcd(x + x', n)$; \\
	1060	\IF $p \notin \{1, n\}$ \THEN \RETURN $p$; \- \\
	1061	\FOREVER;
	1062	\end{program}
	1063	\end{slide}
	1064
	1065	\begin{proof}
	1066	The program attempts to find two square roots of $y$ mod $n$. It's easy to
	1067	see that this might lead to factors of $n$. If $x^2 \equiv x'^2 \pmod{n}$
	1068	then $x^2 - x'^2 = k n$ for some constant $k$. Then $(x + x')(x - x')$ is
	1069	a factorization of $k n$. It remains to prove the probability bound on $x
	1070	+ x'$ being a nontrivial factor of $n$.
	1071
	1072	Let $n$ be an odd composite. Then, if $x \not\equiv \pm y \pmod{n}$ but
	1073	$x^2 \equiv y^2 \pmod{n}$, then $\gcd(x + y, n)$ is a nontrivial factor of
	1074	$n$.
	1075
	1076	Firstly, we claim that, if $p$ is an odd prime then the congruence $x^2
	1077	\equiv y \pmod{p}$ has precisely two solutions $x$, $x'$ such that $x
	1078	\equiv -x' \pmod{p}$. Let $g$ be primitive mod $p$, with $x = g^\alpha$,
	1079	$x' = g^\beta$. Then $g^{2 \alpha} \equiv g^{2 \beta} \pmod{p}$, so $2
	1080	\alpha \equiv 2 \beta \pmod{p - 1}$. But $p - 1$ is even, by hypothesis,
	1081	so $\alpha \equiv \beta \pmod{(p - 1)/2}$. But $g^{(p-1)/2} \equiv -1
	1082	\pmod{p}$; hence $x \equiv \pm x' \pmod{p}$, proving the claim.
	1083
	1084	There must exist odd primes $p$, $q$, such that $p\|n$ and $q\|n$, and $x
	1085	\equiv -y \pmod{p}$ and $x \equiv y \pmod{q}$, for if not, then $x \equiv
	1086	\pm y \pmod{n}$ contrary to hypothesis. But then $x + y \equiv 0
	1087	\pmod{p}$, so $p\|(x + y)$; but $x + y \equiv 2 x \not\equiv 0 \pmod{q}$,
	1088	since $q$ is odd. Hence, $p$ divides $x + y$, but $q$ does not, so $\gcd(x
	1089	+ y, n)$ is a nontrivial factor of $n$, completing the proof.
	1090	\end{proof}
	1091
	1092	\begin{slide}
	1093	\topic{quadratic residuosity}
	1094	\head{The Quadratic Residuosity Problem}
	1095
	1096	If there is an $x$ such that $x^2 \equiv y \pmod{n}$ then $y$ is a
	1097	\emph{quadratic residue modulo $n$}, and we write $y \in Q_n$; if there is
	1098	no such $x$ then $y$ is a \emph{quadratic nonresidue modulo $n$}.
	1099
	1100	If $p$ is prime, then we can use the \emph{Legendre symbol} to decide
	1101	whether $x$ is a quadratic residue mod $p$:
	1102	\[ \jacobi{x}{p} = x^{(p-1)/2} \bmod p =
	1103	\begin{cases}
	1104	0 & if $p$ divides $x$ \\
	1105	-1 & if $x$ is a quadratic nonresidue mod $p$ \\
	1106	+1 & if $x$ is a quadratic residue mod $p$
	1107	\end{cases}. \]%
	1108	The \emph{Jacobi symbol} (written the same way) is defined for odd~$n$: if
	1109	$n = p_1^{a_1} p_2^{a_2} \cdots p_k^{a_k}$ where the $p_i$ are prime, then
	1110	\[ \jacobi{x}{n} =
	1111	\jacobi{x}{p_1}^{a_1} \jacobi{x}{p_2}^{a_2} \cdots
	1112	\jacobi{x}{p_k}^{a_k}. \]%
	1113	This can be efficiently computed without knowledge of the factors of $n$
	1114	\cite[Section~2.4.5]{Menezes:1997:HAC}.
	1115	\end{slide}
	1116
	1117	\begin{slide}
	1118	\head{The Quadratic Residuosity Problem (cont.)}
	1119
	1120	If $\jacobi{x}{n} = -1$ then $x$ is certainly \emph{not} a quadratic
	1121	residue mod $n$; however, if $\jacobi{x}{n} = 1$ then $x$ might be a
	1122	quadratic residue or it might not; if not, then we say that $x$ is a
	1123	\emph{pseudosquare}.
	1124
	1125	If $n = p q$ is a product of two primes and $x \inr (\Z/n\Z)^*$ is chosen
	1126	at random, then
	1127	\[ \Pr\Bigl[x \in Q_n \Bigm\| \jacobi{x}{n} = 1\Bigr] = \frac{1}{2}, \]
	1128	since we have
	1129	\[ \jacobi{x}{p} = \jacobi{x}{q} = \pm 1 \]
	1130	with each occurring with equal probability.
	1131
	1132	The problem of distinguishing pseudosquares from quadratic residues is
	1133	called the Quadratic Residuosity Problem (QRP). It is not known how to
	1134	solve this problem without factoring $n$.
	1135	\end{slide}
	1136
	1137	\begin{slide}
	1138	\topic{discrete logarithms}
	1139	\head{The Discrete Logarithm Problem}
	1140
	1141	The (Integer) Discrete Logarithm Problem asks for the solution $x$ given a
	1142	congruence of the form $g^x \equiv y \pmod{n}$. This seems to be about as
	1143	difficult as factoring. (The ability to solve discrete logs modulo $n$ is
	1144	sufficient to factor $n$. The best known algorithms for IFP and DLP have
	1145	the same running time.)
	1146
	1147	The problem generalizes to other cyclic groups, e.g., elliptic curves over
	1148	finite fields.
	1149	\end{slide}
	1150
	1151	\begin{slide}
	1152	\topic{self-reducibility}
	1153	\resetseq
	1154	\head{Self-reducibility, \seq}
	1155
	1156	The problems of square-root extraction, deciding quadratic residuosity, the
	1157	RSA problem, and finding discrete logarithms share the property of being
	1158	\emph{randomly self-reducible}; i.e., an instance of the problem can be
	1159	transformed into many different derived instances \emph{without skewing the
	1160	probability distribution of problem instances}, such that a solution to
	1161	one of the derived instances yields a solution to the original one.
	1162
	1163	This is a good property to have. It implies that `most' problem instances
	1164	are as hard as the hardest instances.
	1165	\end{slide}
	1166
	1167	\begin{slide}
	1168	\head{Self-reducibility, \seq: the RSA problem \cite{Rivest:1978:MOD}}
	1169
	1170	The RSA problem is to compute $e$-th roots modulo $n = p q$, where $e$ is
	1171	relatively prime to $n$. Suppose that the algorithm $S(n, e, y)$ returns a
	1172	value $x$ such that $x^e \equiv y \pmod{n}$ for `many' choices of $y$, or
	1173	the special symbol $\bot$ otherwise. The following probabilistic algorithm
	1174	then solves the RSA problem for arbitrary $y$:
	1175	\begin{program}
	1176	Algorithm $\id{solve-rsa}(n, e, y)$: \+ \\
	1177	\REPEAT \\ \quad\=\+\kill
	1178	$x' \getsr \{1, 2, \ldots, n - 1\}$; \\
	1179	\IF $\gcd(x', n) = 1$ \THEN \\ \quad\=\+\kill
	1180	$y' \gets y x'^e \bmod n$; \\
	1181	$x \gets S(n, e, y')$; \\
	1182	\IF $x \ne \bot$ \THEN \RETURN $x x'^{-1} \bmod n$; \-\- \\
	1183	\FOREVER;
	1184	\end{program}
	1185	\end{slide}
	1186
	1187	\begin{slide}
	1188	\topic{the Diffie-Hellman problem}
	1189	\head{The Diffie-Hellman problem \cite{Diffie:1976:NDC}}
	1190
	1191	Let $G = \langle g \rangle$ be a cyclic group or order $q$. Let $\alpha$
	1192	and $\beta$ be indices, $\alpha, \beta \in \Z/q\Z$.
	1193
	1194	The (computational) \emph{Diffie-Hellman} problem is, given $g^\alpha$ and
	1195	$g^\beta$, to find $g^{\alpha\beta}$.
	1196
	1197	The (computational) Diffie-Hellman \emph{assumption} is that there is no
	1198	probabilistic algorithm which solves the computational Diffie-Hellman
	1199	problem in time polynomial in $\log q$.
	1200
	1201	Obviously, being able to compute discrete logs in $G$ efficiently would
	1202	yield a solution to the Diffie-Hellman problem. But it may be the case
	1203	that the Diffie-Hellman problem is easier than the discrete log problem.
	1204
	1205	The Diffie-Hellman problem is self-reducible.
	1206	\end{slide}
	1207
	1208	\begin{slide}
	1209	\head{The Decisional Diffie-Hellman assumption \cite{Boneh:1998:DDP}}
	1210
	1211	The computational Diffie-Hellman assumption makes a statement only about
	1212	algorithms which compute the \emph{entire} answer $g^{\alpha\beta}$. Since
	1213	Diffie-Hellman is frequently used for key-exchange, what can we say about
	1214	the ability of an adversary to guess any of the bits?
	1215
	1216	The Decisional Diffie-Hellman (DDH) assumption asserts that, if you don't
	1217	know $\alpha$ or $\beta$, then it's hard to tell $g^{\alpha\beta}$ from a
	1218	random element of $G$; that is, that the distributions of the following
	1219	experiments are computationally indistinguishable:
	1220	\begin{program}
	1221	$\alpha \getsr \Z/q\Z;$ \\
	1222	$\beta \getsr \Z/q\Z;$ \\
	1223	\RETURN $(g^\alpha, g^\beta, g^{\alpha\beta})$;
	1224	\next
	1225	$\alpha \getsr \Z/q\Z;$ \\
	1226	$\beta \getsr \Z/q\Z;$ \\
	1227	$\gamma \getsr \Z/q\Z;$ \\
	1228	\RETURN $(g^\alpha, g^\beta, g^\gamma)$;
	1229	\end{program}
	1230	\end{slide}
	1231
	1232	\begin{slide}
	1233	\head{The Decisional Diffie-Hellman assumption (cont.)}
	1234
	1235	If $A$ is an algorithm attempting to solve DDH in the group $G$, then we
	1236	write its advantage as
	1237	\begin{eqnarray*}[rl]
	1238	\Adv{ddh}{G}(A) =
	1239	& \Pr[\alpha \getsr \Z/q\Z; \beta \getsr \Z/q\Z :
	1240	A(g^\alpha, g^\beta, g^{\alpha\beta}) = 1] - {} \\
	1241	& \Pr[\alpha \getsr \Z/q\Z; \beta \getsr \Z/q\Z;
	1242	\gamma \getsr \Z/q\Z :
	1243	A(g^\alpha, g^\beta, g^\gamma) = 1]
	1244	\end{eqnarray*}
	1245	and the insecurity of DDH in $G$ as
	1246	\[ \InSec{ddh}(G; t) = \max_A \Adv{ddh}{G}(A) \]
	1247	with the maximum over all algorithms $A$ running in time $t$.
	1248	\end{slide}
	1249
	1250	\begin{slide}
	1251	\head{The Decisional Diffie-Hellman assumption (cont.)}
	1252
	1253	If you can solve the computational Diffie-Hellman problem, you can solve
	1254	the decisional version. If you can compute discrete logs, you can solve
	1255	both.
	1256
	1257	There \emph{are} groups in which the computation problem is (believed to
	1258	be) hard, but the decisional problem is easy. In particular, if the group
	1259	order $q$ has small factors then the decisional problem isn't hard.
	1260	\end{slide}
	1261
	1262	\endinput
	1263
	1264	%%% Local Variables:
	1265	%%% mode: latex
	1266	%%% TeX-master: "ips"
	1267	%%% End: