Makefile.in: Drop dist target
[secnet] / NOTES
CommitLineData
974d0468 1* Design of new, multi-subnet secnet protocol
2fe58dfd 2
974d0468
SE
3Like the first (1995/6) version, we're tunnelling IP packets inside
4UDP packets. To defeat various restrictions which may be imposed on us
5by network providers (like the prohibition of incoming TCP
6connections) we're sticking with UDP for everything this time,
3454dce4 7including key setup. This means we have to handle retries, etc.
2fe58dfd
SE
8
9Other new features include being able to deal with subnets hidden
10behind changing 'real' IP addresses, and the ability to choose
11algorithms and keys per pair of communicating sites.
12
13** Configuration and structure
14
3454dce4
SE
15[The original plan]
16
2fe58dfd
SE
17The network is made up from a number of 'sites'. These are collections
18of machines with private IP addresses. The new secnet code runs on
19machines which have interfaces on the private site network and some
20way of accessing the 'real' internet.
21
22Each end of a tunnel is identified by a name. Often it will be
23convenient for every gateway machine to use the same name for each
24tunnel endpoint, but this is not vital. Individual tunnels are
25identified by their two endpoint names.
26
3454dce4
SE
27[The new plan]
28
29It appears that people want to be able to use secnet on mobile
30machines like laptops as well as to interconnect sites. In particular,
31they want to be able to use their laptop in three situations:
32
331) connected to their internal LAN by a cable; no tunnel involved
342) connected via wireless, using a tunnel to protect traffic
353) connected to some other network, using a tunnel to access the
36internal LAN.
37
38They want the laptop to keep the same IP address all the time.
39
40Case (1) is simple.
41
42Case (2) requires that the laptop run a copy of secnet, and have a
43tunnel configured between it and the main internal LAN default
44gateway. secnet must support the concept of a 'soft' tunnel where it
45adds a route and causes the gateway to do proxy-ARP when the tunnel is
46up, and removes the route again when the tunnel is down.
47
48The usual prohibition of packets coming in from one tunnel and going
49out another must be relaxed in this case (in particular, the
50destination address of packets from these 'mobile station' tunnels may
51be another tunnel as well as the host).
52
53(Quick sanity check: if chiark's secnet address was in
54192.168.73.0/24, would this work properly? Yes, because there will be
55an explicit route to it, and proxy ARP will be done for it. Do we want
56packets from the chiark tunnel to be able to go out along other
57routes? No. So, spotting a 'local' address in a remote site's list of
58networks isn't sufficient to switch on routing for a site. We need an
59explicit option. NB packets may be routed if the source OR the
60destination is marked as allowing routing [otherwise packets couldn't
61get back from eg. chiark to a laptop at greenend]).
62
4f5e39ec
SE
63[the even newer plan]
64
65secnet sites are configured to grant access to particular IP address
66ranges to the holder of a particular public key. The key can certify
67other keys, which will then be permitted to use a subrange of the IP
68address range of the certifying key.
69
70This means that secnet won't know in advance (i.e. at configuration
71time) how many tunnels it might be required to support, so we have to
72be able to create them (and routes, and so on) on the fly.
73
3454dce4
SE
74** VPN-level configuration
75
76At a high level we just want to be able to indicate which groups of
77users can claim ownership of which ranges of IP addresses. Assuming
78these users (or their representatives) all have accounts on a single
79machine, we can automate the submission of keys and other information
80to make up a 'sites' file for the entire VPN.
81
82The distributed 'sites' file should be in a more restricted format
83than the secnet configuration file, to prevent attackers who manage to
84distribute bogus sites files from taking over their victim's machines.
85
86The distributed 'sites' file is read one line at a time. Each line
87consists of a keyword followed by other information. It defines a
88number of VPNs; within each VPN it defines a number of locations;
89within each location it defines a number of sites. These VPNs,
90locations and sites are turned into a secnet.conf file fragment using
91a script.
92
93Some keywords are valid at any 'level' of the distributed 'sites'
94file, indicating defaults.
95
96The keywords are:
97
98vpn n: we are now declaring information to do with VPN 'n'. Must come first.
99
100location n: we are now declaring information for location 'n'.
101
102site n: we are now declaring information for site 'n'.
103endsite: we're finished declaring information for the current site
104
105restrict-nets a b c ...: restrict the allowable 'networks' for the current
106 level to those in this list.
107end-definitions: prevent definition of further vpns and locations, and
108 modification of defaults at VPN level
109
110dh x y: the current VPN uses the specified group; x=modulus, y=generator
111
112hash x: which hash function to use. Valid options are 'md5' and 'sha1'.
113
114admin n: administrator email address for current level
115
116key-lifetime n
117setup-retries n
118setup-timeout n
119wait-time n
120renegotiate-time n
121
122address a b: a=dnsname, b=port
123networks a b c ...
124pubkey x y z: x=keylen, y=encryption key, z=modulus
125mobile: declare this to be a 'mobile' site
126
b2a56f7c
SE
127** Logging etc.
128
129There are several possible ways of running secnet:
130
131'reporting' only: --version, --help, etc. command line options and the
132--just-check-config mode.
133
134'normal' run: perform setup in the foreground, and then background.
135
136'failed' run: setup in the foreground, and terminate with an error
137before going to background.
138
139'reporting' modes should never output anything except to stdout/stderr.
140'normal' and 'failed' runs output to stdout/stderr before
141backgrounding, then thereafter output only to log destinations.
142
2fe58dfd
SE
143** Protocols
144
145*** Protocol environment:
146
147Each gateway machine serves a particular, well-known set of private IP
148addresses (i.e. the agreement over which addresses it serves is
149outside the scope of this discussion). Each gateway machine has an IP
150address on the interconnecting network (usually the Internet), which
151may be dynamically allocated and may change at any point.
152
153Each gateway knows the RSA public keys of the other gateways with
154which it wishes to communicate. The mechanism by which this happens is
155outside the scope of this discussion. There exists a means by which
156each gateway can look up the probable IP address of any other.
157
158*** Protocol goals:
159
160The ultimate goal of the protocol is for the originating gateway
161machine to be able to forward packets from its section of the private
162network to the appropriate gateway machine for the destination
163machine, in such a way that it can be sure that the packets are being
164sent to the correct destination machine, the destination machine can
165be sure that the source of the packets is the originating gateway
166machine, and the contents of the packets cannot be understood other
167than by the two communicating gateways.
168
169XXX not sure about the address-change stuff; leave it out of the first
170version of the protocol. From experience, IP addresses seem to be
171quite stable so the feature doesn't gain us much.
172
173**** Protocol sub-goal 1: establish a shared key
174
175Definitions:
176
1737eeef
IJ
177A is the originating gateway machine name
178B is the destination gateway machine name
09a385fb 179A+ and B+ are the names with optional additional data, see below
2fe58dfd
SE
180PK_A is the public RSA key of A
181PK_B is the public RSA key of B
182PK_A^-1 is the private RSA key of A
183PK_B^-1 is the private RSA key of B
184x is the fresh private DH key of A
185y is the fresh private DH key of B
186k is g^xy mod m
187g and m are generator and modulus for Diffie-Hellman
188nA is a nonce generated by A
189nB is a nonce generated by B
190iA is an index generated by A, to be used in packets sent from B to A
191iB is an index generated by B, to be used in packets sent from A to B
192i? is appropriate index for receiver
193
194Note that 'i' may be re-used from one session to the next, whereas 'n'
195is always fresh.
196
09a385fb
IJ
197The optional additional data after the sender's name consists of some
198initial subset of the following list of items:
199 * A 32-bit integer with a set of capability flags, representing the
200 abilities of the sender.
3ed1846a
IJ
201 * In MSG3/MSG4: a 16-bit integer being the sender's MTU, or zero.
202 (In other messages: nothing.) See below.
09a385fb
IJ
203 * More data which is yet to be defined and which must be ignored
204 by receivers.
205The optional additional data after the receiver's name is not
206currently used. If any is seen, it must be ignored.
207
208Capability flag bits must be in one the following two categories:
209
2101. Early capability flags must be advertised in MSG1 or MSG2, as
211 applicable. If MSG3 or MSG4 advertise any "early" capability bits,
212 MSG1 or MSG3 (as applicable) must have advertised them too. Sadly,
213 advertising an early capability flag will produce MSG1s which are
214 not understood by versions of secnet which predate the capability
215 mechanism.
216
2172. Late capability flags are advertised in MSG2 or MSG3, as
218 applicable. They may also appear in MSG1, but this is not
219 guaranteed. MSG4 must advertise the same set as MSG2.
220
0b5df556
MW
221Currently, the low 16 bits are allocated for negotiating bulk-crypto
222transforms. Bits 8 to 15 are used by Secnet as default capability
223numbers for the various kinds of transform closures: bit 8 is for the
224original CBCMAC-based transform, and bit 9 for the new EAX transform;
225bits 10 to 15 are reserved for future expansion. The the low eight bits
226are reserved for local use, e.g., to allow migration from one set of
227parameters for a particular transform to a different, incompatible set
b2de6bd2
IJ
228of parameters for the same transform. Bit 31, if advertised by both
229ends, indicates that a mobile end gets priority in case of crossed MSG1.
230The remaining bits have not yet been assigned a purpose.
0b5df556
MW
231
232No early capability bits are currently defined.
09a385fb 233
ff05a229 234
3ed1846a
IJ
235MTU handling
236
237In older versions of secnet, secnet was not capable of fragmentation
238or sending ICMP Frag Needed. Administrators were expected to configure
239consistent MTUs across the network.
240
241It is still the case in the current version that the MTUs need to be
242configured reasonably coherently across the network: the allocated
243buffer sizes must be sufficient to cope with packets from all other
244peers.
245
246However, provided the buffers are sufficient, all packets will be
247processed properly: a secnet receiving a packet larger than the
248applicable MTU for its delivery will either fragment it, or reject it
249with ICMP Frag Needed.
250
251The MTU additional data field allows secnet to advertise an MTU to the
252peer. This allows the sending end to handle overlarge packets, before
253they are transmitted across the underlying public network. This can
254therefore be used to work around underlying network braindamage
255affecting large packets.
256
257If the MTU additional data field is zero or not present, then the peer
258should use locally-configured MTU information (normally, its local
259netlink MTU) instead.
260
261If it is nonzero, the peer may send packets up to the advertised size
262(and if that size is bigger than the peer's administratively
263configured size, the advertiser promises that its buffers can handle
264such a large packet).
265
266A secnet instance should not assume that just because it has
267advertised an mtu which is lower than usual for the vpn, the peer will
268honour it, unless the administrator knows that the peers are
269sufficiently modern to understand the mtu advertisement option. So
270secnet will still accept packets which exceed the link MTU (whether
271negotiated or assumed).
272
273
2fe58dfd
SE
274Messages:
275
291906b8 2761) A->B: i*,iA,msg1,A+,B+,nA
7e29719e
IJ
277
278i* must be encoded as 0. (However, it is permitted for a site to use
279zero as its "index" for another site.)
2fe58dfd 280
1737eeef 2812) B->A: iA,iB,msg2,B+,A+,nB,nA
2fe58dfd
SE
282
283(The order of B and A reverses in alternate messages so that the same
284code can be used to construct them...)
285
5b5f297f 2863) A->B: {iB,iA,msg3,A+,B+,[chosen-transform],nA,nB,g^x mod m}_PK_A^-1
2fe58dfd
SE
287
288If message 1 was a replay then A will not generate message 3, because
289it doesn't recognise nA.
290
291If message 2 was from an attacker then B will not generate message 4,
292because it doesn't recognise nB.
293
1737eeef 2944) B->A: {iA,iB,msg4,B+,A+,nB,nA,g^y mod m}_PK_B^-1
2fe58dfd
SE
295
296At this point, A and B share a key, k. B must keep retransmitting
297message 4 until it receives a packet encrypted using key k.
298
2995) A: iB,iA,msg5,(ping/msg5)_k
300
3016) B: iA,iB,msg6,(pong/msg6)_k
302
303(Note that these are encrypted using the same transform that's used
304for normal traffic, so they include sequence number, MAC, etc.)
305
306The ping and pong messages can be used by either end of the tunnel at
307any time, but using msg0 as the unencrypted message type indicator.
308
309**** Protocol sub-goal 2: end the use of a shared key
310
3117) i?,i?,msg0,(end-session/msg7,A,B)_k
312
313This message can be sent by either party. Once sent, k can be
314forgotten. Once received and checked, k can be forgotten. No need to
315retransmit or confirm reception. It is suggested that this message be
316sent when a key times out, or the tunnel is forcibly terminated for
317some reason.
318
1e80c220 319**** Protocol sub-goal 3: send a packet
2fe58dfd 320
1e80c220 3218) i?,i?,msg0,(send-packet/msg9,packet)_k
2fe58dfd 322
1e80c220 323**** Other messages
974d0468 324
1e80c220 3259) i?,i?,NAK (NAK is encoded as zero)
2fe58dfd 326
1e80c220
IJ
327If the link-layer can't work out what to do with a packet (session has
328gone away, etc.) it can transmit a NAK back to the sender.
2fe58dfd 329
1e80c220
IJ
330This can alert the sender to the situation where the sender has a key
331but the receiver doesn't (eg because it has been restarted). The
332sender, on receiving the NAK, will try to initiate a key exchange.
4f5e39ec 333
1e80c220
IJ
334Forged (or overly delayed) NAKs can cause wasted resources due to
335spurious key exchange initiation, but there is a limit on this because
336of the key exchange retry timeout.
4f5e39ec
SE
337
33810) i?,i?,msg8,A,B,nA,nB,msg?
1e80c220
IJ
339
340This is an obsolete form of NAK packet which is not sent by any even
341vaguely recent version of secnet. (In fact, there is no evidence in
342the git history of it ever being sent.)
343
344This message number is reserved.
dd9209d1
IJ
345
34611) *,*,PROD,A,B
347
348Sent in response to a NAK from B to A. Requests that B initiates a
349key exchange with A, if B is willing and lacks a transport key for A.
350(If B doesn't have A's address configured, implicitly supplies A's
351public address.)
352
353This is necessary because if one end of a link (B) is restarted while
354a key exchange is in progress, the following bad state can persist:
355the non-restarted end (A) thinks that the key is still valid and keeps
356sending packets, but B either doesn't realise that a key exchange with
357A is necessary or (if A is a mobile site) doesn't know A's public IP
358address.
359
360Normally in these circumstances B would send NAKs to A, causing A to
361initiate a key exchange. However if A and B were already in the
362middle of a key exchange then A will not want to try another one until
363the first one has timed out ("setup-time" x "setup-retries") and then
364the key exchange retry timeout ("wait-time") has elapsed.
365
366However if B's setup has timed out, B would be willing to participate
367in a key exchange initiated by A, if A could be induced to do so.
368This is the purpose of the PROD packet.
369
370We send no more PRODs than we would want to send data packets, to
371avoid a traffic amplification attack. We also send them only in state
372WAIT, as in other states we wouldn't respond favourably. And we only
373honour them if we don't already have a key.
374
375With PROD, the period of broken communication due to a key exchange
376interrupted by a restart is limited to the key exchange total
377retransmission timeout, rather than also including the key exchange
378retry timeout.