Low-Level Academy

Blog
Subscribe

< TCP/IP Fundamentals
BeginnerRust

Fragmentation

Hi there! This lesson is interactive, and it depends on WebAssembly and JavaScript being enabled in the browser. While you can still read the lesson text, we recommend enabling JS to add explorable & interactive elements. Don't worry, we don't use any tracking cookies and we respect your privacy.

In the first lesson, we found the IP address of your friend Alice and we had a nice chat. But now she asks you to send her a postcard as an image file, and you have been trying to do that—unsuccessfully. Turns out, you can't send a large file in a single network packet!

Let's see why exactly this happens and devise a solution to overcome this limitation, learning how to successfully deliver your picture. After completing this lesson, you’ll learn how to split large messages into smaller packets using Rust iterators and how to deal with out-of-order delivery.

Let's suppose that this image is of 260 kilobytes in size. As we learned in the previous lesson, a single IP packet fits 65 kilobytes of data. In real networks, though, this number is even smaller as it is limited by the constraints of the physical Ethernet or Wi-Fi networks. We call this limit the Maximum transmission unit size, or MTU for short. MTU depends on the physical network configuration, and in the majority of Ethernet or Wi-Fi-based networks it sits at 1500 bytes. In practice, this means that most IP packets are smaller than 1.5 kilobytes.

What happens if we have an MTU of 1500 but send an IP packet of 65 kilobytes in size?

To find out, let's get back to the table deconstruction of an IP packet:

0		15	16		31
Version	Header length	Type of Service	Total length
Identification			Flags	Fragment offset
Time to live (TTL)		Protocol	Header checksum
Source IP address
Destination IP address

There are three fields which are relevant to our question: identification, flags, and the fragment offset.

Because we know the MTU size in advance, we can cut IP packets into many smaller pieces called fragments. Each fragment becomes its own IP packet, carrying a slice of the original message payload with its own unique identification number. The offset says which part of the original payload this fragment carries. This information allows the receiving end to reconstruct the original packet from its fragmented form. Finally, flags indicate if the IP packet is fragmented, and if it is, the final fragment will have flags set to zero while the fragment offset will have a non-zero value.

Let's see how a single 16 KB IP packet is split up when we adjust the MTU size:

Packet (1536 bytes)

Identification: 0

Offset: 0

Flags: fragmented

Packet (1536 bytes)

Identification: 1

Offset: 1536

Flags: fragmented

Packet (1536 bytes)

Identification: 2

Offset: 3072

Flags: fragmented

Packet (1536 bytes)

Identification: 3

Offset: 4608

Flags: fragmented

Packet (1536 bytes)

Identification: 4

Offset: 6144

Flags: fragmented

Packet (1536 bytes)

Identification: 5

Offset: 7680

Flags: fragmented

Packet (1536 bytes)

Identification: 6

Offset: 9216

Flags: fragmented

Packet (1536 bytes)

Identification: 7

Offset: 10752

Flags: fragmented

Packet (1536 bytes)

Identification: 8

Offset: 12288

Flags: fragmented

Packet (1536 bytes)

Identification: 9

Offset: 13824

Flags: fragmented

Packet (1024 bytes)

Identification: 10

Offset: 15360

Flags: none

This kind of fragmentation happens without our knowledge. It's a part of the network stack implementation in operating systems, so we don't have to deal with it at the IP level.

Moreover, in the real-world Internet, we will rarely experience IP fragmentation. The reason is that IP fragmentation is fragile: for example, if one single fragment is not delivered, the original message has to be dropped, fragmented again, and retransmitted because there's no way to retransmit only a part of it. Instead, packets are generally made smaller than the MTU using path MTU discovery, where packets are made progressively smaller until they are accepted at the destination.

And remember: we're dealing with IP payloads, so if we send a UDP datagram, the IP payload will include the UDP header. Because of this, we can't see UDP source and destination port numbers before completely reconstructing the original message:

IP packet (1500 bytes)

Identification: 0

Offset: 0

Payload (UDP header):

0	15
Source port
Destination port
UDP length: 3500
UDP checksum
UDP payload

IP packet (1500 bytes)

Identification: 1

Offset: 1500

Payload:

0	...
UDP payload

These limitations have led to completely disabling fragmentation in the newer Internet Protocol version (IPv6). Naturally, you ask: why are we even talking about it then? It's because while the IP fragmentation is not that useful to us, it's still a nice idea we can reuse for our own purpose: we can do the fragmentation at the level of our application. That's what we will do on the next page.

Low-Level Academy

< TCP/IP FundamentalsBeginnerRust

Fragmentation

< TCP/IP Fundamentals
BeginnerRust