Server Lifecycle

A server socket listens for connections rather than initiating them. The typical lifecycle looks something like this:

create
bind
listen
accept
close

We covered #1 already; now we’ll continue on with the rest of the list.

Servers Bind

The second step in the lifecycle of a server socket is to bind to a port where it will listen for connections.

require 'socket'

# First, create a new TCP socket.
socket = Socket.new(:INET, :STREAM)

# Create a C struct to hold the address for listening.
addr = Socket.pack_sockaddr_in(4481, '0.0.0.0')

# Bind to it.
socket.bind(addr)

This is a low-level implementation that shows how to bind a TCP socket to a local port. In fact, it’s almost identical to the C code you would write to accomplish the same thing.

This particular socket is now bound to port 4481 on the local host. Other sockets will not be able to bind to this port; doing so would result in an Errno::EADDRINUSE exception being raised. Client sockets will be able to connect to this socket using this port number, once a few more steps have been completed.

If you run that code block you’ll notice that it exits immediately. The code works but doesn’t yet do enough to actually listen for a connection. Keep reading to see how to put the server in listen mode.

To recap, a server binds to a specific, agreed-upon port number which a client socket can then connect to.

Of course, Ruby provides syntactic sugar so that you never have to actually use Socket.pack_sockaddr_in or Socket#bind directly. But before learning the syntactic sugar it’s important that we see how to do things the hard way.

What port should I bind to?

This is an important consideration for anyone writing a server. Should you pick a random port number? How can you tell if some other program has already ‘claimed’ a port as their own?

In terms of what’s possible, any port from 1-65,535 can be used, but there are important conventions to consider before picking a port.

The first rule: don’t try to use a port in the 0-1024 range. These are considered ‘well-known’ ports and are reserved for system use. A few examples: HTTP traffic defaults to port 80, SMTP traffic defaults to port 25, rsync defaults to port 873. Binding to these ports typically requires root access.

The second rule: don’t use a port in the 49,000-65,535 range. These are the ephemeral ports. They’re typically used by services that don’t operate on a predefined port number but need ports for temporary purposes. They’re also an integral part of the connection negotiation process we’ll see in the next section. Picking a port in this range might cause issues for some of your users.

Besides that, any port from 1025-48,999 is fair game for your uses. If you’re planning on claiming one of those ports as the port for your server then you should have a look at the IANA list of registered ports and make sure that your choice doesn’t conflict with some other popular server out there.

What address should I bind to?

I bound to 0.0.0.0 in the above example, but what’s the difference when I bind to 127.0.0.1? Or 1.2.3.4? The answer has to do with interfaces.

Earlier I mentioned that your system has a loopback interface represented with the IP address 127.0.0.1. It also has a physical, hardware-backed interface represented by a different IP address (let’s pretend it’s 192.168.0.5). When you bind to a specific interface, represented by its IP address, your socket is only listening on that interface. It will ignore the others.

If you bind to 127.0.0.1 then your socket will only be listening on the loopback interface. In this case, only connections made to localhost or 127.0.0.1 will be routed to your server socket. Since this interface is only available locally, no external connections will be allowed.

If you bind to 192.168.0.5, in this example, then your socket will only be listening on that interface. Any clients that can address that interface will be listened for, but any connections made on localhost will not be routed to that server socket.

If you want to listen on all interfaces then you can use 0.0.0.0. This will bind to any available interface, loopback or otherwise. Most of the time, this is what you want.

require 'socket'

# This socket will bind to the loopback interface and will
# only be listening for clients from localhost.
local_socket = Socket.new(:INET, :STREAM)
local_addr = Socket.pack_sockaddr_in(4481, '127.0.0.1')
local_socket.bind(local_addr)

# This socket will bind to any of the known interfaces and
# will be listening for any client that can route messages
# to it.
any_socket = Socket.new(:INET, :STREAM)
any_addr = Socket.pack_sockaddr_in(4481, '0.0.0.0')
any_socket.bind(any_addr)

# This socket attempts to bind to an unkown interface
# and raises Errno::EADDRNOTAVAIL.
error_socket = Socket.new(:INET, :STREAM)
error_addr = Socket.pack_sockaddr_in(4481, '1.2.3.4')
error_socket.bind(error_addr)

Servers Listen

After creating a socket, and binding to a port, the socket needs to be told to listen for incoming connections.

require 'socket'

# Create a socket and bind it to port 4481.
socket = Socket.new(:INET, :STREAM)
addr = Socket.pack_sockaddr_in(4481, '0.0.0.0')
socket.bind(addr)

# Tell it to listen for incoming connections.
socket.listen(5)

The only addition to the code from the last chapter is a call to listen on the socket.

If you run that code snippet it still exits immediately. There’s one more step in the lifecycle of a server socket required before it can process connections. That’s covered in the next chapter. First, more about listen.

The Listen Queue

You may have noticed that we passed an integer argument to the listen method. This number represents the maximum number of pending connections your server socket is willing to tolerate. This list of pending connections is called the listen queue.

Let’s say that your server is busy processing a client connection, when any new client connections arrive they’ll be put into the listen queue. If a new client connection arrives and the listen queue is full then the client will raise Errno::ECONNREFUSED.

How big should the listen queue be?

OK, so the size of the listen queue looks a bit like a magic number. Why wouldn’t we want to set that number to 10,000? Why would we ever want to refuse a connection? All good questions.

First, we should talk about limits. You can get the current maximum allowed listen queue size by inspecting Socket::SOMAXCONN at runtime. On my Mac this number is 128. So I’m not able to use a number larger than that. The root user is able to increase this limit at the system level for servers that need it.

Let’s say you’re running a server and you’re getting reports of Errno::ECONNREFUSED. Increasing the size of the listen queue would be a good starting point. But ultimately you don’t want to have connections waiting in your listen queue. That means that users of your service are having to wait for their responses. This may be an indication that you need more server instances or that you need a different architecture.

Generally you don’t want to be refusing connections. You can set the listen queue to the maximum allowed queue size using server.listen(Socket::SOMAXCONN).

Servers Accept

Finally we get to the part of the lifecycle where the server is actually able to handle an incoming connection. It does this with the accept method. Here’s how to create a listening socket and receive the first connection:

require 'socket'

# Create the server socket.
server = Socket.new(:INET, :STREAM)
addr = Socket.pack_sockaddr_in(4481, '0.0.0.0')
server.bind(addr)
server.listen(128)

# Accept a connection.
connection, _ = server.accept

Now if you run that code you’ll notice that it doesn’t return immediately! That’s right, the accept method will block until a connection arrives. Let’s give it one using netcat:

$ echo ohai | nc localhost 4481

When you run these snippets you should see the nc(1) program and the Ruby program exit successfully. It may not be the most epic finale ever, but it’s proof that everything is connected and working properly. Congrats!

Accept is blocking

The accept call is a blocking call. It will block the current thread indefinitely until it receives a new connection.

Remember the listen queue we talked about in the last chapter? accept simply pops the next pending connection off of that queue. If none are available it waits for one to be pushed onto it.

Accept returns an Array

In the example above I assigned two values from one call to accept. The accept method actually returns an Array. The Array contains two elements: first, the connection, and second, an Addrinfo object. This represents the remote address of the client connection.

Addrinfo

Addrinfo is a Ruby class that represents a host and port number. It wraps up an endpoint representation nicely. You’ll see it as part of the standard Socket interface in a few places.

You can construct one of these using something like Addrinfo.tcp('localhost', 4481). Some useful methods are #ip_address and #ip_port. Have a look at $ ri Addrinfo for more.

Let’s begin by taking a closer look at the connection and address returned from #accept.

require 'socket'

# Create the server socket.
server = Socket.new(:INET, :STREAM)
addr = Socket.pack_sockaddr_in(4481, '0.0.0.0')
server.bind(addr)
server.listen(128)

# Accept a new connection.
connection, _ = server.accept

print 'Connection class: '
p connection.class

print 'Server fileno: '
p server.fileno

print 'Connection fileno: '
p connection.fileno

print 'Local address: '
p connection.local_address

print 'Remote address: '
p connection.remote_address

When the server gets a connection (using the netcat snippet from above) it outputs:

Connection class: Socket
Server fileno: 5
Connection fileno: 8
Local address: #<Addrinfo: 127.0.0.1:4481 TCP>
Remote address: #<Addrinfo: 127.0.0.1:58164 TCP>

The results from this little bit of code tell us a ton about how TCP connections are handled. Let’s dissect it a little bit at a time.

Connection Class

Although accept returns a ‘connection’, this code tells us that there’s no special connection class. A connection is actually an instance of Socket.

File Descriptors

We know that accept is returning an instance of Socket, but this connection has a different file descriptor number (or fileno) than the server socket. The file descriptor number is the kernel’s method for keeping track of open files in the current process.

Sockets are Files?

Yep. At least in the land of Unix, everything is treated as a file. This includes files found on the filesystem as well as things like pipes, sockets, printers, etc.

This indicates that accept has returned a brand new Socket different from the server socket. This Socket instance represents the connection. This is important. Each connection is represented by a new Socket object so that the server socket can remain untouched and continue to accept new connections.

Connection Addresses

Our connection object knows about two addresses: the local address and the remote address. The remote address is the second return value returned from accept but can also be accessed as remote_address on the connection.

The local_address of the connection refers to the endpoint on the local machine. The remote_address of the connection refers to the endpoint at the other end, which might be on another host but, in our case, it’s on the same machine.

Each TCP connection is defined by this unique grouping of local-host, local-port, remote-host, and remote-port. The combination of these four properties must be unique for each TCP connection.

Let’s put that in perspective for a moment. You can initiate two simultaneous connections from the local host to a remote host, so long as the remote ports are unique. Similarly you can accept two simultaneous connections from a remote host to the same local port, provided that the remote ports are unique. But you cannot have two simultaneous connections to a remote host if the local ports and remote ports are identical.

The Accept Loop

So accept returns one connection. In our code examples above the server accepts one connection and then exits. When writing a production server it’s almost certain that we’d want to continually listen for incoming connections so long as there are more available. This is easily accomplished with a loop:

require 'socket'

# Create the server socket.
server = Socket.new(:INET, :STREAM)
addr = Socket.pack_sockaddr_in(4481, '0.0.0.0')
server.bind(addr)
server.listen(128)

# Enter an endless loop of accepting and
# handling connections.
loop do
  connection, _ = server.accept
  # handle connection
  connection.close
end

This is a common way to write certain kinds of servers using Ruby. It’s so common in fact that Ruby provides some syntactic sugar on top of it. We’ll look at Ruby wrapper methods at the end of this chapter.

Servers Close

Once a server has accepted a connection and finished processing it, the last thing for it to do is to close that connection. This rounds out the create-process-close lifecycle of a connection.

Rather than paste another block of code, I’ll refer you to the one above. It calls close on the connection before accepting a new one.

Closing on Exit

Why is close needed? When your program exits, all open file descriptors (including sockets) will be closed for you. So why should you close them yourself? There are a few good reasons:

Resource usage. If you’re done with a socket, but you don’t close it, it’s possible to store references to sockets you’re no longer using. In Ruby’s case the garbage collector is your friend, cleaning up any unreferenced connections for you, but it’s a good idea to maintain full control over your resource usage and get rid of stuff you don’t need. Note that the garbage collector will close anything that it collects.
Open file limit. This is really an extension of the previous one. Every process is subject to a limit on the number of open files it can have. Remember that each connection is a file? Keeping around unneeded connections will continue to bring your process closer and closer to this open file limit, which may cause issues later.

To find out the allowed number of open files for the current process you can use Process.getrlimit(:NOFILE). The returned value is an Array of the soft limit (user-configurable) and hard limit (system), respectively.

If you want to bump up your limit to the maximum then you can Process.setrlimit(Process.getrlimit(:NOFILE)[1]).

Different Kinds of Closing

Given that sockets allow two-way communication (read/write) it’s actually possible to close just one of those channels.

require 'socket'

# Create the server socket.
server = Socket.new(:INET, :STREAM)
addr = Socket.pack_sockaddr_in(4481, '0.0.0.0')
server.bind(addr)
server.listen(128)
connection, _ = server.accept

# After this the connection may no longer write data, but may still read data.
connection.close_write

# After this the connection may no longer read or write any data.
connection.close_read

Closing the write stream will send an EOF to the other end of the socket (more on EOF soon).

The close_write and close_read methods make use of shutdown(2) under the hood. shutdown(2) is notably different than close(2) in that it causes a part of the connection to be fully shut down, even if there are copies of it lying around.

How are there copies of connections?

It’s possible to create copies of file descriptors using Socket#dup. This will actually duplicate the underlying file descriptor at the operating system level using dup(2). But this is pretty uncommon, and you probably won’t see it.

The more common way that you can get a copy of a file descriptor is through Process.fork. This method creates a brand new process (Unix only) that’s an exact copy of the current process. Besides providing a copy of everything in memory, any open file descriptors are dup(2)ed so that the new process gets a copy.

close will close the socket instance on which it’s called. If there are other copies of the socket in the system then those will not be closed and the underlying resources will not be reclaimed. Indeed, other copies of the connection may still exchange data even if one instance is closed.

So shutdown, unlike close, will fully shut down communication on the current socket and other copies of it, thereby disabling any communication happening on the current instance as well as any copies. But it does not reclaim resources used by the socket. Each individual socket instance must still be closed to complete the lifecycle.

require 'socket'

# Create the server socket.
server = Socket.new(:INET, :STREAM)
addr = Socket.pack_sockaddr_in(4481, '0.0.0.0')
server.bind(addr)
server.listen(128)
connection, _ = server.accept

# Create a copy of the connection.
copy = connection.dup

# This shuts down communication on all copies of the connection.
connection.shutdown

# This closes the original connection. The copy will be closed
# when the GC collects it.
connection.close

Ruby Wrappers

We all know and love the elegant syntax that Ruby offers, and its extensions for creating and working with server sockets are no exception. These convenience methods wrap up the boilerplate code in custom classes and leverage Ruby blocks where possible. Let’s have a look.

Server Construction

First up is the TCPServer class. It’s a clean way to abstract the ‘server construction’ part of the process.

require 'socket'

server = TCPServer.new(4481)

Ah, now that feels more like Ruby code. That code is effectively replacing this:

require 'socket'

server = Socket.new(:INET, :STREAM)
addr = Socket.pack_sockaddr_in(4481, '0.0.0.0')
server.bind(addr)
server.listen(5)

I know which one I prefer to use!

Creating a TCPServer instance actually returns an instance of TCPServer, not Socket. The interface exposed by each of them is nearly identical, but with some key differences. The most notable of which is that TCPServer#accept returns only the connection, not the remote_address.

Notice that we didn’t specify the size of the listen queue for these constructors? Rather than using Socket::SOMAXCONN, Ruby defaults to a listen queue of size 5. If you need a bigger listen queue you can call TCPServer#listen after the fact.

As IPv6 gains momentum, your servers may need to be able to handle both IPv4 and IPv6. Using this Ruby wrapper will return two TCP sockets, one that can be reached via IPv4 and one that can be reached via IPv6, both listening on the same port.

require 'socket'

servers = Socket.tcp_server_sockets(4481)

Connection Handling

Besides constructing servers, Ruby also provides nice abstractions for handling connections.

Remember using loop to handle multiple connections? Using loop is for chumps. Do it like this:

require 'socket'

# Create the listener socket.
server = TCPServer.new(4481)

# Enter an endless loop of accepting and
# handling connections.
Socket.accept_loop(server) do |connection|
  # handle connection
  connection.close
end

Note that connections are not automatically closed at the end of each block. The arguments that get passed into the block are the exact same ones that are returned from a call to accept.

Socket.accept_loop has the added benefit that you can actually pass multiple listening sockets to it and it will accept connections on any of the passed-in sockets. This goes really well with Socket.tcp_server_sockets:

require 'socket'

# Create the listener socket.
servers = Socket.tcp_server_sockets(4481)

# Enter an endless loop of accepting and
# handling connections.
Socket.accept_loop(servers) do |connection|
  # handle connection
  connection.close
end

Notice that we’re passing a collection of sockets to Socket.accept_loop and it handles them gracefully.

Wrapping it all into one

The granddaddy of the Ruby wrappers is Socket.tcp_server_loop, it wraps all of the previous steps into one:

require 'socket'

Socket.tcp_server_loop(4481) do |connection|
  # handle connection
  connection.close
end

This method is really just a wrapper around Socket.tcp_server_sockets and Socket.accept_loop, but you can’t write it any more succinctly than that!

System Calls From This Chapter

Socket#bind -> bind(2)
Socket#listen -> listen(2)
Socket#accept -> accept(2)
Socket#local_address -> getsockname(2)
Socket#remote_address -> getpeername(2)
Socket#close -> close(2)
Socket#close_write -> shutdown(2)
Socket#shutdown -> shutdown(2)

Server Lifecycle

Servers Bind#

What port should I bind to?#

What address should I bind to?#

Servers Listen#

The Listen Queue#

How big should the listen queue be?#

Servers Accept#

Accept is blocking#

Accept returns an Array#

Connection Class#

File Descriptors#

Connection Addresses#

The Accept Loop#

Servers Close#

Closing on Exit#

Different Kinds of Closing#

Ruby Wrappers#

Server Construction#

Connection Handling#

Wrapping it all into one#

System Calls From This Chapter#

Servers Bind

What port should I bind to?

What address should I bind to?

Servers Listen

The Listen Queue

How big should the listen queue be?

Servers Accept

Accept is blocking

Accept returns an Array

Connection Class

File Descriptors

Connection Addresses

The Accept Loop

Servers Close

Closing on Exit

Different Kinds of Closing

Ruby Wrappers

Server Construction

Connection Handling

Wrapping it all into one

System Calls From This Chapter