37 KiB
						
					
					
				
			
		
		
	
	SCTP user-land implementation (usrsctp)
SCTP is a message oriented, reliable transport protocol with direct support for multihoming that runs on top of IP or UDP, and supports both v4 and v6 versions.
Like TCP, SCTP provides reliable, connection oriented data delivery with congestion control. Unlike TCP, SCTP also provides message boundary preservation, ordered and unordered message delivery, multi-streaming and multi-homing. Detection of data corruption, loss of data and duplication of data is achieved by using checksums and sequence numbers. A selective retransmission mechanism is applied to correct loss or corruption of data.
In this manual the socket API for the SCTP User-land implementation will be described. It is based on RFC 6458. The main focus of this document is on pointing out the differences to the SCTP Sockets API. For all aspects of the sockets API that are not mentioned in this document, please refer to RFC 6458. Questions about SCTP itself can hopefully be answered by RFC 4960.
Getting Started
The user-land stack has been tested on FreeBSD 10.0, Ubuntu 11.10, Windows 7, Mac OS X 10.6, and Mac OS X 10.7. The current version of the user-land stack is provided on github. Download the tarball and untar it in a folder of your choice. The tarball contains all the sources to build the libusrsctp, which has to be linked to the object file of an example program. In addition there are two applications in the folder programs that can be built and run.
Building the Library and the Applications
Unix-like Operating Systems
In the folder usrsctp type
$ ./bootstrap
$ ./configure
$ make
Now, the library libusrsctp.la has been built in the subdirectory usrsctplib, and the example programs are ready to run from the subdirectory programs.
If you have root privileges or are in the sudoer group, you can install the library in /usr/local/lib and copy the header file to /usr/include with the command
$ sudo make install
Windows
On Windows you need a compiler like Microsoft Visual Studio. You can build the library and the example programs with the command line tool of the compiler by typing
$ nmake -f Makefile.nmake
in the directory usrsctp.
CMake
Create a directory outside the usrsctp directory, enter it and generate files by typing
$ cmake <path-to-usrsctp-sources>
$ cmake --build .
By default CMake generates a DEBUG build with verbose output.
Running the Test Programs
Several test programs are included, including a discard server and a client. You can run both to send data from the client to the server. The client reads data from stdin and sends them to the server, which prints the message in the terminal and discards it. The sources of the server are also provided here and those of the client here.
Using UDP Encapsulation
Both programs can either send data over SCTP directly or use UDP encapsulation, thus encapsulating the SCTP packet in a UDP datagram. The first mode works on loopback or in a protected setup without any NAT boxes involved. In all other cases it is better to use UDP encapsulation.
The usage of the discard_server is
$ discard_server [local_encaps_port remote_encaps_port]
For UDP encapsulation the ports have to be specified. The local and remote encapsulation ports can be arbitrarily set. For example, you can call
$ ./discard_server 11111 22222
on a Unix-like OS and
$ discard_server.exe 11111 22222
on Windows.
The client needs two additional parameters, the server's address and its port. Its usage is
$ client remote_addr remote_port [local_port local_encaps_port remote_encaps_port]
The remote address is the server's address. If client and server are started on the same machine, the loopback address 127.0.0.1 can be used for Unix-like OSs and the local address on Windows. The discard port is 9, thus 9 has to be taken as remote port. The encapsulation ports have to match those of the server, i.e. the server's local_encaps_port is the client's remote_encaps_port and vice versa. Thus, the client can be started with
$ ./client 127.0.0.1 9 0 22222 11111
on a Unix-like OS and
$ client.exe 192.168.0.1 9 0 22222 11111
on Windows provided your local IP address is 192.168.0.1.
Sending over SCTP
To send data over SCTP directly you might need root privileges because raw sockets are used. Thus instead of specifying the encapsulation ports you have to start the programs prepending sudo or in case of Windows start the program from an administrator console.
Using the Callback API
Instead of asking constantly for new data, a callback API can be used that is triggered by SCTP. A callback function has to be registered that will be called whenever data is ready to be delivered to the application.
The discard_server has a flag to switch between the two modi. If  use_cb is set to 1, the callback API will be used. To change the setting, just set the flag and compile the program again.
Basic Operations
All system calls start with the prefix usrsctp_ to distinguish them from the kernel variants. Some of them are changed to account for the different demands in the userland environment.
Differences to RFC 6458
usrsctp_init()
Every application has to start with usrsctp_init(). This function calls sctp_init() and reserves the memory necessary to administer the data transfer. The function prototype is
void usrsctp_init(uint16_t udp_port)
As it is not always possible to send data directly over SCTP because not all NAT boxes can process SCTP packets, the data can be sent over UDP. To encapsulate SCTP into UDP a UDP port has to be specified, to which the datagrams can be sent. This local UDP port  is set with the parameter udp_port. The default value is 9899, the standard UDP encapsulation port. If UDP encapsulation is not necessary, the UDP port has to be set to 0.
usrsctp_finish()
At the end of the program usrsctp_finish() should be called to free all the memory that has been allocated before. The function prototype is
int usrsctp_finish(void)
The return code is 0 on success and -1 in case of an error.
usrsctp_socket()
A representation of an SCTP endpoint is a socket. Is it created with usrsctp_socket(). The function prototype is:
struct socket *
usrsctp_socket(int domain,
               int type,
               int protocol,
               int (*receive_cb)(struct socket *sock,
                                 union sctp_sockstore addr,
                                 void *data,
                                 size_t datalen,
                                 struct sctp_rcvinfo,
                                 int flags,
                                 void *ulp_info),
               int (*send_cb)(struct socket *sock,
                              uint32_t sb_free),
               uint32_t sb_threshold,
               void *ulp_info)
The arguments taken from RFC 6458 are:
- domain: PF_INET or PF_INET6 can be used.
- type: In case of a one-to-many style socket it is SOCK_SEQPACKET, in case of a one-to-one style socket it is SOCK_STREAM. For an explanation of the differences between the socket types please refer to RFC 6458.
- protocol: Set IPPROTO_SCTP.
In usrsctp, a callback API can be used.
- The function pointers of the receive and send callbacks are new arguments to the socket call. If no callback API is used, these must be NULL.
- The sb_thresholdspecifies the amount of free space in the send socket buffer before the send function in the application is called. If a send callback function is specified andsb_thresholdis 0, the function is called whenever there is room in the send socket buffer.
- Additional data may be passed along within the ulp_infoparameter. This value will be passed to thereceive_cbwhen it is invoked.
On success usrsctp_socket() returns the pointer to the new socket in the struct socket data type. It will be needed in all other system calls. In case of a failure NULL is returned and errno is set to the appropriate error code.
usrsctp_close()
The function prototype of usrsctp_close() is
void usrsctp_close(struct socket *so)
Thus the only difference is the absence of a return code.
Same Functionality as RFC 6458
The following functions have the same functionality as their kernel pendants. There prototypes are described in the following subsections. For a detailed description please refer to RFC 6458.
usrsctp_bind()
int
usrsctp_bind(struct socket *so,
             struct sockaddr *addr,
             socklen_t addrlen)
- so: Pointer to the socket as returned by usrsctp_socket().
- addr: The address structure (struct sockaddr_infor an IPv4 address orstruct sockaddr_in6for an IPv6 address).
- addrlen: The size of the address structure.
usrsctp_bind() returns 0 on success and -1 in case of an error.
usrsctp_listen()
int
usrsctp_listen(struct socket *so,
               int backlog)
- so: Pointer to the socket as returned by usrsctp_socket().
- backlog: If backlog is non-zero, enable listening, else disable listening.
usrsctp_listen() returns 0 on success and -1 in case of an error.
usrsctp_accept()
struct socket *
usrsctp_accept(struct socket *so,
               struct sockaddr * addr,
               socklen_t * addrlen)
- so: Pointer to the socket as returned by usrsctp_socket().
- addr: On return,  the primary address of the peer (struct sockaddr_infor an IPv4 address orstruct sockaddr_in6for an IPv6 address).
- addrlen: Size of the returned address structure.
usrsctp_accept() returns the accepted socket on success and NULL in case of an error.
usrsctp_connect()
int
usrsctp_connect(struct socket *so,
                struct sockaddr *name,
                socklen_t addrlen)
- so: Pointer to the socket as returned by usrsctp_socket().
- name: Address of the peer to connect to (struct sockaddr_infor an IPv4 address orstruct sockaddr_in6for an IPv6 address).
- addrlen: Size of the peer's address.
usrsctp_connect() returns 0 on success and -1 in case of an error.
usrsctp_shutdown()
int
usrsctp_shutdown(struct socket *so, int how)
- so: Pointer to the socket of the association to be closed
- how: Specifies the type of shutdown.  The values are as follows:
- SHUT_RD: Disables further receive operations. No SCTP protocol action is taken.
- SHUT_WR: Disables further send operations, and initiates the SCTP shutdown sequence.
- SHUT_RDWR: Disables further send and receive operations, and initiates the SCTP shutdown sequence.
 
usrsctp_shutdown() returns 0 on success and -1 in case of an error.
Sending and Receiving Data
Since the publication of RFC 6458 there is only one function for sending and one for receiving that is not deprecated. Therefore, only these two are described here.
usrsctp_sendv()
ssize_t
usrsctp_sendv(struct socket *so,
              const void *data,
              size_t len,
              struct sockaddr *addrs,
              int addrcnt,
              void *info,
              socklen_t infolen,
              unsigned int infotype,
              int flags)
- so: The socket to send data on.
- data: As it is more convenient to send data in a buffer and not a struct iovecdata structure, we chose to pass the data as a void pointer.
- len: Length of the data.
- addrs: In this version of usrsctp at most one destination address is supported. In the case of a connected socket, the parameter addrscan be set to NULL.
- addrcnt: Number of addresses. As at most one address is supported, addrcnt is 0 if addrs is NULL and 1 otherwise.
- info: Additional information for a message is stored in void *info. The data typesstruct sctp_sndinfo,struct sctp_prinfo, andstruct sctp_sendv_spaare supported as defined in RFC 6458. Support forstruct sctp_authinfois not implemented yet, therefore, errno is set EINVAL and -1 will be returned, if it is used.
- infolen: Length of info in bytes.
- infotype: Identifies the type of the information provided in info. Possible values are
- SCTP_SENDV_NOINFO
- SCTP_SENDV_SNDINFO
- SCTP_SENDV_PRINFO
- SCTP_SENDV_SPA (For additional information please refer to RFC 6458.)
 
- flags: Flags as described in RFC 6458.
usrsctp_sendv() returns the number of bytes sent, or -1 if an error occurred.  The variable errno is then set appropriately.
usrsctp_recvv()
ssize_t
usrsctp_recvv(struct socket *so,
             void *dbuf,
             size_t len,
             struct sockaddr *from,
             socklen_t * fromlen,
             void *info,
             socklen_t *infolen,
             unsigned int *infotype,
             int *msg_flags)
- so: The socket to receive data on.
- dbuf: Analog to usrsctp_sendv()the data is returned in a buffer.
- len: Length of the buffer in bytes.
- from: A pointer to an address to be filled with the sender of the received message's address.
- fromlen: An in/out parameter describing the from length.
- info: A pointer to the buffer to hold the attributes of the received message.  The structure type of info is determined by the infotype parameter. The attributes returned in infohave to be handled in the same way as specified in RFC 6458.
- infolen: An in/out parameter describing the size of the info buffer.
- infotype:  On return, *infotypeis set to the type of the info buffer. The current defined values are- SCTP_RECVV_NOINFO
- SCTP_RECVV_RCVINFO
- SCTP_RECVV_NXTINFO
- SCTP_RECVV_RN (A detailed description is given in RFC 6458)
 
- flags: A pointer to an integer to be filled with any message flags (e.g., MSG_NOTIFICATION). Note that this field is an in/out parameter. Options for the receive may also be passed into the value (e.g.,MSG_EOR). Returning from the call, the flags' value will differ from its original value.
usrsctp_recvv() returns the number of bytes received, or -1 if an error occurred.  The variable errno is then set appropriately.
Socket Options
Socket options are used to change the default behavior of socket calls. Their behavior is specified in RFC 6458. The functions to get or set them are
int
usrsctp_getsockopt(struct socket *so,
                     int level,
                     int optname,
                     void *optval,
                     socklen_t *optlen)
and
int
usrsctp_setsockopt(struct socket *so,
                     int level,
                     int optname,
                     const void *optval,
                     socklen_t optlen)
and the arguments are
- so: The socket of type struct socket.
- level: Set to IPPROTO_SCTP for all SCTP options.
- optname: The option name as specified in The Socket Options table below.
- optval: The buffer to store the value of the option as specified in the second column of Socket Options below.
- optlen:  The size of the buffer (or the length of the option returned in case of usrsctp_getsockopt).
These functions return 0 on success and -1 in case of an error.
Socket Options supported by usrsctp
| Option | Datatype | r/w | 
|---|---|---|
| SCTP_RTOINFO | struct sctp_rtoinfo | r/w | 
| SCTP_ASSOCINFO | struct sctp_assocparams | r/w | 
| SCTP_INITMSG | struct sctp_initmsg | r/w | 
| SCTP_NODELAY | int | r/w | 
| SCTP_AUTOCLOSE | int | r/w | 
| SCTP_PRIMARY_ADDR | struct sctp_setprim | r/w | 
| SCTP_ADAPTATION_LAYER | struct sctp_setadaptation | r/w | 
| SCTP_DISABLE_FRAGMENTS | int | r/w | 
| SCTP_PEER_ADDR_PARAMS | struct sctp_paddrparams | r/w | 
| SCTP_I_WANT_MAPPED_V4_ADDR | int | r/w | 
| SCTP_MAXSEG | struct sctp_assoc_value | r/w | 
| SCTP_DELAYED_SACK | struct sctp_sack_info | r/w | 
| SCTP_FRAGMENT_INTERLEAVE | int | r/w | 
| SCTP_PARTIAL_DELIVERY_POINT | int | r/w | 
| SCTP_HMAC_IDENT | struct sctp_hmacalgo | r/w | 
| SCTP_AUTH_ACTIVE_KEY | struct sctp_authkeyid | r/w | 
| SCTP_AUTO_ASCONF | int | r/w | 
| SCTP_MAX_BURST | struct sctp_assoc_value | r/w | 
| SCTP_CONTEXT | struct sctp_assoc_value | r/w | 
| SCTP_EXPLICIT_EOR | int | r/w | 
| SCTP_REUSE_PORT | int | r/w | 
| SCTP_EVENT | struct sctp_event | r/w | 
| SCTP_RECVRCVINFO | int | r/w | 
| SCTP_RECVNXTINFO | int | r/w | 
| SCTP_DEFAULT_SNDINFO | struct sctp_sndinfo | r/w | 
| SCTP_DEFAULT_PRINFO | struct sctp_default_prinfo | r/w | 
| SCTP_REMOTE_UDP_ENCAPS_PORT | int | r/w | 
| SCTP_ENABLE_STREAM_RESET | struct sctp_assoc_value | r/w | 
| SCTP_STATUS | struct sctp_status | r | 
| SCTP_GET_PEER_ADDR_INFO | struct sctp_paddrinfo | r | 
| SCTP_PEER_AUTH_CHUNKS | struct sctp_authchunks | r | 
| SCTP_LOCAL_AUTH_CHUNKS | struct sctp_authchunks | r | 
| SCTP_GET_ASSOC_NUMBER | uint32_t | r | 
| SCTP_GET_ASSOC_ID_LIST | struct sctp_assoc_ids | r | 
| SCTP_RESET_STREAMS | struct sctp_reset_streams | w | 
| SCTP_RESET_ASSOC | struct sctp_assoc_t | w | 
| SCTP_ADD_STREAMS | struct sctp_add_streams | w | 
Further usage details are described in RFC 6458, RFC 6525, and draft-ietf-tsvwg-sctp-udp-encaps-03 (work in progress).
Sysctl variables
In kernel implementations like for instance FreeBSD, it is possible to change parameters in the operating system. These parameters are called sysctl variables.
In usrsctp applications can set or retrieve these variables with the functions
void usrsctp_sysctl_set_ ## (uint32_t value)
and
uint32_t usrsctp_sysctl_get_ ## (void)
respectively, where ## stands for the name of the variable.
In the following paragraphs a short description of the parameters will be given.
Manipulate Memory
usrsctp_sysctl_set_sctp_sendspace()
The space of the available send buffer can be changed from its default value of 262,144 bytes to a value between 0 and 2^32 - 1 bytes.
usrsctp_sysctl_set_sctp_recvspace()
The space of the available receive buffer can be changed from its default value of 262,144 bytes to a value between 0 and 2^32 - 1 bytes.
usrsctp_sysctl_set_sctp_hashtblsize()
The TCB (Thread Control Block) hash table sizes, i.e. the size of one TCB in the hash table, can be tuned between 1 and 2^32 - 1 bytes. The default value is 1,024 bytes. A TCB contains for instance pointers to the socket, the endpoint, information about the association and some statistic data.
usrsctp_sysctl_set_sctp_pcbtblsize()
The PCB (Protocol Control Block) hash table sizes, i.e. the size of one PCB in the hash table, can be tuned between 1 and 2^32 - 1 bytes. The default value is 256 bytes. The PCB contains all variables that characterize an endpoint.
usrsctp_sysctl_set_sctp_system_free_resc_limit()
This parameters tunes the maximum number of cached resources in the system. It can be set between 0 and 2^32 - 1. The default value is 1000.
usrsctp_sysctl_set_sctp_asoc_free_resc_limit()
This parameters tunes the maximum number of cached resources in an association. It can be set between 0 and 2^32 - 1. The default value is 10.
usrsctp_sysctl_set_sctp_mbuf_threshold_count()
Data is stored in mbufs. Several mbufs can be chained together. The maximum number of small mbufs in a chain can be set with this parameter, before an mbuf cluset is used. The default is 5.
usrsctp_sysctl_set_sctp_add_more_threshold()
TBD This parameter configures the threshold below which more space should be added to a socket send buffer. The default value is 1452 bytes.
Configure RTO
The retransmission timeout (RTO), i.e. the time that controls the retransmission of messages, has several parameters, that can be changed, for example to shorten the time, before a message is retransmitted. The range of these parameters is between 0 and 2^32 - 1ms.
usrsctp_sysctl_set_sctp_rto_max_default()
The default value for the maximum retransmission timeout in ms is 60,000 (60secs).
usrsctp_sysctl_set_sctp_rto_min_default()
The default value for the minimum retransmission timeout in ms is 1,000 (1sec).
usrsctp_sysctl_set_sctp_rto_initial_default()
The default value for the initial retransmission timeout in ms is 3,000 (3sec). This value is only needed before the first calculation of a round trip time took place.
usrsctp_sysctl_set_sctp_init_rto_max_default()
The default value for the maximum retransmission timeout for an INIT chunk in ms is 60,000 (60secs).
Set Timers
usrsctp_sysctl_set_sctp_valid_cookie_life_default()
A cookie has a specified life time. If it expires the cookie is not valid any more and an ABORT is sent. The default value in ms is 60,000 (60secs).
usrsctp_sysctl_set_sctp_heartbeat_interval_default()
Set the default time between two heartbeats. The default is 30,000ms.
usrsctp_sysctl_set_sctp_shutdown_guard_time_default()
If a SHUTDOWN is not answered with a SHUTDOWN-ACK while the shutdown guard timer is still running, the association will be aborted after the default of 180secs.
usrsctp_sysctl_set_sctp_pmtu_raise_time_default()
TBD To set the size of the packets to the highest value possible, the maximum transfer unit (MTU) of the complete path has to be known. The default time interval for the path mtu discovery is 600secs.
usrsctp_sysctl_set_sctp_secret_lifetime_default()
TBD The default secret lifetime of a server is 3600secs.
usrsctp_sysctl_set_sctp_vtag_time_wait()
TBD Vtag time wait time, 0 disables it. Default: 60secs
Set Failure Limits
Transmissions and retransmissions of messages might fail. To protect the system against too many retransmissions, limits have to be defined.
usrsctp_sysctl_set_sctp_init_rtx_max_default()
The default maximum number of retransmissions of an INIT chunks is 8, before an ABORT is sent.
usrsctp_sysctl_set_sctp_assoc_rtx_max_default()
This parameter sets the maximum number of failed retransmissions before the association is aborted. The default value is 10.
usrsctp_sysctl_set_sctp_path_rtx_max_default()
This parameter sets the maximum number of path failures before the association is aborted. The default value is 5. Notice that the number of paths multiplied by this value should be equal to sctp_assoc_rtx_max_default. That means that the default configuration is good for two paths.
usrsctp_sysctl_set_sctp_max_retran_chunk()
The parameter configures how many times an unlucky chunk can be retransmitted before the association aborts. The default is set to 30.
usrsctp_sysctl_set_sctp_path_pf_threshold()
TBD Default potentially failed threshold. Default: 65535
usrsctp_sysctl_set_sctp_abort_if_one_2_one_hits_limit()
TBD When one-2-one hits qlimit abort. Default: 0
Control the Sending of SACKs
usrsctp_sysctl_set_sctp_sack_freq_default()
The SACK frequency defines the number of packets that are awaited, before a SACK is sent. The default value is 2.
usrsctp_sysctl_set_sctp_delayed_sack_time_default()
As a SACK (Selective Acknowlegment) is sent after every other packet, a timer is set to send a SACK in case another packet does not arrive in due time. The default value for this timer is 200ms.
usrsctp_sysctl_set_sctp_strict_sacks()
TBD This is a flag to turn the controlling of the coherence of SACKs on or off. The default value is 1 (on).
usrsctp_sysctl_set_sctp_nr_sack_on_off()
If a slow hosts receives data on a lossy link it is possible that its receiver window is full and new data can only be accepted if one chunk with a higher TSN (Transmission Sequence Number) that has previously been acknowledged is dropped. As a consequence the sender has to store data, even if they have been acknowledged in case they have to be retransmitted. If this behavior is not necessary, non-renegable SACKs can be turned on. By default the use of non-renegable SACKs is turned off.
usrsctp_sysctl_set_sctp_enable_sack_immediately()
In some cases it is not desirable to wait for the SACK timer to expire before a SACK is sent. In these cases a bit called SACK-IMMEDIATELY (see draft-tuexen-tsvwg-sctp-sack-immediately-09) can be set to provoke the instant sending of a SACK. The default is to turn it off.
usrsctp_sysctl_set_sctp_L2_abc_variable()
TBD SCTP ABC max increase per SACK (L). Default: 1
Change Max Burst
Max burst defines the maximum number of packets that may be sent in one flight.
usrsctp_sysctl_set_sctp_max_burst_default()
The default value for max burst is 0, which means that the number of packets sent as a flight is not limited by this parameter, but may be by another one, see the next paragraph.
usrsctp_sysctl_set_sctp_use_cwnd_based_maxburst()
The use of max burst is based on the size of the congestion window (cwnd). This parameter is set by default.
usrsctp_sysctl_set_sctp_hb_maxburst()
Heartbeats are mostly used to verify a path. Their number can be limited. The default is 4.
usrsctp_sysctl_set_sctp_fr_max_burst_default()
In the state of fast retransmission the number of packet bursts can be limited. The default value is 4.
Handle Chunks
usrsctp_sysctl_set_sctp_peer_chunk_oh()
In order to keep track of the peer's advertised receiver window, the sender calculates the window by subtracting the amount of data sent. Yet, some OSs reduce the receiver window by the real space needed to store the data. This parameter sets the additional amount to debit the peer's receiver window per chunk sent. The default value is 256, which is the value needed by FreeBSD.
usrsctp_sysctl_set_sctp_max_chunks_on_queue()
This parameter sets the maximum number of chunks that can be queued per association. The default value is 512.
usrsctp_sysctl_set_sctp_min_split_point()
TBD The minimum size when splitting a chunk is 2904 bytes by default.
usrsctp_sysctl_set_sctp_chunkscale()
TBD This parameter can be tuned for scaling of number of chunks and messages. The default is10.
usrsctp_sysctl_set_sctp_min_residual()
TBD This parameter configures the minimum size of the residual data chunk in the second part of the split. The default is 1452.
Calculate RTT
The calculation of the round trip time (RTT) depends on several parameters.
usrsctp_sysctl_set_sctp_rttvar_bw()
TBD Shift amount for bw smoothing on rtt calc. Default: 4
usrsctp_sysctl_set_sctp_rttvar_rtt()
TBD Shift amount for rtt smoothing on rtt calc. Default: 5
usrsctp_sysctl_set_sctp_rttvar_eqret()
TBD What to return when rtt and bw are unchanged. Default: 0
Influence the Congestion Control
The congestion control should protect the network against fast senders.
usrsctp_sysctl_set_sctp_ecn_enable
Explicit congestion notifications are turned on by default.
usrsctp_sysctl_set_sctp_default_cc_module()
This parameter sets the default algorithm for the congestion control. Default is 0, i.e. the one specified in RFC 4960.
usrsctp_sysctl_set_sctp_initial_cwnd()
Set the initial congestion window in MTUs. The default is 3.
usrsctp_sysctl_set_sctp_use_dccc_ecn()
TBD Enable for RTCC CC datacenter ECN. Default: 1
usrsctp_sysctl_set_sctp_steady_step()
TBD How many the sames it takes to try step down of cwnd. Default: 20
Configure AUTH and ADD-IP
An important extension of SCTP is the dynamic address reconfiguration (see RFC 5061), also known as ADD-IP, which allows the changing of addresses during the lifetime of an association. For this feature the AUTH extension (see RFC 4895) is necessary.
usrsctp_sysctl_set_sctp_auto_asconf()
If SCTP Auto-ASCONF is enabled, the peer is informed automatically when a new address is added or removed. This feature is enabled by default.
usrsctp_sysctl_set_sctp_multiple_asconfs()
By default the sending of multiple ASCONFs is disabled.
usrsctp_sysctl_set_sctp_auth_enable()
The use of AUTH, which is normally turned on, can be disabled by setting this parameter to 0.
usrsctp_sysctl_set_sctp_asconf_auth_nochk()
It is also possible to disable the requirement to use AUTH in conjunction with ADD-IP by setting this parameter to 1.
Concurrent Multipath Transfer (CMT)
A prominent feature of SCTP is the possibility to use several addresses for the same association. One is the primary path, and the others are needed in case of a path failure. Using CMT the data is sent on several paths to enhance the throughput.
usrsctp_sysctl_set_sctp_cmt_on_off()
To turn CMT on, this parameter has to be set to 1.
usrsctp_sysctl_set_sctp_cmt_use_dac()
To use delayed acknowledgments with CMT this parameter has to be set to 1.
usrsctp_sysctl_set_sctp_buffer_splitting()
For CMT it makes sense to split the send and receive buffer to have shares for each path. By default buffer splitting is turned off.
Network Address Translation (NAT)
To be able to pass NAT boxes, the boxes have to handle SCTP packets in a specific way.
usrsctp_sysctl_set_sctp_nat_friendly()
SCTP NAT friendly operation. Default:1
usrsctp_sysctl_set_sctp_inits_include_nat_friendly()
Enable sending of the nat-friendly SCTP option on INITs. Default: 0
usrsctp_sysctl_set_sctp_udp_tunneling_port()
Set the SCTP/UDP tunneling port. Default: 9899
SCTP Mobility
usrsctp_sysctl_set_sctp_mobility_base()
TBD Enable SCTP base mobility. Default: 0
usrsctp_sysctl_set_sctp_mobility_fasthandoff()
TBD Enable SCTP fast handoff. default: 0
Miscellaneous
usrsctp_sysctl_set_sctp_no_csum_on_loopback()
Calculating the checksum for packets sent on loopback is turned off by default. To turn it on, set this parameter to 0.
usrsctp_sysctl_set_sctp_nr_outgoing_streams_default()
The peer is notified about the number of outgoing streams in the INIT or INIT-ACK chunk. The default is 10.
usrsctp_sysctl_set_sctp_do_drain()
Determines whether SCTP should respond to the drain calls. Default: 1
usrsctp_sysctl_set_sctp_strict_data_order()
TBD Enforce strict data ordering, abort if control inside data. Default: 0
usrsctp_sysctl_set_sctp_default_ss_module()
Set the default stream scheduling module. Implemented modules are:
- SCTP_SS_DEFAULT
- SCTP_SS_ROUND_ROBIN
- SCTP_SS_ROUND_ROBIN_PACKET
- SCTP_SS_PRIORITY
- SCTP_SS_FAIR_BANDWITH
- SCTP_SS_FIRST_COME
usrsctp_sysctl_set_sctp_default_frag_interleave()
TBD Default fragment interleave level. Default: 1
usrsctp_sysctl_set_sctp_blackhole()
TBD Enable SCTP blackholing. Default: 0
usrsctp_sysctl_set_sctp_logging_level()
Set the logging level. The default is 0.
usrsctp_sysctl_set_sctp_debug_on()
Turn debug output on or off. It is disabled by default. To obtain debug output, SCTP_DEBUG has to be set as a compile flag.
sysctl variables supported by usrsctp
| Parameter | Meaning | Default Value | 
|---|---|---|
| sctp_sendspace | Send buffer space | 1864135 | 
| sctp_recvspace | Receive buffer space | 1864135 | 
| sctp_hashtblsize | Tunable for TCB hash table sizes | 1024 | 
| sctp_pcbtblsize | Tunable for PCB hash table sizes | 256 | 
| sctp_system_free_resc_limit | Cached resources in the system | 1000 | 
| sctp_asoc_free_resc_limit | Cashed resources in an association | 10 | 
| sctp_rto_max_default | Default value for RTO_max | 60000ms | 
| sctp_rto_min_default | Default value for RTO_min | 1000ms | 
| sctp_rto_initial_default | Default value for RTO_initial | 3000ms | 
| sctp_init_rto_max_default | Default value for the maximum RTO for sending an INIT | 60000ms | 
| sctp_valid_cookie_life_default | Valid cookie life time | 60000ms | 
| sctp_init_rtx_max_default | Maximum number of INIT retransmissions | 8 | 
| sctp_assoc_rtx_max_default | Maximum number of failed retransmissions before the association is aborted | 10 | 
| sctp_path_rtx_max_default | Maximum number of failed retransmissions before a path fails | 5 | 
| sctp_ecn_enable | Enabling explicit congestion notifications | 1 | 
| sctp_strict_sacks | Control the coherence of SACKs | 1 | 
| sctp_delayed_sack_time_default | Default delayed SACK timer | 200ms | 
| sctp_sack_freq_default | Default SACK frequency | 2 | 
| sctp_nr_sack_on_off | Turn non-renegable SACKs on or off | 0 | 
| sctp_enable_sack_immediately | Enable sending of the SACK-IMMEDIATELY bit | 0 | 
| sctp_no_csum_on_loopback | Enable the compilation of the checksum on packets sent on loopback | 1 | 
| sctp_peer_chunk_oh | Amount to debit peers rwnd per chunk sent | 256 | 
| sctp_max_burst_default | Default max burst for SCTP endpoints | 0 | 
| sctp_use_cwnd_based_maxburst | Use max burst based on the size of the congestion window | 1 | 
| sctp_hb_maxburst | Confirmation Heartbeat max burst | 4 | 
| sctp_max_chunks_on_queue | Default max chunks on queue per asoc | 512 | 
| sctp_min_split_point | Minimum size when splitting a chunk | 2904 | 
| sctp_chunkscale | Tunable for Scaling of number of chunks and messages | 10 | 
| sctp_mbuf_threshold_count | Maximum number of small mbufs in a chain | 5 | 
| sctp_heartbeat_interval_default | Deafult time between two Heartbeats | 30000ms | 
| sctp_pmtu_raise_time_default | Default PMTU raise timer | 600secs | 
| sctp_shutdown_guard_time_default | Default shutdown guard timer | 180secs | 
| sctp_secret_lifetime_default | Default secret lifetime | 3600secs | 
| sctp_add_more_threshold | Threshold when more space should be added to a socket send buffer | 1452 | 
| sctp_nr_outgoing_streams_default | Default number of outgoing streams | 10 | 
| sctp_cmt_on_off | Turn CMT on or off. | 0 | 
| sctp_cmt_use_dac | Use delayed acknowledgment for CMT | 0 | 
| sctp_fr_max_burst_default | Default max burst for SCTP endpoints when fast retransmitting | 4 | 
| sctp_auto_asconf | Enable SCTP Auto-ASCONF | 1 | 
| sctp_multiple_asconfs | Enable SCTP Muliple-ASCONFs | 0 | 
| sctp_asconf_auth_nochk | Disable SCTP ASCONF AUTH requirement | 0 | 
| sctp_auth_disable | Disable SCTP AUTH function | 0 | 
| sctp_nat_friendly | SCTP NAT friendly operation | 1 | 
| sctp_inits_include_nat_friendly | Enable sending of the nat-friendly SCTP option on INITs. | 0 | 
| sctp_udp_tunneling_port | Set the SCTP/UDP tunneling port | 9899 | 
| sctp_do_drain | Determines whether SCTP should respond to the drain calls | 1 | 
| sctp_abort_if_one_2_one_hits_limit | When one-2-one hits qlimit abort | 0 | 
| sctp_strict_data_order | Enforce strict data ordering, abort if control inside data | 0 | 
| sctp_min_residual | Minimum residual data chunk in second part of split | 1452 | 
| sctp_max_retran_chunk | Maximum times an unlucky chunk can be retransmitted before the association aborts | 30 | 
| sctp_default_cc_module | Default congestion control module | 0 | 
| sctp_default_ss_module | Default stream scheduling module | 0 | 
| sctp_default_frag_interleave | Default fragment interleave level | 1 | 
| sctp_mobility_base | Enable SCTP base mobility | 0 | 
| sctp_mobility_fasthandoff | Enable SCTP fast handoff | 0 | 
| sctp_L2_abc_variable | SCTP ABC max increase per SACK (L) | 1 | 
| sctp_vtag_time_wait | Vtag time wait time, 0 disables it. | 60secs | 
| sctp_blackhole | Enable SCTP blackholing | 0 | 
| sctp_path_pf_threshold | Default potentially failed threshold | 65535 | 
| sctp_rttvar_bw | Shift amount for bw smoothing on rtt calc | 4 | 
| sctp_rttvar_rtt | Shift amount for rtt smoothing on rtt calc | 5 | 
| sctp_rttvar_eqret | What to return when rtt and bw are unchanged | 0 | 
| sctp_steady_step | How many the sames it takes to try step down of cwnd | 20 | 
| sctp_use_dccc_ecn | Enable for RTCC CC datacenter ECN | 1 | 
| sctp_buffer_splitting | Enable send/receive buffer splitting | 0 | 
| sctp_initial_cwnd | Initial congestion window in MTUs | 3 | 
| sctp_logging_level | Logging level | 0 | 
| sctp_debug_on | Turns debug output on or off. | 0 | 
Examples
See https://github.com/sctplab/usrsctp/tree/master/programs
References
SCTP
R. Stewart:
Stream Control Transmission Protocol.
RFC 4960, September 2007.
auth
M. Tüxen, R. Stewart, P. Lei, and E. Rescorla:
Authenticated Chunks for the Stream Control Transmission Protocol (SCTP).
RFC 4895, August 2007.
addip
R. Stewart, Q. Xie, M. Tüxen, S. Maruyama, and M. Kozuka:
Stream Control Transmission Protocol (SCTP) Dynamic Address Reconfiguration.
RFC 5061, September 2007.
socketAPI
R. Stewart, M. Tüxen, K. Poon, and V. Yasevich:
Sockets API Extensions for the Stream Control Transmission Protocol (SCTP).
RFC 6458, Dezember 2011.
streamReset
R. Stewart, M. Tüxen, and P. Lei:
Stream Control Transmission Protocol (SCTP) Stream Reconfiguration.
RFC 6525, February 2012.
udpencaps
M. Tüxen and R. Stewart
UDP Encapsulation of Stream Control Transmission Protocol (SCTP) Packets for End-Host to End-Host Communication
RFC 6951, May 2013.
sack-imm
M. Tüxen, I. Rüngeler, and R. Stewart:
SACK-IMMEDIATELY Extension for the Stream Control Transmission Protocol
RFC 7053, November 2013.