This is a postprint version of the following published document:


DOI: https://doi.org/10.1145/3360468.3368189

© 2019 Association for Computer Machinery.
ABSTRACT
Network programmability brings agility, flexibility and rapid introduction of new capabilities, towards supporting application requirements. Enabling programmable traffic management at the data plane can provide significant benefits pertaining to Quality of Service (QoS). To this end we adopt P4, domain specific language for programming network switches, to implement an Active Queue Management (AQM) scheme for reducing queuing delay. We verify the design with a proof of concept implementation of PI2, a modern AQM that requires only basic bit manipulations at the data plane.

CCS CONCEPTS
• Networks → Programmable networks; Network algorithms; Network performance evaluation.

1 INTRODUCTION
Centralization of the network’s intelligence in Software Defined Networking (SDN) is an advantage for applications that do not have strict real-time requirements and depend on global network state. However, when the service uses local state information, e.g., to support QoS, the same level of flexibility must be supported at the data plane. Making stateful data plane algorithms programmable, complementing existing programmable forwarding plane solutions, can be beneficial in terms of meeting QoS requirements, reduce the control load on the SDN controller(s) and network overhead, and enhance network flexibility by enabling customized traffic management.

Programmable data plane solutions such as P4 [1] and supported architectures, provide an excellent way to define the packet forwarding behavior of network devices. However, most programmable devices still typically have non-programmable traffic managers. The P4 open source community has recently started working towards defining a P4 programmable traffic manager, encompassing functionality such as packet scheduling, shaping, policing etc. However, traffic management is still not programmable for P4 targets.

Towards providing fully programmable and customized data planes, we present our initial effort to complement the suite of AQM schemes using the P4 domain specific language for programming network switches (such as in [5]). Our approach builds on the premise that an SDN data path is a very fast and efficient engine to perform low level primitive operations at wire speed. We implement a modern AQM algorithm, namely PI2 [2], that can be deployed using basic bit manipulations, while other P4 AQM implementations require more complex approximation operations, external hardware function support or P4 longest prefix match-mapping tables. We verify the design through a proof of concept implementation. The P4 program source code is provided in [4].

2 BACKGROUND
P4 is a declarative language for programming protocol-independent packet processors, with constructs optimized for packet forwarding functions. Using P4, developers can program data plane packet pipelines, on a variety of targets (e.g., ASICs, FPGAs etc.) based on a match/action architecture (e.g., Fig. 1). While the P4 language is target-independent, a P4 compiler translates P4 programs into the instruction set of the hardware of the packet processor.

3 SYSTEM DESIGN
The BMV2 switch buffers packets between the ingress and egress pipeline using an egress queue per port. Using the v1model architecture, a P4 program can access in the egress pipeline the queuing delay for a packet and related queuing information, as part of the standard packet metadata. Therefore the PI2 AQM in our case is integrated into the P4 egress pipeline as denoted in Fig. 2. The AQM can be deployed on any Linux based system without the need of specialized hardware or proprietary software. The implementation serves as a proof of concept for deploying a custom AQM using P4.

Figure 1: P4 Target Architecture (v1model)

Figure 2: PI2 AQM in V1model

Ingress Control Flow: Match-action tables are the mechanism for performing packet processing. The ingress pipeline uses one match action table to implement basic L3-forwarding.

Egress Control Flow: A second match-action table is introduced that applies the AQM, complemented with the PI2 configuration parameters. The switch upon matching the port number executes the update_PI2 action [4]. Configuration parameters include the
queuing delay target in $\mu s$, PI2 gain factors ($\alpha$ and $\beta$) and the PI update interval $T$ in exponent(2) $\mu s$. A sample static rule used by the control plane to populate the table is provided in Listing 1. The PI gain factors are set to $\alpha = 0.3125Hz$ and $\beta = 3.125Hz$ according to the stability analysis for RTTs up to 100 ms and an update time of around 33ms (typically 1/3th of the max RTT). Fixed calculations and conversions on parameters are performed by the controller to avoid repetitive per packet processing. So the actual input values for the P4 program are scaled to fit in a 32-bit integer value. The delay target is set to 20,000 $\mu$s.

Listing 1: PI_exact Rule (excerpt)

Discussion: The current P4 abstraction model poses several challenges for implementing an AQM scheme. The absence of floating-point types required careful mapping into integer types, while the lack of support for complex arithmetic functions can be tackled with the use of smart bit manipulations or external approximation functions supported by the hardware. Authors in [5] employed the P4 longest prefix match table feature to map approximations for the square root. Our implementation does not require such resources which otherwise would constrain the forwarding functionality.

4 PROOF OF CONCEPT

In the current section we illustrate the functionality of our P4-programmable AQM scheme using some simple test cases.

Evaluation Environment: Our testbed consists of 3 Ubuntu machines with at least two 1Gbps interfaces as shown in Fig. 3. The configuration parameters for the PI2 AQM algorithm are provided in Listing 1. We compare PI2 with a taildrop queue that requires a buffer of 200 packets, assuming a reference link of 120Mbps and a delay target of 20ms. For the PI2 AQM we set the buffer size high enough (10,000 packets) to make sure that the only congestion control is performed by the AQM. The target delay can be modified without impacting other control parameters (we use 5ms and 20ms). The P4 switch can limit the rate of packets emitted from the egress pipeline, emulating different link capacities. We use this configuration option to set the rate limit. A base RTT is established by setting a 5ms delay on the interfaces of the client and server machines. A single greedy TCP flow is running and the network queuing delay is shown during its steady state phase over an interval of 250 sec (average values over 1 second intervals).

Evaluation Results: We verified our P4-PI2 implementation using an extensive set of tests to make sure it lines up with the results in [2]. However to demonstrate the effectiveness of the implementation, we present in Fig. 4 the queuing delay for the P4-PI2 AQM, compared to the baseline taildrop (TD) for varying network conditions. The initial bottleneck link capacity is set to 120Mbps. Fig. 4(i) confirms as expected that a TD buffer size of 200 packets limits the delay to 20ms. It also shows that a PI2 AQM can easily be reconfigured by the controller with different delay targets (here 20ms and 5ms) via a simple update in the corresponding entry at the table.

When the link capacity is reduced to 12Mbps as in the plot of Fig. 4(ii), the queue will be served 10 times slower, resulting for the TD case in a 10 times higher delay (up to 200ms). The PI2 controller on the other hand, keeps the delay at the target levels. Moreover, there is no need to update the AQM settings when the throughput of the link changes.

5 FUTURE WORK

Due to its isolation features, network slicing is a great opportunity to further customize per-slice network behavior. Next to AQMs, we believe additional per-slice protocol customization and real-time network-service interactions are required enablers to deal with future network challenges. This will require an agile and flexible programmable data-plane beyond the current forwarding and traffic management scope.

ACKNOWLEDGMENTS

This work was supported by H2020 5GROWTH project (grant agreement no. 856709)

REFERENCES