Estimating Bandwidth from Passive Measurement Traces
John G. Cleary and H. Stele Martin.
Department of Computer Science, University of Waikato.
A methodology for analysing bandwidth from passive measurement traces is described. Previous analyses of delay times for HTTP requests showed that effects from bandwidth are significant.
Using accurate passive measurement traces we extract Round Trip Times (RTTs) from TCP streams. We start by identifying a TCP connection between two machines (typically this is an HTTP flow and contains traffic in both directions). The TCP connection is then analysed to identify the different message types (for example data, ACK, SYN and FIN) and the state of the TCP stacks at the time when the messages were sent. By pairing packets that are known to have a small amount of host processing (for example SYN and ACK packets), the minimum physical time from the measurement point to the host and back, is approximated.
For small SYN and ACK packets, using just the measured time as an approximation for the RTT works well. When this technique is applied to larger packets, then the effect of the bandwidth over the link is significant and must be taken into account.
The effective bandwidth seen by the user is dependent on the number and speed of the links between the two hosts being measured. Each separate link where the packet has to be stored and forwarded adds extra time proportional to the length of the packet. In the simple case the packets in both directions will travel over the same links and the observed bandwidth will be the same in both directions. We analyse such data by plotting the observed RTTs against the sum of the lengths of the packets in both directions. The slope of the lower edge of this plot gives an estimate of the effective bandwidth. A number of different regression algorithms for estimating this slope are examined for robustness and accuracy.
It is clear that for some of the flows that have been analysed, these plots do not give consistent results. We show that this is a result of the effective bandwidth being different in the forward and reverse directions. It is postulated that this is a result of asymmetric routing of the packets.
Given the size of the packets in each direction, RTT can be plotted on a 3 dimensional graph as a function of forward data size and reverse data size. The effective bandwidth in each direction can then be approximated from the plane which underlies the minima of these times.
The regression algorithms are extended to deal with the estimate of the bandwidths in both the forward and reverse directions. Care is taken to ensure that enough data points are used when estimating each minimum point. A number of example analyses of different flows are given. A resampling technique is used to evaluate the robustness and accuracy of the regressions and the resulting estimated bandwidths.