View Single Post
Old 02-15-2007, 12:51 PM  
borked
Totally Borked
 
borked's Avatar
 
Industry Role:
Join Date: Feb 2005
Posts: 6,284
Quote:
Originally Posted by KlenTelaris View Post
Well host looks good but i personaly more preffer host which have different method for bandwith calculation.
Of course, you are entitled to demand the option of your choice, and I'm pretty sure Split offer GB as a bandwidth option (Joel - can you confirm?)

But then again, maybe you don't fully understand what 95th percentile is? It really is a far better method (for the customer):

Quote:

Computing the 95th percentile from the accumulated 5-minute data transfer samples provides an effective means of evaluating the regular, sustained utilization of your connection. The 95th percentile calculation is a widely used mathematical algorithm--most large service providers (UUNet, Sprint, Cable & Wireless, etc.) use this method to calculate bills for larger circuits (e.g. Ethernet, T3).

Site Specific computes the 95th percentile based on the inbound and outbound data transfer measured at your connection point at 5 minute intervals over a one month period. The 95th percentile value is computed separately on your accumulated inbound and outbound data transfer sample sets, respectively. The higher of the two computed values becomes the 95th percentile value used on your monthly bill.

To compute the 95th percentile value on a data set, the following algorithm is used: sort the data set by value from higest to lowest, discard the highest 5% of the sorted samples, and the next highest sample becomes the 95th percentile value for the data set. For example, suppose that a month is 1 hour, 40 minutes in length. Over the course of this short month, we gathered the following data sets for the inbound and outbound traffic (all numbers in Mb/s):

inbound = [0.139 0.653 0.201 0.116 0.084 0.032 0.047 0.185 0.198 0.203 0.276 0.370 0.971 0.233 0.218 0.182 0.169 0.126 0.131 0.157]
outbound = [1.347 1.435 1.229 0.523 0.438 0.231 0.347 0.689 0.940 1.248 1.385 1.427 3.988 1.265 1.221 1.013 0.992 0.874 0.896 1.002]

After sorting, we obtain:

sorted_in = [0.971 0.653 0.370 0.276 0.233 0.218 0.203 0.201 0.198 0.185 0.182 0.169 0.157 0.139 0.131 0.126 0.116 0.084 0.047 0.032]
sorted_out = [3.988 1.435 1.427 1.385 1.347 1.265 1.248 1.229 1.221 1.013 1.002 0.992 0.940 0.896 0.874 0.689 0.523 0.438 0.347 0.231]

Each sample set contains 20 samples--5% of 20 is 1, so discarding the top 5% means we must discard the top sample from each data set. We are now left with:

remaining_in = [0.653 0.370 0.276 0.233 0.218 0.203 0.201 0.198 0.185 0.182 0.169 0.157 0.139 0.131 0.126 0.116 0.084 0.047 0.032]
remaining_out = [1.435 1.427 1.385 1.347 1.265 1.248 1.229 1.221 1.013 1.002 0.992 0.940 0.896 0.874 0.689 0.523 0.438 0.347 0.231]

The highest sample from each remaining data set is the 95th percentile value for the originating set. So, for each set, above, we obtain the following values:

95th_in = 0.653 Mb/s
95th_out = 1.435 Mb/s

The higher of the two computed 95th percentile values becomes the final 95th percentile value used for billing:

95th percentile = 1.435 Mb/s
ie - ALL inbound traffic is automatically discarded. GB bandwidth counts IN+OUT.

The fact that the top 5% of bandwidth usage is simply discarded, as is the inbound traffic, gives you a far better deal than simple GB IN+OUT monitoring.

Thus, if you have 1000GB IN+OUT, this DOES NOT equate to 3mbs on 95th percentile. Sure, it does mean 3mbs, but after having removed all inbound traffic and the top 5% of bandwidth usage.

I know it may be easier to follow GB bandwidth, but 95th percentile for a website with bursts etc, is by far the way to go.
__________________

For coding work - hit me up on andy // borkedcoder // com
(consider figuring out the email as test #1)



All models are wrong, but some are useful. George E.P. Box. p202
borked is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote