Five number summary calculator

For five number summary calculation, please enter numerical data separated with comma (or space, tab, semicolon, or newline). For example: 861.2 752.7 263.1 982.7 815.6 897.8 813.3 787.2 -490.3 373.0 104.2 313.9 229.5



Five number summary

A five number summary consists of these five statistics:

the minimum,
Q1 (the first quartile, or the 25% mark),
the median,
Q3 (the third quartile, or the 75% mark),
the maximum,

The five-number summary gives you a rough idea about what your data set looks like. For example, you’ll have your lowest value (the minimum) and the highest value (the maximum) or more concentrated data.
The main reason you’ll want to find a five-number summary is to find more useful statistics, like the interquartile range IQR, sometimes called the middle fifty.

How to enter data as a frequency table?

Simple. First-type data elements (separated by spaces or commas, etc.), then type f: and further write frequency of each data item. Each element must have a defined frequency that count of numbers before and after symbol f: must be equal. For example:

1.1 2.5 3.99
f: 5 10 15

How to enter a grouped data?

Grouped data are data formed by aggregating individual data into groups so that a frequency distribution of these groups serves as a convenient means of summarizing or analyzing the data.

groupfrequency
10-205
20-3010
30-4015
This grouped data you can enter:
10-20 20-30 30-40
f: 5 10 15

How to enter data as a cumulative frequency table?

Similar to a frequency table, but instead f: type cf: in the second line. For example:

10 20 30 40 50 60 70 80
cf: 5 13 20 32 60 80 90 100


The cumulative frequency is calculated by adding each frequency from a frequency distribution table to the sum of its predecessors. The last value will always be equal to the total for all observations since all frequencies will already have been added to the previous total.


Questions




  • Question1:


    There seems to be a problem with the 5-figure calculator. I placed the following data set and the Q3, giving a wrong value. 8, 12, 9, 8, 16, 10 ,14, 7, 5, 21, 13, 10, 8, 10, 11, 8, 11, 9, 11, 14. I used The program, against TI and Casio graphics calculator as well as a different program. This website gave me 12.75, while the calculators and the other website gave me 12.5.

    Answer1:


    All seems OK. But there are at least three methods to compute quartiles.


    For large numbers of data, all methods converge to the same results. For small datasets, the methods differ in how they interpolate gaps in the input data.

    Answer2:


    Q3 in case 20 values, so its indexes is 0... 19
    quartile3 index is 19*0.75 = 14.25
    and quartile3 value Q3 = 0.75*12+0.25*13 = 12.25
  • Question2


    Your Quartile calculator has an error. Both Q1 & Q3 are incorrect with the data set below. Your calculator says Q1 is 13 and Q3 is 44. Correct answers: Q1 is 13.5 and Q3 is 41. Verified with third party and Excel spreadsheet. There is definitely a problem with your code. When a data set has an even number of items all three methods will have the same result!! In my example, Q1 should be 13.5 no matter what!

    {0, 0, 0, 0, 0, 0, 3, 3, 3, 3, 3, 6, 6, 6, 6, 6, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 9, 10, 10, 10, 10, 10, 10, 10, 10, 10, 11, 13, 13, 13, 13, 13, 13, 13, 13, 13, 13, 13, 13, 13, 13, 13, 13, 13, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 16, 16, 16, 16, 17, 17, 17, 17, 17, 17, 17, 18, 18, 19, 20, 20, 20, 20, 20, 20, 20, 20, 21, 21, 21, 21, 21, 21, 21, 21, 21, 21, 21, 21, 21, 21, 22, 22, 22, 23, 23, 23, 23, 24, 24, 24, 24, 24, 24, 24, 24, 24, 24, 24, 25, 25, 25, 26, 26, 26, 26, 27, 28, 28, 28, 28, 28, 29, 30, 30, 30, 31, 31, 31, 31, 31, 31, 31, 33, 33, 33, 33, 34, 34, 34, 34, 35, 35, 36, 37, 37, 38, 38, 38, 38, 38, 38, 38, 40, 41, 41, 41, 42, 43, 44, 44, 44, 44, 45, 45, 45, 45, 45, 45, 45, 45, 46, 47, 47, 48, 48, 48, 49, 49, 50, 50, 51, 51, 52, 52, 52, 52, 52, 53, 54, 54, 54, 55, 55, 55, 56, 56, 56, 56, 57, 59, 61, 61, 63, 63, 69, 70, 70, 77, 80}

    Answer2


    Hi Peter, Thank you for adding the method buttons! So I see "Method 2" is the algorithm you had before. "Method 1" is more accurate and the preferred method. "Method 2" is acceptable but still has the same bug as before. It only happens when the data set is even. Again, Q1 & Q3 results should be the same for evens regardless of method per the Wikipedia article. I'll give you a really simple example which shows the error I've been talking about all along. Try data set { 3, 7, 7, 9 }. Each method gives different results but Method 1 is indeed correct. Q1 should be 5 and Q3 should be 8 for both methods. Hope this helps!

    Consider the set:

    {1,2,3,4,5,6,7,8,9,10,11,12,13}

    Clearly, Q2 is 7, Q1 is 3.5, and Q3 is 10.5. However, your program yields

    "Quartile Q1: 4
    Quartile Q2: 7
    Quartile Q3: 10"

    ^ This is incorrect.

    I think the error in the algorithm is in how it creates the lower and upper halves after finding Q2.

    Consider the set:

    {1,2,3,4,5,6}

    If the list is of even length, Q2 will be the average of the two middle values (e.g. Q2 = [3+4]/2 = 3.5) . Hence, the lower half should be a list ranging from the LOW (i.e. the lowest value in the original set of values) to the floor of Q2 (e.g. if Q2=3.5, the floor of Q2 is 3). Hence,

    lower half = {1,2,3}

    The upper half should range from the ceiling of Q2 (e.g. is Q2=3.5, the ceiling of Q2 is 4) to the HIGH (i.e. the highest value in the original set of values). Hence,

    upper half = {4,5,6)

    A visual:

    1 2 3 4 5 6
    1 * 3 * 4 * 6

    Q2 = 3.5
    Q1 = 2
    Q3 = 5