Percentile positions in Grouped data / in Ranges: Example

The following table presents some random generated data about ages that are grouped in some specified intervals. Both, the convenient as well True Upper and Lower limits are presented for each Age Range. Also, the Simple and Cumulative frequency is presented for each Age Range.

For example: The Age Range of 31 – 40 has True Upper and Lower limits 30.5 – 40.5. It has a Simple Frequency of 5, which means that 5 participants claimed that had an age between 31 to 40. Finally, its Cumulative Frequency is 9, that is, 9 (4+5) participants claimed that had an Age between 21 to 30 (4) and between 31 to 40 (5).

Convenient Range limits | True Range limits (Χu and Χl) | Freq. Fi | Cumulative Frequency cf |
---|---|---|---|

21 - 30 | 20.5 - 30.5 | 4 | 4 |

31- 40 | 30.5 - 40.5 | 5 | 9 |

41- 50 | 40.5 - 50.5 | 3 | 12 |

51- 60 | 50.5 - 60.5 | 8 | 20 |

Statistical formula

Note that in order to find what value corresponds to a specified -ile position such as the positions of Percentile, Quartile, or Decile in grouped data, the following formula must be used:

Symbol explanation

The is the value that is indicated from a specified -ile.

The is the specified -ile e.g. 25th Percentile, 5th Decile, 2nd Quartile.

The is the True Lower Range limit of this Range that includes the specified -ile position.

The is the size / Width of this Range that includes the specified -ile position.

The is the Simple Frequency of this Range that includes the specified -ile position.

The takes the value that is denoted to the position of a specified -ile e.g. it takes the value of 2 for , it takes the value of 50 for .

The is the number of how much Arithmetic parts / pieces or “moieties” are produced from the specified -ile e.g. Quartiles produce 4 Arithmetic pieces while Percentiles produce 100 “moeties”.

The is the size of the dataset, that is, the total Simple Frequency.

The is the Cumulative Frequency of the Range that is right before from the Range that includes the specified -ile position.

Note that this part of formula: “” can indicate in what observation lies a specified -ile position. Note that it is mainly based on the size of the total number of observations as well to which -ile will be used.

Statistical Example Ι

The steps that we must follow in order to find the value that is indicated by a specified / chosen -ile -let’s say by 50th Percentile - for the given example of Grouped data are the following ones:

i) A prerequisite in order to find the value that is indicated by a specified -ile is that the given Ranges must be arranged in an ascending order e.g. from the Lowest Age Range to the Highest Age Range, which has been done.

ii) Note that the total number of observations in this example is .

iii) then, we must find where lies the 50th position between these 20 observations. This 50th position can also be expressed as when using Percentile concept. Therefore, the 50th position (which is the middle position) lies in the 10th observation .

iv) Note that the Age Ranges of 21-30 and 31- 40 includes the first 9 observations while the Age Range that includes the 10th observation is the next one, which is: . It can also named as the -ile group or the Percentile Range / Group.

v) The True Lower limit of this Age Range is:

vi) The is the size of the “Percentile Range” which can be found by subtracting the *True Lower Range limit* from the *True Upper Range limit*. Therefore: , and thus .

vii) The is the Simple Frequency that has the “Percentile Group”, which is 3: .

viii) The is the Cumulative Frequency that cumulate to those Age Ranges before the “Percentile Group”. The Age Ranges 21- 30 and 31- 40 includes the first 9 observations which are before the “Percentile Group”, therefore: .

Statistical Example Ι: Calculations

Now, we must ready to enter these values into the Statistical Formula:

Therefore, the Age that is indicated by the position of is the age of 43.83. Note that the position of “always” coincide with the position of (both refer to the “Median position”. Therefore, this Age value is also the Median value. Therefore, 50% of the total Age observations will be before this value and 50% of the total Age observations will be after this value.

Statistical Example IΙ: Interquartile Range

What is the value that is indicated by the and positions, respectively ?

i) The first step is to find in which observation lies the 1st and 3rd Quartile positions. This can be found by using this part of formula which indicates the 5th and 15th age observation, respectively:

—– and

—–

ii) The next step is to find in which Age Range these observations lies. According to the Cumulative Frequencies:

—–The 5th observation is included in Age Range of: and

—–The 15th observation is included in Age Range of:

iii) The True Lower limit of these Age Ranges is:

—– and

—–

iv) The size of both these Age Ranges is 10:

—–

—–

v) The Simple Frequency for each Age Range is:

—–

—–

viii) Finally, the Cumulative Frequency that exists before these given Age Ranges, respectively is:

—–

—–

**Example IΙ: Results**

Therefore the Interquartile Range of Age groups is “32.5 – 54.25″. This Age “Range” includes the 50% of the total Age Observations, those that are placed in the “middle”.

Example IΙ: The value of the Interquartile Range

The value of Interquartile Range is the result that we will get when the value that exists in the First Quartile position will be subtracted from the value that exists in the Third Quartile position, which is 21.75:

Statistical Example ΙII: Decile positions: and

i) The first step is to find in which observation lies the 1st and 9th Decile positions. This can be found by using this part of formula which indicates the 2nd and 18th age observation, respectively:

—– and

—–

ii) The next step is to find in which Age Range these observations lies. According to the Cumulative Frequencies:

—–The 2nd observation is included in Age Range of: and

—–The 18th observation is included in Age Range of:

iii) The True Lower limit of these Age Ranges is:

—– and

—–

iv) The size of both these Age Ranges is 10:

—– and

—–

v) The Simple Frequency for each Age Range is:

—– and

—–

viii) Finally, the Cumulative Frequency that exists before these given Age Ranges, respectively is:

—– and

—–

**Example IIΙ: Results**

This Decile “Range” of “25.5 – 58″ includes the 80% of the total Age observations. The “middle” ones.