Understanding Statistics of Grouped Data
Statistics is a branch of mathematics dealing with the collection, analysis, interpretation, presentation, and organization of data. When data sets are large, it becomes more convenient to group them into classes, forming a grouped frequency distribution. This section summarizes key concepts for understanding and analyzing such data.
Grouped Frequency Table
A grouped frequency table organizes raw data into classes or intervals and shows the frequency (number of occurrences) of data values within each class. This makes large datasets manageable and easier to interpret.
- Class Interval: A range of values for which the data is grouped (e.g., 0-10, 10-20).
- Class Limits: The end values of a class interval. The lower limit is the smallest value, and the upper limit is the largest value in the interval.
- Class Mark (Mid-point): The average of the lower and upper limits of a class interval. It is used as ‘xi’ (representative value) for calculations like mean. Formula: (Lower Limit + Upper Limit) / 2.
- Frequency (fi): The number of observations falling within a particular class interval.
Measures of Central Tendency for Grouped Data
These measures describe the center or typical value of a distribution.
- Mean: The average of the data. For grouped data, it is calculated using the formula:
Mean (x-bar) = Sum(fi * xi) / Sum(fi)
Where fi is the frequency of the i-th class and xi is the class mark of the i-th class. - Median: The middle-most value when the data is arranged in ascending or descending order. For grouped data, it is found using the formula:
Median = L + [ (N/2 – CF) / f ] * h
Where L = lower limit of the median class, N = sum of frequencies, CF = cumulative frequency of the class preceding the median class, f = frequency of the median class, h = class size. - Mode: The value that appears most frequently in the data set. For grouped data, it is found using the formula:
Mode = L + [ (f1 – f0) / (2f1 – f0 – f2) ] * h
Where L = lower limit of the modal class, f1 = frequency of the modal class, f0 = frequency of the class preceding the modal class, f2 = frequency of the class succeeding the modal class, h = class size.
Cumulative Frequency Context
Cumulative frequency is the running total of frequencies. It helps in determining the number of observations that fall below a certain value. There are two types:
- Less Than Cumulative Frequency: Indicates the number of observations with values less than the upper limit of a class interval.
- More Than Cumulative Frequency: Indicates the number of observations with values greater than or equal to the lower limit of a class interval.
Cumulative frequency curves, also known as ogives, are graphical representations used to estimate the median, quartiles, and other percentiles of grouped data.
Example of Cumulative Frequency Calculation:
| Class Interval | Frequency (fi) | Cumulative Frequency (Less Than) |
|---|---|---|
| 0-10 | 5 | 5 |
| 10-20 | 8 | 5 + 8 = 13 |
| 20-30 | 12 | 13 + 12 = 25 |
| 30-40 | 7 | 25 + 7 = 32 |
Quick Revision Points
- Grouped data simplifies analysis of large datasets.
- Class mark is the midpoint used for ‘xi’.
- Cumulative frequency helps find the position of specific data points like median.
- Mean, Median, and Mode are central tendencies; each has a specific formula for grouped data.
- Ogive is a graphical tool for visualizing cumulative frequencies and estimating median.
Extra Practice Questions
- The mid-point of a class interval is 42 and the class size is 10. Find the lower and upper limits.
Answer: Lower limit = 37, Upper limit = 47. (Midpoint – h/2, Midpoint + h/2) - If the median class for a grouped data is 30-40, and N/2 is 25, CF of preceding class is 18, frequency of median class is 10, what is the median?
Answer: Median = 30 + [ (25 – 18) / 10 ] * 10 = 30 + 7 = 37. - For a frequency distribution, if Sum(fi) = 50 and the assumed mean (A) is 25, with Sum(fi * di) = 150, find the mean. (di = xi – A)
Answer: Mean = A + [ Sum(fi * di) / Sum(fi) ] = 25 + (150 / 50) = 25 + 3 = 28. - What is the relation between mean, median, and mode for a moderately skewed distribution?
Answer: Mode = 3 Median – 2 Mean (Empirical formula). - If the class intervals are 0-9, 10-19, 20-29, what is the class size?
Answer: The actual class intervals are 0-9.5, 9.5-19.5, 19.5-29.5. So class size is 19.5 – 9.5 = 10.

Content created and reviewed by the CBSE Quiz Editorial Team based on the latest NCERT textbooks and CBSE syllabus. Our goal is to help students practice concepts clearly, confidently, and exam-ready through well-structured MCQs and revision content.