In everyday life and in many competitive exams, we often need to summarize a large set of numbers to understand its "center" or typical value. One important way to do this is by finding the median of a dataset.
What is the median? Simply put, the median is the middle value of a list of numbers arranged in order. It divides the data into two equal halves: half of the data points lie below the median, and half lie above it. Because the median depends on the position of values in an ordered set, it is called a positional measure of central tendency.
Understanding the median is especially useful when data contains extreme values or outliers, which can distort averages. For students preparing for competitive exams, mastering the median is key to solving many problems in statistics and arithmetic.
When the number of values in a dataset is odd, finding the median is straightforward:
Example: For the dataset \{3, 5, 7, 9, 11\}, there are 5 numbers (odd). Sorted data is already given. The middle position is \((5 + 1)/2 = 3\). So, the 3rd value is 7, which is the median.
When the dataset has an even number of values, there is no single middle value. Instead:
Example: For the dataset \{3, 5, 7, 9, 11, 13\}, \(n=6\) (even).
Often data is presented as a summary in form of grouped frequency distribution, where data is grouped into class intervals with associated frequencies. Here, the median is estimated using a formula rather than direct observation.
To find the median in such grouped data, follow these steps:
| Class Interval (cm) | Frequency (f) | Cumulative Frequency (CF) |
|---|---|---|
| 0 - 10 | 5 | 5 |
| 10 - 20 | 8 | 13 |
| 20 - 30 | 12 | 25 |
| 30 - 40 | 10 | 35 |
In this example, if total \(N=35\), \(\frac{N}{2} = 17.5\), median class is where cumulative frequency exceeds 17.5 (i.e., 20-30 cm).
It's important to understand how median differs from other measures of central tendency:
For example, in the data \{2, 3, 5, 8, 100\}, mean is \(\frac{2+3+5+8+100}{5} = 23.6\), which is skewed by 100. Median is 5, which better represents the "middle" value.
In entrance exams, problems on median appear regularly in areas such as:
Mastering median helps you tackle questions more easily and avoid confusion with similar statistics like mean and mode.
Step 1: Sort the data in ascending order: 3, 5, 7, 9, 11.
Step 2: Count the number of values: \(n = 5\) (odd).
Step 3: Find the middle position using \(\frac{n+1}{2} = \frac{5+1}{2} = 3\).
Step 4: The 3rd number in the sorted list is 7.
Answer: The median is 7.
Step 1: Sort the data: 3, 7, 10, 12, 15, 20.
Step 2: Number of observations \(n = 6\) (even).
Step 3: Middle positions are at \(n/2 = 3\) and \(n/2 + 1 = 4\).
Step 4: The 3rd and 4th values are 10 and 12.
Step 5: Median = \(\frac{10 + 12}{2} = 11.\)
Answer: The median is 11.
| Length (cm) | Frequency (f) |
|---|---|
| 0 - 10 | 5 |
| 10 - 20 | 8 |
| 20 - 30 | 12 |
| 30 - 40 | 10 |
Step 1: Calculate total frequency: \(N = 5 + 8 + 12 + 10 = 35.\)
Step 2: Find \(\frac{N}{2} = \frac{35}{2} = 17.5.\)
Step 3: Calculate cumulative frequencies:
Step 4: The median position 17.5 lies in the class where cumulative frequency just exceeds 17.5, which is 20-30 cm.
Step 5: Identify parameters for formula:
Step 6: Apply formula:
\[ \text{Median} = 20 + \left(\frac{17.5 - 13}{12}\right) \times 10 = 20 + \left(\frac{4.5}{12}\right) \times 10 = 20 + 3.75 = 23.75 \text{ cm} \]
Answer: The median length of the pipes is 23.75 cm.
Step 1: Sort the data:
142, 147, 148, 149, 150, 153, 155, 160
Step 2: Number of observations \(n = 8\) (even).
Step 3: Find middle positions: 4th and 5th values.
Step 4: Values at these positions: 149 and 150.
Step 5: Median = \(\frac{149 + 150}{2} = 149.5\) cm.
Answer: The median height is 149.5 cm.
Step 1: Sort the sales figures:
7800, 7950, 8400, 8800, 9000, 9200, 9600
Step 2: Number of observations \(n = 7\) (odd).
Step 3: Find middle position: \(\frac{7+1}{2} = 4\).
Step 4: The 4th value in the sorted list is 8800 INR.
Answer: The median daily sales amount is Rs.8800.
When to use: At the start of any median calculation to avoid errors.
When to use: When dealing with frequency tables in entrance exam problems.
When to use: While analyzing skewed data or outliers.
When to use: During time-pressured exams to save calculation time.
When to use: When dealing with complex or unfamiliar datasets.
Progress tracking is paywalled — subscribe to mark subtopics as understood and save your streak.
Go to practice →