Learn on PengiThe Art of Problem Solving: Prealgebra (AMC 8)Chapter 13: Data and Statistics

Lesson 2: Limits of Basic Statistics

In this Grade 4 AMC Math lesson from The Art of Problem Solving: Prealgebra, students explore the limitations of average, median, and mode as statistical measures. Using real score comparisons across multiple data sets, students discover that knowing the average or median alone reveals nothing about the size of a data set, the total of its values, or the value of the other measure. The lesson builds critical thinking about when and why basic statistics can be misleading or insufficient for drawing conclusions.

Section 1

Understanding What Statistics Tell Us

Property

To summarize and interpret numerical data sets, a comprehensive analysis involves several key steps. First, report the number of observations to understand the size of the data. Next, describe the attribute being measured, including its units. Then, calculate quantitative measures of center (like the mean, median, and mode) to find a typical value. Finally, relate these measures to the shape of the data distribution and the context to describe overall patterns, identify any significant deviations, and draw meaningful conclusions.

Examples

Section 2

Data Set Size Cannot Be Determined from Mean or Median

Property

Given only the mean or median of a data set, it is impossible to determine how many values are in the data set. Multiple data sets of different sizes can have identical means or medians.

Examples

Section 3

Selecting Appropriate Measures Based on Shape

Property

When data are close to a symmetrical bell-shaped curve, the mean and median (and mode) are very close. In such cases, either measure accurately represents the typical values.

However, in cases where the data are skewed, the mean and median can be far apart. When data are skewed, the median is usually to the side of the mean in the direction of the skew. The median value is often closer to the typical values than the mean. The mode is best for categorical data or to find the most frequent value.

Examples

  • A team's points per game were 10, 12, 11, 9, and 45. The median (11) better represents a typical game than the mean (17.4), which is inflated by the outlier score of 45.
  • The heights of a group of students are 150, 152, 155, 158, and 160 cm. The data is symmetric. The mean is 155 cm and the median is 155 cm. Both are excellent measures of center.
  • To determine the most requested book genre in a library (e.g., Fantasy, Sci-Fi, Mystery), the librarian would find the mode. The mean and median cannot be calculated for these categories.

Lesson overview

Expand to review the lesson summary and core properties.

Expand

Section 1

Understanding What Statistics Tell Us

Property

To summarize and interpret numerical data sets, a comprehensive analysis involves several key steps. First, report the number of observations to understand the size of the data. Next, describe the attribute being measured, including its units. Then, calculate quantitative measures of center (like the mean, median, and mode) to find a typical value. Finally, relate these measures to the shape of the data distribution and the context to describe overall patterns, identify any significant deviations, and draw meaningful conclusions.

Examples

Section 2

Data Set Size Cannot Be Determined from Mean or Median

Property

Given only the mean or median of a data set, it is impossible to determine how many values are in the data set. Multiple data sets of different sizes can have identical means or medians.

Examples

Section 3

Selecting Appropriate Measures Based on Shape

Property

When data are close to a symmetrical bell-shaped curve, the mean and median (and mode) are very close. In such cases, either measure accurately represents the typical values.

However, in cases where the data are skewed, the mean and median can be far apart. When data are skewed, the median is usually to the side of the mean in the direction of the skew. The median value is often closer to the typical values than the mean. The mode is best for categorical data or to find the most frequent value.

Examples

  • A team's points per game were 10, 12, 11, 9, and 45. The median (11) better represents a typical game than the mean (17.4), which is inflated by the outlier score of 45.
  • The heights of a group of students are 150, 152, 155, 158, and 160 cm. The data is symmetric. The mean is 155 cm and the median is 155 cm. Both are excellent measures of center.
  • To determine the most requested book genre in a library (e.g., Fantasy, Sci-Fi, Mystery), the librarian would find the mode. The mean and median cannot be calculated for these categories.