1. Framework for Data Analysis
A course satisfying the QR overlay requirement should provide a general
overview for how empirical questions or hypotheses can be raised, how
relevant data can be collected and analyzed to address these questions,
and finally, what conclusions these data allow. Specifically, students
should practice formulating questions that arise in the context of the
course and that can be answered by analyzing data. They should then
decide what type of data to collect, how to collect and analyze these
data, and what conclusions their data support.
2. Collecting Data
A QR overlay course should address issues of data quality. Are the
data representative or biased? Are the data that are collected really
relevant to the question being investigated? Certain courses might stress
the importance of random samples, experiments versus observational studies,
blind versus double blind experiments, and so forth.
3. Representing Data
A QR overlay course should stress different methods of representing
data, including numerical representations (tables of data), visual representations
(pie charts, scatter-plots, line graphs, and histograms), verbal representations
(writing reasonable captions that describe a graph).
4. Summarizing Data
A course should stress different ways of summarizing data, including
verbal summaries of data sets. A QR overlay course must include discussions
of different measures of central tendency, including mean, median, or
such other measures. An overlay course must also include address different
measures of dispersion, including the range, standard deviation, and
percentile ranks. Depending on the course, other measures might include
index of diversity, index of qualitative variation, or absolute variation.
5. Probability
Because of the random component in sampling from a population, students
should have some understanding of basic probability. This must include
a working knowledge of how and when to use the addition and multiplication
rules, and the vocabulary of statistical independence, mutual exclusivity,
and the informal notions of the "Monte Carlo Fallacy" and the "Law of
Averages."
6. Distributions
Students in a QR overlay course should see different examples of distributions,
including both normally distributed data sets as well as non-normally
distributed data sets. They should know when to expect that a population
will be normally distributed, and what it means for a distribution to
be skewed. They should know what the mean, median, and standard deviation
tell about a distribution. Finally, they should know that a normal distribution
is fully specified by its mean and standard deviation and that the percentage
of the population on a given interval can be determined from a table
or a formula.
7. Sampling
A QR overlay course should discuss sampling and stress the distinction
between "sampling" and "collecting data." It should introduce the notion
of sample mean, and discuss why the sample mean might well vary from
the population mean. The distinction between sample statistics (e.g.,
the mean of a sample) and population parameters (e.g., the mean of the
population) should be emphasized.