Philip J. Guo - Personal Home Page
Main Menu

Summary of Statistical Tests

by Philip Guo (philip@pgbovine.net)

Summary:
As I attempt to learn statistics to assist my research, I've found it useful to document the various tests that I've learned. My goal in creating this page is to provide a quick summary of what each test can be used for and when it can be applied. I'm not a statistics expert by any means, so please email me if there are errors.

Testing whether the population mean is equal to some value

You collect a simple random sample from a population of ratio-level or interval-level values.

Any population distribution (more realistic scenario)

You must have large sample size (N >= 30) to proceed. You can use the z-test to get an approximate solution.

Normally-distributed population


Testing whether the means of two populations are identical, given two sets of independent samples

You collect two sets of independent simple random samples from populations of ratio-level or interval-level values (e.g., subjects are randomly-picked to receive either one of two experimental treatments).

Any population distribution (more realistic scenario)

Normally-distributed population


Testing whether one population has values that are consistently greater than or less than those of the other population, given two sets of paired samples

You collect a set of paired samples from two populations of ratio-level or interval-level values (e.g., each subject is given both experimental treatments).

Any population distribution (more realistic scenario)

The following non-parametric tests can even be used for ordinal-level values.

The population of differences between pairs are normally-distributed

That's right, you read that correctly! The following test assumes that the differences in the pairs amongst the population are normally-distributed, which might make it difficult to apply.


Testing whether the means of more than two populations are identical

You collect the same sets of ratio-level or interval-level measurements from several different groups (e.g., people's heights) and want to determine whether the means of all of the groups' respective populations are identical.

Any population distribution (more realistic scenario)

The following non-parametric test can even be used for ordinal-level values, but it assumes that the observations in each group come from distributions with the same shape (less stringent than homoscedasticity, though):

Normally-distributed and homoscedastic population

Homoscedastic means that the within-group variances for all groups are identical (e.g., the variance in heights within each group of people are identical).


Testing whether two variables are correlated

You collect pairs of ratio-level or interval-level measurements from a population, where each of the two elements in each pair measures a different property (e.g., height and weight).


Testing whether the observed frequencies of categorical (nominal) variables deviate significantly from their expected frequencies

The following are goodness-of-fit tests on counts (frequencies) of observations of categorical (a.k.a. nominal) variables.

These tests can be computationally expensive, so they are not recommended for N > 1000:

These tests require that the expected counts in each category not be too small (the smallest expected count greater than 5 will suffice):


Testing whether the proportions in two different groups are identical

You have two categorical variables, each of which have two or more possible values.


Sources


Created: 2008-03-28
Last modified: 2008-03-30