t-test

Educational level: this is a tertiary (university) resource.

Completion status: this resource is ~50% complete.

“A t-test is used to determine whether a set or sets of scores are from the same population.” - Coakes & Steed (1999), p.61^[1]

History

The t-test is 116 years old. The t statistic was introduced by William Sealy Gosset for cheaply monitoring the quality of beer brews. Gosset was a statistician for the Guinness brewery in Dublin, Ireland. He was hired due to Guiness' innovative policy of recruiting the best graduates from Oxford and Cambridge to apply biochemistry and statistics to Guinness' industrial processes. Gosset published the t test in Biometrika in 1908, but was forced to use a pen name by his employer who regarded the fact that they were using statistics as a trade secret. Hence, the name Student's t-test. Read more here: t-test history.

Types

There are three types of t-test:

One-sample t-test

Used to compare a sample mean with a known population mean or some other meaningful, fixed value

Independent samples t-test

Used to compare two means from independent groups

Paired samples t-test

Used to compare two means that are repeated measures for the same participants - scores might be repeated across different measures or across time.
Used also to compare paired samples, as in a two treatment randomized block design.

Assumptions

Dependent variables must be:

Measured at interval or ratio level level of measurement - i.e., needs to be continuous.
Normally distributed in all groups of the independent variable.
- Robust to violations of this assumption if sample sizes are large and approximately equal (> 15 cases per group)
Have approximately equal variance across all groups of the IV (homogeneity of variance e.g., tested by Levene's test).
- If not the p-values for significance tests are inaccurate.
- If the variances are different SPSS has post-hoc tests to adjust for this.
Cases represent random samples from the populations and the scores of the test variable are independent of each other.
- Inaccurate p-values if the independence assumption is violated.

Graphing

Bar chart (1 IV) and clustered bar charts (> 1 IV)
Box and whisker plot
Error-bar chart
Stem and leaf plot

Write-up

Checklist

Purpose? (e.g., hypothesis?)
Variables / design
Descriptive statistics (4 moments)
Assumptions
Figure / graph? (optional)
One-tailed or two-tailed?
Test statistics, including effect size and direction of effects

Example 1

An independent t-test was used to determine whether there was a difference in mean grip strength between males and females. This revealed a significant difference (t (88) = 2.04, p = .04), with males having significantly higher mean strength scores than females (males, M = 2.93, SD = 1.23; females, M = 2.71, SD = '1.41).

Note: Usually means, standard deviations and effect sizes are reported in a Table.

References

↑ Coakes, Sheridan J. & Steed, Lyndall G. 1999 SPSS : analysis without anguish : versions 7.0, 7.5, 8.0 for Windows / Sheridan J. Coakes, Lyndall G. Steed Jacaranda Wiley, Brisbane, Australia.