A synthetic dataset for teaching correlation.

vocab_data

Format

A data frame with 4000 rows and 4 variables:

ages

Age of participants

vocab

Estimated vocabulary size

count

Row numbers

reader_type

How often the participant reads (average, frequent)