2023 Boston Marathon - Understanding variability using ANOVA
Module
Please note that these material have not yet completed the required pedagogical and industry peer-reviews to become a published module on the SCORE Network. However, instructors are still welcome to use these materials if they are so inclined.
Introduction
For this activity, you will be exploring the result times from female and male runners that finished the 2023 Boston Marathon.
In particular, you will examine both visualizations and Analysis of Variance (ANOVA) modules of result times to help understand the variation in finish times of participants.
Investigating these trends is useful for several reasons. Mainly, exploring these trends can help to deepen our understanding of how different factors, such as gender and age, impact marathon performances. Analyzing the distribution of finish times can also provide insights into the competitive landscape of the marathon.
Data
The data set contains 26598 rows and 15 columns. Each row represents a runner who completed the Boston Marathon in 2023
Download data: boston_marathon_2023.csv
Variable Descriptions
Variable | Description |
---|---|
age_group | age group of the runner |
place_overall | finishing place of the runner out of all runners |
place_gender | finishing place of runner among the same gender |
place_division | finishing place of runner among runners of the same gender and age group |
name | name of runner |
gender | gender of runner |
team | team the runner is affiliated with |
bib_number | bib number of runner |
half_time | half marathon time of runner |
finish_net | finishing time timed from when they cross the starting gate |
finish_gun | finishing time of runner timed from when the starter gun is fired |
age_group | age group of the runner |
half_time_sec | half marathon time in seconds |
finish_net_sec | net finish in seconds |
finish_gun_sec | gun finish in seconds |
finish_net_minutes | net finish in minutes |
Data Source