Ironman Triathlon (Canadian Females) - Multiple Linear Regression
Welcome video
Introduction
An Ironman triathlon is one of the most grueling endurance events in the world, consisting of three sequential races: a 2.4-mile (3.86 km) swim, a 112-mile (180.25 km) bike ride, and a marathon 26.2-mile (42.20 km) run. The event tests the physical and mental limits of athletes, requiring months or even years of dedicated training. Originating in 1978 in Hawaii, the Ironman triathlon has become a global phenomenon, symbolizing the ultimate challenge in long-distance triathlon competitions.
The Triathlon Multiple Linear Regression module focuses on analyzing the relationships and predictive capabilities of multiple variables in the context of triathlon performance. Specifically, we will explore the prediction of run times (the last stage) using swim times and bike times as predictors. By employing simple regression models, checking conditions with residuals plots, and conducting hypothesis tests and confidence intervals, we aim to identify significant predictors and understand their contextual implications. The dataset used for this analysis comprises the 2022 Canadian finishers of the Lake Placid Ironman.
Data
The data set has 64 rows with 17 columns. Each row represents a Canadian female who has participated in the 2022 Lake Placid Ironman. Note that this data set includes more variables than what are needed to complete the activity. Students are welcome to further explore the data using these additional variables.
Download data: ironman_lake_placid_female_2022_canadian.csv
Variable Descriptions
Variable | Description |
---|---|
Bib | registration number of each runner used for identification |
Name | The participant’s name |
Country | What country the participant is from |
Gender | The participant’s gender |
Division | The age range or membership a runner is |
Division.Rank | Within the divisions, the place each runner has obtained over all races |
Overall.Time | The total time it took to complete the Ironman in minutes |
Overall.Rank | The runner’s finishing place for that particular triathlon |
Swim.Time | The time in minutes it took to complete the swimming portion |
Swim.Rank | The place the runner finished for the swim portion |
Bike.Time | The time in minutes it took to complete the biking portion |
Bike.Rank | The place the runner finished for the bike portion |
Run.Time | The time in minutes it took to complete the running portion |
Run.Rank | The place the runner finished for the running portion |
Finish.Status | States whether someone completed the Ironman successfully |
Location | Where the Ironman takes place |
Year | They year when the mentioned participant ran |
Materials
Class handout - requires technology