PGA - Drive for Show, Putt for Dough?

Correlation
Using tournament data for professional golfers to see if driving or putting are more strongly related to success.
Authors
Affiliation

Alyssa Bigness

St. Lawrence University

Robin Lock

St. Lawrence University

Published

February 5, 2024

Module

Please note that these material have not yet completed the required pedagogical and industry peer-reviews to become a published module on the SCORE Network. However, instructors are still welcome to use these materials if they are so inclined.

Introduction

A common expression among golfers is “Drive for show, putt for dough.” This implies that the long initial tee shots (drives) on each hole are impressive, but the real key to success is the final strokes rolling the ball along the green into the hole (putts). Do data support this adage?

The dataset for this activity was obtained from the PGA statistics website PGA Statistics Website. Cases include all golfers who made the cut in each of 19 PGA tournaments in 2022. The dataset includes variables for driving ability, putting ability, and measuring success in the tournament. The “driving” variables include average driving distance (avgDriveDist), driving accuracy percentage (drivePct), and strokes gained off the tee (driveSG). The “putting” variables are average putts per round (avgPuttsPerRound), one putt percentage (onePuttPct), and strokes gained putting (puttsSG). The variables to measure “success” are scoring average (avgScore), official money won (Money), and Fedex Cup points (Points).

The learning objectives associated with this module are:

  • Students will be able to use correlation to measure the strength association between quantitative variables.
  • Checking regression model assumptions
  • Students will be able to compare correlations to assess which variables may be more strongly related.

This module requires students compare correlations between pairs of variables.

Technology requirement: Two handout activities accommodate different levels of available technology.

  • The “No Tech version” provides a correlation matrix for students to use to find the required correlations to compare.

  • The “With Tech” version provides the dataset and asks students to use technology to compute the needed correlations.

Data

Each row of data gives the measures for one golfer in one tournament. The dataset covers 19 PGA tournaments from the 2022 season with 1387 cases in all. Each tournament consists of four rounds of golf. Some golfers are eliminated after the first two (or sometimes three) rounds. Only players who competed in all four rounds (i.e. those that made the “cut”) are included in this dataset.

Download data: PGA2022.csv

Variable Descriptions
Variable Description
playerName Name of the player
country The country where the player is from
avgDriveDist Average driving distance (in yards)
drivePct Percentage of times a tee shot comes to rest in the fairway
driveSG Strokes gained off the tee measures player performance off the tee on all par 4s and par 5s of how much better or worse a player’s drive is than the PGA Tour average
avgPuttsPerRound Average number of total putts per round
onePuttPct Percentage of times it took one putt to get the ball into the hole
puttsSG Strokes gained putting measures how many strokes a player gains or loses on the greens compared to the PGA Tour average
avgScore The scoring average is the total strokes divided by the total rounds
Money The official money is the prize money award to the Professional members
Points FedexCup Regular Season Points are awarded points by finished position for performance in each tournament
Tournament The tournament where the PGA Tour is taking place

Data Source

The data were obtained from the PGA Statistics website

Materials

The data and worksheet associated with this module are available for download through the following links.

PGA2022.csv - Dataset with driving, putting, and success measures for indivudal golfer in 19 PGA tournaments in 2022.

PGACorrelationsNoTech.docx- “No Tech” version of the activity worksheet provides a correlation matrix for students to use in answering the questions.

PGACorrelationsWithTech.docx - “With Tech” version of the activity worksheet assumes students have technology to compute any needed correlations from the provided dataset.

Sample solutions to the worksheets

PGACorrelationsNoTech-Ans.docx- Sample solutions to the “No Tech” version of the activity worksheet.

PGACorrelationsWithTech-Ans.docx - Sample solutions to the “With Tech” version of the activity worksheet.

Students should find evidence to support the claim that. in general, putting statistics tend to be better predictors of tournament success than driving statistics. So golfers really do “drive for show, but putt for dough”.

Authors

Fill in later