Skip to contents

Get Play-By-Play Data for NCAA Baseball Games

Usage

ncaa_pbp(
  game_info_url = NA_character_,
  game_pbp_url = NA_character_,
  raw_html_to_disk = FALSE,
  raw_html_path = "/",
  read_from_file = FALSE,
  file = NA_character_,
  ...
)

Arguments

game_info_url

The url for the game's boxscore data. This can be found using the ncaa_schedule_info function.

game_pbp_url

The url for the game's play-by-play data. This can be found using the ncaa_schedule_info function.

raw_html_to_disk

Write raw html to disk (saves as {game_pbp_id}.html in raw_html_path directory)

raw_html_path

Directory path to write raw html

read_from_file

Read from raw html on disk

file

File with full path to read raw html

...

Additional arguments passed to an underlying function like httr.

Value

A data frame with play-by-play data for an individual game.

col_nametypesdescription
game_datecharacterGame date (NA on the redesigned page; use ncaa_schedule_info()).
locationcharacterVenue / conditions line when present.
attendancelogicalReported attendance (NA on the redesigned page).
inningcharacterInning number.
inning_top_botcharacterHalf-inning ("top" or "bot").
scorecharacterRunning score (away-home) after the play.
battingcharacterBatting team name.
fieldingcharacterFielding team name.
descriptioncharacterPlay description text.
game_pbp_urlcharacterstats.ncaa.org play-by-play url for the game.
game_pbp_idintegerstats.ncaa.org play-by-play (contest) identifier.

Details

Live usage (reads stats.ncaa.org, which is behind Akamai bot protection and needs the optional chromote + Google Chrome browser fallback, so it is shown here rather than as a runnable example):

ncaa_pbp(game_info_url = "https://stats.ncaa.org/contests/2167178/box_score")