Leveraging US Census Data in BigQuery

I’ve been using Google’s cloud services more and more, specifically BigQuery. Besides the speed of queries and the simple API integration for different languages, like R, BigQuery makes available a large number of public data sets that come in quite handy. Here’s a quick guide to leveraging some of the... [Read More]
Tags: census, BigQuery, SQL, geocoding, ACS

Building Statlines Over Custom Date Ranges with baseballr and Statcast Data

baseballr has provided two functions to pull statlines for players over custom date ranges, daily_batter_bref and daily_pitcher_bref. Unfortunately, Baseball-Reference has made some underlying changes to the feature that produces the data for those functions and I am not sure they will be salvageable any time soon. [Read More]
Tags: R, statacast, baseballr

Creating a Restrosheet Event and Roster Database Using R and baseballr

Retrosheet remains one of the very best data resources for the game of baseball. While we are all used to play-by-play data being readily availabel through Baseball Savant, if you really want to do any kind of research relying on that kind of data before 2008, Retrosheet is the only... [Read More]
Tags: R, baseballr, retrosheet