These R packages import sports, weather, stock data and more (2024)

Close Ad

computerworld

UNITED STATES

  • United States
  • Australia
  • India
  • Netherlands
  • New Zealand
  • United Kingdom

×

Close

Feature

These R packages import sports, weather, stock data and more (2)

');IDG.GPT.IMUCounter = IDG.GPT.IMUCounter + 1; IDG.GPT.defineGoogleTagSlot(slotName, [300,250], false);var options = {screenSize:'971 1115', scriptTags: []};$('#' + slotName).responsiveAd(options);

By Sharon Machlis

Executive Editor, Data & Analytics, Computerworld |

Related Topics

  • Analytics
  • Business Intelligence
  • R Language

There are lots of good reasons you might want to analyze public data, from detecting salary trends in government data to uncovering insights about a potential investment (or your favorite sports team).

But before you can run analyses and visualize trends, you need to have the data. The packages listed below make it easy to find economic, sports, weather, political and other publicly available data and import it directly into R -- in a format that's ready for you to work your analytics magic.

Packages that are on CRAN can be installed on your system by using the R command install.packages("packageName") -- you only need to run this once. GitHub packages are best installed with the devtools package -- install that once with install.packages("devtools") and then use that to install packages from GitHub using the format devtools::install_github("repositoryName/packageName"). Once installed, you can load a package into your working session once each session using the format library("packageName").

Some of the sample code below comes from package documentation or blog posts by package authors. For more information about a package, you can run help(package="packageName") in R to get info on functions included in the package and, if available, links to package vignettes (R-speak for additional documentation). To see sample code for a particular function, try example(topic="functionName", package="packageName") or simply ?functionName for all available help about a function including any sample code (not all documentation includes samples).

For more on best R packages, see Great R Packages for data import, wrangling and visualization.

R packages to import public data

PackageCategoryDescriptionSample CodeMore info
blscrapeREconomics, GovernmentFor specific information about U.S. salaries and employment info, the Bureau of Labor Statistics offers a wealth of data available via this new package. blsAPI package is another option. CRAN.bls_api(c("LEU0254530800", "LEU0254530600"),
startyear = 2000, endyear = 2015)
Package vignettes
quantmodFinance, GovernmentThis package is designed for financial modelling but also has functions to easily pull data from Google Finance, Yahoo Finance and the St. Louis Federal Reserve (FRED). CRAN.getSymbols("DEXJPUS",src="FRED")Intro on getting data
Bureau of Economic AnalysisEconomics, GovernmentMaintained by Andera Batch at BEA, this taps into the bureau's API to download data sets. CRAN.beaSpecs <- list('UserID' = beaKey , 'Method' = 'GetData', 'datasetname' = 'NIPA', 'TableName' = 'T20305', 'Frequency' = 'Q', 'Year' = 'X', 'ResultFormat' = 'json');
beaPayload <- beaGet(beaSpecs);
See the GitHub repo, including info about recent project changes
edgarWebRFinance, GovernmentThis package is designed to let you search and download data from the U.S. Securities and Exchange Commission, including corporate and mutual-fund financial filings. CRAN.getSymbols("DEXJPUS",src="FRED")See the package vignette
FredRFinance, GovernmentIf you're interested just in Fed data, FredR can access data from the Federal Reserve Economic Data API, including 240,000 US and international data sets from 77 sources. Free API key needed. GitHub.fred <- FredR(api.key)
fred$series.search("GDP")
gdp <- fred$series.observations(series_id = 'GDPC1')
Project's GitHub page
tidycensusGovernment, GISThis package downloads data from the U.S. 10-year census and American Community Survey in R-ready format. In addition, you can import data and_ geospatial files for easy mapping. Free API key needed. (For Canadian Census data, there's a completely separate cancensus package). CRAN.get_acs(state = "CA", county = "Orange", geography = "tract", variables = "B19013_001", geometry = TRUE)Project documentation page
censusapiGovernmentThis aims to be complete and offer data from all the bureau's APIs, not just the decennial Census and ACS. API key required. GitHub.mydata <- getCensus(name="acs5", vintage=2014,
key=mycensuskey,
vars=c("NAME", "B01001_001E", "B19013_001E"),
region="congressional district:*", regionin="state:36")
See the Computerworld tutorial or package's GitHub repo.
ipumsrGovernmentFor those doing heavy-duty Census analysis with raw IPUMS microdata, this package makes it easy to import that data into R. CRAN.read_ipums_micro()See the package vignettes, including the intro vignette.
eu.us.opendataGovernmentThis joint project of the U.S. and European Union governments aims to make it easy to import and compare US and EU data sets. Free U.S. Bureau of Economic Analysis API key needed. GitHub.getRel('gross domestic product', lucky = T, beaKey = myKey)
getRel('gdp', lucky = T, beaKey = myKey)
See the project's README file for installation and usage instructions.
RSocrataGovernmentPull data from any municipality that uses the Socrata data platform. Created by the City of Chicago data team. CRAN.mydata <- read.socrata(
"https://data.cityofchicago.org/
Transportation/Towed-Vehicles/ygr5-vcbg")
RSocrata GitHub repo
forbesListRMiscA bit of a niche offering, this taps into lists maintained by Forbes including largest private companies, top business schools and top venture capitalists. GitHub.#top venture capitalists 2012-2016
mydata <-
get_years_forbes_list_data(years = 2012:2016,
list_name = "Top VCs")
See the project GitHub page. You may need to manually load the tidyr package for code to work.
pollstRPoliticsThis package pulls political polling data from the Huffington Post Pollster API. CRAN.slug <- "donald-trump-favorable-rating"
polls <- pollster_charts_polls(slug)[["content"]]
See the example vignette
data.worldMultiR package to use data sets from data.world. API key needed. CRAN.See examples in GitHub repo.See the Quick Start vignette
LahmanSportsR interface for the famed Lahman baseball database. CRAN.batavg <- battingStats()See this blog post on using R for baseball research
rnoaaWeatherTap into numerous National Oceanic and Atmospheric Administration APIs, including climate, tornadoes and the Climate Prediction Center. NOAA API key needed.options(noaakey = "yourAPIkey")
storms14 <- storm_data(year = 2014)
See this rOpenSci tutorial.
rtweetSocial MediaTap into Twitter's REST and stream APIs with R. API key needed. CRAN.search_tweets("#rstats", n = 18000, include_rts = FALSE)See the introductory vignette.

Related:

  • Analytics
  • Business Intelligence
  • R Language

Sharon Machlis is Director of Editorial Data & Analytics at Foundry, where she works on data analysis and in-house editor tools in addition to writing. Her book Practical R for Mass Communication and Journalism was published by CRC Press. She was named Digital Analytics Association's 2021 Top (Data) Practitioner, winner of the 2023 Jesse H. Neal journalism award for best instructional content, 2014 Azbee national gold award for investigative reporting, and 2017 Azbee gold for how-to article, among other awards. You can find her on Mastodon at masto.machlis.com/@smach.

Follow

Copyright © 2018 IDG Communications, Inc.

It’s time to break the ChatGPT habit

These R packages import sports, weather, stock data and more (2024)
Top Articles
Latest Posts
Article information

Author: Foster Heidenreich CPA

Last Updated:

Views: 6219

Rating: 4.6 / 5 (56 voted)

Reviews: 87% of readers found this page helpful

Author information

Name: Foster Heidenreich CPA

Birthday: 1995-01-14

Address: 55021 Usha Garden, North Larisa, DE 19209

Phone: +6812240846623

Job: Corporate Healthcare Strategist

Hobby: Singing, Listening to music, Rafting, LARPing, Gardening, Quilting, Rappelling

Introduction: My name is Foster Heidenreich CPA, I am a delightful, quaint, glorious, quaint, faithful, enchanting, fine person who loves writing and wants to share my knowledge and understanding with you.