Because of interest and the need for better examples (for teaching and for use in tools under-development), such as prcr and tidyLPA, I worked to create two data packages, data easily available through an R package.
A benefit of the data being in an R
package is that it is even easier to access than other formats (in R
): Just load the name of the package and type the name of the data frame, or, if the data is included as built-in data in another package (one that is loaded), just type the name of the data frame. This can be helpful because data packages can make it easy to use an interesting dataset right away (loading data can sometimes be surprisingly hard) Another benefit is the data are documented and easily joined (in the case of the student questionnaire and achievement data in the PISA data) with related information, such as students’ schools. The data are also saved in an efficient format.
pisaUSA15: Data package that provides student questionnaire data from the 2015 PISA for students in the United States of America. I’ve used these for examples in the
prcr
andtidyLPA
packages. More information is available at http://www.oecd.org/pisa/data/.railtrails: Data package with trail data from the Rails-to-Trails Conservancy, including the trail name, state, distance, and surface. I’ve used this data to illustrate ideas behind mixed effects (or multi-level) models here, here, and here. More information is available at https://www.traillink.com/. UPDATE: This package is now available on CRAN (see here).