• Introduction
    • Course Introduction
    • R vs RStudio
    • The R/RStudio environments
  • Data Manipulation
    • Week 02
    • Data types
    • Reading and writing data files
    • Working with date objects
    • Extracting and replacing values using base functions
    • Relational and boolean operations
    • Example: Cleaning a buoy data table
    • Week 03
    • Manipulating dataframes with dplyr
    • Working with string objects
    • Week 04
    • Grouping and summarizing tables
    • Tidying/reshaping tables using tidyr
    • Joining data tables
    • Example of data manipulation workflows
  • Plots
    • Week 05
    • The base plotting environment
    • The lattice plotting environment
    • The ggplot plotting environment
    • Manipulating colors in R
  • Univariate
    • Week 06
    • Visualizing univariate data
    • Week 07
    • Comparing univariate data distributions
    • Theoretical Q-Q plot
    • Week 08
    • Fits and residuals
    • Spread-location plot
    • Week 09
    • Re-expressing data
    • Letter value summaries
    • The Two R’s of EDA
  • Bivariate
    • Week 10
    • Bivariate analysis
    • Resistant lines
    • Week 11
    • The third R of EDA: Residuals
    • Detecting discontinuities in the data
  • Two-way tables
    • Week 12
    • Median polish/Mean polish
  • Misc
    • R markdown documents
    • Week 13
    • Creating maps in R
    • Connecting to relational databases
  • Datasets

Data used in ES218

  • CO2 (1959-2018): file, README
  • CO2 (1959-2020): file, README
  • Southern oscillation index (1960-2014): file, README
  • Pacific decadal oscillation index (1900-2001): file, README
  • ACS Income and Education (2008-2012): file (CSV), file (RDS), Code book
  • Income by educational attainment (2008 - 2012): file (CSV), file with FIPS (CSV), Code book
  • Population (1850-2013): file, README
  • Colby courses (Spring 2020): file, Department/Division key
  • Seeded clouds: file, README
  • GHCND temperature data for five US locations: file, README, Documentation
  • Historical temperature data for New York: file, README
  • Historical temperature data for Bombay: file, README
  • Historical temperature data for Shanghai: file, README
  • Gulf of Maine buoy data (2012): file, README
  • Gulf of Maine buoy data (1905 - 2012): file
  • Kennebec River daily discharge (2004 - 2014): file (csv),file (xlsx), README
  • NFL Combine: file, README
  • Uninsured by county: file, README
  • Greenhouse gas emissions for the northeast (2008 & 2011): file, README
  • North american grain production (1961-2012): file, README
  • Logan airport (BOS) flight data (2014): file (CSV),file (rds), README
  • Consumer expenditure (1995-2012): file (csv), file (RDS), README
  • William Cleveland’s fusion time data: file
  • William Cleveland’s food web data: file
  • William Cleveland’s ganglion data: file
  • William Cleveland’s figure 3.6 data: file
  • William Cleveland’s carbon dating data: file
  • In-class sample data 1: file

Creative Commons License Manny Gimond (2021)