Skip to content

Latest commit

 

History

History
 
 

week4

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 

This week covers replications and extensions of this article on correlations between socioeconomic factors and mobility with COVID-19 caseloads in NYC in early 2020. You'll work in groups of 2 for this.

  1. Start by reading the paper. Focus on figures 1 and 2 and table 1. These will be the main results to replicate in the paper. Time permitting we will try to replicate Table 2 as well.

  2. Follow the links to the datasets that were used, and try to track down the COVID case count and American Community Survey (ACS) data. The tidycensus package should be useful for getting the ACS data. You'll need a Census API key (signup here) and this book might be helpful for working with the tidycensus package.

  3. Sketch out a plan for how you can use these data sets to get the results in the paper and write reproducible code to create the results in the paper. Follow the template for the ngrams assignment in organizing your code. For instance, you might want a file called 01_download_covid_case_counts.sh to download the data for NYC case counts to a data/ subdirectory in your repository. Do all of your work in your group's project repository, which you have direct read/write access to (no need to fork and issue pull requests, etc.):

  4. From here think about extensions to the article. Each group will probably come up with different questions to ask of the data. Write down the questions your group is interested in along with a plan for how you can tackle them with the data you have.

  5. Write one Rmarkdown notebook file that contains all of your results, including the replications and your extension. Commit the source code and rendered notebook to your group's Github repository (links for that coming soon), with a README that explains what each file does.