Skip to content

Latest commit

 

History

History
42 lines (24 loc) · 1.23 KB

File metadata and controls

42 lines (24 loc) · 1.23 KB
layout title
page
Methods

Data

What data sets are you using?

This project uses the 2023 RDS and PSD data.

What did you do to prepare the data?

To prepare the data, we created functions to clean the data and impute additional columns as needed. As part of this process, we also ensured the common columns in the RDS and PSD were standardized so they could be harmonized (e.g., had the same factor levels).

Tools (aka “component specification”)

What software packages, modules, etc. did you use?

We used R Studio and GitHub.

What are the dependencies between these and how did you render them interoperable?

Processes

What does your workflow or pipeline look like?

The fellows divided work among ourselves, assigning each person to create certain functions that targeted certain columns. Once the functions were created, all of them were called in a single script, which produced a series of cleaned dataframes for analyses: A clean RDS, a clean PSD, and a clean cominbed dataframe.

What steps did you follow?

Analyses

What approaches did you try that didn’t work? What analyses did you end up sticking with?

RDS

Limitations

What are the shortcomings of your approach? How can your work be improved?