• Home
  • Blog
  • Primary Data Cleaning Lab Project

Primary Data Cleaning Lab Project

0 comments

Lab 3 is based on Primary Data Cleaning.

You are part of a research team that is studying whether or not participation in intramural sports influences college student’s perceptions of institutional support from their university. You have designed a survey that focuses on two key constructs: 1) Intramural Sport Program Attitudes and 2) Perceived Institutional Support. In addition, you have questions that ask whether they participate in intramural sport (PART) and their year in school (YEAR). The codebook for this survey is provided on the Canvas page (Codebook) and the raw data from the survey is also provided (Data). Please “clean” the data to prepare for analysis and make sure that your final file has:

 Separate tabs for “Raw Data” and “Cleaned Data”

 A column added for respondent IDs

 Simplify the labels (e.g. GA_1, GA_2,GA_3)

 Bolded column labels, centered alignment, and adjusted widths

 Cases with 3 or more missing data values have been deleted

 Cases with 2 or less missing data values have had median imputation (i.e. replace with median value)

Categorical variables re-coded into numerical form

 Composite mean scores created

 “Cleaned” data only have respondent ID, Intramural participation, year in school, and composite mean scores with two decimal places

 Calculate the Cronbach’s alpha for the two key constructs included in this survey (i.e., Intramural Sport Program Attitudes, Perceived Institutional Support).

About the Author

Follow me


{"email":"Email address invalid","url":"Website address invalid","required":"Required field missing"}