Questions: Data Preparation, Screening, and Quality Assurance

5 questions to test your understanding

Score: 0 / 5

Question 1 Multiple Choice

In a depression study, participants with the highest depression scores are significantly more likely to skip the follow-up questionnaire. What type of missingness is this, and what is its primary implication?

AMCAR — missingness is unrelated to anything, so listwise deletion produces unbiased estimates

BMAR — missingness depends on observed variables, so multiple imputation using other variables is valid

CMNAR — missingness is related to the unobserved values themselves, meaning analyses that ignore it will likely be biased

DMCAR — because we cannot directly observe why participants skipped the questionnaire

Question 2 Multiple Choice

You discover that three participants have their age recorded as '220'. What is the most appropriate first step?

ARemove all three cases immediately to protect data integrity

BReplace each value with the sample mean age

CVerify the values against original records; correct if possible, flag for exclusion if not verifiable

DIgnore them — three impossible values cannot materially affect a large sample

Question 3 True / False

If less than 5% of values are missing, listwise deletion generally produces unbiased estimates.

TTrue

FFalse

Question 4 True / False

Documenting every data preparation decision — what was found, what was done, and why — is essential for scientific reproducibility, not optional bookkeeping.

TTrue

FFalse

Question 5 Short Answer

Why is it necessary to determine the mechanism of missingness (MCAR, MAR, or MNAR) before deciding how to handle missing data?

Think about your answer, then reveal below.