Questions: Data Preparation, Screening, and Quality Assurance

5 questions to test your understanding

Score: 0 / 5
Question 1 Multiple Choice

In a depression study, participants with the highest depression scores are significantly more likely to skip the follow-up questionnaire. What type of missingness is this, and what is its primary implication?

AMCAR — missingness is unrelated to anything, so listwise deletion produces unbiased estimates
BMAR — missingness depends on observed variables, so multiple imputation using other variables is valid
CMNAR — missingness is related to the unobserved values themselves, meaning analyses that ignore it will likely be biased
DMCAR — because we cannot directly observe why participants skipped the questionnaire
Question 2 Multiple Choice

You discover that three participants have their age recorded as '220'. What is the most appropriate first step?

ARemove all three cases immediately to protect data integrity
BReplace each value with the sample mean age
CVerify the values against original records; correct if possible, flag for exclusion if not verifiable
DIgnore them — three impossible values cannot materially affect a large sample
Question 3 True / False

If less than 5% of values are missing, listwise deletion generally produces unbiased estimates.

TTrue
FFalse
Question 4 True / False

Documenting every data preparation decision — what was found, what was done, and why — is essential for scientific reproducibility, not optional bookkeeping.

TTrue
FFalse
Question 5 Short Answer

Why is it necessary to determine the mechanism of missingness (MCAR, MAR, or MNAR) before deciding how to handle missing data?

Think about your answer, then reveal below.