5 questions to test your understanding
A researcher scrapes 50 million tweets to study public opinion on immigration policy. She finds 62% of tweets express negative views and concludes that most people oppose more permissive immigration policy. What is the most significant flaw in this reasoning?
A study using social media data finds that people who post frequently about social activities report higher loneliness on follow-up surveys. A researcher concludes that active social media use causes loneliness. What is the primary methodological concern?
Collecting a larger dataset in big data research eliminates selection bias by including more observations from the target population.
Big data's scale can reveal patterns impossible to detect in smaller datasets, but it amplifies the consequences of poor research design rather than compensating for it.
Explain why selection bias in big data research is different from selection bias in traditional survey research, and why increasing the dataset size cannot fix it.