Questions: Corpus Linguistics - Methodology

5 questions to test your understanding

Score: 0 / 5
Question 1 Multiple Choice

A corpus is best described as:

AAny collection of written texts assembled for research
BA large, carefully designed, annotated collection of natural language texts with systematic sampling, quality control, and documented provenance
CA database of dictionary definitions
DA machine-readable version of a classic literary work
Question 2 Multiple Choice

Why is corpus annotation (tagging for parts of speech, parsing for syntax, marking for semantic roles) important in corpus linguistics?

ATo correct errors in the original texts
BTo enable systematic searching and quantitative analysis of linguistic structures that are not visible in raw text
CBecause raw text is uninterpretable
DTo replace linguistic theory with empirical counting
Question 3 True / False

Corpus evidence that word X appears 1000 times more frequently than word Y definitively proves that speakers find word X more useful or psychologically salient.

TTrue
FFalse
Question 4 True / False

The use of corpora to study language has mainly confirmed theoretical predictions from introspective linguistics without substantially changing linguistic theory.

TTrue
FFalse
Question 5 Short Answer

Explain why sampling strategy is crucial in corpus design and how poor sampling can lead to misleading conclusions about language.

Think about your answer, then reveal below.