Questions — Causal Information Theory

Question 1 Multiple Choice

In a causal graph, two variables X and Z are d-separated given conditioning set S if all paths between them are blocked. What does d-separation imply about information flow?

Ad-separation means X and Z are always independent

Bd-separation means X and Z are informationally independent given S in any distribution consistent with the causal graph — formally, I(X;Z|S) = 0 in all Markov distributions over the graph

Cd-separation guarantees that X does not cause Z

Dd-separation is only relevant for continuous variables

Question 2 True / False

Transfer entropy T(X → Y) measures information flow from time series X to time series Y. It is defined as I(Y_t; X_past | Y_past). Why is conditioning on Y_past necessary to isolate the causal effect of X on Y?

TTrue

FFalse

Question 3 Short Answer

Explain why observational data (where variables are passively observed) can yield different conditional independence statements than interventional data (where variables are actively set to specific values). What does this imply for causal discovery?

Think about your answer, then reveal below.

Question 4 Multiple Choice

The causal Markov condition states that each variable is conditionally independent of its non-descendants given its parents in the causal DAG. This condition relates the graph structure to probability distributions. Why is this condition essential for causal inference?

AIt defines what we mean by a 'causal' graph as opposed to just a dependency graph

BIt connects causal assumptions (encoded in the graph) to measurable probabilities (conditional independences), allowing data to constrain and test causal hypotheses

CIt is only a mathematical convenience with no practical importance

DIt guarantees that causal inference is always identifiable from data

Questions: Causal Information Theory