A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Information Theory in Neuroscience

Research Depth 99 in the knowledge graph ☐ I know this ☆ Set as goal

444prerequisites beneath it

Mutual Information Shannon Entropy +1 more→

Core Idea

Information theory provides quantitative tools for understanding neural coding: how neurons encode sensory information, process it, and transmit it to downstream structures. The mutual information I(S; R) between stimulus S and neural response R quantifies how much information about the stimulus is available in spike patterns. The information rate (bits per spike or bits per second) measures the efficiency of the neural code. Fisher information quantifies the precision with which neurons can encode stimulus parameters — related to but distinct from mutual information. The channel capacity of a single neuron (the maximum information that can be reliably transmitted given its biophysical constraints) explains why neurons use high rates: limited bandwidth requires high firing rates or complex temporal patterns. Population coding amplifies information through redundancy and synergy. Information-theoretic frameworks reveal that neural systems operate near information-theoretic limits, often optimizing for coding efficiency under metabolic constraints. These concepts illuminate sensory transduction, neural computation, learning, and brain function.

Explainer

How does the brain encode information? A neuron receives inputs, fires action potentials, and transmits signals to downstream targets. How much information about sensory stimuli is encoded in spike patterns? How efficiently do neurons use their bandwidth? Information theory provides quantitative answers.

Neural Information and Mutual Information:

Consider a sensory neuron responding to a stimulus (e.g., light intensity). The stimulus S ranges over possible values; the response R is the spike count or spike timing. The mutual information I(S; R) = H(R) - H(R|S) measures how much knowing the response reduces uncertainty about the stimulus. H(R) is the response entropy (uncertainty in spike patterns given no stimulus information). H(R|S) is the response entropy conditioned on the stimulus (residual uncertainty due to noise). If responses are always the same regardless of stimulus, I(S; R) = 0. If responses perfectly track the stimulus, I(S; R) = H(R). Empirically, sensory neurons carry 1-10 bits of information per stimulus presentation, surprisingly high given the apparent noisiness of individual spikes.

Fisher Information and Decoding Precision:

Fisher information F(theta) measures the curvature of the log-likelihood of a response given parameter theta. The Cramer-Rao bound states that the minimum-variance unbiased estimator of theta achieves variance lower-bounded by 1/F. For neurons encoding a stimulus intensity, high Fisher information means small intensity changes are reliably detected. The relationship between Fisher and mutual information is subtle: mutual information is the average information over the entire stimulus range; Fisher information is the local information around a particular value. For Gaussian noise, the relationship is clean, but in general they capture complementary aspects.

Information Rate and Bandwidth:

Neurons operate under bandwidth constraints. The refractory period (1-2 ms) limits the temporal resolution of spike timing. The maximum spike rate (limited by biophysics) constrains how fast the neuron can signal. Together, these create a finite "channel capacity": the maximum information the neuron can reliably transmit per unit time. For a neuron with maximum firing rate f_max (Hz) and temporal resolution delta_t (seconds), the information-theoretic capacity is roughly log_2(f_max * delta_t) bits per spike. To transmit more information, the neuron must increase its firing rate or use more complex temporal patterns (burst timing, phase relationships).

Population Coding and Synergy:

No single neuron carries all information about a stimulus. Populations of neurons distribute information across many cells. If N neurons each independently carried I bits and were uncorrelated, the population would carry N*I bits. In reality, neurons are correlated — they share information (redundancy) — but also encode in collective patterns (synergy). The challenge is decoding: how does the brain extract information from population responses? Linear decoding (weighted sum of spike counts) leaves information on the table; nonlinear decoding can extract synergistic information. Populations are often organized to minimize redundancy (e.g., neurons with different tuning curves) while maximizing synergy for task-relevant variables.

Efficient Coding Hypothesis:

A central principle in computational neuroscience is that neural circuits optimize the information transmitted per unit metabolic cost. Neurons are expensive: a single action potential costs roughly 10⁹ ATP molecules. The firing rate reflects an energy-information tradeoff. Sensory systems in data-rich environments (e.g., vision) fire at higher rates than those in low-information environments (e.g., slow chemical sensing). Learning itself may optimize neural codes: early in training, neurons fire irregularly; with practice, responses become more selective (reduced entropy) and informative about task-relevant variables. This fits an information-theoretic view: the nervous system allocates resources (firing rates, connectivity) to maximize information about behaviorally important variables.

Applications:

Neural Decoding: Given population spike patterns, estimate the stimulus or behavioral variable. Information theory guides optimal decoder design.
Sensory Adaptation: When stimulus statistics change, information-theoretic principles predict how neural responses adjust to re-optimize coding efficiency.
Brain-Computer Interfaces: Information theory quantifies the channel capacity of neural signals and limits achievable performance.
Evolution and Development: Animals in informatic-rich niches have larger brains and higher neural firing rates. Information-theoretic principles may explain these patterns.

Information theory applied to neuroscience reveals that the brain, despite its apparent randomness and noise, operates near information-theoretic limits — efficiently encoding, compressing, and transmitting information under severe biological constraints. This perspective has transformed our understanding of neural coding and continues to guide research into how the brain solves information processing problems.

Practice Questions 4 questions

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Number Bonds to 10 → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts (0s, 1s, 2s, 5s, 10s) → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Comparing and Ordering Integers → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Dividing Integers → Unit Rates → Proportions → Percent Concept → Converting Between Fractions, Decimals, and Percents → Operations with Rational Numbers → Two-Step Equations → Solving Multi-Step Equations → Equations with Variables on Both Sides → Angle Pairs: Complementary, Supplementary, and Vertical → Parallel Lines and Transversals → Corresponding Angles → Alternate Interior Angles → Triangle Angle Sum Theorem → Exterior Angle Theorem → Triangle Inequality Theorem → Similar Triangles: AA Similarity → Similar Triangles: SSS and SAS Similarity → Proportions in Similar Triangles → Right Triangle Trigonometry Introduction → Sine, Cosine, and Tangent Ratios → Trigonometric Ratios Review → Radian Measure → Converting Between Degrees and Radians → The Unit Circle → Graphing Sine and Cosine → Graphing Tangent and Reciprocal Trigonometric Functions → Derivatives of Trigonometric Functions → Antiderivatives → Indefinite Integrals → Basic Integration Rules → Riemann Sums → Definite Integral Definition → Probability Density Functions and Continuous Distributions → Cumulative Distribution Functions → Continuous Random Variables → Probability Density Functions → Expected Value → Weak Law of Large Numbers → Probability Axioms and Rules → Conditional Probability → Law of Total Probability → Bayes' Theorem → Joint and Conditional Entropy → Mutual Information → KL Divergence → Fisher Information → Information Theory in Neuroscience

Longest path: 100 steps · 444 total prerequisite topics

Prerequisites (3)

Shannon Entropyhard Mutual Informationhard Fisher Informationsoft

Leads To (0)

No topics depend on this one yet.