The Maxwell-Boltzmann distribution gives the probability that a classical particle has energy E as P(E) ∝ exp(-E/kT). The velocity distribution of gas particles follows from this and explains the speed distribution, average kinetic energy, and pressure of ideal gases. It emerges as the high-temperature limit of quantum statistics.
You already know from the canonical partition function that the probability of a system occupying a microstate with energy E is proportional to the Boltzmann factor exp(-E/kT), where k is Boltzmann's constant and T is temperature. The Maxwell-Boltzmann distribution applies this framework to the translational kinetic energy of individual molecules in a classical ideal gas. Each molecule moves independently, so its energy is just (1/2)mv², and the probability of having speed v follows directly from P ∝ exp(-mv²/2kT).
The resulting speed distribution f(v) has a characteristic shape: it starts at zero (no molecules with zero speed), rises to a peak at the most probable speed v_mp = sqrt(2kT/m), then falls off with a long tail toward high speeds. Three characteristic speeds are often distinguished: the most probable speed v_mp, the mean speed ⟨v⟩ = sqrt(8kT/πm), and the root-mean-square speed v_rms = sqrt(3kT/m). All three scale as sqrt(T/m) — speed increases with temperature and decreases with molecular mass. This explains why lighter gases like helium diffuse faster than heavier gases like nitrogen at the same temperature.
The tail of the distribution is physically crucial even though it contains few molecules. Evaporation, chemical reaction rates, and atmospheric escape all depend on molecules with energies well above average. The Arrhenius equation for reaction rates (which you may encounter in chemistry or physical chemistry) draws directly on this tail: only molecules with enough energy to surmount an activation barrier contribute to the reaction rate, and that fraction is set by the Boltzmann factor.
The Maxwell-Boltzmann distribution is described as the "classical limit" because at sufficiently high temperatures (or low densities), quantum statistics reduce to it. In the quantum case, identical particles obey either Fermi-Dirac statistics (fermions, half-integer spin) or Bose-Einstein statistics (bosons, integer spin). Both distributions reduce to the Boltzmann factor when the occupation probability per state is much less than 1 — the regime where particles rarely compete for the same quantum state. This condition is satisfied for most common gases at room temperature, which is why the classical Maxwell-Boltzmann picture works so well in everyday chemistry and kinetic theory.
Connecting back to thermodynamics: averaging (1/2)mv² over the Maxwell-Boltzmann distribution gives ⟨KE⟩ = (3/2)kT per molecule, which is the equipartition theorem result for three translational degrees of freedom. This is not a coincidence — both equipartition and Maxwell-Boltzmann follow from the same Boltzmann distribution over phase space. The partition function you computed earlier is the generating object from which the speed distribution, average energy, pressure, and heat capacity all emerge.