Small-angle X-ray scattering (SAXS) measures the scattering of X-rays by macromolecules in solution, providing low-resolution structural information about molecular size, shape, and oligomeric state without requiring crystals or isotope labeling. The scattering curve (intensity vs. scattering angle) encodes the radius of gyration (Rg, from the Guinier approximation), maximum dimension (Dmax, from the pair distribution function), and overall molecular envelope (from ab initio shape reconstruction). SAXS is particularly valuable for characterizing flexible and disordered proteins, multi-domain assemblies, conformational changes upon ligand binding, and quality control of purified samples. It complements high-resolution methods by providing solution-state shape information under nearly any buffer condition.
Not all structural questions require atomic resolution. Before determining a crystal structure or cryo-EM map, a researcher often needs to answer simpler but equally important questions: Is this protein globular or elongated? Is it monomeric or dimeric in solution? Does it undergo a conformational change upon ligand binding? Is it folded or disordered? SAXS answers these questions rapidly, in solution, under any buffer conditions, with minimal sample requirements.
In a SAXS experiment, a dilute protein solution (~0.5-5 mg/mL) is exposed to a focused X-ray beam, and the scattering at low angles (0.5-5 degrees) is measured on a detector. Unlike crystallography (where the crystal lattice produces discrete diffraction spots), solution SAXS produces a smooth, continuous scattering curve because the molecules are randomly oriented — the signal is the rotational average of the scattering from all orientations. The scattering curve I(q) as a function of momentum transfer q (proportional to the scattering angle) encodes the overall size and shape of the molecule.
The most fundamental analysis extracts the radius of gyration (Rg) from the Guinier region (very low angles, where the scattering curve is approximately Gaussian). Rg quantifies the overall compactness of the molecule — a larger Rg means a more extended molecule. The pair distribution function P(r) — obtained by indirect Fourier transform of the scattering curve — shows the distribution of all intramolecular distances, with the maximum distance being Dmax. A spherical protein has a bell-shaped P(r) that drops to zero at a Dmax approximately equal to the diameter. An elongated protein has a skewed P(r) extending to larger distances. A multi-domain protein with a flexible linker shows a P(r) with a shoulder reflecting the inter-domain separation.
For more detailed modeling, ab initio shape reconstruction algorithms (like DAMMIF/DAMMIN) build dummy-atom models that reproduce the experimental scattering curve, producing a molecular envelope at ~15-25 Angstrom resolution. When high-resolution structures of individual domains are available, rigid-body modeling fits the relative positions and orientations of domains to the SAXS data. For flexible systems, ensemble methods (EOM, MultiFoXS) generate thousands of conformations and select a weighted ensemble that best fits the experimental curve — characterizing not a single structure but the range of shapes the molecule adopts in solution. SAXS has become an essential complement to high-resolution methods, providing the solution-state context that crystallographic and cryo-EM structures need for complete biological interpretation.