Crystallographic Symmetry and Space Groups

A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Core Idea

Crystallographic symmetry describes how molecules are arranged in a crystal lattice, and this symmetry fundamentally determines what information the diffraction experiment provides. A crystal is defined by a unit cell (the smallest box that tiles space by translation to fill the crystal), which contains one or more copies of the molecule related by symmetry operations (rotations, screw axes, and translations). The set of all symmetry operations forms the space group — one of 65 possible space groups for biological macromolecules (which cannot have mirror planes or inversion centers because proteins and nucleic acids are chiral). The asymmetric unit is the smallest portion of the unit cell from which the entire crystal can be generated by applying the space group symmetry operations. Crucially, the asymmetric unit (what crystallography solves) is not necessarily the biologically relevant assembly — a dimer in the crystal may be a crystallographic artifact, or a monomer in the asymmetric unit may form a biological dimer across a symmetry axis. Distinguishing the crystallographic assembly from the biological assembly is essential for interpreting crystal structures correctly.

Explainer

When a protein crystallizes, its molecules arrange in a regular three-dimensional lattice — a pattern that repeats identically in all directions, extending across the entire crystal (which may contain billions of unit cells). This regularity is what makes diffraction possible: X-rays scattered by all the identical copies interfere constructively at specific angles, producing the sharp diffraction spots from which the structure is determined. Understanding the symmetry of this arrangement is not merely a mathematical formality — it determines how many molecules are in each repeating unit, how the diffraction data should be processed, how molecular replacement searches are conducted, and whether an observed protein-protein contact is biologically meaningful or an artifact of crystal packing.

The unit cell is the basic repeating box: its dimensions (a, b, c lengths and alpha, beta, gamma angles) define the lattice, and the entire crystal is generated by stacking unit cells in three dimensions. Within the unit cell, molecules may be related by symmetry operations — rotations (2-fold, 3-fold, 4-fold, or 6-fold axes) and screw axes (rotations combined with translations along the axis). The complete set of symmetry operations, together with the lattice translations, defines the space group. For biological macromolecules, only 65 of the 230 possible space groups are allowed, because proteins and nucleic acids are chiral — they cannot be superimposed on their mirror image, so symmetry operations that include reflection (mirror planes, glide planes, inversion) are physically impossible in protein crystals.

The asymmetric unit is the fundamental concept for interpreting crystal structures. It is the smallest region of the unit cell that, when all space group symmetry operations are applied, generates the complete unit cell contents. If the space group has 4-fold symmetry (multiplicity = 4), the asymmetric unit is one quarter of the unit cell. The asymmetric unit may contain one molecule, part of a molecule (if the molecule sits on a crystallographic symmetry axis), or multiple molecules (if more than one copy happens to be present in the asymmetric unit — called non-crystallographic symmetry, or NCS). The critical point for biologists is that the asymmetric unit is not the same as the biological assembly. A protein that functions as a homodimer may crystallize with one monomer in the asymmetric unit, with the biologically relevant dimer formed across a crystallographic two-fold axis. Conversely, a monomeric protein may have two copies in the asymmetric unit related by NCS — a packing arrangement with no biological significance.

Distinguishing the crystallographic assembly (what the crystal symmetry generates) from the biological assembly (the functional oligomeric state in solution) requires integrating multiple lines of evidence. The PISA server analyzes crystal interfaces by calculating buried surface area, solvation energy, and interface complementarity to predict which contacts represent stable biological assemblies versus crystal-packing artifacts. Solution experiments — analytical ultracentrifugation, size-exclusion chromatography, cross-linking mass spectrometry — provide independent evidence of oligomeric state. This analysis is critical because published crystal structures in the PDB are deposited as asymmetric unit contents, and the biological assembly must be generated by applying the appropriate symmetry operations. Misinterpreting a crystallographic dimer as a biological dimer (or missing a biological dimer because only one monomer is in the asymmetric unit) can lead to fundamentally wrong conclusions about mechanism, regulation, and drug targeting.

Crystallographic Symmetry and Space Groups

Core Idea

Explainer

Prerequisite Chain

Prerequisites (2)

Leads To (2)