Homology Modeling

Research Depth 190 in the knowledge graph I know this Set as goal
Unlocks 2 downstream topics
homology-modeling comparative-modeling template-based loop-modeling MODELLER

Core Idea

Homology modeling (comparative modeling) predicts a protein's three-dimensional structure based on its sequence similarity to one or more experimentally determined template structures. Because protein structure is more conserved than sequence during evolution, proteins sharing >30% sequence identity typically share the same overall fold, enabling structure prediction by copying the template's backbone and rebuilding side chains and loops. The process involves template identification (BLAST, HHpred), sequence-template alignment, model building (MODELLER, SWISS-MODEL), and model validation (DOPE score, Ramachandran analysis). Homology modeling was the dominant computational structural biology method before AlphaFold, and understanding its principles remains important for interpreting and evaluating all predicted structures.

Explainer

For decades before AlphaFold, the most reliable method for predicting a protein's structure was to find a relative with a known structure and copy it. This is homology modeling — the computational analog of the evolutionary insight that structure is more conserved than sequence. Two proteins that diverged from a common ancestor millions of years ago may share only 30% of their amino acid sequence, yet their three-dimensional folds are remarkably similar. Homology modeling exploits this conservation to build a structural model of a target protein using one or more experimentally determined template structures.

The workflow has four steps. Template identification: search the PDB for structures with sequence similarity to the target, using BLAST (sequence search) or HHpred (profile-profile search, more sensitive for distant homologs). Alignment: align the target sequence to the template sequence, determining which residues correspond. This is the most error-prone step — insertions, deletions, and ambiguous regions in the alignment directly produce errors in the model. Model building: programs like MODELLER or SWISS-MODEL copy the template backbone for aligned regions, model insertions/deletions (loops) using ab initio or knowledge-based methods, place side chains using rotamer libraries, and optimize the model through energy minimization. Validation: assess model quality using statistical potential scores (DOPE), stereochemical checks (Ramachandran plot), and comparison to experimental data if available.

The accuracy of homology models depends primarily on the sequence identity to the template. Above 50% identity, models are typically excellent — backbone RMSD to the true structure is ~1 Angstrom, and most side chain orientations are correct. Between 30-50%, the core fold is reliable but loops and surface features have significant errors. Between 20-30% (the "twilight zone"), alignment uncertainty becomes the dominant error source, and model quality varies widely. Below 20%, fold recognition itself becomes unreliable. The most problematic regions in any homology model are loops (connecting secondary structure elements), which diverge rapidly in evolution and are modeled ab initio rather than copied from the template — loop modeling remains one of the hardest unsolved problems in computational structural biology.

AlphaFold has transformed this landscape by using deep learning to predict structures from sequence with accuracy rivaling experimental structures for many proteins. But the conceptual framework of homology modeling remains relevant: AlphaFold's multi-sequence alignment input is effectively an automated version of homology modeling's template search, and its confidence scores (pLDDT, PAE) flag the same regions — loops, disordered segments, domains without homologs — that challenge traditional homology modeling. Understanding why some regions are hard to predict (lack of evolutionary conservation, conformational flexibility, sequence-structure ambiguity) is essential for interpreting any predicted structure, whether from MODELLER or AlphaFold.

Practice Questions 3 questions

Prerequisite Chain

Counting to 10Counting to 20Understanding ZeroThe Number ZeroCounting to FiveOne-to-One CorrespondenceCombining Small Groups Within 5Addition Within 10Addition Within 20Two-Digit Addition Without RegroupingTwo-Digit Addition with RegroupingAddition Within 100Repeated Addition as MultiplicationMultiplication Facts Within 100Division as Equal SharingDivision as Grouping (Measurement Division)Division: Grouping (Repeated Subtraction) ModelDivision: Fair Sharing ModelDivision as Equal SharingDivision as GroupingBasic Division FactsDivision Facts Within 100Two-Digit by One-Digit DivisionDivision with RemaindersRemainders and Quotients in DivisionDivision Word ProblemsIntroduction to Long DivisionFactors and MultiplesPrime and Composite NumbersEquivalent FractionsRelating Fractions and DecimalsDecimal Place ValueReading and Writing DecimalsComparing and Ordering DecimalsAdding and Subtracting DecimalsMultiplying DecimalsDividing DecimalsDividing FractionsMixed Number ArithmeticOrder of OperationsInteger Order of OperationsVariable ExpressionsCombining Like TermsOne-Step EquationsTwo-Step EquationsSolving Multi-Step EquationsEquations with Variables on Both SidesAngle Pairs: Complementary, Supplementary, and VerticalParallel Lines and TransversalsCorresponding AnglesAlternate Interior AnglesTriangle Angle Sum TheoremExterior Angle TheoremTriangle Inequality TheoremSimilar Triangles: AA SimilaritySimilar Triangles: SSS and SAS SimilarityProportions in Similar TrianglesRight Triangle Trigonometry IntroductionTrigonometric Ratios ReviewRadian MeasureConverting Between Degrees and RadiansThe Unit CircleGraphing Sine and CosineGraphing Tangent and Reciprocal Trigonometric FunctionsDerivatives of Trigonometric FunctionsAntiderivativesIterated Integrals and Fubini's TheoremDouble Integrals in Cartesian CoordinatesDouble Integrals over Rectangular RegionsDouble Integrals in Polar CoordinatesDouble Integrals: Definition and SetupIterated Integrals and Fubini's TheoremDouble Integrals over Rectangular RegionsDouble Integrals over General RegionsApplications of Double Integrals: Area, Mass, and MomentsTriple Integrals in Cartesian CoordinatesTriple Integrals in Cylindrical and Spherical CoordinatesChange of Variables and the Jacobian DeterminantApplications of Triple Integrals: Volume and MassVector Fields and Their RepresentationsLine Integrals of Vector FieldsGreen's TheoremSurface Integrals and Flux of Vector FieldsSurface Integrals and Flux of Vector FieldsDivergence Theorem: Flux and OutflowDivergence TheoremElectric FluxGauss's LawConductors in Electrostatic EquilibriumCapacitance and CapacitorsDielectricsDielectric Constant and Relative PermittivityElectric Field Inside Dielectric MaterialsDielectric Materials and PolarizationDielectric Susceptibility and PermittivityEnergy Density in Electric FieldsElectric Current and Current DensityElectrical Resistance and ResistivityOhm's Law and Circuit ElementsElectromotive Force (EMF) and BatteriesKirchhoff's Circuit Laws: Voltage and CurrentDC Circuit Network Analysis MethodsTransient Response in RC CircuitsRC CircuitsLC and RLC CircuitsAC Circuits: FundamentalsImpedance and ReactanceAC Power and ResonanceElectromagnetic WavesThe Electromagnetic SpectrumBlackbody Radiation and Planck's LawPhotoelectric EffectThe Photon: Light as QuantaCompton ScatteringWave-Particle Dualityde Broglie WavelengthHeisenberg Uncertainty PrincipleWavefunction and the Born RuleThe Schrödinger EquationState Vectors and WavefunctionsQuantum SuperpositionQuantum EntanglementBell Theorem and Bell InequalitiesPostulates of Quantum MechanicsScattering TheoryIntroduction to Scattering TheoryPartial Wave Analysis in ScatteringSpin Angular MomentumElectron Spin and Intrinsic Magnetic MomentStern-Gerlach Experiment: Spin Quantization and MeasurementElectron Diffraction and Matter Wave PropertiesDavisson-Germer Experiment: Crystal Diffraction of ElectronsElectron Diffraction and Matter Wave InterferenceWavefunctions and Probability Density InterpretationQuantum Superposition and Linear Combinations of StatesQuantum Operators and ObservablesCanonical Commutation Relations and UncertaintyHeisenberg Uncertainty Principle and Measurement LimitsTime-Independent Schrödinger Equation and EigenvaluesHydrogen Atom in Quantum MechanicsSpectral Lines and Energy TransitionsSelection Rules for Atomic TransitionsLS and jj Coupling Schemes in Multi-Electron AtomsPauli Exclusion Principle and Antisymmetric WavefunctionsElectron Configuration and the Aufbau PrincipleThe Periodic Table and Atomic Electronic StructureThe Periodic TableElectron ConfigurationPeriodic TrendsIonization EnergyIonic BondingLewis StructuresResonance Structures and Delocalized ElectronsResonance and Formal ChargeMolecular Polarity and Dipole MomentsIntermolecular ForcesStates of Matter and Phase Changes: Melting, Boiling, and SublimationGas Laws and the Ideal Gas EquationGas Stoichiometry and Volume-Volume CalculationsThermochemistry and EnthalpyHeat Capacity and CalorimetryEntropy and Molecular DisorderSpontaneity and ΔGEntropy and Gibbs Free EnergyChemical EquilibriumChemical KineticsRate Law DeterminationEnzyme KineticsCell Cycle Regulation and CheckpointsMitosisCytokinesisMeiosisChromosomal Theory of InheritanceMendelian GeneticsDominance, Recessiveness, and Allelic InteractionsSex-Linked InheritanceNon-Mendelian Inheritance PatternsPopulation Genetics and Hardy-Weinberg EquilibriumNatural SelectionGenetic DriftEvolutionary Genetics FoundationsAllele Frequency Change and Evolutionary DynamicsGene Flow and Population StructureGene Flow and Selection: Opposing ForcesGene FlowHardy-Weinberg EquilibriumSpeciationPhylogenetics and Evolutionary TreesMolecular Evolution and Molecular ClocksPairwise Sequence AlignmentHomology Modeling

Longest path: 191 steps · 1001 total prerequisite topics

Prerequisites (2)

Leads To (1)