A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Data Augmentation Techniques

Graduate Depth 83 in the knowledge graph ☐ I know this ☆ Set as goal

399prerequisites beneath it

Core Idea

Data augmentation generates synthetic training examples through domain-appropriate transformations (image rotations, text paraphrasing) without collecting new labels. This increases effective dataset size and improves robustness. Domain knowledge is critical: augmentations must preserve label semantics to avoid introducing noise.

Explainer

From supervised learning, you know that model performance depends heavily on having enough labeled training data. But collecting and labeling data is expensive — medical images need expert radiologists, speech data needs transcription, and rare events may simply not appear often enough in any dataset. Data augmentation offers a practical workaround: instead of collecting new data, generate synthetic training examples by applying transformations to existing data that change the input while preserving the correct label. A photo of a cat rotated 15 degrees is still a photo of a cat, so you can train on both the original and the rotated version, effectively doubling your dataset for free.

In computer vision, standard augmentation techniques include random cropping, horizontal flipping, rotation, color jittering (slight changes to brightness, contrast, and saturation), and scaling. These work because image classifiers should be invariant to these transformations — a dog is still a dog whether the photo is slightly brighter or the dog appears on the left side instead of the right. More aggressive techniques like cutout (masking random rectangular regions) and mixup (blending two images and their labels) push the model to rely on broader patterns rather than memorizing specific pixel arrangements. In natural language processing, augmentations include synonym replacement, random word insertion and deletion, back-translation (translating to another language and back), and paraphrasing. Audio augmentation adds background noise, changes pitch, or varies speed.

The critical constraint is that augmentations must preserve label semantics. Flipping a photo horizontally is fine for animal classification, but disastrous for text recognition — a mirror-image "b" becomes "d." Rotating a chest X-ray 180 degrees changes the clinical interpretation entirely. Replacing a word with a synonym works for sentiment analysis but can destroy meaning in a medical context where terminology is precise. This is where domain knowledge becomes essential: you must understand what transformations the task is invariant to and limit augmentation to those transformations. Applying inappropriate augmentations doesn't just fail to help — it actively injects incorrect training signal, teaching the model that wrong answers are right.

Data augmentation also functions as a form of regularization, reducing overfitting by making it harder for the model to memorize the training set. When every epoch presents slightly different versions of the same images, the model is forced to learn robust, generalizable features rather than pixel-level patterns specific to the training data. This is particularly valuable when working with small datasets, where overfitting is the primary failure mode. Modern approaches like AutoAugment and RandAugment take this further by learning or randomly sampling augmentation policies, removing the need for manual tuning of which transformations to apply and how aggressively. The combination of augmentation with other regularization techniques (dropout, weight decay) and transfer learning has made it possible to train effective models on datasets that would have been considered impossibly small a decade ago.

Practice Questions 5 questions

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Number Bonds to 10 → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts (0s, 1s, 2s, 5s, 10s) → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Comparing and Ordering Integers → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Introduction to Exponents → Order of Operations → Integer Order of Operations → Variable Expressions → The Distributive Property → Variables and Expressions Review → Introduction to Polynomials → Adding and Subtracting Polynomials → Multiplying Polynomials → Factorial → Permutations → Combinations → Counting Principles: Addition and Multiplication Rules → Introduction to Graph Theory → Propositional Logic Foundations → Logical Equivalences → Boolean Algebra → Boolean Type and Truth Values → Comparison Operators and Boolean Tests → Logical Operators and Boolean Algebra → Conditional Statements → Defining and Calling Functions → Functions: Decomposing Problems → Function Parameters and Argument Passing → Return Values → Variable Scope → Introduction to Classes → Objects and Instances → Methods and Attributes → Algorithm Design Basics → Supervised Learning Fundamentals → Data Augmentation Techniques

Longest path: 84 steps · 399 total prerequisite topics

Prerequisites (1)

Supervised Learning Fundamentalshard

Leads To (0)

No topics depend on this one yet.